Iterative Relative Fuzzy Connectedness for Multiple Objects with Multiple Seeds

Krzysztof Chris Ciesielski; Jayaram K Udupa; Punam K Saha; Ying Zhuge

doi:10.1016/j.cviu.2006.10.005

. Author manuscript; available in PMC: 2008 Sep 1.

Published in final edited form as: Comput Vis Image Underst. 2007 Sep;107(3):160–182. doi: 10.1016/j.cviu.2006.10.005

Iterative Relative Fuzzy Connectedness for Multiple Objects with Multiple Seeds

Krzysztof Chris Ciesielski ^a,^*, Jayaram K Udupa ^b, Punam K Saha ^b, Ying Zhuge ^b

PMCID: PMC2442428 NIHMSID: NIHMS28684 PMID: 18769655

Abstract

In this paper we present a new theory and an algorithm for image segmentation based on a strength of connectedness between every pair of image elements. The object definition used in the segmentation algorithm utilizes the notion of iterative relative fuzzy connectedness, IRFC. In previously published research, the IRFC theory was developed only for the case when the segmentation was involved with just two segments, an object and a background, and each of the segments was indicated by a single seed. (See Udupa, Saha, Lotufo [15] and Saha, Udupa [14].) Our theory, which solves a problem of Udupa and Saha from [13], allows simultaneous segmentation involving an arbitrary number of objects. Moreover, each segment can be indicated by more than one seed, which is often more natural and easier than a single seed object identification.

The first iteration step of the IRFC algorithm gives a segmentation known as relative fuzzy connectedness, RFC, segmentation. Thus, the IRFC technique is an extension of the RFC method. Although the RFC theory, due to Saha and Udupa [19], is developed in the multi object/multi seed framework, the theoretical results presented here are considerably more delicate in nature and do not use the results from [19]. On the other hand, the theoretical results from [19] are immediate consequences of the results presented here. Moreover, the new framework not only subsumes previous fuzzy connectedness descriptions but also sheds new light on them. Thus, there are fundamental theoretical advances made in this paper.

We present examples of segmentations obtained via our IRFC based algorithm in the multi object/multi seed environment, and compare it with the results obtained with the RFC based algorithm. Our results indicate that, in many situations, IRFC outperforms RFC, but there also exist instances where the gain in performance is negligible.

Keywords: image segmentation, path strength, path connectedness, fuzzy connectedness

1 Introduction

Image segmentation—the process of partitioning (in a hard or fuzzy manner) the image domain into meaningful object regions—is perhaps the most challenging and critical problem in image processing and analysis. Research in this area will probably continue indefinitely long because the solution space is infinite dimensional, and since any single solution framework is unlikely to produce an optimal solution (in the sense of the best possible precision, accuracy, and efficiency) for a given application domain. It is important to distinguish between two types of activities in segmentation research—the first relating to the development of application domain-independent general solution frameworks, and the second pertaining to the construction of domain-specific solution starting from a known general solution framework. The latter is not a trivial task most of the time. Both these activities are crucial, the former for advancing the theoretical aspects of, and shedding new light on, segmentation research, and the latter for bringing the theoretical advances to actual practice. The topic of this paper pertains to the former.

General segmentation frameworks [1]-[12] may be broadly classified into three groups: boundary-based [1]-[5], region-based [6]-[10], and hybrid [11,12]. As the nomenclature indicates, in the first two groups the focus is on recognizing and delineating the boundary or the region occupied by the object in the image. In the third group, the focus is on exploiting the complementary strengths of each of boundary-based and region-based strategies to overcome their individual shortcomings. The segmentation framework discussed in the present paper belongs to the region-based group and constitutes an extension of the fuzzy connectedness (abbreviated from now on as FC) methodology [9].

In the FC framework [9], a fuzzy topological construct, called fuzzy connectedness, characterizes how the spatial elements (abbreviated as spels) of an image hang together to form an object. This construct is arrived at roughly as follows. A fuzzy relation called affinity is defined on the image domain; the strength of affinity between any two spels depends on how close the spels are spatially and how similar their intensity-based properties are in the image. Affinity is intended to be a local relation. A global fuzzy relation called fuzzy connectedness is induced on the image domain by affinity as follows. For any two spels c and d in the image domain, all possible paths connecting c and d are considered. Each path is assigned a strength of fuzzy connectedness which is simply the minimum of the affinities of consecutive spels along the path. The level of fuzzy connectedness between c and d is considered to be the maximum of the strengths of all paths between c and d. For segmentation purposes, FC is utilized in several ways as described below. See [13] for a review of the different FC definitions and how they are employed in segmentation and applications.

In absolute FC (abbreviated AFC) [9], the support of a segmented object is considered to be the maximal set of spels, containing one or more seed spels, within which the level of FC is at or above a specific threshold. To obviate the need for a threshold, relative FC (or RFC) [19] was developed by letting all objects in the image to compete simultaneously via FC to claim membership of spels in their sets. Each co-object is identified by one or more seed spels. Any spel c in the image domain is claimed by that co-object with respect to whose seed spels c has the largest level of FC compared to the level of FC with the seed sets of all other objects. To avoid treating the core aspects of an object (that are very strongly connected to its seeds) and the peripheral subtle aspects (that may be less strongly connected to the seeds) in the same footing, an iterative refinement strategy is devised in iterative RFC (or IRFC) [14]-[16]. The superior performance of IRFC over RFC and the underlying reasons are illustrated in Figures 11 and 9(e-f). Another advantage of IRFC is that the objects it generates are topologically nicer than those generated by RFC or AFC—any IRFC object generated by a single seed has no “holes” (i.e., is simply connected), unless a “hole” contains a seed of another object. This feature is illustrated in Figure 1.

(a) A hand drawn scene with four iso-intensity objects and a dark background. (b) Object labeling in the true scene. (c)-(e) Three phantom scenes generated from (a) at different levels of blur, noise, and inhomogeneity. (f)-(h) Multi-object segmentations of (c)-(e), respectively, by using RFC. (i)-(k) Segmentation of (c)-(e) by using IRFC.

(a) A slice display of the separation of cervical vertebra by applying RFC for the slice shown in Figure 8(a). White spels are not assigned to any specific vertebra. (b) Color surface rendition of the three vertebra segmented by RFC. (c)-(d) Same as (a)-(b), respectively, but by using IRFC. (e) Color surface rendition of arterial (red) and veinous (blue) trees segmented by RFC. (f) Same as (e) but by using IRFC.

(a) Original image, with seeds s and t indicating the object and the background, respectively. (b) The foreground object (in white) generated by RFC. (c) The foreground object (in white) generated by IRFC. (We used the same homogeneity based affinity in both cases.)

In general, IRFC leads to better object definition than RFC with a theoretical construct similar to that of RFC. The proper design of affinity is crucial to the effectiveness of the segmentations that ensue, no matter what type of FC is used. In scale-based FC [13], which is applicable to all of AFC, RFC, and IRFC, affinity is defined not based just on the properties of the two spels under question but also on the properties of all spels in the local scale region around the two spels. In vectorial FC [27], affinity is constructed in a vectorial manner, allowing spels to assume not just scalar values but any vectorial values, which may come from the original acquisition of the image owing to multiple image properties at every spel or that may arise from vector-valued features estimated from the given scalar or vectorial image. By using S and V to abbreviate “scale-based” and “vectorial,” and by allowing a combination of these indexes with different types of FC referred to above, we may describe the FC family that is developed to date by methods denoted by AFC, SAFC, VAFC, VSAFC, RFC, SRFC, VRFC, VSRFC, IRFC, SIRFC, VIRFC, and VSIRFC. See [13] and the original articles cited therein for further details on each member of this family.

In the present paper, we make two sets of fundamental contributions. (1) The original IRFC was devised, due to theoretical challenges, in a 2-object (foreground-background) scenario. We now overcome this theoretical challenge and generalize its theory to multiple objects. (2) In this process of generalization, several most fundamental properties of AFC, RFC, and IRFC have been uncovered. They allow us to better understand the behavior of the FC process in general, and IRFC in particular, and give us a single unified theoretical framework within which all members of FC family methods can be described elegantly. This may lead us to more effective segmentation strategies in the future. These fundamental theoretical advances are described in Section 2. For ease of reading, most long proofs are pooled together in Section 3, so that skipping this section will not affect the understandability of the new results presented in the paper. The new algorithm is described in Section 4. Some examples and comparison with RFC are presented in Section 5 to demonstrate the behavior of the multi-object strategy of generalized IRFC. Our concluding remarks are stated in Section 6.

2 Theory

In this section we present the theoretical framework of generalized IRFC. The terminology and notation employed in this paper follow in spirit that of previously published FC papers. However, we slightly deviate from the previous notation in several aspects, and we believe that the new approach is more precise and elegant.

2.1 Basic definitions and notation

The most fundamental notion in our theory is that of the strength of connectedness between a pair of image elements. In its definition, we will use a notation that is only a slight modification of that used by Udupa and Samarasekera [9].

In this paper we will use the following interpretation of the notions of (hard) functions and relations, which is standard in set theory and is used in many calculus books. A binary relation R from a set X into a set Y is identified with its graph; that is, R is equal to {〈x, y〉 ∈ X × Y : xRy holds} . Since a function f : X → Y is a (special) binary relation from X to Y , in particular we have f = {〈x, f(x)〉: x ∈ X}. With this interpretation, handling fuzzy sets and fuzzy relations becomes quite natural and less cumbersome than usual. In particular, let $Z$ be a fuzzy subset of a hard set X with a membership function μ_Z: X → [0, 1]. For x ∈ X, we interpret $μ_{Z} (x)$ as the degree to which x belongs to $Z$ . Usually such a fuzzy set $Z$ is defined [17] as ${〈 x, μ_{Z} (x) 〉 : x \in X}$ , which is the graph of $μ_{Z}$ . Thus, according to our interpretation, $Z$ is actually equal to $μ_{Z}$ . Note that this interpretation fits also quite well the situation when $Z$ is the hard subset Z of X, as then $Z = μ_{Z}$ is equal to the characteristic function χ_Z (defined as χ_Z(x) = 1 for x ∈ Z and χ_Z(x) = 0 for x ∈ X\ Z), and the identification of Z with χZ is quite common in analysis and set theory. Notice also, that a fuzzy binary relation ρ from X to Y is just a fuzzy subset of X × Y , so it is equal to its membership function μ_ρ: X × Y → [0, 1].

Let n ≥ 2. A binary fuzzy relation α on $Z^{n}$ is said to be a fuzzy adjacency binary relation if α = μ_α is symmetric (i.e., μ_α(c, d) = μ_α(d, c)) and reflexive (i.e., μ_α(c, c) = 1). The value of μ_α(c, d) depends only on the relative spatial position of c and d. Usually μ_α(c, d) is decreasing with respect to the distance function ||c - d||. In most applications, α is just a hard case relation like 4-adjacency relation for n = 2 or 6-adjacency in the three-dimensional case. By an n-dimensional fuzzy digital space we will understand a pair $〈 Z^{n}, α 〉$ . The elements of digital space are called spels. (For n = 2 also called pixels, while for n = 3 - voxels.) A scene over a fuzzy digital space $Z^{n}$ is a pair $C = 〈 C, f 〉$ , where $C = \prod_{j = 1}^{n} [- b_{j}, b_{j}] \subset Z^{n}$ , each b_j > 0 being an integer, and $f : C \to R$ is a scene intensity function. In this paper, symbols $C$ and C will always stand for a scene and its domain, respectively, as defined above.

The most fundamental measure of local “hanging togetherness” of any pair of spels is an affinity relation κ. It is a fuzzy binary relation defined on C; that is, μ_κ: C × C → [0, 1]. Affinity relation κ is defined to be symmetric and reflexive. The value of μ_κ(c, d) depends not only on the adjacency strength μ_κ(c, d), but also on the intensity function f. There are many methods of finding the affinity relation for a given scene. (See the survey paper [13].) In this paper, we will always assume that an appropriate affinity has already been specified for the segmentation task on hand.

A translation of the local strength of connectedness given by κ into the global strength of connectedness is done with the help of the notion of a path and its strength. A path in A ⊆ C is any sequence p = 〈c₁, . . . , c_l〉, where l > 0 and c_i∈ A for every i = 1, . . . , l. The family of all paths in A is denoted by $P^{A}$ . If c, d ∈ A, then the family of all paths 〈c₁, . . . , c_l〉 in A from c to d (i.e., such that c₁ = c and c_l = d) is denoted by $P_{c d}^{A}$ . The strength μ(p) of a path $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{C}$ is defined as the strength of its κ-weakest link; that is,

μ (p) = \min {μ_{κ} (c_{i - 1}, c_{i}) : 1 < i \leq l},

(1)

when l > 1, and μ(p) = 1 for l = 1. For c, d ∈ A ⊆ C the fuzzy κ-connectedness strength in A between c and d is defined as the strength of a strongest path in A between c and d; that is,

μ^{A} (c, d) = \max {μ (p) : p \in P_{c, d}^{A}} .

(2)

If κ is a hard binary relation, κ: C × C → {0, 1}, then the relation μ^A is known as a transitive closure of κ ∩ (A × A). Note that

μ^{A} (c, d) \leq μ^{B} (c, d) for every c, d \in A \subseteq B \subseteq C .

(3)

Notice also that μ^A(c, d) ≥ μ_κ(c, d). A path $p \in P_{c, d}^{A}$ with μ(p) = μ^A(c, d) is referred to as a strongest path (in A) from c to d.

It is easy to see that, for every c, d ∈ A ⊆ C and paths $p, q \in P^{A}$ , we have

(i) μ(〈c, d〉) = μ_κ(c, d) and μ(p)≤ μ(q) if p is either an initial or a terminal extension of q; and

(ii) μ^A is reflexive and symmetric on A.

It is also not difficult to see (and it follows easily from Proposition 2.1 below) that

(iii) μ^A is transitive on A; that is, μ^A (c, d) ≥ min{μ^A (c, x), μ^A(x, d)} for every c, d, x ∈ A.

A very interesting fact is that if μ^A is defined from μ via formula (2) and the properties (i)-(iii) hold, then one might assume as well μ is defined by a formula (1), since under this conditions, independent of the actual definition of μ(p), we still have $μ^{A} (c, d) = \max_{〈 c_{1}, \dots, c_{l} 〉 \in P_{c, d}^{A}} \min_{1 < i \leq l} μ_{κ} (c_{i - 1}, c_{i})$ . This was proved by Saha and Udupa in [18].

For paths $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{A}$ and $q = 〈 d_{1}, \dots, d_{n} 〉 \in P^{A}$ , we will use symbol p + q to denote the path $〈 c_{1}, \dots, c_{l}, d_{1} \dots, d_{n} 〉 \in P^{A}$ . We will use this symbol only when c_l = d₁. Notice that in such a situation, by the definition in (1), we have μ(p + q) = min{μ(p), μ(q)}, as μ_α(c_l, d₁) = 1.

The following result is a slight refinement of [15, Prop. 2.3].

Proposition 2.1

For any spels a, b, c ∈ A ⊆ C,

μ^{A} (a, b) > μ^{A} (b, c) \Rightarrow μ^{A} (a, c) = μ^{A} (b, c) .

(4)

Proof

If p_ab and p_bc are the strongest paths between a and b and between b and c, respectively, then the path p_ab + p_bc justifies μ^A(a, c) ≥ μ^A(b, c), as μ^A(a, c) ≥ μ(p_ab+p_bc) = min{p_ab, p_bc} ≥ μ^A(b, c). If we had μ^A(a, c) > μ^A(b, c), with path p_ca being the strongest path between c and a, then we would have μ(p_ca + p_ab) = min{μ(p_ca), μ(p_ab)} > μ^A(b, c), which is impossible.

2.2 Fuzzy connected objects: absolute and relative

By a segmentation of a scene $C = 〈 C, f 〉$ we will understand any family {P₁, . . . , P_m} of pairwise disjoint hard subsets of C. Although this is a departure from the terminology used in the previous papers on fuzzy connectedness, the change is only superficial. This is the case since the algorithms from all previous papers were also designed to create the hard segmentations of C, while, in the last step, each set P from the segmentation was assigned a membership function μ_P: C → [0, 1] of the form μ_P(c) = η(f(c)) · χ_P(c), where η is a function (like Gaussian) that maps the image intensity function into objectness values. Although this last step could be done also in the case of our segmentation, we will confine ourselves up to the step of hard segmentation only since, from the viewpoint of the new theory and algorithms, this is what matters.

To translate the notion of a path strength into an actual segmentation of a given scene $C = 〈 C, f 〉$ , one must indicate each object with one or more seeds. So, assume that we have a nonempty set S ⊂ C of seeds such that each seed represents a different object. (The case of multiple seeds per object will be discussed later.)

The simplest way to define a segmentation of a scene $C$ is to choose a threshold θ ∈ (0, 1] and for each seed s ∈ S define an object in $C$ associated with s as

P_{s θ} = {c \in C : μ^{C} (c, s) \geq θ} .

These objects were first studied by Udupa and Samarasekera in [9]. It is easy to see that s ∈ P_sθ for every s ∈ S. Also, for s, t ∈ S, if θ ≤ μ^C(s, t), then P_sθ and P_tθ are disjoint; on the other hand, if θ ≤ μ^C(s, t), then P_sθ = P_tθ. Thus, to make the objects disjoint, one must choose θ greater than every number μ^C(s, t), for all distinct s, t ∈ S. This is the underlining mechanism of AFC. This phenomenon is illustrated in Figure 2 on a CT slice of a human knee, wherein three seed spels s, t, and u are chosen, one in each of three muscle regions. Since the strength of connectedness between any two seeds is much lower than the strength of connectedness within each object (Figure 2(b)), for the individual muscle regions a threshold can be selected to specify P_sθ.

Illustration of AFC segmentation of the muscles of a knee. (a) A CT slice of a human knee. (b) Each pixel has a strength of connectedness with respect to each seed, u, s, and t, chosen within muscle regions. The largest of these strengths is shown as a scene.

A considerably more powerful segmentation tool is that of RFC. For any s ∈ C and T ⊂ C, define

P_{s T} = {c \in C : μ^{C} (c, s) > μ^{C} (c, t) for every t \in T \ {s}} .

Then, the segmentation generated by seeds S ⊂ C is defined as {P_sS: s ∈ S}.

It is easy to see that the objects {P_sS: s ∈ S} are pairwise disjoint. In addition, s ∈ P_sS as long as there is no t ∈ S, t ≠ s, with μ^C(s, t) = 1; if there is such a t, then P_sS is empty. Note also, that if θ > max{μ^C(s, t): s, t ∈ S, s ≠ t} (so that the sets {P_sθ: s ∈ S} are pairwise disjoint), then P_sθ ⊂ P_sS for every s ∈ S. Thus, the RFC method of segmentation is indeed more refined than the AFC method. Again, by using the example in Figure 2, we demonstrate in Figure 3 the results P_sS of RFC. Note that these segmented regions are generally larger than those in Figure 2. Note also that the spels that are not in the muscle regions all have the same strength of connectedness with respect to at least two objects.

RFC segmentation of the knee muscles from Figure 2, where the same seed points were used as in the AFC segmentation shown in Figure 2(b).

One of the important properties of the above described methods of segmentation (AFC and RFC) is known as robustness. This property states that the segmentation does not change if different seeds are chosen within the same objects, which, for the practice of these segmentation methods, is a very desirable property to have. The following result, due to Saha and Udupa [19], is the precise statement of this property in case of RFC segmentation. (This result follows also from our Corollary 2.7.)

Proposition 2.2 (Robustness)

Let S = {s₁, . . . , s_m} ⊂ C and for every i ∈ {1, . . . , m} let t_i ∈ P_siS. If T = {t₁, . . . , t_m}, then P_tiT = P_siS for every i ∈ {1, . . . , m}.

The objects P_sS are often referred to as connected components. The following fact justifies the word connected in this term. Moreover, this fact will be used in what follows as a motivational tool and in the actual proofs.

Fact 2.3

If p = 〈c₁, . . . , c_l〉 is a strongest path from c ∈ P_sS to an s ∈ S, then c_i ∈ P_sS for every i ∈ {1, . . . , l}; that is, $p \in P^{P_{s S}}$ .

Proof

Fix an i ∈ {1, . . . , l} and a t ∈ S\ {s}. Since c ∈ P_sS, we know that μ^C(c, s) > μ^C(c, t). We need to show that μ^C(c_i, s) > μ^C(c_i, t). But, by (4), we have μ^C(s, t) = μ^C(c, t). Since also

μ^{C} (c_{i}, s) \geq μ (〈 c_{i}, \dots, c_{l} 〉) \geq μ (p) = μ^{C} (c, s) > μ^{C} (c, t) = μ^{C} (s, t),

by (4) we have μ^C(c_i, s) > μ^C(s, t) = μ^C(c_i, t).

It is sometimes difficult to pinpoint a single seed in a desired object, and often, it is convenient, or becomes necessary, to choose multiple seeds for each object under consideration. So, let $S$ be a family of nonempty pairwise disjoint sets of seeds. For each $S \in S$ , we like to find an object $P_{S S}$ containing S in a way similar to that described above. To define $P_{S S}$ , it is convenient to have the following notation for every c ∈ A ⊂ C and D ⊂ A:

μ^{A} (c, D) = \max_{d \in D} μ^{A} (c, d) .

(Note that μ^A(c, ∅) = -∞, as max ∅ = ∞ according to a convention that, for a finite $Z \subset R$ , max Z is the smallest b ∈ [-∞, ∞] for which z ≤ b for every z ∈ Z.) We define

\begin{matrix} P_{S S} = & {c \in C : μ^{C} (c, S) > μ^{C} (c, T) for every T \in S \ {S}} \\ = & {c \in C : \max_{s \in S} μ^{C} (c, s) > μ^{C} (c, t) for every t \in W}, \end{matrix}

where $W = ⋃ (S \ {S})$ . Although this multi seed approach is useful in practice, it is worth to note that this theory is quite close to, and readily ensues from, the single seed theory, as each $P_{S S}$ can be easily expressed in terms of objects generated by singleton seeds:

P_{S S} = ⋃_{s \in S} P_{s W},

(5)

since $P_{S S} = ⋃_{s \in S} {c \in C : μ^{C} (c, s) > μ^{C} (c, t) for every t \in W} = ⋃_{s \in S} P_{s W}$ .

2.3 Iterative Relative Fuzzy Connectedness: motivation, definition, and properties

The RFC segmentation {P_sS: s ∈ S of a scene can still leave quite a sizable “boundary” set B = C\ ∪_s∈S P_sS; that is, the set of all spels c outside any of the objects P_sS wherein the strengths of connectedness are equal with respect to the seeds. An example is provided in Figure 4 to illustrate this concept of “boundary” spels left unclaimed. The goal of what follows is to find a way to naturally redistribute some of the spels from B among the object regions in a new generation (iteration) of segmentation. Another motivation for IRFC, also explained in Figure 4, is to overcome the problem of “path strength dilution within the same object,” of paths that reach the peripheral subtle and thin aspects of the object.

Illustration of the phenomenon of “path strength dilution within the same object.” The strongest paths from s₁ to t₁, s₁ to t₂, s₂ to t₁, and s₂ to t₂ are likely to have the same strength because of partial volume effects.

In Figure 4, two object regions A and B, each with its core and peripheral subtle parts, are shown. Owing to blur and other artifacts introduced into the scene by the imaging device due to partial volume effect and other shortcomings, the strongest paths from s₁ to t₁, s₁ to t₂, s₂ to t₁, and s₂ to t₂ are all likely to assume similar strengths. As a consequence, the spels in the dark areas may fall in B, the unclaimed “boundary” set.

A basic idea behind the definition of relative fuzzy connected objects P_sS, s ∈ S, is that each seed s ∈ S competes for each spel: a spel c goes to the object P_sS provided c is connected to s in a stronger way than to any other seed t ∈ S. Here the strength of connectedness between c and d is expressed by a number μ^C(c, d), the strength of a strongest path (in C) between c and d. Thus, the fact that a spel c belongs to P_sS means that

c is connected to s within the object P_sS with a strength μ^C(c, s) and any appropriate path between c and t ∈ S\ {s} is weaker than μ^C(c, s).

Although the clause “within the object P_sS” may not seem obvious from the definition of P_sS, it is justified both by intuition and by Fact 2.3. It is also not clear what we have in mind by an “appropriate path,” but at this stage it does not matter, since the strength inequality holds for any path between c and t ∈ S\ {s}.

The importance of the clause “appropriate path” comes to light when we examine the spels c from the “boundary” set B = C\ ∪_s∈S P_sS. If we like to refine our definition and to extend each object P_sS, s ∈ S, to a possible larger object $P_{s S}^{'}$ , what would be the “appropriate” paths between c ∈ B and s ∈ S that we should consider? Since the “strongest path” justifying $c \in P_{t S}^{'}$ , for t ∈ S, should be contained in $P_{t S}^{'} \subset B \cup P_{t S}$ , it seems should that we should restrict our attention to the paths between c and t only from B ∪ P_tS. Thus, to obtain a definition of $P_{s S}^{'}$ , we should modify the definition of P_sS by replacing each number μ^C(c, t), t ∈ S\ {s}, with μ^B∪P_tS(c, t). This leads to

P_{s S}^{'} = P_{s S} \cup {c \in B : μ^{B \cup P_{s S}} (c, s) > μ^{B \cup P_{t S}} (c, s) for every t \in S \ {s}} .

Although this definition could be used as the engine for the iteration described below, it turns out that it will be more convenient to use its equivalent form:

P_{s S}^{+} = P_{s S} \cup {c \in C \ P_{s S} : μ^{C} (c, s) > μ^{C \ P_{s S}} (c, t) for every t \in S \ {s}} .

The equality $P_{s S}^{'} = P_{s S}^{+}$ follows from Theorem 3.7.

Figure 5 illustrates these ideas pictorially. The initial segmentation is defined by RFC conservatively, so that P_sS corresponds to the core aspects of the object identified by seed s (illustrated by the hatched area containing s in Figure 5). This leaves a large boundary set B where the strengths of connectedness with respect to the different seeds are equal (illustrated by the shaded area containing s in Figure 5). In the next iteration, the segmentation is improved incrementally by grabbing those spels of B that are connected more strongly to P_sS than to P_tS. When considering the object associated with s, the “appropriate” path from s to any c ∈ B is any path in C. However, all objects have to compete with the object associated with s by allowing paths from their respective seeds t to c not to go through P_sS since this set has already been declared to be part of the object of s.

Pictorial illustration of IRFC advantages over RFC.

The advantage of the formula for $P_{s S}^{+}$ over that for $P_{s S}^{'}$ comes from the fact that, unlike the case of $P_{s S}^{'}$ , we can compute $P_{s S}^{+}$ without knowing sets P_tS for t ≠ s. This makes the implementation of the algorithm easier and more efficient. In addition, the two object IRFC theory in earlier papers discussing this subject [15] was done in the format of $P_{s S}^{+}$ , which makes our ( $P_{s S}^{+}$ based) theory its natural generalization. However, the formalism underlining $P_{s S}^{'}$ also has its advantages. First of all, it is more intuitive from the connectedness point of view. Also, the disjointness of the new generation of segments (see Theorem 2.4) is obvious in the $P_{s S}^{'}$ setting, while it requires a complicated argument in the $P_{s S}^{+}$ formalism.

Now, the iterative version of sets P_sS can be defined as follows. For each s ∈ C let $P_{s S}^{0}$ be the empty set and define iteratively sets P^j_sS by a formula $P_{s S}^{j + 1} = P_{s S}^{j} \cup Q_{s S}^{j}$ , where

Q_{s S}^{j} = {c \in C \ P_{s S}^{j} : μ^{C} (c, s) >^{C \ P_{s S}^{j}} (c, t) for every t \in S \ {s}} .

This definition works fine if we assume that each object is connected and is generated by a single seed. However, we like to develop this theory also in the case when each object in the segmentation is generated by a set S of seeds where the different resulting segments may be disconnected. So, let $μ_{S}$ be a nonempty family of nonempty pairwise disjoint sets of seeds. For every A ⊂ C let $P_{A S}^{0} = \emptyset$ and for j = 0, 1, 2, . . . define $P_{A S}^{j + 1} = P_{A S}^{j} \cup Q_{A S}^{j}$ , where

\begin{matrix} Q_{A S}^{j} = & {c \in C \ P_{A S}^{j} : μ^{C} (c, A) > μ^{C \ P_{A S}^{j}} (c, T) for every T \in S \ {A}} \\ = & {c \in C \ P_{A S}^{j} : μ^{C} (c, A) > μ^{C \ P_{A S}^{j}} (c, t) for every t \in ⋃ (S \ {A})} \\ = & {c \in C \ P_{A S}^{j} : μ^{C} (c, A) > μ^{C \ P_{A S}^{j}} (c, ⋃ (S \ {A})} . \end{matrix}

The equality between the sets defining $Q_{A S}^{j}$ follows immediately from the definition μ^A(c, D) = max_d∈D μ^A(c, d). (For alternative definitions of $P_{A S}^{j + 1}$ see also Subsection 3.2.)

Clearly $P_{A S}^{j} \subseteq P_{A S}^{j + 1}$ for every j and for any A ⊂ C. Since the scene domain C is finite, the growth must stop at some stag j. In particular, there is a k for which $P_{S S}^{k + 1} = P_{S S}^{k}$ for all $S \in S$ . We will denote such terminal iterative $P_{S S}^{k}$ as $P_{S S}^{I}$ . The IRFC segmentation (of $C$ with respect to $S$ ) is defined as ${P_{S S}^{I} : S \in S}$ .

Note that the result of the first iteration $P_{S S}^{1}$ is equal to $P_{S S}$ as defined by the RFC formula. Thus, $P_{S S} \subseteq P_{S S}^{I}$ . In particular, the iterative technique is a refinement of the RFC method. Also, for every s ∈ S ⊂ C and j, we have

P_{s S}^{j} = P_{{s} S}^{j},

where $S = {{s} : s \in S}$ . Thus, every family ${P_{s S}^{j} : s \in S}$ of single seed generated IRFC segmentation can be easily represented in the formalism of multi seed generated IRFC segmentations. In other words, the theory of IRFC segmentations ${P_{S S}^{j} : S \in S}$ contains, as special cases, the theories of RFC and IRFC segmentations generated by singleton seeds, as well as the theory of RFC in the case of multi seed generated objects.

Equation (5) shows a beautiful relation between the RFC objects, P_sS, generated by singleton seeds and their multi seed generated counterparts P_S_S. Could we also prove its iterative analog? This certainly would give a hope that a large part of multi seed IRFC theory could be easily deduced from its single seed counterpart. However, the iterative analog of (5) is false as can be seen in Example 3.14. Thus, we need to prove our results in a full multi seed setting.

The most fundamental property of any segmentation is that the objects it creates are pairwise disjoint. For IRFC segmentation, this is given by the following theorem.

Theorem 2.4

For any family $S$ of subsets of a scene C, we have $P_{S S}^{I} \cap P_{U S}^{I} = \emptyset$ for every distinct $S, U \in S$ .

Since the iteration leading to the sets $P_{S S}^{j}$ uses the formula as in $P_{s S}^{+}$ rather than as in $P_{s S}^{'}$ , the proof of Theorem 2.4 is rather complicated and it will be postponed till the next section. It is also worth to notice that, in our proof of the equation $P_{s S}^{+} = P_{s S}^{'}$ (see Theorem 3.7), we need to use Theorem 2.4 in the $P_{s S}^{+}$ formalism.

Notice that, in the formulation of Theorem 2.4, we assumed almost nothing about the family $S$ of sets of seeds. We will continue with these minimal assumptions about $S$ throughout most of the theoretical development that follows, since this does not make the proofs any more difficult. Moreover, in some cases (e.g., when we modify $S$ to form another family of seeds $T$ to compare the $S$ -segmentation with $T$ -segmentation), it saves us the trouble of checking any extra properties we could impose on the generating families of seeds. However, in practical applications, we will apply our algorithm only when the sets in $S$ are nonempty and pairwise disjoint.

Notice that allowing the empty set to be in $S$ does not change much, since $P_{\emptyset S}^{j}$ is empty for any $S$ and j. The fact that allowing overlapping sets in $S$ also changes little is more subtle. It is true that if a seed s belongs to distinct $S, T \in S$ , then s does not belong to $P_{S S}^{I}$ , or any other $P_{U S}^{I}$ . This is certainly an undesirable situation, since we would like the generating seeds S to be in the object $P_{S S}^{I}$ they generate. Unfortunately, a simple assumption that the sets in $S$ be pairwise disjoint does not solve the problem: if $S, T \in S$ are distinct and there are s ∈ S and t ∈ T with μ^C(s, t) = 1, then neither s nor t belongs to $⋃_{V \in S} P_{V S}^{I}$ . Then the question arises as to what part of S belongs to $P_{S S}^{I}$ . In Lemma 3.2, we will show that the missing seeds are precisely those from the above example: if $E_{S} = ⋃_{S \in S} {s \in S : μ^{C} (s, t) = 1 for some t \in T \in S \ {S}}$ , then $S \ E_{S} \subseteq P_{S S}^{1}$ , while $E_{S}$ is disjoint with $⋃_{V \in S} P_{V S}^{I}$ . We will also show in Proposition 3.12 that, even if $E_{S}$ is nonempty, it is possible to redistribute its elements (i.e., to find a family $T = {T_{S} \supseteq S \ E_{S} : S \in S}$ with $⋃ T = ⋃ S$ ) in such a way that $T \subseteq P_{T T}^{1}$ for every $T \in T$ . Moreover, we can ensure that $P_{S S}^{I} \subseteq P_{T_{s} T}^{I}$ for every $S \in S$ .

The second fundamental property of our segmentation method is its stability with respect to different choices of seeds initializing the segmentation process. This will be discussed in the next subsection.

2.4 Robustness of IRFC segmentation

The most natural impulse for a formulation of a robustness theorem in our setting is to state it in the compact format of Proposition 2.2: “For a family $S = {S_{1}, \dots, S_{m}}$ of seeds and nonempty sets $T_{i} \subseteq P_{S_{i} S}^{I}$ , $P_{T_{i} T} = P_{S_{i} S}$ , where $T = {T_{1}, \dots, T_{m}}$ .” However, in a multiple seed setting, there is no hope for such a result even in the case of RFC or AFC. To verify this, consider a scene $C = 〈 C, f 〉$ that contains three uniform circles C₁, C₂, and C₃ which are pairwise completely separated. (This means that for any c ∈ C_i and d ∈ C_j we have μ^C(c, d) = 1 for i = j and μ^C(c, d) = 0 for i ≠ j.) If we choose S₁ = T₁ = C₁, S₂ = C₂ ∪ C₃ and T₂ =C₂, then $P_{s_{2} S}^{I} = P_{S_{2} S} = C_{2} \cup C_{3}$ , while $P_{T_{2} T}^{I} = P_{T_{2} T} = C_{2}$ is smaller. The difficulty outlined in this example comes from the fact that an object $P_{S_{i} S}^{I}$ may have more than one connected component, while T_i may intersect only one of them. Thus, to insure that this will not happen, we will assume that S_i ⊆ T_i, leading to the following result.

Theorem 2.5

Let $S = {S_{1}, \dots, S_{m}}$ be a family of subsets of C, fix k ∈ {1, 2, 3, . . .}, and let $S_{i} \subseteq T_{i} \subseteq S_{i} \cup P_{S_{i} S}^{k}$ for every i ∈ {1, . . ., m}. If $T = {T_{1}, \dots, T_{m}}$ , then $P_{S_{i} S}^{I} = P_{T_{i} T}^{I}$ for every i ∈ {1, . . . , m}. Moreover, if k = 1, then $P_{S_{i} S}^{j} = P_{T_{i} T}^{j}$ for every i ∈ {1, . . . , m} and j ∈ {0, 1, 2, . . .}.

This theorem shows that there is considerable flexibility in the choice of seeds used to iteratively generate an object $P = P_{S S}^{I}$ : as long as we choose the seeds inside P and ensure that they contain some minimal set of generators, the final result will always be the same. The version of the theorem when k = 1 has even nicer conclusion. However, the assumption that $T_{i} \subset S_{i} \cup P_{S_{i} S}^{1}$ may be somewhat restrictive—it may be difficult to guess which spels be in the “core part” $P_{S_{i} S}^{1}$ of the object, even in the case when the entire object, $P_{S_{i} S}^{I}$ , can be guessed with a good approximation. Note also that, in fact Theorem 2.5 remains true, if we assume that each T_i contains only a subset T_{S_i} of S_i described in Proposition 3.10.

The only version of Theorem 2.5 that was previously proved in the literature (see [15]) was done only for two components, in a single seed format, and in the version with k = 1 (i.e., Corollary 2.7 below for m = 2).

Theorem 2.5 directly leads to the following corollary.

Corollary 2.6

Let $S = {S_{1}, \dots, S_{m}}$ and $T = {T_{1}, \dots, T_{m}}$ be the families of subsets of C, fix k ∈ {1, 2, 3, . . .}, and assume that for every i ∈{1,...,m} we have $T_{i} \subseteq S_{i} \cup P_{S_{i} S}^{k}$ and $S_{i} \subseteq T_{i} \cup P_{T_{i} T}^{k}$ . Then $P_{S_{i} S}^{I} = P_{T_{i} T}^{I}$ for every i ∈ {1, . . . , m}. Moreover, if k = 1, then $P_{S_{i} S}^{j} = P_{T_{i} T}^{j}$ for every i ∈ {1 , . . . , m} and j ∈ {0, 1, 2, . . .}.

Proof

For i ∈ {1, . . . , m}, put U_i = S_i ∪ T_i and let $U = {U_{1}, \dots, U_{m}}$ . Then, the pairs $〈 S, U 〉$ and $〈 T, U 〉$ satisfy the assumptions of Theorem 2.5, so, $P_{S_{i} S}^{I} = P_{U_{i} U}^{I} = P_{T_{I} T}^{j}$ for every i ∈ {1, . . . , m}. If k = 1, we also have $P_{S_{i} S}^{j} = P_{U_{i} U}^{j} = P_{T_{i} T}^{j}$ for every j ∈ {0, 1, 2, . . .}.

When we restrict our attention to the segmentation generated with only singleton seeds, one of the inclusions in the assumptions of Corollary 2.6 can be dropped and we obtain an analog of Proposition 2.2.

Corollary 2.7

Let S = {s₁, . . . ,s_m} and T = {t₁, . . . , t_m} be some m-element subsets of C and assume that for every i ∈ {1, . . . , m} we have $t_{i} \in P_{s_{i} S}^{1}$ . Then $P_{t_{i} T}^{j} = P_{s_{i} S}^{j}$ for every i ∈ {1, . . . , m} and j ∈ {0, 1, . . .}.

It is not accidental that in Corollary 2.7 we assume that each t_i belongs to a smaller set $P_{s_{i} S}^{1}$ rather than to a bigger set $P_{s_{i} S}^{I}$ , as in our other robustness results—the version of Corollary 2.7 with assumption $t_{i} \in P_{s_{i} S}^{I}$ is false, even if we weaken the conclusion to $P_{t_{i} T}^{I} = P_{s_{i} S}^{I}$ . A simple example of such a situation is given in Example 3.15, where $t_{i} \in P_{s_{i} S}^{I}$ for all i while $P_{t_{1} T}^{I} ⊊ P_{s_{1} S}^{I}$ . This is yet another reason why in Theorem 2.5 we need the assumption S_i ⊆ T_i.

3 The proofs and the examples

This section is designed mainly to prove the results announced in the previous section. This will require an introduction of some new concepts and proving several auxiliary results, some of which are of independent interest and are fundamental to the FC phenomenon. Unless otherwise explained, in what follows, $C = 〈 C, f 〉$ will always stand for a digital scene with fixed adjacency and affinity relations, and $S$ for a nonempty family of subsets of C.

3.1 Disjointness of the segments

The following simple fact will be used (often implicitly) many times in this section.

Fact 3.1

If c, d ∈ A ⊆ B ⊆ C and p is a path in A from c to d such that μ(p) = μ^B(c, d), then μ^A(c, d) = μ^B(c, d).

Proof

This follows immediately from μ^A(c, d) ≤ μ^B(c, d) = μ(p) ≤ μ^A(c, d), where the first inequality is justified by (3) and the last is a consequence of the definition of μ^A.

The next lemma describes precisely what portion of S must belong to $P_{S S}^{1}$ .

Lemma 3.2

For $S \in S$ let $E = {s \in S : μ^{C} (s, T) = 1 for some T \in S \ {S}}$ . Then $S \ E \subseteq P_{S S}^{1}$ and E is disjoint with $⋃_{V \in S} P_{V S}^{I}$ .

Proof

Clearly $S \ E \subseteq P_{S S}^{1}$ as μ^C(s, S) = 1 > μ^C(s, T) for any s ∈ S\ E and $T \in S \ {S}$ .

We will prove $P_{S S}^{j} \subseteq C \ E$ by induction on j ∈ {0, 1, 2, . . .}. For j = 0 it is obvious, as $P_{S S}^{0} = \emptyset$ . So, assume that for some j we have $P_{S S}^{j} \subseteq C \ E$ . We need to show that $P_{S S}^{j + 1} \subseteq C \ E$ . For this, choose a $c \in P_{S S}^{j + 1}$ and, by way of contradiction, assume that c ∈ E. Then there is a $T \in S \ {S}$ for which μ^C(c, T) = 1. Moreover, any strongest path from c to T is in E, so, by Fact 3.1, we have μ^E(c, T) = 1. Also, $E \subseteq C \ P_{S S}^{j}$ , which follows from the inductive assumption, and (3) imply that $μ^{C \ P_{S S}^{j}} (c, T) \geq μ^{E} (c, T)$ . So, $μ^{C \ P_{S S}^{j}} (c, T) = 1$ . However, this contradicts $μ^{C} (c, S) > μ^{C \ P_{S S}^{j}} (c, T)$ , which is a consequence of $c \in P_{S S}^{j + 1}$ . So, indeed c ∈ C\ E.

Now, if $V \in S \ {S}$ , then the inclusion $P_{V S}^{j} \subseteq C \ S \subseteq C \ E$ is proved by even easier induction. Indeed, if $P_{V S}^{j} \subseteq C \ S$ is true for some j, then $μ^{C} (s, V) \leq 1 = μ^{C} (s, s) = μ^{C \ P_{V S}^{j}} (s, s) = μ^{C \ P_{V S}^{j}} (s, S)$ for every s ∈ S; that is, no s ∈ S is in $P_{V S}^{j + 1}$ .

Let c ∈ A ⊂ C and S ⊂ A. We say that a path $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{A}$ is a nice path (in A) from c to S provided c₁ = c, c_l ∈ S, and for every k ∈ {1, . . . , l}, we have μ(〈c_k, . . . , c_l〉) = μA(c_k, S), that is, 〈c_k, . . . , c_l〉 is a strongest possible path in A from c_k to S. If S = {s}, then we will say that p is a nice path (in A) from c to s, rather than to S.

Lemma 3.3

For every c ∈ A ⊂ C and S ⊂ A, there exists a nice path in A from c to S.

Proof

We will start with the following simple remark. In its statement, by a one-to-one path we understand any path in which no spel appears more than once.

(I) For every d ∈ A ⊂ C and S ⊂ A there exists a one-to-one path $p \in P^{A}$ from d to an s ∈ S with μ(p) = μ^A(d, S).

Indeed, let $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{A}$ be a shortest path in A from d to an s ∈ S with μ(p) = μ^A(d, S). Then p must be one-to-one. Otherwise, there would exist 1 ≤ i < j ≤ l for which c_i = c_j. But then the path 〈c₁, . . . , c_i, c_j+1, . . . , c_l〉 would be a strongest path in A from d to an s ∈ S of shorter length than p, which contradicts the choice of p.

Next we will prove, by induction on n = 1, 2, 3, . . ., the following statement.

I_n: For every c ∈ A ⊂ C and S ⊂ A there exists a one-to-one path $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{A}$ from c to S such that for every i ∈ {1, . . . , n}

if i \leq l, then μ (〈 c_{i}, \dots, c_{l} 〉) = μ^{A} (c_{i}, S) .

(*)

For n = 1 the statement is true: it is just the condition (I) we proved above. So, assume that I_n holds. We need to prove I_n+1.

So, pick c ∈ A ⊂ C and S ⊂ A. Let p = 〈c₁, . . . , c_l〉 be a path satisfying I_n. If l ≤ n, then p satisfies also I_n+1 and we are done. So, assume that l ≥ n+1. Let x = μ(〈c_n, . . . , c_l〉) = μ^A(c_n, S), y = μ(〈c_n+1, . . . ,c_l〉)), and z = μ^A(c_n+1, S). Then x ≤ y < z. If y = z, then p satisfies also I_n+1 and, again, we are done. So, assume that x ≤ y ≤ z. Let $q = 〈 d_{1}, \dots, d_{m} 〉 \in P^{A}$ be a path from c_n+1 to S with μ(q) = μ^A(d₁, S). By (I) we can assume that q is one-to-one. Let $p^{'} = 〈 c_{1}, \dots, c_{n}, d_{1}, \dots, d_{m} 〉 \in P^{A}$ . We will show that p’ satisfies I_n+1.

Indeed, clearly p’ is a path in A from c to S. To see that p’ is one-to-one assume, by way of contradiction, that this is not the case. Then there exist 1 ≤ i ≤ n and 1 ≤ j ≤ m such that c_i = d_j. But then

μ^{A} (c_{i}, S) = μ^{A} (d_{j}, S) \geq μ (〈 d_{j}, \dots, d_{m} 〉) \geq μ (〈 d_{1}, \dots, d_{m} 〉) = z .

Thus, z ≤ μ^A(c_i, S) = μ(〈c_i, . . . , c_l〉) ≤ μ(〈c_n, . . . , c_l〉) = x, contradicting x ≤ y < z. So, q is one-to-one.

To see (*) take an i ≤ n + 1, then condition (*) becomes μ(〈d1, . . . , d_m〉) = μ^A(d₁, S) and it is ensured by μ(q) = μ^A(d₁, S). So, assume i ≤ n. Then μ(〈c_i, . . . , c_n, d₁, . . . , d_m〉) ≥ μ(〈c_i, . . . , c_l〉) = μ^A(c_i, S), since μ(〈d₁, . . . , d_m〉) = μ^A(〈c_n+1,S) ≥ μ(〈c_n+1, ...,c_l〉). Thus, (*) holds. This finishes the inductive proof of I_n.

Finally, note that if N is the size of A, then I_N implies the lemma, since for any one-to-one path $p = 〈 c_{1}, \dots, c_{l} 〉 \in P^{A}$ we have l ≤ N, so a path satisfying and I_N must be nice.

The following fact is the iterative version of Fact 2.3.

Fact 3.4

If $S \in S$ and p = 〈c₁, . . . , c_l〉 is a nice path from $c \in P_{S S}^{j}$ to S, then $c_{i} \in P_{S S}^{j}$ for every i ∈ {1, . . . , l}, that is, $p \in P^{P_{S S}^{j}}$ .

Proof

The proof goes by induction on j. For j = 1 it follows from Fact 2.3 and (5). So, assume that it is true for some j ≤ 1. We need to prove it for j + 1.

So, fix an $S \in S$ and a nice path p = 〈c₁, . . . , c_l〉 from $c \in P_{S S}^{j + 1}$ to S. First notice thatthere is an i ∈ {1, . . . , l} for which $c_{i} \in P_{S S}^{j}$ .

Indeed otherwise p is in $C \ P_{S S}^{j}$ and c_l ∈ E, where E is as in Lemma 3.2. Pick a $T \in S \ {S}$ for which μ^C(c_l, T) = 1 and let q be a path from c_l to T with μ(q) = 1. Then q is in $E \subseteq C \ P_{S S}^{j}$ . Thus, p + q is a path in $C \ P_{S S}^{j}$ from c to T and $μ^{C \ P_{S S}^{j}} (c, T) \geq μ (p + q) = μ (p) = μ^{C} (c, S)$ , contradicting $c \in P_{S S}^{j + 1}$ .

Let k ∈ {1, . . . , l} be the smallest number such that $c_{k} \in P_{S S}^{j}$ . Since 〈c_k, . . . , c_l〉 is a nice path from $c_{k} \in P_{S S}^{j}$ to S, by the inductive assumption we have that $c_{i} \in P_{S S}^{j} \subset P_{S S}^{j + 1}$ for every i ∈ {k, . . . , l}. Thus, we just need to prove that, for each i ∈ {1, . . , k - 1}, the spel c_i belongs to $Q_{S S}^{j}$ .

If k = 1 there is nothing to prove. So, assume that k > 1. Then the proof is almost identical to that for Fact 2.3.

Fix an i ∈ {1, . . . , k - 1}, a $T \in S \ {S}$ , and a t ∈ T. Since $c_{1} \in Q_{S S}^{j}$ , we know that $μ^{C} (c_{1}, S) > μ^{C \ P_{S S}^{j}} (c_{1}, T) \geq μ^{C \ P_{S S}^{j}} (c_{1}, t)$ . We need to show that $μ^{C} (c_{i}, c_{l}) > μ^{C \ P_{S S}^{j}} (c_{i}, t)$ ,as $μ^{C} (c_{i}, c_{l}) = μ^{C} (c_{i}, S)$ . Since

μ^{C \ P_{S S}^{j}} (c_{1}, c_{i}) \geq μ (〈 c_{1}, \dots, c_{i} 〉) \geq μ (〈 c_{1}, \dots, c_{l} 〉) = μ^{C} (c_{1}, S) > μ^{C \ P_{S S}^{j}} (c_{1}, t),

by (4) we have $μ^{C \ P_{S S}^{j}} (c_{1}, t) = μ^{C \ P_{S S}^{j}} (c_{i}, t)$ . Thus

μ^{C} (c_{i}, c_{l}) \geq μ (〈 c_{1}, \dots, c_{l} 〉) \geq μ (p) = μ^{C} (c_{1}, S) > μ^{C \ P_{S S}^{j}} (c_{1}, t) = μ^{C \ P_{S S}^{j}} (c_{i}, t),

completing the proof.

It would be nice if the conclusion of Fact 3.4 was true for any strongest path from c to S, rather than just for nice paths. This, however, is not the case. In the above proof, the place we used the stronger assumption is where we claimed that $c_{i} \in P_{S S}^{j}$ for every i ∈ {k, . . . , l}. If p is just any strongest path from c to S, then 〈c_k, . . . , c_l〉 does not need to be a strongest path from $c_{k} \in P_{S S}^{j}$ to S and it might happen that $c_{k + 1} \notin P_{S S}^{j + 1}$ . A specific example of such a situation is a given in Example 3.13.

Fact 3.4 says, in particular, that if $S \in S$ is a singleton, say S = {s}, then for every $c \in P_{S S}^{j}$ there is a strongest path in $P_{S S}^{j}$ from c to s. The following remark gives a stronger version of this fact.

Remark 3.5

If $S \in S$ is a singleton, then for every $c, d \in P_{S S}^{j}$ , there is a strongest path r in $P_{S S}^{j}$ from c to d, that is, $μ^{C} (c, d) = μ^{P_{S S}^{j}} (c, d)$ .

Proof

Let S = {s} and let p = 〈c₁, . . . , c_l〉 be a nice path from c to s and q = 〈d₁, . . . , d_m〉 be a nice path from d to s. Then, by Fact 3.4, $p, q \in P^{P_{S S}^{j}}$ . If μ^C(c, d) = min{μ^C(c, s), μ^C(d, s) = min μ(p), μ(q), then r = 〈c₁, . . . , c_l, d_m, . . . , d₁〉 is as desired. So, assume that μ^C(c, d) is greater than min {μ^C(c, s), μ^C(d, s)}. Then, in particular, μ^C(c, d) > μ^C(d, s) = μ(q). Let 〈b₁, . . . , b_n〉 be a nice path from c to d and put r = 〈b₁, . . . , b_n, d₁, . . . , d_m〉. We claim that r is a nice path from c to s. Indeed, clearly for any index i ∈ {1, . . . , m}, the path 〈d_i, . . . , d_m〉 is a strongest from d_i to d_m = s, since q was nice. Next, fix an i ∈ {1, . . . , n}. Since

μ^{C} (b_{i}, d) \geq μ (〈 b_{i}, \dots, b_{n} 〉) \geq μ (〈 b_{1}, \dots, b_{n} 〉) = μ^{C} (c, d) > μ^{C} (d, s) = μ (q)

we have, by (4), that μ^C(b_i, s) = μ^C(d, s) and

\begin{matrix} μ (〈 b_{i}, \dots, b_{n}, d_{1}, \dots, d_{m} 〉) & = \min {μ (〈 b_{i}, \dots, b_{n} 〉), μ (〈 d_{1}, \dots, d_{m} 〉)} \\ = \min {μ (〈 b_{i}, \dots, b_{n} 〉), μ (q)} \\ = μ (q) = μ^{C} (d, s) = μ^{C} (b_{i}, s) . \end{matrix}

Thus, r is a nice path from c to s and as such, by Fact 3.4, it is in $P_{S S}^{j}$ . In particular, 〈b₁, . . . , b〉 is in $P_{S S}^{j}$ .

Assume that $U, S, T \in S$ are distinct and that there exists a spel c ∈ C for which μ^C(c, U) < μ^C(c, S) = μ^C(c, T). Then $c \notin ⋃_{V \in S} P_{V S}^{1}$ . Is it possible that $c \in P_{U S}^{j}$ for some j > 1? This certainly would be counter intuitive. The next fact ensures us that this is impossible.

Fact 3.6

If $S, U \in S$ and $c \in P_{S S}^{I}$ , then μ^C(c, S) s≥ μ^C(c, U).

Proof

By way of contradiction assume that μ^C(c, S) < μ^C(c, U). Choose a nice path p = 〈c₁, . . . , c_l〉 from c to U and let k ∈ {1, . . . , l} be the largest index with $c_{k} \in P_{S S}^{I}$ . Let s ∈ S. Since

μ^{C} (c, c_{k}) \geq μ (〈 c_{1}, \dots, c_{k} 〉) \geq μ (p) = μ^{C} (c, U) > μ^{C} (c, S) \geq μ^{C} (c, s),

(4) implies that μ^C(c_k, s) = μ^C(c, s). Therefore, for every s ∈ S,

μ^{C} (c_{k}, s) = μ^{C} (c, s) < μ^{C} (c, U) = μ (p) \leq μ (〈 c_{k}, \dots, c_{l} 〉) \leq μ^{C} (c_{k}, U) .

Thus, μ^C(c_k, S) < μ^C(c_k, U). Let i ∈ {0, 1, 2, . . .} be the smallest index with the property that $c_{k} \in P_{S S}^{i}$ . Note that i > 1 since μ^C(c_k, S) < μ^C(c_k, U). But, by the maximality of k, we have that $〈 c_{k}, \dots, c_{l} 〉 \in P^{C \ P_{S S}^{i - 1}}$ . Therefore, $μ^{C} (c_{k}, s) < μ^{C} (c_{k}, U) = μ (〈 c_{k}, \dots, c_{l} 〉) \leq μ^{C \ P_{u S}^{i - 1}} (c_{k}, U)$ implying $c_{k} \notin Q_{S S}^{i - 1}$ . Since the minimality of i implies also that $c_{k} \notin P_{S S}^{i - 1}$ , we conclude $c_{k} \notin P_{S S}^{i}$ , contradicting choice of i.

Proof of Theorem 2.4

We prove that $P_{S S}^{j} \cap P_{U S}^{j} = \emptyset$ by induction on j = 0, 1, 2, . . ..

Clearly the result is true for j = 0 since sets $P_{S S}^{0}$ are empty. Also, definition the of $P_{S S}^{1} = P_{S S}$ clearly insures that the result is true for j = 1. So, assume that the result is true for some j. We need to show that the sets $P_{S S}^{j + 1} = P_{S S}^{j} \cup Q_{S S}^{j}$ , with $S \in S$ , are pairwise disjoint.

For this first notice that $P = ⋃_{S \in S} P_{S S}^{j}$ is disjoint with $Q = ⋃_{S \in S} Q_{S S}^{j}$ . Indeed, take a c∈ P and let $S \in S$ be such that $c \in P_{S S}^{j}$ . By Lemma 3.3 there exists a nice path p (in C) from c to S. Fact 3.4 then shows that $p \in P^{P_{S S}^{j}}$ . Now, take a $U \in S$ . We need to show that $c \notin Q_{U S}^{j}$ .

This is obvious if U = S, since $Q_{S S}^{j} \subset C \ P_{S S}^{j}$ . So, assume that U ≠ S. Then, by the inductive assumption, $P_{S S}^{j} \subset C \ P_{U S}^{j}$ , so $p \in P^{C \ P_{U S}^{j}}$ . In particular, $μ^{C \ P_{U S}^{j}} (c, S) = μ (p) = μ^{C} (c, S)$ . Now, by way of contradiction, assume that $c \in Q_{U S}^{j} \subset P_{U S}^{j + 1}$ . Then, in particular, $μ^{C} (c, U) > μ^{C \ P_{U S}^{j}} (c, S)$ . Therefore, μ^C(c, U) > μ^C(c, S). But this, together with $c \in P_{S S}^{j}$ , contradicts Fact 3.6. So, indeed P ∩ Q = ∅.

Let B^j = C\ P . To finish the proof of the theorem it is enough to show that every c ∈ B^j belongs to at most one of $Q_{S S}^{j}$ with $S \in S$ . So, fix a c ∈ B^j and let $U \in S$ be such that $μ^{C} (c, U) = \max_{T \in S} μ^{C} (c, T)$ . Let p = 〈c₁, . . . , c_l〉 be a nice path from c to U. If $p \in P^{B^{j}}$ , then for every $S \in S$ we have $μ^{C} (c, S) \leq μ^{C} (c, U) = μ (p) \leq μ^{B^{j}} (c, U) \leq μ^{C \ P_{S S}^{j}} (c, U)$ insuring that $c \notin Q_{S S}^{j}$ for every $S \in S \ {U}$ . So, we can assume that $p \notin P^{B^{j}}$ , that is, that there is an i ≤ l with c_i ∈ P. Let k ∈ {1,...,l} be the smallest index such that c_k ∈ P. Let $S \in S$ be such that $c_{k} \in P_{S S}^{j}$ . We claim that

there is a path r \in P^{B^{j} \cup P_{S S}^{j}} from c to S such that μ (r) = μ^{C} (c, U) .

(6)

To see this notice first that μ(〈c_k, . . . , c_l〉) = μ^C(c_k, U), since 〈c_k, . . . , c_l〉 is a nice path from c_k to U. Note also that μ^C(c_k, S) ≥ μ^C(c_k, U). This is obvious if S = U. Otherwise, this follows from Fact 3.6, as $c_{k} \in P_{S S}^{j}$ . Let q = 〈d₁, . . . , d_m〉 be a nice path from c_k to S. We claim that the path r= 〈c₁, . . . , c_k-1, d₁, . . . , d_m〉 satisfies (6).

Clearly r is a path from c to S and $r \in P^{B^{j} \cup P_{S S}^{j}}$ , since {c₁, . . . , c_k-1} ⊂ B^j, while $q \in P^{P_{S S}^{j}}$ follows from Fact 3.4. Also, μ(q) = μ^C(c_k, S) μ (c_k, U) = μ(〈c_k, . . . , c_l〉) implies that

μ (r) = \min {μ (〈 c_{1}, \dots, c_{k} 〉), μ (q)} \geq \min {μ (〈 c_{1}, \dots, c_{k} 〉), μ (〈 c_{k}, \dots, c_{l} 〉)} = μ (p) .

Combining this with μ^C(c, U)≥ μ^C(c, S), which follows from the maximality of μ^C(c, U), we get

μ^{C} (c, S) \geq μ (r) \geq μ (p) = μ^{C} (c, U) \geq μ^{C} (c, S) .

Thus, μ(r) = μ^C(c, U), completing the proof of (6).

To finish the proof of the theorem, notice that, by (6), for every $T \in S \ {S}$ we have $μ^{C} (c, T) \leq μ^{C} (c, U) = μ (r) \leq μ^{B^{j} \cup P_{S S}^{j}} (c, S) \leq μ^{C \ P_{T S}^{j}} (c, S)$ insuring that $c \notin Q_{T S}^{j}$ .

3.2 Alternative definitions of $P_{S S}^{j}$

Theorem 3.7

Let j ∈ {0, 1, 2, . . .}, $B^{j} = C \ ⋃_{S \in S} P_{S S}^{j}$ , and $S \in S$ . If

R = {c \in B^{j} : μ^{C} (c, S) > μ^{C \ P_{S S}^{j}} (c, T) \ for every \ T \in S \ {S}},

W = {c \in B^{j} : μ^{B^{j} \cup P_{S S}^{j}} (c, S) > μ^{C \ P_{S S}^{j}} (c, T) for every T \in S \ {S}},

Z = {c \in B^{j} : μ^{B^{j} \cup P_{S S}^{j}} (c, S) > μ^{B^{j} \cup P_{T S}^{j}} (c, T) for every T \in S \ {S}},

then $Q_{S S}^{j} = R = W = Z$ .

Proof

Since

Q_{S S}^{j} = {c \in C \ P_{S S}^{j} : μ^{C} (c, S) > μ^{C \ P_{S S}^{j}} (c, T) for every T \in S \ {S}},

by Theorem 2.4 we have $Q_{S S}^{j} \subset B^{j} \subset C \ P_{S S}^{j}$ . So, $Q_{S S}^{j} = Q_{S S}^{j} \cap B^{j} = R$ .

Clearly W ⊂ R, since $μ^{C} (c, S) \geq μ^{B^{j} \cup P_{S S}^{j}} (c, S)$ . To see that R ⊂ W take a $c \in R = Q_{S S}^{j}$ . Let p be a nice path from c to S and notice that, by Fact 3.4,

μ^{C} (c, S) = μ (p) \leq μ^{B^{j} \cup P_{S S}^{j}} (c, S) \leq μ^{C} (c, S) .

This implies that c ∈ W . So $W = R = Q_{o S}^{j}$ . Now, in order to prove the theorem it is enough to show that W = Z.

By Theorem 2.4, we have $B^{j} \cup P_{T T}^{j} \subset C \ P_{S S}^{j}$ for every $T \in S \ {S}$ . Thus, $μ^{B^{j} \cup P_{T S}^{j}} (c, T) \leq μ^{C \ P_{S S}^{j}} (c, T)$ for every $T \in S \ {S}$ and c ∈ C. So, W ⊂ Z.

To see that Z ⊂ W take a c ∈ Z and by way of contradiction assume c ∉ W. Then, there is a $T \in S \ {S}$ such that $μ^{B^{j} \cup P_{S S}^{j}} (c, S) \leq μ^{C \ P_{S S}^{j}} (c, T)$ . Also, $μ^{B^{j} \cup P_{S S}^{j}} (c, S) > μ^{B^{j} \cup P_{T S}^{j}} (c, T)$ since c ∈ Z. So, $μ^{C \ P_{S S}^{j}} (c, T) > μ^{B^{j} \cup P_{T S}^{j}} (c, T)$ .

Let p = 〈c₁ , . . . , c_l〉 be a nice path in $C \ P_{S S}^{j}$ from c to T. Notice that p cannot be a path contained in $B^{j} = C \ \ ⋃_{S \in S} P_{S S}^{j}$ , since this would imply $μ^{B^{j} \cup P_{T S}^{j}} (c, T) \geq μ^{B^{j}} (c, T) \geq μ (p) = μ^{C \ P_{S S}^{j}} (c, T)$ which contradicts the inequality $μ^{C \ P_{S S}^{j}} (c, T) > μ^{B^{j} \cup P_{T S}^{j}} (c, T)$ . Thus, p intersects $⋃_{S \in S} P_{S S}^{j}$ . Let k ∈ {1, . . . , l} be the smallest index such that $c_{k} \in ⋃_{S \in S} P_{S S}^{j}$ . Let $U \in S$ be such that $c_{k} \in P_{U S}^{j}$ . Then U ≠ S, since $p \in P^{C \ P_{S S}^{j}}$ . If U = T, then $p \in B^{j} \cup P_{T S}^{j}$ since 〈c_k, . . . , c_l〉 be a nice path from $c_{k} \in P_{U S}^{j} = P_{T S}^{j}$ to T. Thus, $μ^{B^{j} \cup P_{T S}^{j}} (c, T) \geq μ (p) = μ^{C \ P_{o S}^{j}} (c, T)$ which contradicts the inequality $μ^{C \ P_{S S}^{j}} (c, T) > μ^{B^{j} \cup P_{T S}^{j}} (c, T)$ . Thus, we can assume that U ≠ T.

Let q be a nice path from $c_{k} \in P_{U S}^{j}$ to U. Then, by Fact 3.4, $q \in P^{P_{U S}^{j}}$ . Also, by Fact 3.6, μ^C(c_k, U) ≥ μ^C(c_k, T). Then

μ (q) = μ^{C} (c_{k}, U) \geq μ^{C} (c_{k}, T) \geq μ (〈 c_{k}, \dots, c_{l} 〉) \geq μ (p) .

Thus, if r = 〈c₁, . . . , c_k_-₁〉 + q, then μ(r) ≥ μ(p) and $r \in P^{B^{j} \cup P_{U S}^{j}}$ . So

μ^{B^{j} \cup P_{U S}^{j}} (c, U) \geq μ (r) \geq μ (p) = μ^{C \ P_{S S}^{j}} (c, T) \geq μ^{B^{j} \cup P_{S S}^{j}} (c, S),

contradicting c ∈ Z.

Theorem 3.7 justifies our earlier claim that the iterative definition of $P_{A S}^{j + 1}$ can be obtained by using an approach as in the formula for $P_{s S}^{'}$ instead of the one in the formula for $P_{s S}^{+}$ . More precisely, we have $P_{A S}^{j + 1} = P_{A S}^{j} \cup Z_{A S}^{j}$ , where

Z_{A S}^{j} = {c \in B^{j} : μ^{B^{j} \cup P_{S S}^{j}} (c, A) > μ^{B^{j} \cup P_{T S}^{j}} (c, T) for every T \in S \ {A}} .

Recall also that in (2) we defined $μ^{A} (c, d) = \max {μ (p) : p \in P_{c, d}^{A}}$ only for spels c, d ∈ A, since in any other case the sets $P_{c, d}^{A}$ and ${μ (p) : p \in P_{c, d}^{A}}$ are empty. However, it is standard to define max ∅ to equal -∞ With this agreement in hand, we can consider μ^A given by (2) as a function from C × C into [-∞, ∞]. Then the definition of $P_{A S}^{j + 1}$ can be written in a slightly more compact form:

P_{A S}^{j + 1} = {c \in C : μ^{C} (c, A) > μ^{C \ P_{A S}^{j}} (c, T) for every T \in S \ {A}} .

(7)

The formula is valid since $c \in P_{A S}^{j}$ if and only if $μ^{C \ P_{A S}^{j}} (c, T) = - \infty$ for every $T \in S \ {A}$ .

For A, B, D ⊂ C let

\begin{matrix} P_{A B}^{D} & = {c \in C : μ^{C} (c, A) > μ^{C \ D} (c, b) for every b \in B} \\ = {c \in C : μ^{C} (c, A) > μ^{C \ D} (c, B)} . \end{matrix}

We are introducing this notation since it is easier to work with it (see Fact 3.8) than with the other definitions of $P_{A S}^{j + 1}$ , including (7). At the same time $P_{A S}^{j + 1}$ can be easily expressed in this language:

P_{A S}^{j + 1} = P_{A B}^{D},

where $D = P_{A S}^{j}$ and $B = ⋃ (S \ {A})$ .

3.3 The robustness results

We start here with a list of the properties of $P_{A B}^{D}$ .

Fact 3.8

Let A, B, D, V ⊂ C. Then,

$P_{A B}^{D} = ⋂_{b \in B} P_{A {b}}^{D}$ ,
$P_{A B^{'}}^{D} \supseteq P_{A B}^{D}$ for every B^’ ⊂ B,
$P_{A^{'} B}^{D} \subseteq P_{A B}^{D}$ for every A^’ ⊂ A,
$P_{A B}^{D^{'}} \subseteq P_{A B}^{D}$ for every D^’ ⊂ D,
If $D = P_{A {B}}^{k}$ for some k ∈{0, 1, 2, . . .} and $R \subseteq A \cup P_{A B}^{D}$ , then $P_{R B}^{D} \subseteq P_{A B}^{D}$ .

Proof

(a) is obvious from the definition of $P_{A B}^{D}$ . (b) follows immediately from (a). (c) holds, since μ^C(c, A) ≥ μ^C(c, A^’). To see that (d) holds notice that D^’⊂ D implies C\D^’⊃ C\D. Thus, by (3), μ^C\D^’(c, b) ≥ μ^C\D(c, b), implying (d).

(e) Fix a $c \in P_{R B}^{D}$ and a b ∈ B. We need to show that

μ^{C} (c, A) > μ^{C \ D} (c, b) .

(8)

Notice that μ^C(c, R) > μ^C\D(c, b), since $c \in P_{R B}^{D}$ . Let p = 〈c₁, . . . , c_l〉 be a strongest path from c to R and let m ∈ {1,...,l}be minimal such that r = c_m ∈ R. Then μ^C (c, r) ≥ μ(〈 c₁,...,c_m〉) ≥ μ(p)= μ^C (c, R) ≥ μ^C(C, r). Thus, we have μ(〈c₁,..c_m 〉) = μ^C (c, R) > μ^C\D (c,b). If r ∈ A, then μ^C (C, A) ≥ μ^C (c, r) > μ^C\D (c, b), proving (8). So, we can assume that $r \in P_{A B}^{D} = P_{A {B}}^{k + 1} = ⋃_{n \leq k} Q_{A {B}}^{n}$ . Thus, there exists an n ≤ k with the property that $r \in Q_{A {B}}^{n} = {c \in C \ P_{A {B}}^{n} : μ^{C} (c, A) > μ^{C \ P_{A {B}}^{n}} (c, b) for every b \in B}$ . In particular,

μ^{C} (r, A) > μ^{C \ P_{A {B}}^{n}} (r, b) .

(9)

Also, since $r \in C \ P_{A {B}}^{n}$ , path 〈c₁, . . . , c_m〉 is in $C \ P_{A {B}}^{n}$ . So, by Fact 3.1,

μ^{C \ P_{A {B}}^{n}} (c, r) = μ^{C} (c, r) > μ^{C \ D} (c, b) .

(10)

Next we will prove that

μ^{C} (r, A) > μ^{C \ D} (c, b) .

(11)

If $μ^{C \ P_{A {B}}^{n}} (c, r) > μ^{C \ P_{A {B}}^{n}} (r, b)$ , then, by (4), $μ^{C \ P_{A {B}}^{n}} (r, b) = μ^{C \ P_{A {B}}^{n}} (c, b)$ . So, by (9), $μ^{C} (r, A) > μ^{C \ P_{A {B}}^{n}} (c, b) \geq μ^{C \ D} (c, b)$ , where the last inequality is justified by (3) and an inclusion $C \ P_{A {B}}^{n} \supseteq C \ P_{A {B}}^{k} = C \ D$ . Thus, in this case, (11) holds. So, assume that $μ^{C \ P_{A {B}}^{n}} (c, r) \leq μ^{C \ P_{A {B}}^{n}} (r, b)$ . Then, by (9) and (10), we get $μ^{C} (r, A) > μ^{C \ P_{A {B}}^{n}} (r, b) \geq μ^{C \ P_{A {B}}^{n}} (c, r) > μ^{C \ D} (c, b)$ , finishing the proof of (11).

Now, by (10) and (11), μ^C(c, r) > μ^C^\^D(c, b) and μ^C(r, A) > μ^C^\^D(c, b). Let p₁ be a strongest path from c to r and p₂ be a strongest path from r to A. Then μ(p₁+ p₂) = min{μ(p₁), μ(p₂)} = min{μ^C(c, r), μ^C(r, A)} and so

μ^{C} (c, A) \geq μ (p_{1} + p_{2}) = \min {μ^{C} (c, r), μ^{C} (r, A)} > μ^{C \ D} (c, b),

finishing the proof of (8) and (e).

Lemma 3.9

Let $S_{0} = S \ {A}$ , where $A \in S$ is fixed. If j, k∈ {0, 1, 2, . . .}, $A \subseteq R \subseteq A \cup P_{A S}^{k + 1}$ , and $T = S_{0} \cup {R}$ , then the following holds.

$P_{A S}^{j} \subset P_{R T}^{j}$ .
$P_{R T}^{j} \subseteq P_{A S}^{k + j}$ .
If $V \in S_{0}$ , then $P_{V T}^{j} \subseteq P_{V S}^{j}$ .
If $V \in S_{0}$ , then $P_{V S}^{j} \subseteq P_{V T}^{j}$ .
If either k = 0 or $P_{A S}^{k} = P_{A S}^{k + 1}$ , then $P_{A S}^{j} = P_{R T}^{j}$ and $P_{V S}^{j} = P_{V T}^{j}$ for every j ≥ k and $V \in S_{0}$ . In, particular, ${P_{S S}^{I} : S \in S} = {P_{T T}^{I} : T \in T}$ . Moreover, if k = 0, then also all intermediate segmentations are equal: ${P_{S S}^{j} : S \in S} = {P_{T T}^{j} : T \in T}$ for all j ≥ 0.

Proof

All properties (a)-(d) are proved by induction on j and they are obvious for j = 0.

(a) To make an inductive step, assume that $D^{'} = P_{A S}^{j}$ is a subset of $D = P_{R T}^{j}$ and put $B = ⋃ S_{0}$ . Since A ⊂ R, conditions (c) and (d) from Fact 3.8 give $P_{A S}^{j + 1} = P_{A B}^{D^{'}} \subseteq P_{R B}^{D^{'}} \subseteq P_{R B}^{D} = P_{R T}^{j + 1}$ .

(b) To make an inductive step, assume that $D^{'} = P_{R T}^{j}$ is a subset of $D = P_{A S}^{k + j}$ . First note that $T \ {R} = S_{0}$ . To see this, it is enough to show that $R \notin S_{0}$ . But if there is an $S \in S_{0}$ such that S = R, then $S \ A \subseteq P_{A S}^{k + 1} \cap (E \cup P_{S S}^{1})$ , where E is as in Lemma 3.2. Since, by Lemma 3.2 and Theorem 2.4, this last set is empty, we get S ⊆ A. But we have also A ⊆ R = S, so $A = S \in S_{0}$ , contradicting the definition of $S_{0}$ .

Let $B = ⋃ S_{0} = ⋃ (S \ {A}) = ⋃ (T \ {R})$ , put $D^{″} = P_{A S}^{k} \subseteq D$ , and notice that $R \subseteq A \cup P_{A S}^{k + 1} = A \cup P_{A B}^{D^{″}} \subseteq A \cup P_{A B}^{D}$ follows from from Fact 3.8(d). Then conditions (d) and (e) from Fact 3.8 give $P_{R T}^{j + 1} = P_{R B}^{D^{'}} \subseteq P_{R B}^{D} \subseteq P_{A B}^{D} = P_{A S}^{k + j + 1}$ , completing the proof of (b).

(c) To make an inductive step, assume that it is true for some j, that is, that $D = P_{V S}^{j}$ contains $D^{'} = P_{V T}^{j}$ . Since $B^{'} = ⋃ (S \ {V}) = A \cup ⋃ (S_{0} \ {V})$ is a subset of $B = ⋃ (T \ {V}) = R \cup ⋃ (S_{0} \ {V})$ , conditions (b) and (d) from Fact 3.8 give $P_{V T}^{j + 1} = P_{V B}^{D^{'}} \subseteq P_{V B^{'}}^{D^{'}} \subseteq P_{V B^{'}}^{D} = P_{V S}^{j + 1}$ .

(d) To make an inductive step, assume that $D^{'} = P_{V S}^{j}$ is a subset of $D = P_{V T}^{j}$ . Let $B_{0} = ⋃ (S_{0} \ {V})$ . Then $B^{'} = ⋃ (S \ {V}) = B_{0} \cup A$ is a subset of $B = ⋃ (T \ {V}) = B_{0} \cup R$ . Notice that it is enough to prove that $P_{V B^{'}}^{D^{'}} \subseteq P_{V B}^{D^{'}}$ since this and Fact 3.8(d) imply $P_{V S}^{j + 1} = P_{V B^{'}}^{D^{'}} \subseteq P_{V B}^{D^{'}} \subseteq P_{V B}^{D} = P_{V T}^{j + 1}$ .

To show $P_{V B^{'}}^{D^{'}} \subseteq P_{V B}^{D^{'}}$ take a $c \in P_{V S}^{j + 1} = P_{V B^{'}}^{D^{'}}$ . Then μ^C(c, V) > μ^C\D’(c, B’). We need to prove that

μ^{C} (c, V) > μ^{C \ D^{'}} (c, B) .

(12)

If μ^C\D’(c, B’) ≥ μ^C\D’(c, B), then μ^C(c, V) > μ^C\D’(c, B’) ≥ μ^C\D’(c, B) proving inequality (12). Thus, by way of contradiction, we can assume that μ^C\D’ (c, B’) < μ^C\D’ (c, B). We will find v ∈ V , r ∈ B, a ∈ A, and D₀ ⊆ C such that

μ^{C \ D^{'}} (c, a) = μ^{C \ D^{'}} (r, a) = μ^{C} (r, a) > μ^{C \ D_{0}} (r, v) = μ^{C} (c, V) .

(13)

First notice that (13) gives us a desired contradiction, since then a ∈ B’ implies μ^C\D’(c, B’) ≥ μ^C\D’(c, a) > μ^C(c, V) contradicting $c \in P_{V B^{'}}^{D^{'}}$ . Thus to finish the proof it is enough to show (13).

First, we will choose an appropriate r. Let p₀ = 〈c₁, . . . , c_l〉 be a strongest path in C\ D’ from c to B and let m ∈ {1, . . . , l} be minimal such that r = c_m ∈ B. Then μ^C\D^’(c, r) ≥ μ(p₀) = μ^C\D’ (c, B) ≥ μ^C\D’ (c, r), where p = 〈C₁, ..., C_m 〉. In particular, μ(p) = μ^C\D’(c, r) = μ^C\D’(c, B).

Let a ∈ A be such that there is a path q from r to a which is a nice path from r to A. Since μ^C\D’(c, r) = μ^C\D’(c, B) > μ^C\D’(c, B’) ≥ μ^C\D’(c, a) the equation μ^C\D’(c, a) = μ^C\D’(r, a) follows from (4).

To show μ^C\D’(r, a) = μ^C(r, a) note that $r \in B \ B^{'} = R \ A \subseteq P_{A S}^{k + 1} \subseteq P_{A S}^{I}$ , since μ^C\D’(c, r) = μ^C\D’(c, B) > μ^C\D’(c, B’). In particular, since q is a nice path from r to A, then, by Fact 3.4, q is in $P_{A S}^{I} \subseteq C \ P_{V S}^{I} \subseteq C \ P_{V S}^{j} = C \ D^{'}$ . As μ^C(r, a) = μ(q), Fact 3.1 implies μ^C\D’(r, a) = μ^C (r, a).

Next, we need to choose D₀ and v ∈ V. Let q be a path from c to v which is a nice path from c to V . Then μ^C(c, v) = μ^C(c, V). Since $r \in P_{A S}^{k + 1} = ⋃_{n \leq k} Q_{A S}^{n}$ , there is an n ≤ k with $r \in Q_{A S}^{n} = {c \in C \ P_{A S}^{n} : μ^{C} (c, A) > μ^{C \ P_{A S}^{n}} (c, ⋃ S_{0})}$ . We put $D_{0} = P_{A S}^{n}$ . Then $μ^{C} (r, a) = μ^{C} (r, A) > μ^{C \ D_{0}} (r, ⋃ S_{0}) \geq μ^{C \ D_{0}} (r, v)$ .

To prove μ^C\D₀(r, v) = μ^C(c, V) it is enough to show μ^C\D₀(r, v) = μ^C\D₀(c, v) and μ^C\D₀(c, v) = μ^C(c, V). Recall that μ(p) = μ^C\D’(c, r) = μ^C\D’(c, B), where p is in $C \ D_{0} = C \ P_{A S}^{n}$ since {c₁, . . . , c_m_-₁} is disjoint with $B \supseteq P_{A S}^{n}$ , while $c_{m} = r \in Q_{A S}^{n} \subset C \ P_{A S}^{n}$ . By this and a part of (13) proved so far μ^C\D₀(c, r)≥μ(p) = μ^C\D’(c, B) > μ^C\D’(c, B’)≥ μ^C\D’(c, a) > μ^C\D₀(r, v). So, by (4), we get μ^C\D₀(r, v) = μ^C\D₀(c, v).

The equation μ^C\D₀(c, v) = μ^C(c, V) follows from Fact 3.1, since q, as a nice path from $c \in P_{V S}^{j + 1}$ to V , is in $P_{V S}^{j + 1} \subseteq C \ P_{A S}^{n} = C \ D_{0}$ . This finishes the proof of (d).

(e) Parts (c) and (d) imply that $P_{V S}^{j} = P_{V T}^{j}$ for every $V \in S_{0}$ and j ≥ 0.

If j ≥ k, then $P_{A S}^{j} = P_{R T}^{j}$ follows from $P_{R T}^{j} \subseteq P_{A S}^{k + j} = P_{A S}^{j} \subseteq P_{R T}^{j}$ . Here $P_{R T}^{j} \subseteq P_{A S}^{k + j}$ follows from (b); equation $P_{A S}^{k + j} = P_{A S}^{j}$ is obvious when k = 0 and is proved by an easy induction when $P_{A S}^{k} = P_{A S}^{k + 1}$ ; inclusion $P_{A S}^{j} \subseteq P_{R T}^{j}$ is a restatement of (a).

Proof of Theorem 2.5

First notice that Lemma 3.9(e) implies that (*) the theorem is true if T_i = S_i for every i ≥ 2.

Now, the general form of the theorem follows from (*) by induction on m. Indeed, for 0 ≤ l ≤ m and i ∈ {1, . . . , m} put $T_{i}^{l} = T_{i}$ for i ≤ l and $T_{i}^{l} = S_{i}$ otherwise. Let $T_{l} = {T_{1}^{l}, \dots, T_{m}^{l}}$ . Then $T_{0} = S$ , $T_{m} = T$ . and to every pair $〈 T_{k}, T_{k + 1} 〉$ we can apply (*). Thus, applying it m-times, we get that $P_{S_{i} S}^{j} = P_{T_{i}^{0} T_{0}}^{j} = P_{T_{i}^{1} T_{1}}^{j} = \dots = P_{T_{i}^{m} T_{m}}^{j} = P_{T_{i} T}^{j}$ for every i ∈ {1, . . . m} and an appropriate j.

Proof of Corollary 2.7

Let U_i = {s_i, t_i} and put $U = {U_{1}, \dots, U_{m}}$ , $S = {{s_{1}}, \dots, {s_{m}}}$ , and $T = {{T_{1}}, \dots, {t_{m}}}$ . Then, by Theorem 2.5 (version with k = 1), for every i ∈ {1, . . . ,m} we have $P_{s_{i} S}^{j} = P_{{s_{i}} S}^{j} = P_{U_{i} U}^{j}$ . To finish the proof is enough to show that

s_{i} \in P_{t_{i} T}^{1} for every i \in {1, \dots, m},

(†)

since then, again by Theorem 2.5, $P_{t_{i} T}^{j} = P_{{t_{i}} T}^{j} = P_{U_{i} U}^{j} = P_{s_{i} S}^{j}$ for every i ∈ {1, . . . , m}.

First notice that for every distinct i, k ∈ {1, . . . , m}

μ^{C} (s_{i}, t_{k}) = μ^{C} (t_{i}, s_{k}) .

(14)

Indeed, since $t_{k} \in P_{s_{k} S}^{1}$ we have μ^C (t_k, s_k) > μ^C (t_k, s_i). Therefore, by (4), μ^C(t_k, s_i) = μ^C(s_i, s_k). Similarly, $t_{i} \in P_{s_{i} S}^{1}$ implies μ^C(t_i, s_i) > μ^C (t_i, s_k) so, by (4), μ^C(t_i, s_k) = μ^C(s_i, s_k). This proves (14).

Now, to prove (†) take distinct i, k ∈ {1, . . . , m}. We need to show that μ ^C (s_i, t_i) > μ^C(s_i, t_k). But $t_{i} \in P_{s_{i} S}^{1}$ implies μ^C (t_i, s_i) > μ^C (t_i, s_k). Combining this with (14) gives μ^C(s_i, t_i) = μ^C(t_i, s_i) > μ^C(t_i, s_k) = μ^C(s_i, t_k).

3.4 How to choose seed generating families S?

In a general setting, the title question is well beyond the scope of this paper. What we will discuss here is only its very restricted version: Given $S$ , how to modify it to get either the same or a better segmentation?

The first of the results presented here estimates the size of minimal subsets T_S of $P_{S S}^{1}$ for which the segmentations ${P_{S S}^{I} : S \in S}$ and ${P_{T_{S} T}^{I} : S \in S}$ are equal, where $T = {T_{S} : S \in S}$ .

Proposition 3.10

For every $A \in S$ let $U_{A} = {P_{{s} S_{A}}^{1} : s \in A} \ {\emptyset}$ , where $S_{A} = S \ {A}$ . Then

Sets in $U_{A}$ are pairwise disjoint.
If $T \subseteq P_{A S}^{1} = P_{A S_{A}}^{1}$ , then $P_{T S_{A}}^{1} = P_{A S_{A}}^{1}$ if and only if T intersects every $P \in U_{A}$ .

In particular, if for every $A \in S$ we choose a $T_{A} \subseteq P_{A S}^{1}$ which intersects every $P \in U_{A}$ and put $T = {T_{A} : A \in S}$ , then ${P_{S S}^{j} : S \in S} = {P_{T_{s} T}^{j} : S \in S}$ for every j ≥ 0.

Proof

(a) If $u \in P_{{s} S_{A}}^{1} \cap P_{{t} S_{A}}^{1}$ for some s, t ∈ A, then, by Corollary 2.7, $P_{{s} S_{A}}^{1} = P_{{u} S_{A}}^{1} = P_{{t} S_{A}}^{1}$ .

(b) Let A₀ ⊂ A be minimal such that ${P_{{s} S_{A}}^{1} : s \in A_{0}} = U_{A}$ . By (5) we have $T \subseteq P_{A S}^{1} = ⋃ U_{A}$ . Thus, for every t ∈ T there is a unique a_t ∈ A₀ such that $t \in P_{{a_{t}} S_{A}}^{1}$ . Note that $P_{{t} S_{A}}^{1} = P_{{a_{t}} S_{A}}^{1}$ follows from Corollary 2.7. Let A₁ = {a_t: t ∈ T }. Then, by (5), $P_{T S_{A}}^{1} = ⋃_{a \in A_{1}} P_{{a} S_{A}}^{1} \subseteq ⋃_{a \in A_{0}} P_{{a} S_{A}}^{1} = P_{A S_{A}}^{1}$ and the equation holds precisely when A₁ = A₀, that is, when T intersects every $P \in U_{A}$ .

The value of Proposition 3.10 comes from the fact that, usually, the size of $U_{A}$ is quite small, even if the set A is quite big. Note also, that it is possible that the equation ${P_{S S}^{I} : S \in S} = {P_{T_{s} T}^{I} : S \in S}$ may hold for sets $T_{A} \subseteq P_{A S}^{1}$ which do not intersect every $P \in U_{A}$ . Such a situation is described in Example 3.16.

Lemma 3.11

Let $A \in S$ and $E = {s \in A : μ^{C} (s, T) = 1 for some T \in S_{0}}$ , where $S_{0} = S \ {A}$ . If A₀ = A\ E, then $P_{A_{0} S_{0}}^{j} = P_{A S_{0}}^{j}$ for every j ≥ 0.

Proof

Inclusion $P_{A_{0} S_{0}}^{j} \subseteq P_{A S_{0}}^{j}$ follows from Lemma 3.9(a). We just need to show that $P_{A S_{0}}^{j} \subseteq P_{A_{0} S_{0}}^{j}$ . This will be proved by induction on j ≥ 0.

For j = 0 it is obvious, as both sets are empty. So, assume that for some j we have $P_{A S_{0}}^{j} \subseteq P_{A_{0} S_{0}}^{j}$ . We need to prove that $P_{A S_{0}}^{j + 1} \subseteq P_{A_{0} S_{0}}^{j + 1}$ . For this, choose a $c \in P_{A S_{0}}^{j + 1}$ . We need to show that $c \in P_{A_{0} S_{0}}^{j + 1}$ .

So, fix a $T \in S_{0}$ . We need to prove $μ^{C} (c, A_{0}) > μ^{C \ P_{A_{0} S_{0}}^{j}} (c, T) = μ^{C \ P_{A S_{0}}^{j}} (c, T)$ , where the equation follows from our inductive assumption that $P_{A S_{0}}^{j} = P_{A_{0} S_{0}}^{j}$ . However, since $c \in P_{A S_{0}}^{j + 1}$ , we have $μ^{C} (c, A) > μ^{C \ P_{A S_{0}}^{j}} (c, T)$ . Thus, to finish the proof, it is enough to show that

μ^{C} (c, A_{0}) \geq μ^{C} (c, A) .

(15)

By way of contradiction, assume that (15) is false. Then μ^C(c, A) > μ^C(c, A₀). Let a ∈ A\ A₀ ⊆ E be such that μ^C(c, a) = μ^C(c, A). Let $T \in S_{0}$ be such that μ^E(a, T) = μ^C(a, T) = 1 and let q be a path in E from a to T with μ(q) = 1. Also, let p = 〈c₁, . . . , c_l〉 be a strongest path from c to a. Thus, μ(p) = μ^C(c, a) = μ^C(c, A) > μ^C(c, A₀). If p is disjoint with $P_{A S_{0}}^{j}$ then so is p + q and $μ^{C \ P_{A S_{0}}^{j}} (c, T) \geq μ (p + q) = μ (p) = μ^{C} (c, A)$ contradicting $c \in P_{A S_{0}}^{j + 1}$ . So, assume that p intersects $P_{A S_{0}}^{j} = P_{A_{0} S_{0}}^{j} = ⋃_{k < j} Q_{A_{0} S_{0}}^{k}$ . Let k < j be minimal that p intersects $Q_{A_{0} S_{0}}^{k}$ and let n ∈ {1,...,l} be such that $c_{n} \in Q_{A_{0} S_{0}}^{k}$ . Then $μ^{C} (c_{n}, A_{0}) > μ^{C \ P_{A S_{0}}^{k}} (c_{n}, T) \geq μ (〈 c_{n}, \dots, c_{l} 〉 + q) = μ (〈 c_{n}, \dots, c_{l} 〉) \geq μ (p) = μ^{C} (c, A)$ . Also, μ^C(c, c_n) ≥ μ(〈c₁, . . . , c_n〉) ≥ μ(p) = μ^C(c, A). So, μ^C(c, A₀) ≥ min{μ^C(c, c_n), μ^C(c_n, A₀)} ≥ μ^C(c, A), finishing the proof.

Recall that $E_{S} = ⋃_{A \in S} {s \in S : μ^{C} (s, t) = 1 for some t \in T \in S \ {A}}$ .

Proposition 3.12

For every $S \in S$ , there exists a T_S containing $S \ E_{S}$ such that if $T = {T_{S} : S \in S}$ , then $⋃ T = ⋃ S$ , $T \subseteq P_{T T}^{1}$ for every $T \in T$ , and $P_{S S}^{j} \subseteq P_{T_{S} T}^{j}$ for every $S \in S$ and j ≥ 0.

Proof

For s ∈ C let [s] = {t ∈ C : μ^C(s, t) = 1}. Thus, each [s] is an equivalence class of an equivalence relation ∼ on C defined by s ∼ t if and only if μ^C(s, t) = 1. In particular, the sets in $F = {[s] \cap ⋃ S : s \in ⋃ S}$ are nonempty and pairwise disjoint. Let $W \subseteq ⋃ S$ be a selector of $F$ , that is, such that W intersects each $[s] \cap ⋃ S$ at precisely one element. Define $T_{S} = ⋃ {[s] \cap ⋃ S : s \in S \cap W}$ . We will just sketch the proof that these sets are as desired.

Clearly $⋃ T = ⋃ S$ , as for every $s \in ⋃ S$ there are $S \in S$ and w ∈ W ∩ S such that s ∈ [w], so $s \in [w] \cap ⋃ S \subseteq T_{S} \subseteq ⋃ T$ .

Next, fix an $S \in S$ . To see that $P_{S S}^{j} \subseteq P_{T_{S} T}^{j}$ put $S_{0} = S \ {S}$ and notice that $S \ E_{S} \subseteq T_{S}$ and that $Z = C \ (S \ E_{S})$ contains union of $T_{0} = T \ {T_{S}}$ . Thus

P_{S S}^{j} = P_{S S_{0}}^{j} = P_{S \ E_{S} S_{0}}^{j} = P_{S \ E_{S} {Z}}^{j} \subseteq P_{T_{S} {Z}}^{j} \subseteq P_{T_{S} T_{0}}^{j} = P_{T_{S} T}^{j} .

Here, the second equation follows from Lemma 3.11, the first inclusion from Fact 3.8(c), while the second inclusion is a consequence of Fact 3.8(b). The proof of the third equation is very similar to that of Lemma 3.11 and uses the fact that any [c] intersecting Z intersects also $⋃ S_{0}$ . (This proof relies also on the fact that every strongest path p between spels in [c] is in $P^{[c]}$ and that $[c] \cap P_{S S}^{j} \neq \emptyset$ implies $[c] \subseteq P_{S S}^{j}$ .)

The inclusion $T \subseteq P_{T T}^{1}$ follows from Lemma 3.2 and the fact that $E_{T} = \emptyset$ .

3.5 Examples

In this subsection, we will present the examples announced earlier in this paper, which show different limitations for our results. The examples are presented in a graphical form, where vertices represent spels from a given scene while a number next to an edge of a graph represents the affinity between the connected vertices. Lack of an edge between vertices means that the affinity between the spels they represent is equal to 0.

Our first example shows that, unlike a nice path, a strongest path from an $a \in P_{d S}^{j}$ to d need not to be contained in $P_{d S}^{j}$ .

Example 3.13

Assume that a scene $C$ contains spels a, b, c, d, and s, connected as in Figure 6(a). Let S = {d, s}. Then $P_{s S}^{1} = {c, s}$ and $P_{d S}^{1} = {b, d}$ . Also, $a \in P_{d S}^{2}$ , since $μ^{C} (a, d) = .5 > 0 = μ^{C \ P_{d S}^{1}} (a, s)$ . However, the path p = 〈a, b, c, d〉 is strongest between $a \in P_{d S}^{2}$ and d, but it is not inside $P_{d S}^{2}$ .

The following example shows that the iterative analog of formula (5) is false.

Example 3.14

Assume that a scene $C$ contains spels s, t, u, and c, connected as in Figure 6(b). Let S = {s, t}, U = {u}, and $S = {S, U}$ . Then $P_{U S}^{I} = {u}$ , $P_{S S}^{1} = {s, t}$ , and $P_{S S}^{I} = P_{S S}^{2} = {s, t, c}$ . However, $P_{s {s, u}}^{I} = P_{s {s, u}}^{1} = {s}$ and $P_{t {t, u}}^{I} = P_{t {t, u}}^{1} = {t}$ , showing that $⋃_{s \in S} P_{s {s, u}}^{I} = {s, t} \neq {s, t, c} = P_{S S}^{I}$ .

The following example shows that, in Corollary 2.7, we cannot weaken the assumptions to $t_{i} \in P_{s_{i} S}^{I}$ , even if we also weaken the conclusion to $P_{t_{i} T}^{I} = P_{s_{i} S}^{I}$ .

Example 3.15

Assume that a scene $C$ contains spels a, s, and t, connected as in Figure 7(a). Let S = {s, t}. Then for j > 1 we have $P_{s S}^{1} = {s} ⊊ P_{s S}^{j} = {a, s}$ and $P_{t S}^{1} = P_{t S}^{j} = {t}$ . However, if we replace a seed s with $a \in P_{s S}^{2}$ and put T = {a, t}, then for every i > 0 and j > 1, we have $P_{t T}^{i} = P_{t S}^{i} = {t}$ , and $P_{a T}^{i} = {a} \neq P_{s S}^{j}$ .

The next example shows the limitations of the result from Proposition 3.10.

Example 3.16

Assume that a scene $C$ contains spels s, t, u, and c, connected as in Figure 7(b). Let S = {s, t}, U = {u}, and $S = {S, U}$ . Then $P_{s U}^{1} = {s, c}$ is disjoint with $P_{t U}^{1} = {t}$ . However, although T_S = {s} does not intersect $P_{t U}^{1}$ , we still have $P_{T_{S} U}^{I} = P_{t U}^{2} = {s, c, t} = P_{S S}^{I}$ .

4 The algorithm

In this section, we present an algorithm, called κIRMOFC (abbreviation for iterative relative multi object fuzzy connectedness), allowing a set of seeds for each object. Within this algorithm, the algorithm κFOEMS as described in [18] for multi seeded AFC is called. κFOEMS takes as an input a given scene $C = 〈 C, f 〉$ , an affinity function κ, and a set S ⊂ C of seeds. Its output is a connectivity scene $C_{κ, S} = 〈 C, f_{κ, S} 〉$ , where f_κ,S(c) represents the strength of a κ-strongest path from c to S. Aspects related to the computational efficiency of algorithm κFOEMS have been addressed in [20,21]. For A ∈ C, by the restriction of κ to A we will understand an affinity κ’ on C such that, for every distinct c, d ∈ C, we have κ’(c, d) = κ(c, d) for c, d ∈ A, and κ(c, d) = 0 otherwise. In the algorithm κIRMOFC, we will use the fact that, for distinct c, d ∈ C, the number μ^A(c, d) is equal to μ^C(c, d) calculated with respect to the restriction of κ to A.

Algorithm κIRMOFC

Input

$C = 〈 C, f 〉$ , κ as defined in Section 2, a family $S = {S_{1}, S_{2}, \dots, S_{m}}$ of pairwise disjoint sets of seed spels such that κ(s, t) < 1 for any s and t from distinct sets from $S$ .

Output

For each S in $S$ , iteratively defined fuzzy κ-object $P_{S S}^{I}$ containing S and relative to a background containing $W = ⋃ (S \ {S})$ .

Auxiliary Data Structures

For each $S \in S$ , the κ-connectivity scene $C_{κ, S} = 〈 C, f_{κ, S} 〉$ , the κ_S-connectivity scenes $C_{κ_{S}, W} = 〈 C, f_{κ_{S}, W} 〉$ , where κ_S is the restriction of κ to $C \ P_{S S}^{j}$ , and the temporary scenes $C_{S} = 〈 C, f_{S} 〉$ such that f_S corresponds to the characteristic function of $P_{S S}^{j}$ . Index j refers to the iteration level; that is, the number of completed while loops, in Steps 5-16, for each fixed S.

begin

1. for each $S \in S$ do

2. compute $C_{κ, S}$ by using κFOEMS;

3. set all elements of $C_{S}$ to 0 (this corresponds to setting $P_{S S}^{0} = \emptyset$ );

4. set κ_S = κ and flag = true;

5. while flag = true do

6. set flag = false;

7. compute $C_{κ_{S}, W}$ by using κFOEMS;

8. for all c ∈ C do

9. if f_S(c) = 0 and f_κ,S(c) > f_{κ_S,W}(c) then

10. set f_S(c) = 1;

11. set flag = true;

12. for all d ∈ C, d ≠ c, do

13. set κ_S(c, d) = 0;

14. endfor;

15. endif;

16. endfor;

17. endwhile;

18. output $P_{S S}^{I} = {c \in C : f_{S} (c) = 1}$ ;

19. endfor;

end

In the above algorithm each run of the loop of Steps 2-18 is independent of the other runs and can be considered as a subroutine (similar to algorithm κIFROE from [15]) which for seeds S and W returns an IRFC object containing S and relative to a background containing W. The value of flag determines whether in the previous run of the loop in Steps 6-16 there was at least one spel which was added to the object $P_{S S}^{I}$ (i.e., changed value of f_S(c) from 0 to 1). Since the number of spels c ∈ C is finite, eventually no change is made and the loop terminates. Each time the algorithm enters the loop in Steps 6-16, f_S is the characteristic function of the previous stage, say jth stage, $P_{S S}^{j}$ is the approximation of $P_{S S}^{I}$ , while κ_S is the restriction of κ to $C \ P_{S S}^{j}$ . Notice that this situation remains true when Steps 6-16 of the next stage are completed. Indeed, the loop of Steps 9-15 is entered for each c and the if statement is performed only if c was not yet in $P_{S S}^{j}$ , but the inequality $μ^{C} (c, S) = f_{κ, S} (c) > f_{κ_{S}, W} (c) = μ^{C \ P_{S S}^{j}} (c, W)$ indicates that c is added to $P_{S S}^{j + 1}$ . This is done at Step 10, while the loop in Steps 12-14 restricts current κ_S to C\ {c}. Thus, when Steps 9-15 are finished, all seeds from $C \ P_{S S}^{j}$ for which $μ^{C} (c, S) > μ^{C \ P_{S S}^{j}} (c, W)$ are added to $P_{S S}^{j + 1}$ , and the new κ_S is the restriction of the old κ_S to $⋂_{c \in P_{S S}^{j + 1} \ P_{S S}^{j}} (C \ {c})$ , so it is the restriction of κ to the set $(C \ P_{S S}^{j}) \cap ⋂_{c \in P_{S S}^{j + 1} \ P_{S S}^{j}} (C \ {c}) = C \ P_{S S}^{j + 1}$ . The argument from this paragraph justifies the following result.

Proposition 4.1

For any scene $C = 〈 C, f 〉$ over $〈 Z^{n}, α 〉$ , for any fuzzy affinity relation κ in $C$ , and for any non-empty family of $S$ non-empty pairwise disjoint subsets of C such that κ(s, t) < 1 for any s and t from distinct sets from $S$ , algorithm κIRFCMO terminates, $S \subset P_{S S}^{I}$ for every $S \in S$ , and the family ${P_{S S}^{I} : S \in S}$ is the IRFC segmentation of $C$ .

5 Results and evaluation

5.1 Qualitative Evaluation

In this section, we present the results of application of the IRFC method and compare them with the results obtained by using RFC. Specifically, we present qualitative results of the following three experiments: (1) segmentation of individual vertebra from a 3D CT scene of a human cervical spine; (2) artery/vein separation in contrast-enhanced MR angiograms; (3) segmentation of white matter (WM), gray matter (GM), and cerebro-vascular fluid (CSF) in simulated MR scenes obtained from BrainWebMR simulator [22].

The contact area between the two cervical vertebrae C1 and C2 is shown by an arrow. (b) A surface rendition of the vertebral column consisting of three vertebrae segmented by using AFC. (c) A Maximal Intensity Projection (MIP) rendition of a 3D contrast enhanced MR angiography scene of the body region from belly to knee. (d) A surface rendition of the entire vascular tree segmented by AFC from this scene.

The aim of our first experiment is to compare the performances of RFC and IRFC in segmenting the individual vertebrae. Figure 8(a) displays a region of interest from a slice in the 3D CT data (size: 512 × 512 × 77, voxel size 0.23×0.23×1.0 mm³). In CT scenes, bones appear bright, and it is not difficult to segment them from the rest of the body region. Figure 8(b) displays a surface rendition of the cervical spine column after segmenting it from other bones and soft tissues by using AFC. Here, AFC is used instead of simple thresholding since the former simultaneously removes other non-vertebral bone regions which otherwise would have to be segmented by using a subsequent connectivity analysis. Also, AFC outperforms simple thresholding and connectivity analysis for spels with partial bone occupancy. Our aim in this experiment is to segment the three vertebrae (C1-C3) from the spinal section shown in Figure 8(b). The major challenges in separating the individual vertebrae are: (1) complex shape and geometry of the contact regions between two successive vertebrae; (2) the fuzzy fusion at these junctions (see Figure 8(a)); (3) porous interior of the vertebrae due to the existence of cancellous trabecular bone. It is difficult to separate these vertebrae by using intensity-based features. Therefore, we applied a morphology-based separation through the use of RFC and IRFC methods. The following preprocessing steps were applied first. The cavities created by the trabecular bone network were separately filled in each slice to generate the bone region R_B. We used R_B to define an affinity relation κ utilized in the RFC and IRFC separations of the vertebrae as follows.

(a) An axial slice from the CT scene of a patient’s cervical spine.

First, for a given scene 〈C, f〉, a separate bone volume fraction scene 〈C, f_B〉 was computed by setting

f_{B} (c) = {\begin{matrix} 1 & for c \in R_{B} and f (c) \geq {Bone}_{\max}, \\ \frac{f (c) - {Bone}_{\min}}{{Bone}_{\max} - {Bone}_{\min}} & for c \in R_{B} and {Bone}_{\min} < f (c) < {Bone}_{\max}, \\ 0 & otherwise, \end{matrix}

where Bone_max and Bone_min represent maximal and minimal intensities of spels in R_B, respectively.

For a path p = 〈c₁, c₂, . . . , c_l〉 in C, wherein the consecutive spels are 26-adjacent, we define its fuzzy length as

π_{B} (p) = \sum_{i = 1}^{l - 1} \frac{1}{2} (f_{B} (c_{i}) + f_{B} (c_{i + 1})) \cdot distance (c_{i}, c_{i + 1}) .

$(If \frac{1}{2} (f_{B} (c_{i}) + f_{B} (c_{i + 1})) \cdot distance (c_{i}, c_{i + 1})$ is interpreted as an average bone density of the link 〈c_i, c_i₊₁ 〉, then π_B(p) is approximately the total bone mass of p.) The fuzzy distance transform [23] is derived from f_B as follows:

Ω_{B} (c) = \min_{d \notin R_{B}} {π_{B} (p) : p is a path with adjacent consecutive spels from c to d} .

(Under the interpretation as above, Ω_B(c) is the smallest mass of a path connecting c with the complement of R_B.) Now, affinity between spels c and d is defined as given below, where N = max_{c ∈ C}Ω_B(c):

μ_{κ} (c, d) = {\begin{matrix} \max {Ω_{B} (c), Ω_{B} (d)} ∕ N & for adjacent c \neq d, \\ 1 & c = d, \\ 0 & otherwise . \end{matrix}

(16)

Next, RFC and IRFC algorithms were applied to 〈C, f〉 by using the affinity relation defined above on 〈C, f_B〉. The same set of seeds, selected manually, was used for both methods. The results of vertebral separation obtained by using RFC and IRFC are illustrated in Figures 9(a)-(d), (a) and (c) showing the results on a slice, and (b) and (d) depicting the result via 3D surface rendering. In both figures, voxels segmented as part of a specific vertebra are assigned the same color. In the slice display, spels shown white indicate that they were not assigned to any specific bone. Although RFC has succeeded in capturing the skeletal core of each vertebra after segmentation, it has lost most of the regions of each bone (too many white spels in the slice display) and the results are obviously not acceptable. Despite fuzzy fusion at contact regions between the vertebrae, IRFC has successfully separated them. IRFC stopped after 8, 14, and 15 iterations, respectively, for the first, second, and third vertebra. For the particular affinity function defined above, the results of RFC-based vertebral separation are similar to the results that may be obtained by using morphological erosion with a ball of appropriate size. The beauty of RFC is that, effectively, the radius of the eroding ball is automatically computed by the RFC method. The results obtained by IRFC cannot be produced by using a simple morphological operation.

The aim of our second experiment is to demonstrate how IRFC can be employed to separate arteries and veins in contrast-enhanced MR angiography scenes. MR imaging approaches [25] exist which attempt to elicit different types of signals from the arteries and veins through carefully designed imaging protocols and thereby to distinguish arteries from veins. Here, we use RFC and IRFC to separate artery/vein trees from MR scenes that are acquired by using long resident blood-pool contrast agents [26] which do not produce different signals from the arteries and veins, but which provide a better overall definition of the vessels themselves. Figure 8(c) shows a maximum intensity projection (MIP) rendition from a patient MRA scene (size: 512 × 512 × 60; resolution: 0.94 × 0.94 × 1.8 mm³) of the body region from belly to knee. Figure 8(d) shows a surface rendition of the whole fuzzy vascular structure that was segmented by using AFC from the original MRA data set. Figures 9(e) and (f) show renditions of the fuzzy arterial and veinous trees separated via RFC and IRFC, respectively. Note that, in this experiment, RFC (or, IRFC) was applied between arteries and veins so that when the arterial tree was segmented the veinous tree served as the background and vice versa.

For this experiment, a morphology-based affinity was computed in a manner similar to the first experiment Equation (16), except that no 2D cavity filling was necessary. In this case, the algorithm stopped after nine iterations. Clearly, IRFC has captured more thin branches in segmented arterial and veinous tress than those captured by RFC. Also, RFC segmentation of the main arterial branch on the right appears largely broken and the same is true for the main veinous branch on the left. On the other hand, the main branches in IRFC segmentation of both arterial and veinous trees appear complete, continuous, and smooth.

The results of segmentation, by using RFC and IRFC, of WM, GM, CSF in a simulated MR scene produced by the BrainWebMR simulator [22] are presented in Figure 10. Figures 10(a)-(c) show corresponding slices from the simulated proton density, T1-, and T2-weighted MR data sets. Affinity was computed from the three MR data sets after combining them into one vectorial scene [27]. A set of seeds was manually specified for each of the three regions, and the regions were segmented by using RFC and IRFC. These results are shown in Figures 10(d) and (e). It may be noted that there is not much difference between the segmentation results for RFC and IRFC. As in this example, when one object wraps around the entire boundary of the other object, the scope of refinement of segmentation by using IRFC is reduced. Generally, IRFC outperforms RFC when a relatively large part of one object comes close to a large part of another object, forming a fuzzy interface between them, but otherwise the remaining smaller aspects of the objects have a clean association with the two objects, as in our second example above. This situation can also occur in a multi object setting, as in our first example.

Results of WM, GM, and CSF segmentation on simulated MR scenes produced by BrainWebMR simulator. (a)-(c) Matching slices from simulated PD, T1-, and T2-weighted MR data sets. (d) Segmentation of WM (dark), GM (intermediate brightness), and CSF (bright) regions obtained by using RFC. (e) Same as (d) but for IRFC.

5.2 A Quantitative Evaluation

The purpose of this experiment is to quantitatively evaluate the performance of IRFC and compare it with the performance of RFC under various levels of noise, blurring, and intensity inhomogeneity in the scene. Toward this goal, five 2D scenes $C^{T} = 〈 C, f^{T} 〉$ , T ∈ {1, 2, 3, 4, 5}, were created by using the drawing tools supported by 3DVIEWNIX [24]. Each of these scenes contained four separate objects and a background. The object regions and the background were assigned different constant intensities. One such scene is shown in Figure 11(a). Next, each scene $C^{T}$ was modified by: blurring it (via a 2D Gaussian kernel) at one of three fixed blur levels B₁ > B₂ > B₃; adding noise at one of three fixed levels N₁ > N₂ > N₃; and introducing to it intensity inhomogeneity from one of three fixed levels I₁, I₂, I₃. A scene $C^{T}$ with added blur B ∈ {B₁, B₂, B₃}, noise N ∈ {N₁, N₂, N₃}, and intensity inhomogeneity I ∈ {I₁, I₂, I₃}, is denoted as $C_{BNI}^{T} = 〈 C, f_{BNI}^{T} 〉$ . Thus, from each of the five scenes $C^{T}$ , we generated 27 modified phantom scenes $C_{BNI}^{T}$ . Three of these 135 phantom scenes, generated from the scene $C^{T}$ of Figure 11(a), are illustrated in Figures 11(c)-(e).

In each scene $C^{T} = 〈 C, f^{T} 〉$ , each spel c ∈ C is assigned to a unique object. Let L^T : C → {0, 1, 2, 3, 4} denote the true object labeling function; that is, the set {c ∈ C : L^T (c) = i} is the i-th object for i ∈ {1, 2, 3, 4} and the background, when i = 0. Figure 11(b), used as a reference, presents the true object labeling for Figure 11(a). We will denote by O^T the set of all spels with non-zero label in $C^{T}$ .

Object labeling of the phantom scenes is accomplished in two steps—separation of the foreground from background, and separation among the four objects. This is because the nature of the segmentation task between background and foreground is entirely different from segmentation among objects within the foreground. In the former case, there is a clear intensity difference, and a simpler approach like AFC works fine. On the other hand, among the different foreground objects there is no clear intensity difference and intensity-based approaches will not work. After segmenting the foreground from the background by using AFC, a fuzzy membership scene was created as follows. Let $O_{BNI}^{T}$ denote the set of spels in the foreground region and let ρ and σ denote the mean and standard deviation of spel intensity values over $O_{BNI}^{T}$ .

A foreground fuzzy membership value $φ_{BNI}^{T} (c)$ at a spel $c \in O_{BNI}^{T}$ was then created, defined by

φ_{BNI}^{T} (c) = {\begin{matrix} e^{- \frac{{(f_{BNI}^{T} (c) - ρ)}^{2}}{2 σ^{2}}} & if f_{BNI}^{T} (c) < ρ, \\ 1 & otherwise . \end{matrix}

A fuzzy distance transformation map was then computed from $〈 C, φ_{BNI}^{T} 〉$ , which was utilized to define affinity as described previously Equation (16). Finally, RFC and IRFC methods were applied to obtain multi-object segmentations within the foreground region. Segmentations resulting from RFC and IRFC for scenes in Figures 11(c)-(e) are shown, respectively, in Figures 11(f)-(h) and (i)-(k). In these displays, white colored spels represent foreground spels that are not assigned to any specific region. (Those were referred to as “boundary spels” in our theoretical discussion.) Clearly IRFC has successfully separated the objects while preserving the thin branches, and RFC has captured only the core of the objects and the results are similar to those that can be obtained via morphological erosion.

Let $RFC - L_{BNI}^{T} (c)$ and $IRFC - L_{BNI}^{T} (c)$ denote the object labels estimated at a spel c from a phantom scene $C_{BNI}^{T}$ by using RFC and IRFC, respectively. We use here the label value 5 for the foreground spels which are not assigned to any of the four objects. A similarity measure between L^T (c) and $RFC - L_{BNI}^{T} (c)$ (or $IRFC - L_{BNI}^{T} (c)$ ) is necessary to assess the performance of the two methods. Unlike the one object case, establishing agreement with truth in the case of multiple objects simultaneously is tricky. Here, we have used a figure-of-merit (FOM) that gives a full score only when the label of a spel in the segmentation matches with the true label at that spel; otherwise the score is 0. Specifically, the figure of merit $X - {FOM}_{BNI}^{T}$ , with X ∈ {RFC, IRFC}, for the phantom scene $C_{BNI}^{T}$ is defined as

X - {FOM}_{BNI}^{T} = \frac{Σ_{c \in C} F (X - L_{BNI}^{T} (c), L^{T} (c))}{∥ O^{T} \cup O_{BNI}^{T} ∥} \times 100,

where symbol $∥ O^{T} \cup O_{BNI}^{T} ∥$ denotes the number of spels in $O^{T} \cup O_{BNI}^{T}$ , and F(a, b) = 1 for a = b and F(a, b) = 0 for a ≠ b . Finally, at any given blur, noise, and inhomogeneity level BNI, the mean and the standard deviation values of $X - {FOM}_{BNI}^{T}$ , for T ∈ {1, 2, 3, 4, 5}, are computed. Tables 1 and 2 list the mean and standard deviation of these FOM values for RFC and IRFC methods, respectively. It is clear from these tables that the performance of IRFC is superior to that of RFC.

Table 1.

The mean and standard deviation (in parenthesis) of the similarity measure ${RFC - FOM}_{BNI}^{T}$ , T ∈ {1, 2, 3, 4, 5}, are shown for each blur, noise, and inhomogeneity condition.

B₁N₁I₁	31.29(4.36)	B₂N₁I₁	26.23(4.90)	B₃N₁I₁	25.56(5.36)
B₁N₁I₂	26.67(3.78)	B₂N₁I₂	24.83(5.12)	B₃N₁I₂	21.72(5.78)
B₁N₁I₃	26.51(4.68)	B₂N₁I₃	21.92(5.23)	B₃N₁I₃	20.31(6.09)
B₁N₂I₁	26.69(3.89)	B₂N₂I₁	24.42(4.89)	B₃N₂I₁	21.73(5.45)
B₁N₂I₂	24.28(5.11)	B₂N₂I₂	19.89(5.13)	B₃N₂I₂	17.93(5.85)
B₁N₂I₃	22.47(3.96)	B₂N₂I₃	18.29(5.24)	B₃N₂I₃	15.92(6.12)
B₁N₃I₁	25.49(5.21)	B₂N₃I₁	21.92(4.99)	B₃N₃I₁	18.00(5.54)
B₁N₃I₂	22.58(4.76)	B₂N₃I₂	18.18(5.33)	B₃N₃I₂	15.78(6.02)
B₁N₃I₃	20.09(4.59)	B₂N₃I₃	16.03(5.02)	B₃N₃I₃	15.93(6.11)

Open in a new tab

Table 2.

The mean and standard deviation (in parenthesis) of the similarity measure ${IRFC - FOM}_{BNI}^{T}$ , T ∈ {1, 2, 3, 4, 5}, are shown for each blur, noise, and inhomogeneity condition.

B₁N₁I₁	98.93(0.31)	B₂N₁I₁	98.38(0.37)	B₃N₁I₁	97.30(0.46)
B₁N₁I₂	98.08(0.39)	B₂N₁I₂	96.78(0.40)	B₃N₁I₂	93.12(0.42)
B₁N₁I₃	97.91(0.42)	B₂N₁I₃	94.49(0.34)	B₃N₁I₃	90.42(0.50)
B₁N₂I₁	97.90(0.38)	B₂N₂I₁	95.91(0.29)	B₃N₂I₁	91.73(0.49)
B₁N₂I₂	96.65(0.45)	B₂N₂I₂	90.60(0.40)	B₃N₂I₂	85.89(0.53)
B₁N₂I₃	94.40(0.42)	B₂N₂I₃	87.41(0.38)	B₃N₂I₃	82.81(0.50)
B₁N₃I₁	97.34(0.40)	B₂N₃I₁	92.62(0.35)	B₃N₃I₁	87.69(0.56)
B₁N₃I₂	93.80(0.46)	B₂N₃I₂	86.70(0.42)	B₃N₃I₂	82.09(0.48)
B₁N₃I₃	90.50(0.49)	B₂N₃I₃	83.30(0.46)	B₃N₃I₃	78.90(0.55)

Open in a new tab

6 Concluding remarks

The theory of IRFC segmentation presented in this paper consolidates all earlier versions of FC segmentation theories in a unified framework. This is especially the case for the RFC theory, since any segmentation obtained with the RFC algorithm is just a first iteration step in the IRFC based algorithm. Since our exposition of the IRFC theory is presented with the iteration number as a parameter, the RFC results (viewed as the first-iteration-level-IRFC results) are readily accessible due to the format of our presentation of the IRFC theory.

It should also be stressed that the IRFC theory presented here is self contained. We were not able to use the theoretical results from earlier papers in this connection, because of the intricacy of the arguments needed for the IRFC theory. Thus, from a theoretical point of view, this paper supplants previous papers on FC theory.

Note also that, once the IRFC algorithm is implemented, there is no reason to implement also an RFC based algorithm separately. There are two reasons in support of this statement. First, it is easy to implement an IRFC algorithm that will ask an operator whether to impose a maximal number N of iterations. Then such an algorithm used with “no bound for N” is just our standard IRFC algorithm, and when run with N = 1, it becomes a standard RFC algorithm. Although this allows an implementation of RFC algorithm as a restricted version of IRFC, we do not believe that there is much benefit in running RFC segmentation once an IRFC program is at hand. It is true that, in principle, the RFC algorithm is simpler than IRFC, and in some cases (as demonstrated in Figures 10(d) and (e)) the RFC program works just as well as IRFC. However, in such cases, the first iteration of IRFC will already give the RFC “good enough” segmentation; that is, the IRFC algorithm will stop after just one iteration. Since the expense of running IRFC algorithm in the case it stops after just one iteration is only slightly higher than running the RFC algorithm, the benefit of an operator deciding whether to use IRFC or RFC is minimal, even when there is no better performance of IRFC over RFC.

Apart from its generality, IRFC is a more powerful technique than RFC. Our experiments indicate that there are potentially many situations wherein IRFC would perform better than RFC, especially when multiple objects come close to each other without one completely surrounding the other.

One area that requires careful scrutiny and that can make an impact on the practical utility of FC methods in segmentation is the proper design of affinity. In this paper, we have utilized mostly image-based strategies for defining affinity, as described in previous publications. We have also shown (see Equation (16)) that morphology-based strategies can also be employed to devise effective affinities. It is also conceivable that affinities can be constructed by utilizing information available in statistical shape models [28]. A question naturally arises then as to whether these three strategies can be combined in a FC-driven segmentation task to construct affinities. We are currently studying some of these issues in the context of specific imaging applications.

Acknowledgments

The first author was partially supported by NSF grant DMS-0623906, while working on this project.

The second author was partially supported by DHHS grant NS 37172, while working on this project.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

[1].Kass M, Witkin A, Terzopoulos D. Snakes: Active contour models. Int. J. Comput. Vision. 1987;1:321–331. [Google Scholar]
[2].Falcão AX, Udupa JK, Samarasekera S, Sharma S. User-steered image segmentation paradigms: live wire and live lane. Graph. Models Image Process. 1998;60(4):233–260. [Google Scholar]
[3].Cootes T, Taylor C, Cooper D. Active shape models-their training and application. Computer Vision and Image Understanding. 1995;61:38–59. [Google Scholar]
[4].McInerney T, Terzopoulos D. Deformable models in medical image analysis: A survey. Medical Image Analysis. 1996;1(2):91–108. doi: 10.1016/s1361-8415(96)80007-7. [DOI] [PubMed] [Google Scholar]
[5].Cootes T, Edwards G, Taylor C. Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23(6):681–685. [Google Scholar]
[6].Trivedi M, Bezdek J. Low-level segmentation of aerial images with fuzzy clustering. IEEE Trans. Systems, Man, and Cybernetics. 1986;16(4):589–598. [Google Scholar]
[7].Beucher S. The watershed transformation applied to image segmentation; 10th Pfefferkorn Conf. Signal and Image Processing in Microscopy and Microanalysis; 1992.pp. 299–314. [Google Scholar]
[8].Sethian JA. Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge Univ. Press; 1999. Fast Marching Methods and Level Sets Methods. [Google Scholar]
[9].Udupa JK, Samarasekera S. Fuzzy connectedness and object definition: theory, algorithms, and applications in image segmentation. Graphical Models and Image Processing. 1996;58(3):246–261. [Google Scholar]
[10].Boykov Y, Veksler O, Zabih R. Fast approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23(11):1222–1239. [Google Scholar]
[11].Chakraborty A, Staib L, Duncan J. Deformable boundary finding in medical images by integrating gradient and region information. IEEE Trans. Med. Imag. 1996;15(6):859–870. doi: 10.1109/42.544503. [DOI] [PubMed] [Google Scholar]
[12].Imielinska C, Metaxas D, Udupa JK, Jin Y, Chen T. Hybrid segmentation of anatomical data; Proceedings of MICCAI; 2001.pp. 1048–1057. [Google Scholar]
[13].Udupa JK, Saha PK. Fuzzy connectedness in image segmentation. Proceedings of the IEEE. 2003;91(10):1649–1669. [Google Scholar]
[14].Saha PK, Udupa JK. Iterative relative fuzzy connectedness and object definition: theory, algorithms, and applications in image segmentation; Proceedings of IEEE Workshop on Mathematical Methods in Biomedical Image Analysis; Hilton Head, South Carolina. 2002.pp. 28–35. [Google Scholar]
[15].Udupa JK, Saha PK, Lotufo RA. Relative fuzzy connectedness and object definition: Theory, algorithms, and applications in image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2002;24:1485–1500. [Google Scholar]
[16].Herman GT, De Carvalho BM. Multiseeded segmentation using fuzzy connectedness. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23:460–474. [Google Scholar]
[17].Kaufmann A. Introduction to the Theory of Fuzzy Subsets 1. Academic Press; New York: 1975. [Google Scholar]
[18].Saha PK, Udupa JK. Fuzzy connected Object Delineation: Axiomatic Path Strength Definition and the Case of Multiple Seeds. Computer Vision and Image Understanding. 2001;83:275–295. [Google Scholar]
[19].Saha PK, Udupa JK. Relative fuzzy connectedness among multiple objects: Theory, algorithms, and applications in image segmentation. Computer Vision and Image Understanding. 2001;82(1):42–56. [Google Scholar]
[20].Carvalho BM, Gau CJ, Herman GT, Kong TY. Algorithms for fuzzy segmentation. Pattern Analysis and Applications. 1999;2:73–81. [Google Scholar]
[21].Nyúl LG, Falcão AX, Udupa JK. Fuzzy-connected 3D image segmentation at interactive speeds. Graphical Models. 2002;64:259–281. [Google Scholar]
[22].Kwan RK-S, Evans AC, Pike GB. An extensible MRI simulator for post-processing evaluation. Proceedings of Visualization in Biomedical Computing, Lecture Notes in Computer Science. 1996;1131:135–140. [Google Scholar]
[23].Saha PK, Wehrli FW, Gomberg BR. Fuzzy distance transform—theory, algorithms, and applications. Computer Vision and Image Understanding. 2002;86:171–190. [Google Scholar]
[24].Udupa JK, Odhner D, Samarasekera S, Goncalves R, Iyer K, Venugopal K, Furuie S. 3DVIEWNIX: an open, transportable, multidimensional, multimodality, multiparametric imaging software system; Proceedings of Proceedings of SPIE: Medical Imaging; San Diego, CA, 2164. 1994.pp. 58–73. [Google Scholar]
[25].Bluemke DA, Darrow RD, Gupta R, Tadikonda SK, Dormoulin CL. 3D contrast enhanced phase contrast angiography: utility for artery/vein separation. ISMRM Proc. 1999;2:1237. [Google Scholar]
[26].Lei T, Udupa JK, Saha PK, Odhner D. Artery-vein separation via MRA—an image processing approach. IEEE Transactions on Medical Imaging. 2001;20(20):689–703. doi: 10.1109/42.938238. [DOI] [PubMed] [Google Scholar]
[27].Zhuge Y, Udupa JK, Saha PK. Vectorial scale-based fuzzy connected image segmentation. Computer Vision and Image Understanding. 2006;101(3):177–193. [Google Scholar]
[28].Cootes TF, Taylor CJ, Cooper DH, Graham J. Active shape sodels their training and application. Computer Vision and Image Understanding. 1995;61:38–59. [Google Scholar]

[R1] [1].Kass M, Witkin A, Terzopoulos D. Snakes: Active contour models. Int. J. Comput. Vision. 1987;1:321–331. [Google Scholar]

[R2] [2].Falcão AX, Udupa JK, Samarasekera S, Sharma S. User-steered image segmentation paradigms: live wire and live lane. Graph. Models Image Process. 1998;60(4):233–260. [Google Scholar]

[R3] [3].Cootes T, Taylor C, Cooper D. Active shape models-their training and application. Computer Vision and Image Understanding. 1995;61:38–59. [Google Scholar]

[R4] [4].McInerney T, Terzopoulos D. Deformable models in medical image analysis: A survey. Medical Image Analysis. 1996;1(2):91–108. doi: 10.1016/s1361-8415(96)80007-7. [DOI] [PubMed] [Google Scholar]

[R5] [5].Cootes T, Edwards G, Taylor C. Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23(6):681–685. [Google Scholar]

[R6] [6].Trivedi M, Bezdek J. Low-level segmentation of aerial images with fuzzy clustering. IEEE Trans. Systems, Man, and Cybernetics. 1986;16(4):589–598. [Google Scholar]

[R7] [7].Beucher S. The watershed transformation applied to image segmentation; 10th Pfefferkorn Conf. Signal and Image Processing in Microscopy and Microanalysis; 1992.pp. 299–314. [Google Scholar]

[R8] [8].Sethian JA. Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge Univ. Press; 1999. Fast Marching Methods and Level Sets Methods. [Google Scholar]

[R9] [9].Udupa JK, Samarasekera S. Fuzzy connectedness and object definition: theory, algorithms, and applications in image segmentation. Graphical Models and Image Processing. 1996;58(3):246–261. [Google Scholar]

[R10] [10].Boykov Y, Veksler O, Zabih R. Fast approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23(11):1222–1239. [Google Scholar]

[R11] [11].Chakraborty A, Staib L, Duncan J. Deformable boundary finding in medical images by integrating gradient and region information. IEEE Trans. Med. Imag. 1996;15(6):859–870. doi: 10.1109/42.544503. [DOI] [PubMed] [Google Scholar]

[R12] [12].Imielinska C, Metaxas D, Udupa JK, Jin Y, Chen T. Hybrid segmentation of anatomical data; Proceedings of MICCAI; 2001.pp. 1048–1057. [Google Scholar]

[R13] [13].Udupa JK, Saha PK. Fuzzy connectedness in image segmentation. Proceedings of the IEEE. 2003;91(10):1649–1669. [Google Scholar]

[R14] [14].Saha PK, Udupa JK. Iterative relative fuzzy connectedness and object definition: theory, algorithms, and applications in image segmentation; Proceedings of IEEE Workshop on Mathematical Methods in Biomedical Image Analysis; Hilton Head, South Carolina. 2002.pp. 28–35. [Google Scholar]

[R15] [15].Udupa JK, Saha PK, Lotufo RA. Relative fuzzy connectedness and object definition: Theory, algorithms, and applications in image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2002;24:1485–1500. [Google Scholar]

[R16] [16].Herman GT, De Carvalho BM. Multiseeded segmentation using fuzzy connectedness. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001;23:460–474. [Google Scholar]

[R17] [17].Kaufmann A. Introduction to the Theory of Fuzzy Subsets 1. Academic Press; New York: 1975. [Google Scholar]

[R18] [18].Saha PK, Udupa JK. Fuzzy connected Object Delineation: Axiomatic Path Strength Definition and the Case of Multiple Seeds. Computer Vision and Image Understanding. 2001;83:275–295. [Google Scholar]

[R19] [19].Saha PK, Udupa JK. Relative fuzzy connectedness among multiple objects: Theory, algorithms, and applications in image segmentation. Computer Vision and Image Understanding. 2001;82(1):42–56. [Google Scholar]

[R20] [20].Carvalho BM, Gau CJ, Herman GT, Kong TY. Algorithms for fuzzy segmentation. Pattern Analysis and Applications. 1999;2:73–81. [Google Scholar]

[R21] [21].Nyúl LG, Falcão AX, Udupa JK. Fuzzy-connected 3D image segmentation at interactive speeds. Graphical Models. 2002;64:259–281. [Google Scholar]

[R22] [22].Kwan RK-S, Evans AC, Pike GB. An extensible MRI simulator for post-processing evaluation. Proceedings of Visualization in Biomedical Computing, Lecture Notes in Computer Science. 1996;1131:135–140. [Google Scholar]

[R23] [23].Saha PK, Wehrli FW, Gomberg BR. Fuzzy distance transform—theory, algorithms, and applications. Computer Vision and Image Understanding. 2002;86:171–190. [Google Scholar]

[R24] [24].Udupa JK, Odhner D, Samarasekera S, Goncalves R, Iyer K, Venugopal K, Furuie S. 3DVIEWNIX: an open, transportable, multidimensional, multimodality, multiparametric imaging software system; Proceedings of Proceedings of SPIE: Medical Imaging; San Diego, CA, 2164. 1994.pp. 58–73. [Google Scholar]

[R25] [25].Bluemke DA, Darrow RD, Gupta R, Tadikonda SK, Dormoulin CL. 3D contrast enhanced phase contrast angiography: utility for artery/vein separation. ISMRM Proc. 1999;2:1237. [Google Scholar]

[R26] [26].Lei T, Udupa JK, Saha PK, Odhner D. Artery-vein separation via MRA—an image processing approach. IEEE Transactions on Medical Imaging. 2001;20(20):689–703. doi: 10.1109/42.938238. [DOI] [PubMed] [Google Scholar]

[R27] [27].Zhuge Y, Udupa JK, Saha PK. Vectorial scale-based fuzzy connected image segmentation. Computer Vision and Image Understanding. 2006;101(3):177–193. [Google Scholar]

[R28] [28].Cootes TF, Taylor CJ, Cooper DH, Graham J. Active shape sodels their training and application. Computer Vision and Image Understanding. 1995;61:38–59. [Google Scholar]

PERMALINK

Iterative Relative Fuzzy Connectedness for Multiple Objects with Multiple Seeds

Krzysztof Chris Ciesielski

Jayaram K Udupa

Punam K Saha

Ying Zhuge

Abstract

1 Introduction

Figure 11.

Figure 9.

Figure 1.

2 Theory

2.1 Basic definitions and notation

Proposition 2.1

Proof

2.2 Fuzzy connected objects: absolute and relative

Figure 2.

Figure 3.

Proposition 2.2 (Robustness)

Fact 2.3

Proof

2.3 Iterative Relative Fuzzy Connectedness: motivation, definition, and properties

Figure 4.

Figure 5.

Theorem 2.4

2.4 Robustness of IRFC segmentation

Theorem 2.5

Corollary 2.6

Proof

Corollary 2.7

3 The proofs and the examples

3.1 Disjointness of the segments

Fact 3.1

Proof

Lemma 3.2

Proof

Lemma 3.3

Proof

Fact 3.4

Proof

Remark 3.5

Proof

Fact 3.6

Proof

Proof of Theorem 2.4

3.2 Alternative definitions of PSSj

Theorem 3.7

Proof

3.3 The robustness results

Fact 3.8

Proof

Lemma 3.9

Proof

Proof of Theorem 2.5

Proof of Corollary 2.7

3.4 How to choose seed generating families S?

Proposition 3.10

Proof

Lemma 3.11

Proof

Proposition 3.12

Proof

3.5 Examples

Example 3.13

Figure 6.

Example 3.14

Example 3.15

Figure 7.

Example 3.16

4 The algorithm

Algorithm κIRMOFC

Input

Output

Auxiliary Data Structures

Proposition 4.1

5 Results and evaluation

5.1 Qualitative Evaluation

Figure 8.

Figure 10.

5.2 A Quantitative Evaluation

3.2 Alternative definitions of $P_{S S}^{j}$