A diagnostic reasoning and optimal treatment model for bacterial infections with fuzzy information

Han-Ying Kao; Han-Lin Li

doi:10.1016/j.cmpb.2004.08.003

. 2004 Nov 17;77(1):23–37. doi: 10.1016/j.cmpb.2004.08.003

A diagnostic reasoning and optimal treatment model for bacterial infections with fuzzy information

Han-Ying Kao ^a,^*, Han-Lin Li ^b

PMCID: PMC7125802 PMID: 15639707

Summary

This study proposes an optimization model for optimal treatment of bacterial infections. Using an influence diagram as the knowledge and decision model, we can conduct two kinds of reasoning simultaneously: diagnostic reasoning and treatment planning. The input information of the reasoning system are conditional probability distributions of the network model, the costs of the candidate antibiotic treatments, the expected effects of the treatments, and extra constraints regarding belief propagation. Since the prevalence of the pathogens and infections are determined by many site-by-site factors, which are not compliant with conventional approaches for approximate reasoning, we introduce fuzzy information. The output results of the reasoning model are the likelihood of a bacterial infection, the most likely pathogen(s), the suggestion of optimal treatment, the gain of life expectancy for the patient related to the optimal treatment, the probability of coverage associated with the antibiotic treatment, and the cost-effect analysis of the treatment prescribed.

Keywords: Influence diagrams, Bayesian networks, Diagnostic reasoning, Optimal treatment, Fuzzy parameters, Constraints

1. Introduction

Two generic reasoning tasks are vital in medical reasoning: diagnostic reasoning and treatment planning. Diagnostic reasoning is the process of reconstructing the past facts from the observed evidence. Treatment planning is reasoning about the effects of actions treated on patients [1]. Usually, the practice of medicine requires both kinds of reasoning to work simultaneously. However, few current reasoning methods can conduct the two reasoning tasks successfully at one time. Besides, the reasoning systems become more complex when considering the complexity of human bodies and its relationships with the environmental factors.

In some clinical cases, various factors may raise the difficulty in reasoning, such as the demographic variances of nosography, the incomplete knowledge of the diseases (e.g. severe acute respiratory syndrome (SARS) in the early 2003), some specific restrictions on estimating relevant parameters of the diseases, etc. In these cases, the clinicians’ experiences and judgments may be useful to diagnosis and prescription. Therefore, the site-by-site factors and clinicians’ knowledge, which may be expressed as extra constraints in the reasoning systems, need to be integrated into the medical decision support systems. At the same time, owing to the difficulties to estimate the causal effects between possible pathogens and the diseases, the parameters of the knowledge base can be expressed as fuzzy numbers.

Considering the clinical issues mentioned above, the authors are motivated to develop a reasoning model with the following features.

(i)
Complete diagnostic reasoning as well as treatment planning.
(ii)
Combine the formal knowledge base as well as decision-makers’ judgments that present as extra constraints.
(iii)
Work compatibly with the circumstance where fuzzy information is involved.

In the following section, the background of this research and the proposed approach will be interpreted.

2. Background

In medical informatics and other domains, Bayesian networks [1], [2], [3], [4], [5], [6], [7], [8], [9], [10] and influence diagrams [6], [8], [11], [12], [13] are widely used knowledge representation and decision models under uncertainty. However, there are two limitations of utilizing the above approaches for solving medical reasoning problems:

(i)
All associated probabilities are assumed to be crisp.
(ii)
Difficult to consider the constraints for the relationships among the nodes in Bayesian networks or influence diagrams.
(iii)
Treatment planning and diagnostic problems are not considered in one paradigm.

The limitations mentioned above restrict the practical usefulness of medical reasoning on Bayesian networks and influence diagrams in the following facts. First, the conditional probabilities between a node and its parent or children nodes could be fuzzy instead of a crisp numbers, due to the difficulties of learning accurately the cause–effect relationships among the nodes [14]. Second, as a common fact, the experts may have some professional speculations in the form of constraints when reasoning from a Bayesian network or an influence diagram. These constraints could be boundary, dependency, or disjunctive conditions. Third, the investigators of influence diagrams used to maximize the utility functions by node removal processes [11], [12], [13] and ignore diagnostic reasoning tasks. Oppositely, Bayesian networks have been used widely in probabilistic reasoning but lacked the capability to suggest the optimal decision [2], [3], [8], [9], [10].

This study proposes an optimization model to make diagnostic reasoning and treatment planning for bacterial infections, where the cause–effect relationships are expressed with an influence diagram and fuzzy data. The input information of the reasoning system are conditional probability distributions of the network model, the costs of the candidate antibiotic treatments, the expected effects of the treatments, and extra constraints regarding belief propagation. Since the prevalence of the pathogens and infections are determined by many site-by-site factors, the decisions involve uncertainty not compliant with conventional approaches. So, we allow the decisions to be made under fuzzy contexts, at which some of the parameters could be fuzzy parameters [14], and some constraints regarding diagnosis are introduced. When a patient is received, this reasoning system can, based on the present symptoms or bacteriological tests, help the clinician make precise diagnosis at the first decision point, and also supply the suggestions of optimal treatment for the infection. The outputs of the reasoning model are the likelihood of a bacterial infection, the most likely pathogen(s), the suggestion for the optimal treatment, the gain of life expectancy of the patient related to the optimal treatment, the probability of coverage associated with the antibiotic treatment, and the cost-effect analysis of the treatment prescribed. The input–output diagram is depicted in Fig. 1 .

Fig. 1 — The input–output diagram of the optimization model in this study.

In the remaining of this article, the design considerations are introduced in Section 3. An influence diagram is used to represent the relationships among the variables relevant to the infections. In Section 4, this study describes the reasoning model and system thoroughly. In Section 5, we implement the diagnostic reasoning and planning problem as an optimization model. The illustration and solutions of this numerical example is given as well. In Section 6, some comments and lessons are given. Finally, we discuss the future extensions in Section 7.

3. Design considerations

In this section, the authors will introduce an example of urinary tract infection (UTI), the problem and design goal, and handling the fuzzy information sequentially.

3.1. An example of urinary tract infection (UTI)

Consider one example of urinary tract infections simplified from Leibovici et al. [5]. As depicted in Fig. 2 , this example uses an influence diagram as the knowledge and decision model where the conditional probability distributions for the relevant random and decision variables are calculated. For the sake of simplicity and without loss of generality, all random nodes are assumed binary. The conditional probability distributions of the variables are given in Table 1, Table 2, Table 3. The nodes and their states in Fig. 2 are described as follows.

•
Pathogen (Patho_i): A microorganism capable of causing urinary tract infection. For the convenience of illustration, only 3 of 12 pathogens are presented: Patho₁ (Klebsiella pneumoniae), Patho₂ (Pseudomonas aeruginosa), Patho₃ (Escherichia coli). The states of this kind of nodes are severity: severe (Patho_i = 1) and not severe (Patho_i = 0).
•
Urinary tract infection (UTI): The states of this node are infected (UTI = 1) and not infected (UTI = 0).
•
Signs and symptoms of urinary tract infection (Sign_i): The manifestations that might cause from UTI. There are six possible signs presented in Fig. 2: Sign₁ (suprapubic pain), Sign₂ (frequent micturition), Sign₃ (flank pain), Sign₄ (urinary symptoms), Sign₅ (serum albumin) and Sign₆ (fever). The states of these nodes are present (Sign_i = 1) and absent (Sign_i = 0).
•
Bacteriological tests (Test_i): Test₁ (growth of microorganisms in the blood), Test₂ (growth of microorganisms in the urine) and Test₃ (nitrite test). The states of these nodes are positive (Test_i = 1) and negative (Test_i = 0).
•
Coverage of UTI (Coverage): The percent of pathogens of UTI susceptible to an antibiotic drug. The states of this node are covered (Coverage = 1) and not covered (Coverage = 0).
•
Resistance to antibiotic drugs (Resist): The states of this node are resistant (Resist = 1) and not resistant (Resist = 0).
•
Antibiotic treatment (Tr): The treatment will be appropriate if it matches the in-vitro susceptibility of the pathogens. For simplicity of demonstration, we consider 5 of 26 antibiotic drugs and one additional state for no treatment. Thus, we have six alternatives, that is Tr = {tr₀, tr₁, tr₂, tr₃, tr₄, tr₅}, where tr₀ stands for no treatment and tr_i = 0 or 1. When tr_i = 1, it means that tr_i is prescribed; oppositely, tr_i = 0 means that tr_i is not prescribed. For the efficiency of computation, we allow only one antibiotic drug at one time, which let it possible to formulate this decision problem as a mixed 0–1 integer program. If more than one drug is mixed in the therapy, the mixed treatment will be regarded as another treatment. Notably, this node is a decision node that has effects on the coverage from urinary tract infection.
•
Underlying (Underlying): The underlying disorder of the patient, which will be represented by an equivalent base years of the remaining life in this illustrative example.
•
Cost (Cost(tr_i)): A utility node associated with antibiotic treatments.
•
Gain (Gain): The gain in life expectancy obtained by prescribing an antibiotic drug, which is a function of the coverage (Coverage) and the underlying disorder of the patient (Underlying).

Table 1.

The probability distributions of the pathogens and UTI

P(+patho₁) = 0.1

P(+patho₂) = 0.09

P(+patho₃) = 0.09

P (+ uti | + path o_{1}, + patho {}_{2}, + path o_{3}) = {\tilde{x}}_{1}

P (+ uti | + path o_{1}, - patho {}_{2}, + path o_{3}) = {\tilde{x}}_{2}

P (+ uti | + path o_{1}, + patho {}_{2}, - path o_{3}) = {\tilde{x}}_{3}

P (+ uti | + path o_{1}, - patho {}_{2}, - path o_{3}) = {\tilde{x}}_{4}

P (+ uti | - path o_{1}, + patho {}_{2}, + path o_{3}) = {\tilde{x}}_{5}

P (+ uti | - path o_{1}, - patho {}_{2}, + path o_{3}) = {\tilde{x}}_{6}

P (+ uti | - path o_{1}, + patho {}_{2}, - path o_{3}) = {\tilde{x}}_{7}

P (+ uti | - path o_{1}, - patho {}_{2}, - path o_{3}) = {\tilde{x}}_{8}

Open in a new tab

Table 2.

The conditional probabilities of signs (Sign_i)

P(sign₁\| +uti) = 0.6	P(sign₁\| −uti) = 0.01
P(sign₂\| +uti) = 0.9	P(sign₂\| −uti) = 0.10
P(sign₃\| +uti) = 0.6	P(sign₃\| −uti) = 0.05
P(sign₄\| +uti) = 0.8	P(sign₄\| −uti) = 0.05
P(sign₅\| +uti) = 0.6	P(sign₅\| −uti) = 0.10
P(sign₆\| +uti) = 0.7	P(sign₆\| −uti) = 0.01

Open in a new tab

Table 3.

The conditional probabilities of coverage given Resist = 1

Treatment^a	The instance of (Patho₁, Patho₂, Patho₃)
	(1, 1, 1)	(1, 0, 1)	(1, 1, 0)	(1, 0, 0)	(0, 1, 1)	(0, 0, 1)	(0, 1, 0)	(0, 0, 0)
tr₀^b	0.3	0.4	0.4	0.5	0.4	0.3	0.3	0.6
tr₁	0.7	0.9	0.99	0.95	0.7	0.8	0.75	0.7
tr₂	0.7	0.7	0.85	0.7	0.85	0.8	0.99	0.8
tr₃	0.8	0.8	0.87	0.8	0.95	0.99	0.8	0.9
tr₄	0.7	0.95	0.8	0.9	0.8	0.7	0.9	0.95
tr₅	0.8	0.9	0.85	0.9	0.8	0.9	0.9	0.9

Open in a new tab

The costs of the tr₀, tr₁, tr₂, tr₃, tr₄, tr₅ are 5000 (the receiving and process costs), $ 20,000, 25,000, 30,000, 32,000 and 50,000, respectively.

No treatment.

Each variable above is characterized by crisp or fuzzy probabilities given the state of its parents. For instance, UTI ∈ {0, 1} represents the dichotomy between having urinary tract infection and not having one. Also, +uti stands for the assertion UTI = 1 or “urinary tract infection is present”, and −uti stands for the negation of +uti, i.e., UTI = 0.

Denote Y the random node set of the influence diagram depicted in Fig. 2. The probability distribution of the random nodes given treatment tr_i can be expressed as (3.1):

P (y) = \prod_{i = 1}^{3} P (path o_{i}) \times P (uti | path o_{1}, path o_{2}, path o_{3}) \times \prod_{j = 1}^{6} P (sig n_{j} | uti) \times P (resist) \times \prod_{k = 1}^{3} P (tes t_{k} | path o_{1}, path o_{2}, path o_{3}) \times P (coverage | path o_{1}, path o_{2}, path o_{3}, resist, t r_{i}) .

(3.1)

3.2. Problem and design goals

Consider the conditional probabilities in Table 1 and Table 2 , and the evidence that a patient has suffered from frequent micturition (Sign₂ = 1), flank pain (Sign₃ = 1) and urinary symptoms (Sign₄ = 1), but has not fallen into a suprapubic pain (Sign₁ = 0), serum albumin (Sign₅ = 0) or fever (Sign₆ = 0). Denote the evidence set E = {e} = {Sign₁ = 0, Sign₂ = 1, Sign₃ = 1, Sign₄ = 1, Sign₅ = 0, Sign₆ = 0}. We need to solve the following two problems.

(i)
Compute the belief distribution of Patho₁, Patho₂, Patho₃ and UTI.
(ii)
Make the suggestion of the optimal treatment based on the information given in Table 3 , assuming the patient with resistance to the antibiotic treatments (Resist = 1).

At the first decision point, the clinician tends to make the diagnosis without biological test results; that is, the task is reasoning on the subgraph omitting the nodes Test_i and simplified as to compute P(y|e), where e stands for an instance of the evidence set E, and Y shrinks as {Patho₁, Patho₂, Patho₃, UTI, Coverage}. This is reasonable because the tests will have no effect on the diagnostic results if they do not provide extra information. If the treatment prescribed at the first time does not work, then some biological tests would be further required. Besides, this model would like to provide the suggestion for the optimal treatment that maximizes the gain of life expectancy and minimizes the total costs.

3.3. Handling fuzzy information

Notice that some of the parameters in Table 1 are not crisp but fuzzy numbers. Freeling [14] claimed fuzzy probability as an extension of probability theory, which is more promising than possibility and probability theory as the uncertainty mearsure. For instance, P(+uti|+patho₁, +patho₂, +patho₃) is not a crisp but a fuzzy number, say ${\tilde{x}}_{1}$ , that is P(+uti|+patho₁, +patho₂, +patho₃) = ${\tilde{x}}_{1}$ , and is described with a membership function $μ_{{\tilde{x}}_{1}} (x_{1})$ represented as follows (see Fig. 3 ):

μ_{{\tilde{x}}_{1}} (x_{1}) = \{\begin{array}{l} 5 (x_{1} - 0.6) - 5 (| x_{1} - 0.8 | + x_{1} - 0.8), & 0.6 \leq x_{1} \leq 1.0, \\ 0, & elsewhere, \end{array})

(3.2)

where “|*|” is the absolute value of a term *.

Fig. 3 — The membership function $μ_{{\tilde{x}}_{1}} (x_{1})$ .

The above expression means that the support of ${\tilde{x}}_{1}$ is between 0.6 and 1.0. For example, if x ₁ = 0.7, then $μ_{{\tilde{x}}_{1}} (x_{1}) = 0.5$ . If x ₁ = 0.8 then $μ_{{\tilde{x}}_{1}} (x_{1}) = 1.0$ , which implies that x ₁ = 0.8 is the most confident value. If x ₁ ≤ 0.6 or x ₁ ≥ 1.0 then $μ_{{\tilde{x}}_{1}} (x_{1}) = 0$ , which is least possible to happen.

For the fuzzy parameters in Table 1, we will formulate the membership functions of ${\tilde{x}}_{i}$ , i = 1, 2, …, 8.

Consider a membership function $μ_{\tilde{x}} (x)$ of a fuzzy parameter $\tilde{x}$ as portrayed in Fig. 4 . This piecewise membership function is usually expressed as

μ_{\tilde{x}} (x) = \{\begin{array}{l} s_{1} (x - a_{1}), & a_{1} < x \leq a_{2} \\ μ_{\tilde{x}} (a_{2}) + s_{2} (x - a_{2}), & a_{2} < x \leq a_{3} \\ μ_{\tilde{x}} (a_{3}) + s_{3} (x - a_{3}), & a_{3} < x \leq a_{4} \\ μ_{\tilde{x}} (a_{4}) + s_{4} (x - a_{4}), & a_{4} < x \leq a_{5} \\ 0, & elsewhere \end{array})

(3.3)

where a _j, j = 1, …, 5 represents the break points; s _i, i = 1, …, 4 represents the slopes of the segments. The above expression is not convenient for computation. Here, we adopt an efficient way to express a piecewise linear function. Consider the following proposition.

Proposition 1

Let $μ_{\tilde{x}} (x)$ be the membership function of a fuzzy variable $\tilde{x}$ , as depicted in Fig. 4, where a _j, j = 1, 2, …, m are the break points of $μ_{\tilde{x}} (x)$ , and s _j, j = 1, 2, …, n are the slopes of line segments between a _j and a _j+1, $μ_{\tilde{x}} (x)$ can be expressed as the sum of absolute terms [15], [16]:

$μ_{\tilde{x}} (x) = μ_{\tilde{x}} (a_{1}) + s_{1} (x - a_{1}) + \sum_{j = 2}^{m} \frac{s_{j} - s_{j - 1}}{2} (| x - a_{j} | + x - a_{j})$ (3.4)

Now we are ready to express the membership functions of the fuzzy parameters $μ_{{\tilde{x}}_{i}} (x_{i})$ in Table 4 . The readers may find that all the eight fuzzy parameters are triangular fuzzy numbers. However, the membership functions in Table 4 involve absolute terms, which is not convenient for computation. Since $μ_{\tilde{x}} (x)$ in (3.4) is a function to be maximized, we used the following proposition to linearize the membership functions.

Proposition 2

To maximize a membership function $μ_{\tilde{x}} (x)$ in (3.4) is equivalent to solve the following linear program [15], [16]:(3.5) $(\begin{array}{l} Max z = s_{1} (x - a_{1}) + 2 \sum_{j = 2}^{m} \frac{s_{j} - s_{j - 1}}{2} (x - a_{j} + \sum_{k = 1}^{j} d_{k}) \\ subject to \\ x + d_{1} \geq a_{2}, \\ x + d_{1} + d_{2} \geq a_{3}, \\ ⋮ \\ x + d_{1} + d_{2} + \dots + d_{m - 1} \geq a_{m}, \\ 0 \leq d_{1} \leq a_{2}, \\ 0 \leq d_{k - 1} \leq a_{k} - a_{k - 1}, for k = 2, 3, \dots, m, \\ x \in F (feasible set), \end{array}\}$ where d _k−1 stands for the lower bound of distance between a _k−1 and a _k. For the detailed proof of Proposition 2, please refer to [15], [16].

Now we are ready to formulate the optimization model for diagnosis and treatment planning.

Fig. 4 — A membership function of fuzzy probability $\tilde{x}$ .

Table 4.

The membership functions of fuzzy probabilities

Parameter	$μ_{{\tilde{x}}_{i}} (x_{i})$	Domain of x_i
${\tilde{x}}_{1}$	5(x₁ − 0.6) − 5(\|x₁ − 0.8\| + x₁ − 0.8)	[0.6, 1]
${\tilde{x}}_{2}$	10(x₂ − 0.7) − 10(\|x₂ − 0.8\| − x₂ − 0.8)	[0.7, 0.9]
${\tilde{x}}_{3}$	20(x₃ − 0.7) − 20(\|x₃ − 0.75\| + x₃ − 0.75)	[0.7, 0.8]
${\tilde{x}}_{4}$	10(x₄ − 0.5) − 10(\|x₄ − 0.6\| + x₄ − 0.6)	[0.5, 0.7]
${\tilde{x}}_{5}$	10(x₅ − 0.7) − 10(\|x₅ − 0.8\| + x₅ − 0.8)	[0.7, 0.9]
${\tilde{x}}_{6}$	20(x₆ − 0.55) − 20(\|x₆ − 0.6\| + x₆ − 0.6)	[0.55, 0.65]
${\tilde{x}}_{7}$	10(x₇ − 0.4) − 10(\|x₇ − 0.5\| + x₁ − 0.5)	[0.4, 0.6]
${\tilde{x}}_{8}$	100(x₈) − 100(\|x₈ − 0.01\| + x₈ − 0.01)	[0, 0.02]

Open in a new tab

4. System description

Here we formulate the diagnostic reasoning and treatment planning problems as an optimization model. The objectives of this model are described as follows.

4.1. System objectives

The objectives of this model are described below.

(i)
To maximize the sum of all fuzzy membership functions. That is, we will make the suggestions of optimal treatment under the maximal confidence of the fuzzy information [17].
(ii)
To maximize the gain in life expectancy.
(iii)
To minimize the total costs of the treatments.

In this problem, the clinician has six candidate treatments to choose, where no treatment is included. We represent each antibiotic treatment as a binary variable tr_i (including tr₀ standing for no treatment) and the cost as Cost(tr_i). The total cost is $\sum_{i = 0}^{5} Cost (t r_{i})$ . The objective functions can be expressed as follows:

Max z_{1} = \sum_{i = 1}^{8} μ_{{\tilde{x}}_{i}} (x_{i})

(4.1)

Max z_{2} = E (Gain (Coverage, Underlying))

(4.2)

Min z_{3} = \sum_{i = 0}^{5} Cost (t r_{i})

(4.3)

where “E(*)” stands for the expectation of a term *.

In (4.2), we express the expected gain in life expectancy as a function of Coverage and Underlying. We assume that the underlying disorder and health status can be converted to an equivalent base year, in this case, 35 years, and the gain is a multiple of the base year. This study assumes that, in this clinical case, the patient has the ideal 35 years gain of life expectancy if the probability to recover from UTI is 1. Since the literature [5] shows that one-year gained in life can be regarded equivalent to $ 55,000, we re-write (4.2) as (4.4) for unit standardization:

{z^{'}}_{2} = 55, 000 \times E (Gain (Coverage)) * 35

(4.4)

Setting that only one treatment can be chosen at one decision point, we can formulate the total cost function as in (4.3). Notably, the probability of coverage is determined by the resistance of antibiotic treatment (given Resist = 1), the pathogens (Patho_j), and the treatment (tr_i). The reader may refer to their relationships in Table 3. Defining tr_i as a 0–1 variable, the expectation of Coverage, E(Coverage) can be computed as

E (Coverage | Resist = 1) = α \sum_{i} \sum_{path o_{1}} \sum_{path o_{2}} \sum_{path o_{3}} t r_{i} \times P (coverage | path o_{1}, path o_{2}, path o_{3}, resist = 1, t r_{i})

(4.5)

where α is the normalizing constant, which will be explained in next subsection.

In this optimization program, two categories of constraints must to be satisfied: (1) the constraints for the probability theory, and (2) the extra constraints regarding belief propagation. This optimization model can be implemented with various exact propagation methods. This study does not intend to discuss the details of reasoning algorithms but focus on how to formulate this problem as an optimization model. The interested readers may refer to the literatures [2], [3], [7], [8], [9], [10].

4.2. Basic constraints

Now we formulate the first category of constraints as

\sum_{y} P (y) = α \sum_{path o_{1}} \sum_{path o_{2}} \sum_{path o_{3}} \sum_{uti} \sum_{coverage} [\prod_{j = 1}^{3} P (path o_{j}) \times P (uti | path o_{1}, path o_{2}, path o_{3}) \times P (sig n_{1} = 0 | uti) \times P (sig n_{2} = 1 | uti) \times P (sig n_{3} = 1 | uti) \times P (sig n_{4} = 1 | uti) \times P (sig n_{5} = 0 | uti) \times P (sig n_{6} = 0 | uti) \times \sum_{i = 0}^{5} P (coverage | path o_{1}, path o_{2}, path o_{3}, resist = 1, t r_{i})] = 1,

(4.6)

\sum_{i = 0}^{5} t r_{i} = 1, t r_{i} = 1 or 0,

(4.7)

where α is the normalizing constant which ensures that the sum of the probabilities of every instance of y is 1. The constraint in (4.7) regulates the clinician to prescribe only one treatment in the first decision point.

4.3. Extra constraints

At the same time, in addition to a given formal knowledge base, the clinicians may have some professional speculations about the features of some nodes and the relationships among them, in some specific diagnostic context. These features and relationships can be identified as the following types of constraints.

(i)
Boundary constraints

Some posterior beliefs may have upper or lower bounds. For instance, a clinician may speculate that the posterior probability of Patho₃ should be higher than 0.3 but lower than 0.5, which can be expressed as
$0.3 \leq P (+ path o_{3} | e) \leq 0.5$ (4.8)
(ii)
Dependency constraints

The beliefs of some nodes in the belief network may exist mutually dependent relationships. For example, a clinician may presume that the posterior probability of Patho₁ should be some multiple of Patho₃. Such a relationship is expressed as
$P (+ path o_{1} | e) \leq 0.5 P (+ path o_{3} | e)$ (4.9)
(iii)
Disjunctive constraints

Sometimes the disjunctive condition between the nodes may exist. For example, a doctor may estimate that either P(+patho₂|e) or P(+patho₁|e)is equal to or less than 0.4, which is expressed as
$either P (+ path o_{2} | e) \leq 0.4 or P (+ path o_{1} | e) \leq 0.4$ (4.10)

4.4. The model

Combining constraints (4.8) and (4.10) into this reasoning system, this optimization program becomes

(\begin{array}{l} Max z_{1} \\ Max {z^{'}}_{2} \\ Min z_{3} \\ s . t . (4.6) - (4.8), (4.10) \end{array}\}

(4.11)

Since the disjunctive constraint (4.10) is a nonlinear constraint, we will linearize it by some 0–1 variables as the following.

(\begin{array}{l} M (θ_{1} - 1) \leq P (+ path o_{2} | e) - 0.4 \leq M θ_{1} + M (1 - θ_{2}) \\ M (θ_{2} - 1) \leq P (+ path o_{1} | e) - 0.4 \leq M θ_{2} + M (1 - θ_{1}) \\ ε \leq θ_{2} + θ_{1} \leq 1 \end{array}\}

(4.12)

where θ ₁ and θ ₂ are 0–1 variables, M is a relatively large number, and ɛ is a relatively small positive number.

We can check the four possible combinations of θ ₁ and θ ₂. (1) θ ₁ = 1, θ ₂ = 1: (4.12) turns into 0 ≤ P(+patho₂|e) − 0.4 ≤ M and 0 ≤ P(+patho₁|e) − 0.4 ≤ M, which are inactive constraint; (2) θ ₁ = 0, θ ₂ = 1: (4.12) turns into −M ≤ P(+patho₂|e) − 0.4 ≤ 0 and 0 ≤ P(+patho₁|e) − 0.4 ≤ 2M, which means that when P(+patho₁|e) ≥ 0.4, P(+patho₂|e) must be less than or equal to 0.4; (3) θ ₁ = 1, θ ₂ = 0: (4.12) works as 0 ≤ P(+patho₂|e) − 0.4 ≤ 2M and −M ≤ P(+patho₁|e) − 0.4 ≤ 0, which implies that when P(+patho₂|e) ≥ 0.4, P(+patho₁|e) must be less than or equal to 0.2; (4) θ ₁ = 0, θ ₂ = 0: (4.12) becomes −M ≤ P(+patho₂|e) − 0.4 ≤ M and −M ≤ P(+patho₁|e) − 0.4 ≤ M, which are inactive constraints. The third inequalities in (4.12) exclude the combinations when θ ₁ = 1, θ ₂ = 0 and θ ₁ = 0, θ ₂ = 0. To summarize, (4.12) implies that either P(+patho₂|e) ≤ 0.4 or P(+patho₁|e) ≤ 0.4 must be satisfied.

5. Status report

The model formulated in the previous section is a multiobjective program, so we adopt the fuzzy approach [18], [19] to solve it. Following the steps described below, the model is solved.

Step 1: Get the ideal solutions of every objective.

To obtain the ideal solutions, every objective is optimized independently regardless of other objectives. In (4.11), we maximize z ₁, ${z^{'}}_{2}$ , and minimize z ₃ individually to acquire their ideal solutions $z_{1}^{*}$ , ${z^{'}}_{2}^{*}$ and $z_{3}^{*}$ , respectively. The ideal values are $z_{1}^{*}$ = 8, ${z^{'}}_{2}^{*}$ = 1,722,198, and $z_{3}^{*}$ = 5000.
Step 2: Get the anti-ideal solution of every objective.

To obtain the anti-ideal solutions, every objective is computed in the opposite way regardless of other objectives. Now, we minimize z ₁, ${z^{'}}_{2}$ , and maximize z ₃ to acquire the anti-ideal solutions $z_{1}^{-}$ , ${z^{'}}_{2}^{-}$ and $z_{3}^{-}$ , respectively. The anti-ideal values are $z_{1}^{-}$ = 4, ${z^{'}}_{2}^{-}$ = 733764.5, and $z_{3}^{-}$ = 40,000.
Step 3: Define the membership function of every objective by its ideal and anti-ideal solutions.

With the ideal and anti-ideal solutions of every objective, we can define their membership functions as follows:
$μ_{z_{k}} (z_{k}) = \frac{z_{k} - z_{k}^{-}}{z_{k}^{*} - z_{k}^{-}}$ (5.1)
The membership functions evaluate the degree of fulfillment for every objective.
Step 4: Maximize the minimal membership function of the three objectives.

Using Zimmermann's fuzzy approach for multi-objective programs, the model (4.11) can be converted into (5.2):
$(\begin{array}{l} Max λ \\ s . t . \\ λ \leq μ_{z_{1}} (z_{1}) = \frac{z_{1} - z_{1}^{-}}{z_{1}^{*} - z_{1}^{-}} \\ λ \leq μ_{{z^{'}}_{2}} (z_{2}) = \frac{{z^{'}}_{2} - {z^{'}}_{2}^{-}}{{z^{'}}_{2}^{*} - {z^{'}}_{2}^{-}} \\ λ \leq μ_{z_{3}} (z_{3}) = \frac{z_{3} - z_{3}^{-}}{z_{3}^{*} - z_{3}^{-}} \\ (4.6) - (4.8), (4.12), \end{array}\}$ (5.2)
where $λ$ is defined as $λ = \min_{1, 2, 3} (μ_{z_{1}} (z_{1}), μ_{{z^{'}}_{2}} ({z^{'}}_{2}), μ_{z_{3}} (z_{3}))$ .

In (5.2), this study intends to search for the maximum of the minimal satisfaction level of all the objective functions. To avoid the poor estimation of the fuzzy parameters and decision quality, we set the strict lower bound for the membership of every fuzzy parameter at 0.5. Applying the ideal and anti-ideal values computed in Step 1 and Step 2, (5.2) is specified as (5.3):

(\begin{array}{l} Max λ \\ s . t . \\ λ \leq \frac{z_{1} - 4}{8 - 4} \\ λ \leq \frac{{z^{'}}_{2} - 733764.5}{1, 722, 198 - 733764.5} \\ λ \leq \frac{z_{3} - 40, 000}{5000 - 40, 000} \\ (4.6) - (4.8), (4.12), \end{array}\}

(5.3)

This study will solve (5.3) with LINGO 8.0 developed by LINDO Systems Inc. [21]. LINGO is a software designed to build and solve linear, nonlinear and integer optimization models. LINGO provides an integrated package that includes a language for expressing optimization models, a full featured environment for building and editing problems, and a set of built-in solvers. Part of the LINGO model is listed in Appendix A.

LINGO 8.0 solves (5.3) in 1 s and obtains the optimal treatment as tr₁ (tr₁ = 1, tr₀ = tr₂ = tr₃ = tr₄ = tr₅ = 0), the normalizing constant α = 303.9275, the optimal minimal membership of the objectives λ = 0.5714, and the likelihood of every pathogens:

\begin{array}{l} P (+ path o_{1} | e) = 0.4000, & P (+ path o_{2} | e) = 0.2916, \\ P (+ path o_{3} | e) = 0.3606, & P (+ uti | e) = 0.9430. \end{array}

The suggested optimal treatment results in a probability of 0.8369 to cover from the urinary tract infection, equivalent gain in life expectancy as $ 1,616,259, and the total costs in $ 20,000. Besides, the clinician can make the diagnosis and optimal prescription at the first decision point with an overall confidence of the fuzzy parameters at 0.5978. We also find that ${\tilde{x}}_{4}$ , ${\tilde{x}}_{7}$ , ${\tilde{x}}_{8}$ are referenced significantly apart from the most possible values. It makes sense that, under this reasoning context, the experts need to make some subjective judgment or trade-off between different, even conflicting information sources, which make the fuzzy parameters referenced apart from their most confident values. The detailed solutions and part of LINGO solution report are listed in Table 5 and Appendix B.

Table 5.

The result table

λ	0.5714
$z_{1} = \sum_{i = 1}^{8} μ_{{\tilde{x}}_{i}} (x_{i})$	6.2857
z₂ = E(Gain)	1616259
$z_{3} = \sum_{i = 0}^{5} Cost (t r_{i})$	20000
P(+patho₁\|e)	0.4000
P(+patho₂\|e)	0.2916
P(+patho₃\|e)	0.3606
P(+uti\|e)	0.9430
Optimal treatment	tr₁ = 1, tr₀ = tr₁ = tr₂ = tr₃ = tr₄ = tr₅
P(+coverage\|e, tr₅)	0.8369
x₁	0.800
x₂	0.800
x₃	0.750
x₄	0.600
x₅	0.800
x₆	0.595
x₇	0.450
x₈	0.005
$μ_{{\tilde{x}}_{1}} (x_{1})$	1.000
$μ_{{\tilde{x}}_{2}} (x_{2})$	1.000
$μ_{{\tilde{x}}_{3}} (x_{3})$	1.000
$μ_{{\tilde{x}}_{4}} (x_{4})$	0.500
$μ_{{\tilde{x}}_{5}} (x_{5})$	1.000
$μ_{{\tilde{x}}_{6}} (x_{6})$	0.891
$μ_{{\tilde{x}}_{7}} (x_{7})$	0.500
$μ_{{\tilde{x}}_{8}} (x_{8})$	0.500

Open in a new tab

6. Lesson learned

During the implementation of the reasoning model, the authors find the strength of the optimization model. First, the reasoning system allows the clinicians to combine their special judgments or experiences as extra constraints, which supplement the incomplete formal knowledge. This is useful for some newly discovered disease or infections, and increase the flexibility and robustness for various clinical settings. Second, the model completes two major tasks in medical informatics: diagnostic reasoning and treatment planning simultaneously, which is an important requirement for clinical decision support systems. Third, LINGO provides an efficient computation tool for solving the optimization model, especially when the authors adopt some linearizing techniques to transform the highly nonlinear program. Based on the authors’ experiences, LINGO performs better in solving linear programs than in solving nonlinear programs.

However, the authors also find several potential challenges in developing the proposed reasoning system. First, as the clinical problems grow larger and more complex, it may be a burden for the clinicians to formulate the model. In some diseases, there may be tens or hundreds of nodes in the networks. The clinicians will have difficulties to estimate the parameters or specify the conditions of their diagnosis and prescription. Therefore, the system needs some experts in knowledge engineering or information management to participate in, which consequently increases the costs to implement. Second, as the scales of network grow larger, belief propagation will be more complicated and time-consuming. Some special techniques for belief propagation may be considered, such as clustering, joint tree decomposition, stochastic simulation, and so on [2], [3], [7], [8], [9], [10]. How to integrate these propagation methods and the optimization model will be a critical issue in implementing the reasoning system. Third, as network structures become huge, implementing the optimization model with LINGO will be fairly challenging. LINGO provides several interfaces with other applications, such as Visual C++, Visual Java, Visual Basic, etc. The system developers can bundle LINGO's functionality into their applications, or call functions from within the LINGO models that were written in an external programming language [21]. It will facilitate generating the codes for LINGO models and importing the input data from other applications.

7. Future plans

The authors suggest several future extensions to this research.

1.
Global optimization: Most medical diagnostic problems are highly nonlinear, and the global optimization is difficult to achieve in most cases. The model solvers need some special techniques to search for the global optimum. These optimization techniques can improve the solution quality and reliability of the reasoning model [20].
2.
Integration with other heuristic computation techniques: As the problem and network structure grow complex, some heuristic methods may be needed for belief propagation. The computation efficiency will be improved if the reasoning systems integrate some heuristic techniques, such as stochastic simulation, genetic algorithms, neural network computing, etc.
3.
Integrate various medical knowledge bases: The developers can integrate various medical knowledge bases to acquire richer diagnostic references and treatment suggestions, such as from traditional Chinese medicine, western medicine, Indian medicine, and so on.
4.
Integrate with regional clinical or medical databases: The reasoning system may raise the feasibility and reliability by integrating local or regional medical databases, which will guarantee more accurate parameter estimation and fitness to different regional diagnostic environments. It is also an important stepping stone to build a complete medical decision support system.

Appendix A. Part of the LINGO model

Appendix B. Part of the LINGO solution report

Local optimal solution found at iteration: 1269; objective value: 0.5714286

Variable	Value	Reduced cost
BETA	0.5714286	0.000000
BETA1	0.5978307	0.000000
BETA2	0.8874719	0.000000
BETA3	0.5714286	0.000000
COV	0.8368683	0.000000
tr0	0.000000	0.1428571
tr1	1.000000	0.5714286
tr2	0.000000	0.7142857
tr3	0.000000	0.8571429
tr4	0.000000	0.9142857
tr5	0.000000	1.428571
U1	1.000000	0.000000
U2	1.000000	0.000000
U3	1.000000	0.000000
U4	0.5000000	0.000000
U5	1.000000	0.000000
U6	0.8913229	0.000000
U7	0.5000000	0.000000
U8	0.5000000	0.000000
X1	0.8000000	0.000000
X2	0.8000000	0.000000
X3	0.7500000	0.000000
X4	0.6005808	0.000000
X5	0.8000000	0.000000
X6	0.5945661	0.000000
X7	0.4500000	0.000000
X8	0.5000000E−02	0.000000
D1	0.000000	0.000000
D2	0.000000	0.000000
D3	0.000000	0.000000
D4	0.2470962E−01	0.000000
D5	0.000000	0.000000
D6	0.5433854E−02	0.000000
D7	0.5000000E−01	0.000000
D8	0.5000000E−02	0.000000
ALPHA	303.9275	0.000000
P1	0.4000000	0.000000
P2	0.2915552	0.000000
P3	0.3605508	0.000000
UTI	0.9430058	0.000000
BIG_M	1000.000	0.000000
G1	1.000000	0.000000
G2	0.000000	0.000000
EP	0.1000000E−02	0.000000

Open in a new tab

References

1.Long W.J. Medical informatics: reasoning methods. Artif. Intell. Med. 2001;23:71–87. doi: 10.1016/s0933-3657(01)00076-8. [DOI] [PubMed] [Google Scholar]
2.Castill E., Gutoerre J.M., Hadi A.S. A new method for symbolic inference in Bayesian networks. Networks. 1996;28:31–43. [Google Scholar]
3.Castillo E., Gutoerrez J.M., Hadi A.S. Spinger-Verlag; New York: 1997. Expert Systems and Probabilistic Network Models. [Google Scholar]
4.Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artif. Intell. Med. 2001;23:89–109. doi: 10.1016/s0933-3657(01)00077-x. [DOI] [PubMed] [Google Scholar]
5.Leibovici L., Fishman M., Schonheyder H.C., Riekehr C., Kristensen B., Shraga I., Andreassen S. A causal probabilistic network for optimal treatment of bacterial infections. IEEE Trans. Knowledge Data Eng. 2000;12(4):517–528. [Google Scholar]
6.Oliver R.M., Smith J.Q. Wiley; 1990. Influence Diagrams, Belief Nets and Decision Analysis. [Google Scholar]
7.Pearl J. Cambridge University Press; 2000. Causality—Models, Reasoning, and Inference. [Google Scholar]
8.Pearl J. Morgan Kaufmann; 1997. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. [Google Scholar]
9.Pool D. Average-case analysis of a search algorithm for estimating prior and posterior probabilities in Bayesian networks with extreme probabilities. Proceedings of the 13th International Joint Conference on Artificial Intelligence; vol. 13, no. 1; 1993. pp. 606–612. [Google Scholar]
10.R.D. Shacher, B. D’Ambrosi, B. DelFabero, Symbolic probabilistic inference in belief networks, in: Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence, Morgan Kaufmann, San Francisco, pp. 514–522.
11.Tatman J.A., Shacher R.D. Dynamic programming and influence diagrams. IEEE Trans. Syst. Man Cybernet. 1990;20(2):365–379. [Google Scholar]
12.Owens D.K., Shacher R.D., Nease R.F. Representation and analysis of medical decision problems with influence diagrams. Med. Decis. Making. 1997;17:251–262. doi: 10.1177/0272989X9701700301. [DOI] [PubMed] [Google Scholar]
13.Nease R.F., Owens D.K. Use of influence diagrams to structure medical decisions. Med. Decis. Making. 1997;17:263–275. doi: 10.1177/0272989X9701700302. [DOI] [PubMed] [Google Scholar]
14.Freeling A.N.S. Possibility versus fuzzy probabilities—two alternative decision aids, fuzzy sets and decision analysis. TIMS/Stud. Manage. Sci. 1984;20:67–81. [Google Scholar]
15.Li H.-L., Chang C.-T., Tsai J.-F. Approximately global optimization for assortment problems using piecewise linearization techniques. Eur. J. Oper. Res. 2002;140:584–589. [Google Scholar]
16.Tsai J.-F., Li H.-L., Hu N.-Z. Global optimization for signomial discrete programming problems in engineering design. Eng. Optim. 2002;34(6) [Google Scholar]
17.Ecker J.G., Kupferschmid M., Lawrence C.E., Reilly A.A., Scott A.C.H. An application of nonlinear optimization in molecular biology. Eur. J. Oper. Res. 2002;138:452–458. [Google Scholar]
18.Lee E.S., Li R.J. Fuzzy multiple objective programming and compromise programming with Pareto optimum. Fuzzy Sets Syst. 1993;53:275–288. [Google Scholar]
19.Zimmermann H.-J. Fuzzy programming and linear programming with several objective functions. Fuzzy Sets Syst. 1978;1:45–55. [Google Scholar]
20.Floudas C.A. Kluwer Academic Publishers; 2000. Deterministic Global Optimisation—Theory and Applications. [Google Scholar]
21.LINGO, http://www.lindo.com/cgi/frameset.cgi?leftlingo.html;lingof.html.

[bib1] 1.Long W.J. Medical informatics: reasoning methods. Artif. Intell. Med. 2001;23:71–87. doi: 10.1016/s0933-3657(01)00076-8. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Castill E., Gutoerre J.M., Hadi A.S. A new method for symbolic inference in Bayesian networks. Networks. 1996;28:31–43. [Google Scholar]

[bib3] 3.Castillo E., Gutoerrez J.M., Hadi A.S. Spinger-Verlag; New York: 1997. Expert Systems and Probabilistic Network Models. [Google Scholar]

[bib4] 4.Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artif. Intell. Med. 2001;23:89–109. doi: 10.1016/s0933-3657(01)00077-x. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Leibovici L., Fishman M., Schonheyder H.C., Riekehr C., Kristensen B., Shraga I., Andreassen S. A causal probabilistic network for optimal treatment of bacterial infections. IEEE Trans. Knowledge Data Eng. 2000;12(4):517–528. [Google Scholar]

[bib6] 6.Oliver R.M., Smith J.Q. Wiley; 1990. Influence Diagrams, Belief Nets and Decision Analysis. [Google Scholar]

[bib7] 7.Pearl J. Cambridge University Press; 2000. Causality—Models, Reasoning, and Inference. [Google Scholar]

[bib8] 8.Pearl J. Morgan Kaufmann; 1997. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. [Google Scholar]

[bib9] 9.Pool D. Average-case analysis of a search algorithm for estimating prior and posterior probabilities in Bayesian networks with extreme probabilities. Proceedings of the 13th International Joint Conference on Artificial Intelligence; vol. 13, no. 1; 1993. pp. 606–612. [Google Scholar]

[bib10] 10.R.D. Shacher, B. D’Ambrosi, B. DelFabero, Symbolic probabilistic inference in belief networks, in: Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence, Morgan Kaufmann, San Francisco, pp. 514–522.

[bib11] 11.Tatman J.A., Shacher R.D. Dynamic programming and influence diagrams. IEEE Trans. Syst. Man Cybernet. 1990;20(2):365–379. [Google Scholar]

[bib12] 12.Owens D.K., Shacher R.D., Nease R.F. Representation and analysis of medical decision problems with influence diagrams. Med. Decis. Making. 1997;17:251–262. doi: 10.1177/0272989X9701700301. [DOI] [PubMed] [Google Scholar]

[bib13] 13.Nease R.F., Owens D.K. Use of influence diagrams to structure medical decisions. Med. Decis. Making. 1997;17:263–275. doi: 10.1177/0272989X9701700302. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Freeling A.N.S. Possibility versus fuzzy probabilities—two alternative decision aids, fuzzy sets and decision analysis. TIMS/Stud. Manage. Sci. 1984;20:67–81. [Google Scholar]

[bib15] 15.Li H.-L., Chang C.-T., Tsai J.-F. Approximately global optimization for assortment problems using piecewise linearization techniques. Eur. J. Oper. Res. 2002;140:584–589. [Google Scholar]

[bib16] 16.Tsai J.-F., Li H.-L., Hu N.-Z. Global optimization for signomial discrete programming problems in engineering design. Eng. Optim. 2002;34(6) [Google Scholar]

[bib17] 17.Ecker J.G., Kupferschmid M., Lawrence C.E., Reilly A.A., Scott A.C.H. An application of nonlinear optimization in molecular biology. Eur. J. Oper. Res. 2002;138:452–458. [Google Scholar]

[bib18] 18.Lee E.S., Li R.J. Fuzzy multiple objective programming and compromise programming with Pareto optimum. Fuzzy Sets Syst. 1993;53:275–288. [Google Scholar]

[bib19] 19.Zimmermann H.-J. Fuzzy programming and linear programming with several objective functions. Fuzzy Sets Syst. 1978;1:45–55. [Google Scholar]

[bib20] 20.Floudas C.A. Kluwer Academic Publishers; 2000. Deterministic Global Optimisation—Theory and Applications. [Google Scholar]

[bib21] 21.LINGO, http://www.lindo.com/cgi/frameset.cgi?leftlingo.html;lingof.html.

P(sign₁\| +uti) = 0.6	P(sign₁\| −uti) = 0.01
P(sign₂\| +uti) = 0.9	P(sign₂\| −uti) = 0.10
P(sign₃\| +uti) = 0.6	P(sign₃\| −uti) = 0.05
P(sign₄\| +uti) = 0.8	P(sign₄\| −uti) = 0.05
P(sign₅\| +uti) = 0.6	P(sign₅\| −uti) = 0.10
P(sign₆\| +uti) = 0.7	P(sign₆\| −uti) = 0.01

PERMALINK

A diagnostic reasoning and optimal treatment model for bacterial infections with fuzzy information

Han-Ying Kao

Han-Lin Li

Summary

1. Introduction

2. Background

Fig. 1.

3. Design considerations