Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part II: Proofs of Results

Liliana Orellana; Andrea Rotnitzky; James M Robins

doi:10.2202/1557-4679.1242

. 2010 Mar 3;6(2):9. doi: 10.2202/1557-4679.1242

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part II: Proofs of Results^*

Liliana Orellana ^*, Andrea Rotnitzky ^†, James M Robins ^‡

PMCID: PMC2854089 PMID: 20405047

Abstract

In this companion article to “Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part I: Main Content” [Orellana, Rotnitzky and Robins (2010), IJB, Vol. 6, Iss. 2, Art. 7] we present (i) proofs of the claims in that paper, (ii) a proposal for the computation of a confidence set for the optimal index when this lies in a finite set, and (iii) an example to aid the interpretation of the positivity assumption.

Keywords: dynamic treatment regime, double-robust, inverse probability weighted, marginal structural model, optimal treatment regime, causality

1. Introduction

In this companion article to “Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes. Part I: Main Content” (Orellana, Rotnitzky and Robins, 2010) we present (i) proofs of the claims in that paper, (ii) a proposal for the computation of a confidence set for the optimal index when this lies in a finite set, and (iii) an example to aid the interpretation of the positivity assumption.

The notation, definitions and acronyms are the same as in the companion paper. Througout, we refer to the companion article as ORR-I.

2. Proof of Claims in ORR-I

2.1. Proof of Lemma 1

First note that the consistency assumption C implies that the event

{\bar{O}}_{k} = {\bar{o}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{o}}_{k - 1})

is the same as the event

{\bar{O}}_{k}^{g} = {\bar{o}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{o}}_{k - 1}) .

So, with the definitions

{\underline{V}}_{k, k + l} \equiv (V_{k + 1}, \dots, V_{k + l}), l > 0 and {\underline{V}}_{k, k} \equiv nil

we obtain

= \begin{matrix} E [I_{B} (O, A) {\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}, {\bar{A}}_{K}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1})] \\ E [I_{B} (({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K + 1}), ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) \\ \begin{array}{l} {\underline{ω}}_{k - 1, K} (({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K}), ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | {\bar{O}}_{k}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] \\ w .p .1 . \end{array} \end{matrix}

Next, note that the fact that ${\underline{ω}}_{k - 1, K} (({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K}), ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) = 0$ unless $A_{k} = g_{k} ({\bar{O}}_{k}^{g})$ , $A_{k + 1} = g_{k + 1} ({\bar{O}}_{k}^{g}, O_{k + 1}), \dots, A_{K} = g_{K + 1} ({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K})$ implies that

\begin{array}{l} I_{B} (({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K + 1}), ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) \times \\ {\underline{ω}}_{k - 1, K} (({\bar{O}}_{k}^{g}, {\underline{O}}_{k, K}), ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) \\ = I_{B} ({\bar{O}}_{K + 1}^{g}, {\bar{g}}_{K} ({\bar{O}}_{K}^{g})) {\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) . \end{array}

Then, it follows from the second to last displayed equality that

= \begin{matrix} E [I_{B} (O, A) {\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}, {\bar{A}}_{K}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1})] \\ E [I_{B} ({\bar{O}}_{K + 1}^{g}, {\bar{g}}_{K} ({\bar{O}}_{K}^{g})) {\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | \\ {\bar{O}}_{k}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] \end{matrix}

= \begin{matrix} E [E [{\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] \times \\ I_{B} ({\bar{O}}_{K + 1}^{g}, {\bar{g}}_{K} ({\bar{O}}_{K}^{g})) | {\bar{O}}_{k}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] . \end{matrix}

So, part 1 of the Lemma is proved if we show that

E [{\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] = 1.

(1)

Define for any k = 0, ..., K,

{\underline{ω}}_{k, k} ({\bar{O}}_{k}^{g}, ({\bar{g}}_{k} ({\bar{O}}_{k}^{g}), {\underline{A}}_{k, k})) \equiv 1.

To prove equality (1) first note that,

\begin{array}{l} = \begin{matrix} E [{\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | \\ {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1}] \\ {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ E [{\underline{ω}}_{K - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | \\ {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1}] \end{matrix} \\ \begin{matrix} = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ E [{\underline{ω}}_{K - 1, K} ({\bar{O}}_{K}^{g}, {\bar{A}}_{K}) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})] \end{matrix} \\ \begin{matrix} = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ E [\frac{I_{{g_{K} ({\bar{O}}_{K}^{g})}} (A_{K})}{λ_{K} (g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K}^{g}, {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g}))} | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})] \end{matrix} \\ \begin{matrix} = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ \frac{E [I_{{g_{K} ({\bar{O}}_{K}^{g})}} (A_{K}) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})]}{λ_{K} (g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K}^{g}, {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g}))} \end{matrix} \\ \begin{matrix} = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ \frac{ℙ [A_{K} = g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})]}{λ_{K} (g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K}^{g}, {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g}))} \end{matrix} \end{array}

\begin{array}{l} \begin{array}{l} = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \times \\ \frac{ℙ [A_{K} = g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K}^{g}, {\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})]}{λ_{K} (g_{K} ({\bar{O}}_{K}^{g}) | {\bar{O}}_{K}^{g}, {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g}))} \end{array} \\ = {\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) \end{array}

where the second to last equality follows because given ${\bar{O}}_{K}^{g}$ and ${\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})$ , $O_{K + 1}^{g}$ is a fixed, i.e. non-random function of $O$ and consequently by the sequential randomization assumption, $O_{K + 1}^{g}$ is conditionally independent of A_K given ${\bar{O}}_{K}^{g}$ and ${\bar{A}}_{K - 1} = {\bar{g}}_{K - 1} ({\bar{O}}_{K - 1}^{g})$ . The last equality follows by the definition of λ_K (·|·, ·).

We thus arrive at

\begin{array}{l} = \begin{array}{l} E [{\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})] \\ E {E [{\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K})) | \\ {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1}] | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})} \end{array} \\ = \begin{matrix} E {{\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) | \\ {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})} \end{matrix} \end{array}

This proves the result for the case k = K. If k < K – 1, we analyze the conditional expectation of the last equality in a similar fashion. Specifically, following the same steps as in the long sequence of equalities in the second to last display we arrive at

\begin{array}{l} = \begin{array}{l} E {{\underline{ω}}_{k - 1, K - 1} ({\bar{O}}_{K - 1}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 1})) | \\ {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 2}} \\ {\underline{ω}}_{k - 1, K - 2} ({\bar{O}}_{K - 2}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 2})) \times \\ \times \frac{ℙ [A_{K - 1} = g_{K - 1} ({\bar{O}}_{K - 1}^{g}) | {\bar{O}}_{K + 1}^{g}, {\bar{A}}_{K - 2} = {\bar{g}}_{K - 2} ({\bar{O}}_{K - 2}^{g})]}{λ_{K} (g_{K - 1} ({\bar{O}}_{K - 1}^{g}) | {\bar{O}}_{K - 1}^{g}, {\bar{g}}_{K - 2} ({\bar{O}}_{K - 2}^{g}))} \end{array} \\ = {\underline{ω}}_{k - 1, K - 2} ({\bar{O}}_{K - 2}^{g}, ({\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g}), {\underline{A}}_{k - 1, K - 2})) \end{array}

the last equality follows once again from the sequential randomization assumption. This is so because given ${\bar{O}}_{K - 1}^{g}$ and ${\bar{A}}_{K - 2} = {\bar{g}}_{K - 2} ({\bar{O}}_{K - 2}^{g})$ , ${\bar{O}}_{K}^{g}$ and ${\bar{O}}_{K + 1}^{g}$ are fixed, i.e. deterministic, functions of $O$ and the SR assumption ensures then that ${\bar{O}}_{K}^{g}$ and ${\bar{O}}_{K + 1}^{g}$ are conditionally independent of A_K_–1 given ${\bar{O}}_{K - 1}^{g}$ and ${\bar{A}}_{K - 2} = {\bar{g}}_{K - 2} ({\bar{O}}_{K - 2}^{g})$ .

Equality (1) is thus shown by continuing in this fashion recursively for K – 2, K – 3, ..., K – l until l such that K – l = k – 1.

To show Part 2 of the Lemma, note that specializing part 1 to the case k = 0, we obtain

E [I_{B} (O^{g}, A^{g}) | O_{0}] = E [I_{B} (O, A) ω_{K} ({\bar{O}}_{K}, {\bar{A}}_{K}) | O_{0}] .

Thus, taking expectations on both sides of the equality in the last display we obtain

E [I_{B} (O^{g}, A^{g})] = E [I_{B} (O, A) ω_{K} ({\bar{O}}_{K}, {\bar{A}}_{K})] .

This shows part 2 because B is an arbitrary Borel set.

2.2. Proof of the Assertions in Section 3.2, ORR-I

2.2.1. Proof of Item (a)

Lemma 1, part 2 implies that the densities $p_{g}^{marg}$ factors as

p_{g}^{marg} (o, a) = \prod_{j = 0}^{K} I_{{g_{j} ({\bar{o}}_{j})}} (a_{j}) \prod_{j = 1}^{K + 1} p^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{a}}_{j - 1}) p^{marg} (o_{0}) .

In particular, the event ${{\bar{A}}_{k - 1}^{g} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1}^{g})}$ has probability 1. Consequently,

\begin{array}{l} p_{g}^{marg} (o, a | {\bar{o}}_{k}) = \prod_{j = 0}^{K} I_{{g_{j} ({\bar{o}}_{j})}} (a_{j}) p_{g}^{marg} (o, a | {\bar{o}}_{k}, {\bar{a}}_{k} = {\bar{g}}_{k} ({\bar{o}}_{k})) \\ = \prod_{j = 0}^{K} I_{{g_{j} ({\bar{o}}_{j})}} (a_{j}) \prod_{j = k + 1}^{K + 1} p^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{a}}_{j - 1} = {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) . \end{array}

Therefore,

\begin{array}{l} E {u (O^{g}, A^{g}) | {\bar{O}}_{k}^{g} = {\bar{o}}_{k}} = \\ = \sum_{\underset{l = 0, \dots, K}{a_{l} \in A_{l}}} \int u (o, a) \prod_{j = 0}^{K} I_{{g_{j} ({\bar{o}}_{j})}} (a_{j}) \prod_{j = k + 1}^{K + 1} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) \\ = \int u (o, a) [\sum_{\underset{l = 0, \dots, K}{a_{l} \in A_{l}}} \prod_{j = 0}^{K} I_{{g_{j} ({\bar{o}}_{j})}} (a_{j})] \prod_{j = k + 1}^{K + 1} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) \\ = \int u (o, a) \prod_{j = k + 1}^{K + 1} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) \\ = \int \prod_{j = k + 1}^{K} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) \times \\ [\int u (o, a) d P_{O_{K + 1} | {\bar{O}}_{K}, {\bar{A}}_{K}}^{marg} (o_{K + 1} | {\bar{o}}_{K}, {\bar{a}}_{K} = {\bar{g}}_{K} ({\bar{o}}_{K}))] \\ = \int [\prod_{j = k + 1}^{K} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1}))] φ_{K + 1} ({\bar{o}}_{K}) \\ = \int [\prod_{j = k + 1}^{K - 1} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1}))] \times \\ [\int φ_{K + 1} ({\bar{o}}_{K}) d P_{O_{K} | {\bar{O}}_{K - 1}, {\bar{A}}_{K - 1}}^{marg} (o_{K} | {\bar{o}}_{K - 1}, {\bar{g}}_{K - 1} ({\bar{o}}_{K - 1}))] \\ = \int \prod_{j = k + 1}^{K - 1} d P_{O_{j} | {\bar{O}}_{j - 1}, {\bar{A}}_{j - 1}}^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{g}}_{j - 1} ({\bar{o}}_{j - 1})) φ_{K + 1} ({\bar{o}}_{K}) \\ = \dots = φ_{k + 1} ({\bar{o}}_{k}) . \end{array}

(2)

2.2.2. Proof of Item (b)

Lemma 1, part 1 implies that

\begin{array}{l} E [u (O, A) {\underline{ω}}_{k - 1, K} ({\bar{O}}_{K}, {\bar{A}}_{K}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1})] = \\ = E [u (O^{g}, A^{g}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1} = {\bar{g}}_{k - 1} ({\bar{O}}_{k - 1})] . \end{array}

The left hand side of this equality is equal to

\sum_{\underset{k = k, \dots, K}{a_{k} \in A_{k}}} \int u (o, a) {\underline{ω}}_{k - 1, K} ({\bar{o}}_{K}, {\bar{a}}_{K}) \prod_{j = k}^{K} λ_{j} (a_{j} | {\bar{o}}_{j}, {\bar{a}}_{j - 1}) \prod_{j = k + 1}^{K + 1} d P^{marg} (o_{j} | {\bar{o}}_{j - 1}, {\bar{a}}_{j - 1})

and this coincides with the right hand side of (2) which, as we have just argued, is equal to φ_k+₁ (ō_k).

2.3. Proof of Lemma 2 in ORR-I

Let X be the identity random element on $(X, A)$ and let E_{P^marg × P_X} (·) stand for the expectation operation computed under the product law P^marg × P_X for the random vector (O, A, X). Then the restriction stated in 2) is equivalent to

E_{P^{marg} \times P_{X}} [b (X, Z) ω_{K} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{par} (X, Z; β^{*})}] = 0 for all b

(3)

and the restriction stated in 3) is equivalent to

\begin{array}{l} E_{P^{marg} \times P_{X}} [{b (X, Z) - E_{P \times P_{X}} [b (X, Z) | Z]} \times \\ ω_{K} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{sem} (X, Z; β^{*})}] = 0 for all b . \end{array}

(4)

To show 2) let d (O, A, X) ≡ ω_K (Ō_K, Ā_K) {u (O, A) – h_par (X, Z, β*)}.

(ORR-I, (14)) ⇒ (3).

\begin{array}{l} E_{P^{marg} \times P_{X}} [b (X, Z) d (O, A, X)] = E_{P^{marg} \times P_{X}} [b (X, Z) E_{P \times P_{X}} [d (O, A, X) | X, Z]] \\ = 0 \end{array}

where the last equality follows because E_{P^marg × P_X} [d (O, A, X) |X = x, Z] = E_P^marg [d (O, A, x) |Z] by independence of (O, A) with X under the law P^marg × P_X and, by assumption, E_P^marg [d (O, A, x) |Z] = 0 μ-a.e.(x) and hence E_P^marg [d (O, A, x) |Z] because P_X and μ are mutually absolute continuous.

(3) ⇒ (ORR-I, (14)). Define b^* (X; Z) = E_{P^marg × P_X} [d(O, A, X)|X, Z]. Then,

0 = E_{P^{marg} \times P_{X}} [b^{*} (X, Z) d (O, A, X)] = E_{P^{marg} \times P_{X}} [E_{P \times P_{X}} {[d (O, A, X) | X, Z]}^{2}]

consequently, E_{P^marg × P_X} [d (O, A, X) |X, Z] = 0 with P^marg × P_X prob. 1 which is equivalent to (ORR-I, (14)) because P_X is mutually absolutely continuous with μ.

To show 3) redefine d (O, A, X) as ω_K (Ō_K, Ā_K) {u (O, A) − h_sem (X, Z, β*)}.

(ORR-1, (15)) ⇒ (4)

\begin{array}{l} E_{P^{marg} \times P_{X}} [{b (X, Z) - E_{P^{marg} \times P_{X}} [b (X, Z) | Z]} d (O, A, X)] \\ = E_{P^{marg} \times P_{X}} [{b (X, Z) - E_{P \times P_{X}} [b (X, Z) | Z]} E_{P^{marg} \times P_{X}} {d (O, A, X) | X, Z}] \\ = E_{P^{marg} \times P_{X}} [{b (X, Z) - E_{P^{marg} \times P_{X}} [b (X, Z) | Z]} q (Z)] = 0 \end{array}

where the third equality follows because E_{P^marg × P_X} {d (O, A, X) |X = x, Z} = E_P^marg {d (O, A, x) |Z} and E_P^marg {d (O, A, x) |Z}= q (Z) μ-a.e.(x) and hence P_X-a.e.(x) by absolute continuity.

(4) ⇒ (ORR-I, (15)). Define b^* (X; Z) = E_{P × P_X} [ d(O, A, X)|X, Z]. Then,

\begin{array}{l} 0 = E_{P^{marg} \times P_{X}} [{b^{*} (X, Z) - E_{P^{marg} \times P_{X}} [b^{*} (X, Z) | Z]} d (O, A, X)] \\ = E_{P^{marg} \times P_{X}} [{b^{*} (X, Z) - E_{P^{marg} \times P_{X}} [b^{*} (X, Z) | Z]} b^{*} (X, Z)] \\ = E_{P^{marg} \times P_{X}} [{b^{*} (X, Z) - E_{P^{marg} \times P_{X}} [b^{*} (X, Z) | Z]}^{2}] . \end{array}

Consequently, b^* (X, Z) = E_{P^marg × P_X} [b^* (X, Z) |Z] ≡ q (Z) P_X – a.e. (X) and hence μ_X –a.e. (X) by absolute continuity. The result follows because b^* (x, Z) = E_{P^marg × P_X} [d (O, A, X) |X = x, Z] = E_P^marg [d (O, A, X) |Z].

2.4. Derivation of Some Formulas in Section 5.3, ORR-I

2.4.1. Derivation of Formula (26) in ORR-I

Any element

\sum_{k = 0}^{K} {d_{k} ({\bar{O}}_{k}, {\bar{A}}_{k}) - E [d_{k} ({\bar{O}}_{k}, {\bar{A}}_{k}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]}

of the set Λ is the sum of K + 1 uncorrelated terms because for any l, j such that 0 ≤ l < l + j ≤ K + 1,

\begin{array}{l} E [{d_{l + j} ({\bar{O}}_{l + j}, {\bar{A}}_{l + j}) - E [d_{l + j} ({\bar{O}}_{l + j}, {\bar{A}}_{l + j}) | {\bar{O}}_{l + j}, {\bar{A}}_{l + j - 1}]} \times \\ {d l ({\bar{O}}_{l}, {\bar{A}}_{l}) - E [d_{l} ({\bar{O}}_{l}, {\bar{A}}_{l}) | {\bar{O}}_{l}, {\bar{A}}_{l - 1}]}] \\ = E [E [{d_{l + j} ({\bar{O}}_{l + j}, {\bar{A}}_{l + j}) - E [d_{l + j} ({\bar{O}}_{l + j}, {\bar{A}}_{l + j}) | {\bar{O}}_{l + j}, {\bar{A}}_{l + j - 1}]} | {\bar{O}}_{l + j}, {\bar{A}}_{l + j - 1}] \times \\ {d_{l} ({\bar{O}}_{l}, {\bar{A}}_{l}) - E [d_{l} ({\bar{O}}_{l}, {\bar{A}}_{l}) | {\bar{O}}_{l}, {\bar{A}}_{l - 1}]}] \\ = E [0 \times {d_{l} ({\bar{O}}_{l}, {\bar{A}}_{l}) - E [d_{l} ({\bar{O}}_{l}, {\bar{A}}_{l}) | {\bar{O}}_{l}, {\bar{A}}_{l - 1}]}] = 0. \end{array}

Thus, Λ is equal to Λ₀ ⊕ Λ₁ ⊕ . . . ⊕ Λ_K where

Λ_{k} \equiv {d_{k} ({\bar{O}}_{k}, {\bar{A}}_{k}) - E [d_{k} ({\bar{O}}_{k}, {\bar{A}}_{k}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}] : d_{k} arbitrary scalar function}

and ⊕ stands for the direct sum operator. Then,

Π [Q | Λ] = \sum_{k = 0}^{K} Π [Q | Λ_{k}]

and it can be easily checked that Π [Q|Λ_k] = E (Q|Ō_k, Ā_k) – E [Q|Ō_k, Ā_k–₁].

2.4.2. Derivation of Formula (27) in ORR-I

Applying formula (26, in ORR-I) we obtain

Π [S . (β, γ^{*}, b) | Λ] = \sum_{k = 0}^{K} {E [S . (β, γ^{*}, b) | {\bar{O}}_{k}, {\bar{A}}_{k}] - E [S . (β, γ^{*}, b) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]} .

So, for k = 0, ..., K,

d_{\cdot, o p t, k}^{b} ({\bar{O}}_{k}, {\bar{A}}_{k}) = E [S . (β, γ^{*}, b) | {\bar{O}}_{k}, {\bar{A}}_{k}] .

But,

\begin{array}{l} E [S . (β, γ^{*}, b) | {\bar{O}}_{k}, {\bar{A}}_{k}] \\ = \int_{X_{p o s}} b_{\cdot} (x, z) E [ω_{K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k}, {\bar{A}}_{k}] d P_{X} (x) \\ = \int_{X_{p o s}} b_{\cdot} (x, z) ω_{k}^{x} ({\bar{O}}_{k}, {\bar{A}}_{k}) \times \\ \times E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k}, {\bar{A}}_{k}] d P_{X} (x) \\ = \int_{X_{p o s}} b_{\cdot} (x, z) ω_{k}^{x} ({\bar{O}}_{k}, {\bar{A}}_{k}) \times \\ \times E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] d P_{X} (x) . \end{array}

So formula ((27), ORR-I) is proved if we show that

\begin{array}{l} E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] = \\ {φ_{k + 1}^{x} ({\bar{O}}_{k}) - h_{\cdot} (x, Z; β)} . \end{array}

(5)

This follows immediately from the preceding proof of Result (b) of Section 3.2. Specifically, it was shown there that

E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k + 1}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] = φ_{k + 2} ({\bar{O}}_{k + 1}) .

Consequently, the left hand side of (5) is equal to

\begin{array}{l} E [E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) {u (O, A) - h_{\cdot} (x, Z; β)} | {\bar{O}}_{k + 1}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] | \\ {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] \\ = E [φ_{k + 2}^{x} ({\bar{O}}_{k + 1}) | {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] \\ - h_{\cdot} (x, Z; β) E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) | {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] \\ = φ_{k + 1}^{x} ({\bar{O}}_{k}) - h_{\cdot} (x, Z; β) \end{array}

where the last equality follows by the definition of $φ_{k + 1}^{x} ({\bar{O}}_{k})$ and the fact that $E [{\underline{ω}}_{k, K}^{x} ({\bar{O}}_{K}, {\bar{A}}_{K}) | {\bar{O}}_{k}, {\bar{A}}_{k} = g_{x} ({\bar{O}}_{k})] = 1$ (as this is just the function $φ_{k + 1}^{x} ({\bar{O}}_{k})$ resulting from applying the integration to the utility u (O, A) = 1).

2.4.3. Derivation of Formula (31) in ORR-I

It suffices to show that $S_{a u g} (γ, d_{\cdot, β, γ, τ, o p t}^{b}) = Σ_{k = 0}^{K} \int_{X_{p o s}} b (x, Z) M_{k} (x; β, γ, τ) d P_{X} (x)$ where

M_{k} (x; β, γ, τ) \equiv {ω_{k}^{x} (γ) - ω_{k - 1}^{x} (γ)} {φ_{k + 1}^{x} ({\bar{O}}_{k}; τ) - h_{\cdot} (x, Z; β)} .

But by definition

\begin{array}{l} S_{a u g} (γ, d_{\cdot, β, γ, τ, o p t}^{b}) = \\ = \sum_{k = 0}^{K} {d_{\cdot, β, γ, τ, o p t, k}^{b} ({\bar{O}}_{k}, {\bar{A}}_{k}) - E_{γ} [d_{\cdot, β, γ, τ, o p t, k}^{b} ({\bar{O}}_{k}, {\bar{A}}_{k}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]} \\ = \sum_{k = 0}^{K} {\int_{X_{p o s}} b (x, Z) ω_{k}^{x} (γ) {φ_{k + 1}^{x} ({\bar{O}}_{k}; τ) - h_{\cdot} (x, Z; β)} d P_{X} (x) - \\ - E [\int_{X_{p o s}} b (x, Z) ω_{k}^{x} (γ) {φ_{k + 1}^{x} ({\bar{O}}_{k}; τ) - h_{\cdot} (x, Z; β)} d P_{X} (x) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]} \\ = \sum_{k = 0}^{K} \int_{X_{p o s}} b (x, Z) {ω_{k}^{x} (γ) - E_{γ} [ω_{k}^{x} (γ) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]} \times \\ {φ_{k + 1}^{x} ({\bar{O}}_{k}; τ) - h_{\cdot} (x, Z; β)} d P_{X} (x) \\ = \sum_{k = 0}^{K} \int_{X_{p o s}} b (x, Z) {ω_{k}^{x} (γ) - ω_{k - 1}^{x} (γ)} {φ_{k + 1}^{x} ({\bar{O}}_{k}; τ) - h_{\cdot} (x, Z; β)} d P_{X} (x) \end{array}

where the last equality follows because

\begin{array}{l} E_{γ} [ω_{k}^{x} (γ) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}] = \\ = ω_{k - 1}^{x} (γ) E_{γ} [\frac{I_{{g_{x, k} ({\bar{O}}_{k})}} (A_{k})}{λ_{k} (A_{k} | {\bar{O}}_{k}, {\bar{A}}_{k - 1})} | {\bar{O}}_{k}, {\bar{A}}_{k - 1}] \\ = ω_{k - 1}^{x} (γ) \frac{E_{γ} [I_{{g_{x, k} ({\bar{O}}_{k})}} (A_{k}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1}]}{λ_{k} (g_{x, k} ({\bar{O}}_{k}) | {\bar{O}}_{k}, {\bar{A}}_{k - 1})} = ω_{k - 1}^{x} (γ) . \end{array}

2.5. Proof that b_{·, opt} is Optimal

Write for short, β̂_· (b) ≡ β̂_· (b, d̂_{·, opt}),

\begin{array}{l} Q_{p a r} (b) \equiv \int_{X_{p o s}} b (x, Z) Q_{p a r} (x; β^{†}, γ^{†}, τ^{†}) d P_{X} (x) and \\ Q_{s e m} (b) \equiv \int_{X_{p o s}} {b (x, Z) - \bar{b} (Z)} Q_{s e m} (x; β^{†}, γ^{†}, τ^{†}) d P_{X} (x) . \end{array}

We will show that J_· (b) = E {Q_· (b) Q_· (b_{·, opt})^′} for · = par and · = sem. When either model (16, ORR-I) or (29, ORR-I) are correct, β^* = β^†. Consequently, for · = par we have that J_par (b) is equal to

\begin{array}{l} - E {{\int_{X_{p o s}} b (x, Z) \frac{\partial}{\partial β} h_{par} (x, Z; β) |}_{β^{†}} d P_{X} (x)} \\ = E [\int_{X_{p o s}} b (x, Z) d P_{X} (x) \times \\ \times {\int_{X_{p o s}} b_{p a r, o p t} (x, Z) E {Q_{p a r} (x; β^{†}, γ^{†}, τ^{†}) Q_{p a r} {(\tilde{x}; β^{†}, γ^{†}, τ^{†})}^{'} | Z} d P_{X} (\tilde{x})}] \\ = E [{\int_{X_{p o s}} b (x, Z) Q_{p a r} (x; β^{†}, γ^{†}, τ^{†}) d P_{X} (x)} \times \\ \times {\int_{X_{p o s}} b_{p a r, o p t} (x, Z) Q_{p a r} {(\tilde{x}; β^{†}, γ^{†}, τ^{†})}^{'} d P_{X} (\tilde{x})}] \\ = E {Q_{p a r} (b) Q_{p a r} {(b_{p a r, o p t})}^{'}} . \end{array}

For · = sem and with the definitions b͂ (x, Z) ≡ b (x, Z) – b̄ (Z) and Q͂_sem (x͂; β^†, γ^†, τ^†) ≡ Q_sem (x͂; β^†, γ^†, τ^†) – Q̄_sem (x͂; β^†, γ^†, τ^†), the same argument yields J_sem (b) equal to

\begin{array}{l} E [{\int_{X_{p o s}} \tilde{b} (x, Z) {\tilde{Q}}_{sem} (\tilde{x}; β^{†}, γ^{†}, τ^{†}) d P_{X} (x)} \\ \times {\int_{X_{p o s}} {\tilde{b}}_{s e m, o p t} (x, Z) {\tilde{Q}}_{s e m} {(\tilde{x}; β^{†}, γ^{†}, τ^{†})}^{'} d P_{X} (\tilde{x})}] \\ = E [{\int_{X_{p o s}} \tilde{b} (x, Z) Q_{s e m} (\tilde{x}; β^{†}, γ^{†}, τ^{†}) d P_{X} (x)} \\ {\int_{X_{p o s}} {\tilde{b}}_{s e m, o p t} (x, Z) Q_{s e m} {(\tilde{x}; β^{†}, γ^{†}, τ^{†})}^{'} d P_{X} (\tilde{x})}] \\ = E {Q_{s e m} (b) Q_{s e m} {(b_{p a r, o p t})}^{'}} . \end{array}

Now, with var_A (β̂_· (b)) denoting the asymptotic variance of β̂_· (b), we have that from expansion ((32) in ORR-I)

v a r_{A} ({\hat{β}}_{\cdot} (b)) = var {E {[Q_{\cdot} (b) Q_{\cdot} {(b_{\cdot, o p t})}^{'}]}^{- 1} Q_{\cdot} (b)}

and consequently

\begin{array}{l} c o v_{A} ({\hat{β}}_{\cdot} (b), {\hat{β}}_{\cdot} (b_{\cdot, o p t})) = \\ = E {[Q_{\cdot} (b) Q_{\cdot} {(b_{\cdot, o p t})}^{'}]}^{- 1} cov (Q_{\cdot} (b), Q_{\cdot} (b_{\cdot, o p t})) E {[Q_{\cdot} {(b_{\cdot, o p t})}^{\otimes 2}]}^{- 1} \\ = E {[Q_{\cdot} {(b_{\cdot, o p t})}^{\otimes 2}]}^{- 1} = v a r_{A} ({\hat{β}}_{\cdot} (b_{\cdot, o p t})) . \end{array}

Thus, 0 ≤ var_A (β̂_· (b) – β̂_· (b_{·, opt})) = var_A (β̂_· (b)) + var_A (β̂_· (b_{·, opt}) – 2cov_A (β̂_· (b), β̂_· (b_{·, opt})) = var_A (β̂_· (b)) – var_A (β̂_· (b_{·, opt})) which concludes the proof.

3. Confidence Set for x_opt (z) when $X$ is Finite and h_· (z, x; β) is Linear in β

We first prove the assertion that the computation of the confidence set B_b entails an algorithm for determining if the intersection of $# (X) - 1$ half spaces in ℜ^p and a ball in ℜ^p centered at the origin is non-empty. To do so, first note that linearity implies that $h_{\cdot} (z, x; β) = Σ_{j = 1}^{p} s_{j} (x, z) β_{j}$ for some fixed functions s_j, j = 1, ..., p. Let $N = # (X)$ and write $X = {x_{1}, \dots, x_{N}}$ . The point x_l is in B_b iff

there exists β in C_{b} : \sum_{j = 1}^{p} [s_{j} (x_{l}, z) - s_{j} (x_{k}, z)] β_{i} \geq 0 for all x_{k} \in X - {x_{l}} .

(6)

Define the p × 1 vector $v_{l}^{k}$ whose j^th entry is equal to s_j (x_l, z) – s_j (x_k, z), j = 1, ..., p. Define also the vectors $v_{l}^{* k} = v_{l}^{k}^{'} {\hat{Γ}}_{\cdot} (b)$ and the constants $a_{l}^{k} = v_{l}^{k}^{'} {\hat{β}}_{\cdot} (b, {\hat{d}}_{\cdot, o p t}^{b})$ . Then $Σ_{j = 1}^{p} [s_{j} (x_{l}, z) - s_{j} (x_{k}, z)] β_{j} > 0$ iff $v_{j}^{* k} {\hat{Γ}}_{\cdot} {(b)}^{- 1 / 2} \times (β - {\hat{β}}_{\cdot} (b, {\hat{d}}_{\cdot, o p t}^{b})) > a_{l}^{k}$ . Noting that β in C_b iff ${\hat{Γ}}_{\cdot} {(b)}^{- 1 / 2} (β - {\hat{β}}_{\cdot} (b, {\hat{d}}_{\cdot, o p t}^{b}))$ is in the ball

U \equiv {u \in ℝ^{p} : u^{'} u \leq χ_{p, 1 - α}^{2}}

we conclude that the condition in the display (6) is equivalent to

there exists u in U such that v_{l}^{* k}^{'} u > a_{l}^{k} for k = 1, \dots, N, k \neq l .

The set ${u \in ℝ^{p} : v_{j}^{* k}^{'} u = a_{l}^{k}}$ is a hyper-plane in ℜ^p which divides the Euclidean space ℜ^p into two half-spaces, one of which is ${u \in ℝ^{p} : v_{j}^{* k}^{'} u > a_{l}^{k}}$ . Thus, the condition in the last display imposes that the intersection of N – 1 half-spaces (each one defined by the condition $v_{l}^{* k}^{'} u > a_{l}^{k}$ for each k) and the ball $U$ is non-empty.

Turn now to the construction of a confidence set $B_{b}^{*}$ that includes B_b. Our construction relies on the following Lemma.

Lemma. Let

D = {u \in R^{p} : {(u - u_{0})}^{'} Σ^{- 1} (u - u_{0}) \leq c_{0}}

where u₀ is a fixed p × 1 real valued vector and Σ is a fixed non-singular p × p matrix.

Let α be a fixed, non-null, p×1 real valued vector. Let τ₀ ≡ α′ u₀ and α^* = Σ^1/2α. Assume that α₁ ≠ 0. Let, $v_{1}^{*}$ be the p×1 vector ${(- α_{1}^{* - 1} τ_{0}, 0, \dots, 0)}^{'}$ . Let ϒ be the linear space generated by the p×1 vectors $v_{2}^{*} = {(α_{1}^{* - 1} α_{2}^{*}, 1, 0, 0, \dots, 0)}^{'}$ , $v_{3}^{*} = {(α_{1}^{* - 1} α_{3}^{*}, 0, 1, 0, \dots, 0)}^{'}, \dots, v_{p}^{*} = {(α_{1}^{* - 1} α_{p}^{*}, 0, 0, 0, \dots 1)}^{'}$ and define

\begin{array}{l} v_{1, p r o j}^{*} = v_{1}^{*} - Π [v_{1}^{*} | ϒ] \\ = v_{1}^{*} - V^{*} {(V^{*}^{'} V^{*})}^{- 1} V^{*}^{'} v_{1}^{*} \end{array}

where

V^{*} = (v_{2}^{*}, \dots, v_{p}^{*}) .

Then there exists $u \in D$ satisfying

α^{'} u = 0

if and only if

c_{0} - {‖ v_{1, p r o j}^{*} ‖}^{2} \geq 0.

Proof

α^{'} u = 0 \Leftrightarrow α^{'} Σ^{1 / 2} Σ^{- 1 / 2} (u - u_{0}) = - α^{'} u_{0} .

Then, with τ₀ ≡ −α′ u₀ and α* = Σ^1/2α, we conclude that there exists $u \in D$ satisfying α′ u = 0 if and only if there exists u* ∈ R^p such that

u^{*}^{'} u^{*} \leq c_{0} and - α^{*}^{'} u^{*} = τ_{0} .

Now, by the assumption $α_{1}^{*} \neq 0$ we have −α*^′ u* = τ₀ iff $u_{1} = - α_{1}^{* - 1} \times [τ_{0} + Σ_{j = 2}^{p} α_{j}^{*} u_{j}^{*}]$ . Thus, the collection of all vectors u* satisfying −α*^′ u* = τ₀ is the linear variety

v_{1}^{*} + ϒ = v_{1, p r o j}^{*} + ϒ

where $v_{j}^{*}^{'} s$ and ϒ are defined in the statement of the lemma. The vector $v_{1, p r o j}^{*}$ is the residual from the (Euclidean) projection of $v_{1}^{*}$ into the space ϒ.

Thus, −α*^′ u* τ₀ iff $u^{*} = v_{1, p r o j}^{*} + v_{ϒ}^{*}$ for some $v_{ϒ}^{*} \in ϒ$ . Consequently, by the orthogonality of $v_{1, p r o j}^{*}$ with ϒ we have that for u* satisfying −α*^′ u* = τ₀ it holds that

\begin{array}{l} u^{*}^{'} u^{*} = {‖ u^{*} ‖}^{2} \\ = {‖ v_{1, p r o j}^{*} ‖}^{2} + {‖ v_{ϒ}^{*} ‖}^{2} . \end{array}

Therefore, since ${‖ v_{ϒ}^{*} ‖}^{2}$ is unrestricted,

u^{*}^{'} u^{*} \leq c_{0} for some u^{*} satisfying - α^{*}^{'} u^{*} = τ_{0}

if and only if

c_{0} - {‖ v_{1, p r o j}^{*} ‖}^{2} \geq 0.

(7)

This concludes the proof of the Lemma.

To construct the set $B_{b}^{*}$ we note that the condition in the display (6) implies the negation, for every subset $X_{(- l)}$ of $X - {x_{l}}$ , of the statement

\sum_{j = 1}^{p} \sum_{k \in X_{(- 1)}} [s_{j} (x_{l}, z) - s_{j} (x_{k}, z)] β_{j} < 0 for all β \in C_{b} .

(8)

Thus, suppose that for a given x_l we find that (8) holds for some subset $X_{(- 1)}$ of $X - {x_{l}}$ , then we know that x_l cannot be in B_b. The proposed confidence set $B_{b}^{*}$ is comprised by the points in $X$ for which condition (8) cannot be negated for all subsets $X_{(- 1)}$ . The set $B_{b}^{*}$ is conservative (i.e. it includes B_b but is not necessarily equal to B_b) because the simultaneous negation of the statement (8) for all $X_{(- l)}$ does not imply the statement (6). To check if condition (8) holds for any given subset $X_{(- l)}$ and x_l, we apply the result of Lemma as follows. We define the vector α ∈ ℝ^p whose j^th component is equal to $Σ_{k \in X_{(- l)}} [s_{j} (x_{l}, z) - s_{j} (x_{k}, z)]$ , j = 1,..., p and the vector $u_{0} = {\hat{β}}_{\cdot} (b, {\hat{d}}_{\cdot, o p t}^{b}) \in ℝ^{p}$ . We also define the constant $c_{0} = χ_{p, 1 - α}^{2}$ , and the matrix Σ = Γ̂_· (b). We compute the vectors $α^{*} = Σ^{1 / 2} α$ , $v_{1}^{*}, \dots, v_{p}^{*}$ and the matrix V* as defined in Lemma. We then check if the condition (7) holds. If it holds then this implies that the hyperplane comprised by the set of β’s that satisfy the condition in display (8) with the < sign replaced by the = sign, intersects the confidence ellipsoid C_b, in which case we know that (8) is false. If it does not hold, then we check if condition

\sum_{j = 1}^{p} \sum_{k \in X_{(- 1)}} [s_{j} (x_{l}, z) - s_{j} (x_{k}, z)] {\hat{β}}_{\cdot} {(b, {\hat{d}}_{\cdot, o p t}^{b})}_{j} < 0

(9)

holds. If (9) does not hold, then we conclude that (8) is false for this choice of $X_{(- 1)}$ . If (9) holds, then we conclude that (8) is true and we then exclude x_l from the set $B_{b}^{*}$ .

4. Positivity Assumption: Example

Suppose that K = 1 and that $R_{k} = R_{k}^{g} = 1$ with probability 1 for k = 0, 1, so that no subject dies in neither the actual world nor in the hypothetical world in which g is enforced in the population. Thus, for k = 0, 1, O_k = L_k since both T_k and R_k are deterministic and hence can be ignored. Suppose that L_k and A_k are binary variables (and so are therefore $A_{k}^{g}$ and $L_{k}^{g}$ ) and that the treatment regime g specifies that

g_{0} (l_{0}) = 1 - l_{0} and g_{1} (l_{0}, l_{1}) = l_{0} (1 - l_{1}) .

Assume that

0 < ℙ (L_{0}^{g} = l_{0}, L_{1}^{g} = l_{1}) < 1, l_{0} = 0, 1; l_{1} = 0, 1.

(10)

Assumption PO imposes two requirements,

ℙ [λ_{0} (A_{0}^{g} | L_{0}^{g}) > 0] = 1 and

(11)

ℙ [λ_{1} (A_{1}^{g} | L_{0}^{g}, L_{1}^{g}, A_{0}^{g}) > 0] = 1.

(12)

Because by definition of regime g, $A_{0}^{g} = 1 - L_{0}^{g}$ , then requirement (11) can be re-expressed as

1 = ℙ (L_{0}^{g} = 0) I_{(0, 1]} (λ_{0} (1 | 0)) + ℙ (L_{0}^{g} = 1) I_{(0, 1]} (λ_{0} (0 | 1)) .

Since indicators can only take the values 0 or 1 and $ℙ (L_{0}^{g} = l_{0}) < 1$ , l₀ = 0, 1 (by assumption (10)), the preceding equality is equivalent to

I_{(0, 1]} (λ_{0} (1 | 0)) = 1 and I_{(0, 1]} (λ_{0} (0 | 1)) = 1,

that is to say,

λ_{0} (1 | 0) > 0 and λ_{0} (0 | 1) > 0.

By the definition of λ₀ (·|·) (see (3) in ORR-I), the last display is equivalent to

ℙ (A_{0} = 1 | L_{0} = 0) > 0 and ℙ (A_{0} = 0 | L_{0} = 1) > 0.

(13)

Likewise, because $A_{1}^{g} = L_{0}^{g} (1 - L_{1}^{g})$ , and because $ℙ (L_{0}^{g} = l_{0}, L_{1}^{g} = l_{1}, A_{0}^{g} = l_{0}) = 0$ by the fact that $A_{0}^{g} = 1 - L_{0}$ , requirement (12) can be re-expressed as

\begin{array}{l} 1 = ℙ (L_{0}^{g} = 0, L_{1}^{g} = 0, A_{0}^{g} = 1) I_{(0, 1]} (λ_{1} (0 | 0, 0, 1)) \\ + ℙ (L_{0}^{g} = 0, L_{1}^{g} = 1, A_{0}^{g} = 1) I_{(0, 1]} (λ_{1} (0 | 0, 1, 1)) \\ + ℙ (L_{0}^{g} = 1, L_{1}^{g} = 0, A_{0}^{g} = 0) I_{(0, 1]} (λ_{1} (1 | 1, 0, 0)) \\ + ℙ (L_{0}^{g} = 1, L_{1}^{g} = 1, A_{0}^{g} = 0) I_{(0, 1]} (λ_{1} (0 | 1, 1, 0)) \end{array}

or equivalently, (again because the events $(L_{0}^{g} = l_{0}, L_{1}^{g} = l_{1}, A_{0}^{g} = 1 - l_{0})$ and $(L_{0}^{g} - l_{0}, L_{1}^{g} - l_{1})$ have the same probability by $ℙ (L_{0}^{g} = l_{0}, L_{1}^{g} = l_{1}, A_{0}^{g} = l_{0}) = 0$ ,

\begin{array}{l} 1 = ℙ (L_{0}^{g} = 0, L_{1}^{g} = 0) I_{(0, 1]} (λ_{1} (0 | 0, 0, 1)) + ℙ (L_{0}^{g} = 0, L_{1}^{g} = 1) \\ \times I_{(0, 1]} (λ_{1} (0 | 0, 1, 1)) + ℙ (L_{0}^{g} = 1, L_{1}^{g} = 0) I_{(0, 1]} (λ_{1} (1 | 1, 0, 0)) \\ + ℙ (L_{0}^{g} = 1, L_{1}^{g} = 1) I_{(0, 1]} (λ_{1} (0 | 1, 1, 0)) . \end{array}

Under the assumption (10), the last display is equivalent to

\begin{array}{l} λ_{1} (0 | 0, 0, 1) > 0, λ_{1} (0 | 0, 1, 1) > 0, \\ λ_{1} (1 | 1, 0, 0) > 0 and λ_{1} (0 | 1, 1, 0) > 0 \end{array}

which, by the definition of λ₀ (·|·, ·, ·) in ((3), ORR-I), is, in turn, the same as

\begin{array}{l} ℙ (A_{1} = 0 | L_{0} = 0, L_{1} = 0, A_{0} = 1) > 0, ℙ (A_{1} = 0 | L_{0} = 0, L_{1} = 1, A_{0} = 1) > 0 \\ ℙ (A_{1} = 1 | L_{0} = 1, L_{1} = 0, A_{0} = 0) > 0, ℙ (A_{1} = 0 | L_{0} = 1, L_{1} = 1, A_{0} = 0) > 0. \end{array}

(14)

We conclude that in this example, the assumption PO is equivalent to the conditions (13) and (14). We will now analyze what these conditions encode.

Condition (13) encodes two requirements:

i) the requirement that in the actual world there exist subjects with L₀ = 1 and L₀ = 0 (i.e. that the conditioning events L₀ = 1 and L₀ = 0 have positive probabilities), for otherwise at least one of the conditional probabilities in (13) would not be defined, and
ii) the requirement that in the actual world there be subjects with L₀ = 0 that take treatment A₀ = 1 and subjects with L₀ = 1 that take treatment A₀ = 0, for otherwise at least one of the conditional probabilities in (13) would be 0.

Condition i) is automatically satisfied, i.e. it does not impose a restriction on the law of L₀, by the fact that $L_{0}^{g} = L_{0}$ (since baseline covariates cannot be affected by interventions taking place after baseline) and the fact that we have assumed that $ℙ (L_{0}^{g} = l_{0}) > 0$ , l₀ = 0, 1.

Condition ii) is indeed a non-trivial requirement and coincides with the interpretation of the PO assumption given in section 3.1 for the case k = 0. Specifically, in the world in which g were to be implemented there would exist subjects with L₀ = 0. In such world the subjects with L₀ = 0 would take treatment $A_{0}^{g} = 1$ , then the PO assumption for k = 0 requires that in the actual world there also be subjects with L₀ = 0 that at time 0 take treatment A₀ = 1. Likewise the PO condition also requires that for k = 0 the same be true with 0 and 1 reversed in the right hand side of each of the equalities of the preceding sentence. A key point is that (11) does not require that in the observational world there be subjects with L₀ = 0 that take A₀ = 0, nor subjects with L₀ = 1 that take A₁ = 1. The intuition is clear. If we want to learn from data collected in the actual (observational) world what would happen in the hypothetical world in which everybody obeyed regime g, we must observe people in the study that obeyed the treatment at every level of L₀ for otherwise if, say, nobody in the actual world with L₀ = 0 obeyed regime g there would be no way to learn what the distribution of the outcomes for subjects in that stratum would be if g were enforced. However, we don t care that there be subjects with L₀ = 0 that do not obey g, i.e. that take A₀ = 0, because data from those subjects are not informative about the distribution of outcomes when g is enforced.

Condition (14) encodes two requirements:

iii) the requirement that in the actual world there be subjects in the four strata (L₀ = 0, L₁ = 0, A₀ = 1), (L₀ = 0, L₁ = 1, A₀ = 1), (L₀ = 1, L₁ = 0, A₀ = 0) and (L₀ = 1, L₁ = 1, A₀ = 0) (i.e. that the conditioning events in the display (14) have positive probabilities), for otherwise at least one of the conditional probabilities would not be defined, and
iv) the requirement that in the actual world there be subjects in every one of the strata (L₀ = 0, L₁ = 0, A₀ = 1), (L₀ = 0, L₁ = 1, A₀ = 1), (L₀ = 1, L₁ = 1, A₀ = 0) that have A₁ = 0 at time 1 and the requirement that there be subjects in stratum (L₀ = 1, L₁ = 0, A₀ = 0) that have A₁ = 1 at time 1, for otherwise at least one of the conditional probabilities in (14) would be 0.

Given condition ii) and the sequential randomization (SR) and consistency (C) assumptions, condition iii) is automatically satisfied, i.e. it does not impose a further restriction on the joint distribution of (L₀, L₁, A₀). To see this, first note that by condition (ii) the strata (L₀ = 0, A₀ = 1) and (L₀ = 1, A₀ = 0) are non-empty. So condition (iii) is satisfied provided

ℙ (L_{1} = l_{1} | L_{0} = 0, A_{0} = 1) > 0 and ℙ (L_{1} = l_{1} | L_{0} = 1, A_{0} = 0) > 0 for l_{1} = 0, 1.

But

\begin{array}{l} ℙ (L_{1} = l_{1} | L_{0} = 0, A_{0} = 1) = ℙ (L_{1}^{g} = l_{1} | L_{0} = 0, A_{0} = 1) by assumption (C) \\ = ℙ (L_{1}^{g} = l_{1} | L_{0} = 0) by assumption (SR) \\ = ℙ (L_{1}^{g} = l_{1} | L_{0}^{g} = 0) by assumption (C) \end{array}

and $ℙ (L_{1}^{g} = l_{1} | L_{0}^{g} = 0) > 0$ by (10). An analogous argument shows that $ℙ (L_{1} = l_{1} | L_{0} = 1, A_{0} = 0) > 0$ . Finally, condition (iv) is a formalization our interpretation of assumption PO in section 3.1 for k = 1. In the world in which g was implemented there would exist subjects that would have all four combination of values for $(L_{0}^{g}, L_{1}^{g})$ . However, subjects with $L_{0}^{g} = l_{0}$ will only have $A_{0}^{g} = 1 - l_{0}$ , so in this hypothetical world we will see at time 1 only four possible recorded histories, $(L_{0}^{g} = 0, L_{1}^{g} = 0, A_{0}^{g} = 1)$ , $(L_{0}^{g} = 0, L_{1}^{g} = 1, A_{0}^{g} = 1)$ , $(L_{0}^{g} = 1, L_{1}^{g} - 0, A_{0}^{g} = 0)$ and $(L_{0}^{g} = 1, L_{1}^{g} = 1, A_{0}^{g} = 0)$ . In this hypothetical world subjects with any of the first three possible recorded histories will take $A_{1}^{g} = 0$ and subjects with the last one will take $A_{1}^{g} = 1$ . Thus, in the actual world we must require that there be subjects in each of the first three strata (L₀ = 0, L₁ = 0, A₀ = 1), (L₀ = 0, L₁ = 1, A₀ = 1), (L₀ = 1, L₁ = 0, A₀ = 0) that take A₁ = 0 and subjects in the stratum (L₀ = 1, L₁ = 1, A₀ = 0) that take A₁ = 1. A point of note is that we don t make any requirement about the existence of subjects in strata other than the four mentioned in (iii) or about the treatment that subjects in these remaining strata take. The reason is that subjects that are not in the four strata of condition (iii) have already violated regime g at time 0 so they are uninformative about the outcome distribution under regime g.

Footnotes

This work was supported by NIH grant R01 GM48704.

References

Orellana L, Rotnitzky A, Robins JM.2010Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: Main content The International Journal of Biostatistics 62Article 7. [PubMed] [Google Scholar]

[b1-ijb1242] Orellana L, Rotnitzky A, Robins JM.2010Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: Main content The International Journal of Biostatistics 62Article 7. [PubMed] [Google Scholar]

PERMALINK

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part II: Proofs of Results^*

Liliana Orellana

Andrea Rotnitzky

James M Robins

Abstract

1. Introduction

2. Proof of Claims in ORR-I

2.1. Proof of Lemma 1

2.2. Proof of the Assertions in Section 3.2, ORR-I

2.2.1. Proof of Item (a)

2.2.2. Proof of Item (b)

2.3. Proof of Lemma 2 in ORR-I

2.4. Derivation of Some Formulas in Section 5.3, ORR-I

2.4.1. Derivation of Formula (26) in ORR-I

2.4.2. Derivation of Formula (27) in ORR-I

2.4.3. Derivation of Formula (31) in ORR-I

2.5. Proof that b_{·, opt} is Optimal

3. Confidence Set for x_opt (z) when $X$ is Finite and h_· (z, x; β) is Linear in β

4. Positivity Assumption: Example

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part II: Proofs of Results*

Liliana Orellana

Andrea Rotnitzky

James M Robins

Abstract

1. Introduction

2. Proof of Claims in ORR-I

2.1. Proof of Lemma 1

2.2. Proof of the Assertions in Section 3.2, ORR-I

2.2.1. Proof of Item (a)

2.2.2. Proof of Item (b)

2.3. Proof of Lemma 2 in ORR-I

2.4. Derivation of Some Formulas in Section 5.3, ORR-I

2.4.1. Derivation of Formula (26) in ORR-I

2.4.2. Derivation of Formula (27) in ORR-I

2.4.3. Derivation of Formula (31) in ORR-I

2.5. Proof that b·, opt is Optimal

3. Confidence Set for xopt (z) when X is Finite and h· (z, x; β) is Linear in β

4. Positivity Assumption: Example

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part II: Proofs of Results^*

2.5. Proof that b_{·, opt} is Optimal

3. Confidence Set for x_opt (z) when $X$ is Finite and h_· (z, x; β) is Linear in β