SEMIPARAMETRIC DOSE FINDING METHODS FOR PARTIALLY ORDERED DRUG COMBINATIONS

Matthieu Clertant; Nolan A Wages; John O’Quigley

doi:10.5705/ss.202020.0248

. Author manuscript; available in PMC: 2023 Jan 12.

Published in final edited form as: Stat Sin. 2022;32:1983–2005. doi: 10.5705/ss.202020.0248

SEMIPARAMETRIC DOSE FINDING METHODS FOR PARTIALLY ORDERED DRUG COMBINATIONS

Matthieu Clertant ¹, Nolan A Wages ², John O’Quigley ³

PMCID: PMC9835145 NIHMSID: NIHMS1839647 PMID: 36643072

Abstract

We investigate a statistical framework for Phase I clinical trials that test the safety of two or more agents in combination. For such studies, the traditional assumption of a simple monotonic relation between dose and the probability of an adverse event no longer holds. Nonetheless, the dose toxicity (adverse event) relationship will obey an assumption of partial ordering in that there will be pairs of combinations for which the ordering of the toxicity probabilities is known. Some authors have considered how to best estimate the maximum tolerated dose (a dose providing a rate of toxicity as close as possible to some target rate) in this setting. A related, and equally interesting, problem is to partition the 2-dimensional dose space into two sub-regions: doses with probabilities of toxicity lower and greater than the target. We carry out a detailed investigation of this problem. The theoretical framework for this is the recently presented semiparametric dose finding method. This results in a number of proposals one of which can be viewed as an extension of the Product of Independent beta Priors Escalation method (PIPE). We derive useful asymptotic properties which also apply to the PIPE method when seen as a special case of the more general method given here. Simulation studies provide added confidence concerning the good behaviour of the operating characteristics.

Keywords: Bayesian method, Dose-finding design, Partial ordering, Phase I clinical trials, semiparametric method

1. Introduction

The importance of multi-agent Phase I trials in drug development has grown in recent years. The practical benefits of drug combinations are numerous: several modes of action can be combined or the negative side effects of one drug can potentially be attenuated by the presence of a second compound. The aim of Phase I oncology trials is to find one or more maximum tolerated dose (MTD) combinations having probability of toxicity as close as possible to some threshold α, specified in advance by clinicians (common values are 20%, 25% and 33%). Algorithmic designs remain a popular approach to the identification of the MTD within a discrete set of levels, both for single-agent trials (Storer, 1989), as well as for dual-agent trials (Huang, 2007). These designs make no appeal to any model, and the escalation/de-escalation rules are determined as a function of the most recent set of observations. The simplicity of these methods leads to their frequent implementation in practice. Algorithmic designs have a Markov property, sometimes referred to in this context as a lack-of-memory property. These methods are not efficient and lack desirable statistical properties, such as almost sure convergence. Many model-based designs have been suggested, but their widespread adoption in practice remains an unaccomplished goal. This is even more the case with dual-agent trials because of the complexity and lack of interpretability of some of the proposed models, as well as operational difficulties that can discourage clinical investigators. Model based designs come under two headings: parametrics and non-parametric.

The design and analysis of dual-agent trials can be carried out using Wages, Conaway and O’Quigley (2011), the partial ordering continual reassessment method (CRM) and Wang and Ivanova (2005) with extensions of the CRM; Thall (2003) with a six-parameter logistic-type model and Braun and Wang (2010) with the use of beta distributions and log-linear models. The second one is the class of non-parametric models. For dual-agent trials, we have Mander and Sweeting (2015) with the product of independent beta probabilities escalation design (PIPE) and the Bayesian Optimal Interval (2d-BOIN) method of Lin and Yin (2016). The primary focus of many methods for dual-agent trials has been to find a single MTD for recommendation in phase II studies. However, an arguably more relevant task is centered around the idea that, for combination studies, multiple, essentially, equivalent MTDs may exist. The collection of these MTDs form a maximum tolerated contour (MTC) in two-dimensional space. In this case, the primary objective of the study may be to identify multiple MTD’s for further testing, and investigators may desire a trial designed to meet this objective. This is all the more so as it is becoming common to study more than a single dose in the subsequent dose expansion cohorts (Iasonos and O’Quigley, 2016, DEC).

Our motivation here is to describe a journey that starts with estimation of the MTD and ends with estimation of the MTC. Our signposts are built from illustrations and our ultimate purpose is to allow deeper study of the problem of early phase dose finding for combinations of treatments. We set out on our journey (Section 2) via a look at the landscape, a look at the context in which these problems arise and, specifically, an actual trial carried out at the University of Virginia School of Medicine, and a particular hypothesized set of rates of DLT. Next stop (Section 3) outlines the tools that we will be using in the more usual setting of an MTD target, while Section 4 - our destination is in sight - spells out, step by step, the way in which to solve MTC estimation. Dose finding in two dimensions is necessarily more complex and Section 4.2 examines how actual experimentation can be implemented. Although typically sample sizes will not be large, large sample theory is helpful in providing some confidence in how well the designs should work in practice (Section 5). In particular, Theorem 1 shows that, for all the methods, the almost sure convergence to the MTD is equivalent to the almost sure convergence to the MTC. We have arrived at our destination and Section 6 makes use of what we have learned to carry out a simulated comparison with an available method (PIPE) while Section 7 returns to our original illustration (Section 2) and studies it more deeply.

In addition, the Appendix contains several documents to enable deeper understanding of this work. We include a table that summarizes all of the notation, we look closely at the dose allocation strategie, how to calibrate the methods, numerical experiments in the MTC setting, as well as proofs of theorems and other properties.

2. Context and motivation

We describe a Phase 1 clinical trial on drug combinations that was carried out at the University of Virginia School of Medicine. The trial involved C6 Ceramide NanoLiposome (CNL) and Vinblastine in patients with relapsed/refractory acute myeloid leukemia and patients with untreated acute myeloid leukemia. A grid of 5 by 4 dose-combinations is considered: 5 levels for CNL (from 54 to 215 mg/m²) and 4 levels for Vinblastine (from 0.375 to 2.25 mg/m²). The target rate α was taken to be 20% and the maximum sample size was fixed at 60 patients. This particular real application provides us with the opportunity to illustrate the workings of such a Phase I drug combinations design as well as providing a means to introduce some of the essential ideas developed in this article. In Table 1, a hypothesized dose toxicity function, called Scenario T, is described.

Table 1:

Scenario T corresponds to a hypothetical pattern of toxicity probability in a study of Vinblastine and C6 Ceramide NanoLiposome (CNL). The 5×4 grid of dose combinations is fixed and chosen by the investigator. Dose combinations from the minimal set are shown in bold (see Definition 2).

Toxicity probabilities (%) of dose combinations		Scenario T Drug A1: CNL
Drug A2: Vinblastine	Level (mg/m²)	54	81	122	183	215
	2.25	24	29	34	59	70
	1.5	13	18	22	49	53
	0.75	7	9	16	35	42
	0.375	2	5	9	21	30

Open in a new tab

Let the dose d = (i, j) represents the combination of the i-th dose of an agent A1 and the j-th dose of an agent A2, with i ∈ {1, …, I} and j ∈ {1, …, J}. In Scenario T, I and J are respectively equal to 5 and 4 and dose (2, 3) corresponds to 81 mg/m² of CNL and 1.5 mg/m² of Vinblastine. Let D be the set of all dose combinations: D = {1, …, I} × {1, …, J}. The sequence ${(X_{n}, Y_{n})}_{n \in ℕ}$ is a sample of observations. At step n, corresponding to the n-th patient enrolled in the trial, the variable X_n is the dose selected among the I × J combinations; and the variable Y_n is the observed binary response at this dose taking values {0, 1}; 1 for a dose limiting toxicity (DLT) and 0 otherwise. Each combination of drug d is associated with a toxicity probability noted P_d.

Assumption 1. $\forall n \in ℕ, ℙ (Y_{n} = 1 ∣ X_{n} = d) = P_{d}$ .

In Scenario T, P_(2,3) is equal to 0.18. Note that the ranges {1, …, I} and {1, …, J} are ordered in terms of the probability of toxic response. Such an assumption is common in Phase I trials for cytotoxic agents. The ordering of the marginal doses induces a partial ordering on the full range of doses D. Sign < or ≤ will be used for the total ordering on $ℝ$ and the partial ordering on the set of doses D: (i, j) ≤ (r, s) if and only if i ≤ r and j ≤ s.

Assumption 2. $(i, j) < (r, s) \Rightarrow P_{(i, j)} < P_{(r, s)}$ .

In Scenario T, dose (2, 3) is ordered with dose (3, 3), as are their respective toxicity probabilities: 18% and 22%. The ordering assumption is fundamental to extract information on a dose d from observations on neighboring doses. In particular, given any dose d, the partial ordering allows a decomposition of the set D into four subsets: D = {d} ∪ 𝒜_d ∪ ℬ_d ∪ 𝒞_d, where 𝒜_d = {d′ ∈ D : d′ > d}, ℬ_d = {d′ ∈ D : d′ < d} and 𝒞_d = D \ (𝒜_d ∪ ℬ_d). Those combinations where d belongs to the set 𝒜_d, are associated with toxicity probabilities higher than P_d. The set ℬ_d contains those combinations of doses below d which are associated with toxicity probabilities lower than P_d. The dose combinations in 𝒞_d are not ordered with dose d which means that no prior assumption of order exists between their respective probabilities of toxicity. In Scenario T, the set 𝒞_(2,3) contains doses associated to toxicity probabilities below or above P_(2,3) = 0.18 : e.g. P_(3,1) = 0.09 and P_(1,4) = 0.24. Not unlike the single-agent setting, the primary objective of a Phase I combination study is often to identify a single MTD for further testing in the Phase II setting.

Numerous existing designs try to determine which combination d* among those available in D has a probability of toxicity closest to the target toxicity rate α so that d* = arginf_d∈D |P_d − α|. For Scenario T, the MTD, dose (4, 1), is associated with a toxicity probability 0.21, just above the threshold, α = 0.2, fixed by the investigators. However, in the context of dual-agent dose-finding, the notion of a single MTD is less clear cut. Indeed, aiming for the MTD is justified by an assumption of an ordering on the probabilities of efficacy, corresponding to an analogous ordering of the doses. For a single agent (full ordering), choosing the MTD amounts to maximizing the efficacy probability (probability of observing a sufficient therapeutic effect) under the constraint of being close to some chosen threshold of toxicity. This is no longer true in the dual-agent setting. Indeed, there exist situations in which a particular combination, less toxic than the MTD, may be more efficacious overall (Harrington et al., 2012). In Scenario T, let the efficacy probabilities of doses (4, 1) and (2, 3) be equal to 0.10 and 0.90, respectively. Such a situation is coherent with the partial ordering on D, as dose (2, 3) belongs to 𝒞_(4,1). In this case, the MTD (4, 1) is not particularly advantageous. As a result, a dual-agent trial may aim to locate more than a single MTD, forming an MTC (a cut or a contour) across the dose surface comprised of multiple combinations with similar acceptable toxicity profiles. The objective of the trial then becomes one of determining an MTD curve c* consisting of a set of combinations with toxicity probabilities close to α. In Scenario T, the MTD contour is represented by the line splitting the set of doses in two parts: the dose combinations associated to toxicity probabilities below the target α and those associated to toxicity probabilities above α. The minimal set related to this specific contour contains the closest doses from the line according to the partial ordering (see doses with bold toxicity probabilities in Table 1). This set is defined more formally in Section 4.

However, a design aiming for a single MTD will be of interest in some situations. This is the case if we have too small a number of patients available in order to determine a complete contour of the MTD (20 or 30 patients in a grid of 5 by 4), or if the investigators have some strong prior assumptions or requirements on the set of potential outcomes. We present an example of such an assumption. In the context of CNL and Vinblastine combinations, the investigators may want to reach a dose combination with a level of Vinblastine at least equal to 1.5mg/m², because this agent has previously shown positive results in term of efficacy at this level. The general structure of the semiparametric class of dose finding methods is described by a hierarchical model where the first level deals with the goal of the study - the doses themselves - and the second level is devoted to the dose-response curve. Here, the goal of the study is to find a single MTD. The first level of our model is then a distribution Π on the MTD itself. This distribution is updated in the Bayesian setting after each patient (see Section 3 for further details). In Figure 1, the posterior distribution on the MTD at the end of the trial is summarized by a histogram. Throughout the trial, the next dose combination is selected as the one maximizing the posterior Π according to the observations. The prior distribution on the range of doses D is a matrix of positive weights summing up to 1. It can easily be tuned to allow a progressive exploration in the range of doses or to correspond to expert knowledge: here, if possible, the investigators prefer to increase the level of vinblastine until 1.5 mg/m² (see Section E.2 for the exact prior distribution). This is what is done in the trial. At the beginning, the method recommends increasing the dose levels of vinblastine while keeping the CNL level at 54 mg/m². After 15 patients, the method stays at dose (2, 3). This dose is the one recommended at the end of the trial: 9 patients among 48 treated at this dose experienced a DLT (rate= 18.75%) and the posterior distribution puts its mass at 33.5% on it. According to the posterior distribution, the two other most probable dose combinations are (1, 4) and (3, 3) which, respectively, account for 13.4% and 13.7% of the posterior mass. Note that semiparametric methods can be easily adjusted either to meet a specific criteria or to work under general conditions with uninformative priors. For readers interested in having a rapid overview, the analysis based on semiparametric methods for scenario T continues at the end of the article (Section 7).

The illustrated trial of Figure 1 shows classical behaviour for a model-based design, as we tend to converge to a single MTD (Bayesian model in Section 3). However, a more useful goal may be to explore the MTD curve in order to determine multiple safe dose combinations associated with potentially different probabilities of efficacy (Section 4).

3. Semiparametric methods and a single MTD target

We briefly recall the semiparametric dose finding method introduced in (Clertant and O’Quigley, 2017, SPM) and then indicate how we can exploit these same ideas to tackle the problem of partial ordering. We refer to this as poSPM. The way in which we achieve an extension of the simpler to the more complex situation is to build a model around the non-ordered dose combinations. This extension corresponds to a non-informative prior which is structured in such a way that no distinction can be made between two doses d₁ and d₂ on the basis of observations on a dose d₃ that is not ordered with respect to d₁ and $d_{2} : d_{3} \in 𝒞_{d_{1}}$ and $d_{3} \in 𝒞_{d_{2}}$ .

Consider the set S ⊂ [0, 1]^D, having elements $P = {(P_{d})}_{d \in D}$ . Then, P_d provides the probability of toxicity for scenario P at dose d. The set S is partitioned according to the different values (dose levels) that θ can assume.

S = \underset{θ \in D}{\cup} S_{θ}, where S_{θ} \subset {P \in S | \forall d \in D, | P_{θ} - α | \leq | P_{d} - α ∣} .

S breaks down into I × J classes indexed by θ, where every scenario from the same class has the same MTD. Any P belonging to S_θ associates the MTD at dose θ. For more discussion on this topic, we refer the reader to (Clertant and O’Quigley, 2017, section 4). The definition of S_θ used in practice is expressed via Assumption 3 below. For a fixed P ∈ S, the likelihood that we can associate with the history $(X_{1}^{n}, Y_{1}^{n}) = {(X_{k}, Y_{k})}_{1 \leq k \leq n}$ is:

ℒ (P, X_{1}^{n}, Y_{1}^{n}) = \prod_{k = 1}^{n} {(P_{X_{k}})}^{Y_{k}} {(1 - P_{X_{k}})}^{1 - Y_{k}} = \prod_{d \in D} P_{d}^{n_{d}^{1}} {(1 - P_{d})}^{n_{d}^{0}},

(3.1)

where $n_{d}^{1}$ represents the number of DLTs occurring at dose d, and $n_{d} = n_{d}^{1} + n_{d}^{0}$ the number of patients that are treated at dose d. Let Π be the prior for the parameter θ. Then Π is a probability measure on the discrete space D, i.e. a matrix of positive numbers with I rows and J columns summing up to 1. Each class S_θ is endowed with a prior noted Λ_θ = Λ(.|θ). The family of distributions (Λ_θ)_θ∈D and Π defines a prior on the whole set of scenarios. This prior can be updated sequentially as observations are made. The posterior Π_n on the parameter of the MTD, θ, is obtained by integration over the support S_θ of the scenarios associated with this dose.

Π_{n} (θ) = Π (θ ∣ X_{1}^{n}, Y_{1}^{n}) \propto \int_{S_{θ}} ℒ (P, X_{1}^{n}, Y_{1}^{n}) Λ_{θ} (d P) Π (θ) .

(3.2)

The family of distributions (Λ_θ)_θ∈D plays a predictive role and provides the basis that underlies the practical implementation of the adaptive process. We call this the prior model. In the sequential study, the next patient or cohort of patients is allocated to the dose having the highest probability of being the MTD according to the posterior Π_n. The MTD and the probability of toxicity at any dose d, P_d, are estimated sequentially by:

{\hat{θ}}_{n} = \underset{θ \in D}{argmax} Π_{n} (θ) and {\hat{P}}_{d}^{(n)} = E_{{(Λ \otimes Π)}_{n}} [P_{d}] = \sum_{θ \in D} [\int P_{d} Λ_{θ, n} (d P_{d})] Π_{n} (θ) .

(3.3)

The prior model is a family of I × J distributions on the set of scenarios. Moreover, each scenario can be viewed as a matrix with I rows and J columns having elements P_d, the toxicity probability at dose d. The following assumption expresses a simple structure for the prior Λ_θ. For this purpose the Bernoulli parameter space is broken down into three sets. The high and low toxicity probabilities belong to A = [α + ϵ, 1] and B = [0, α − ϵ], respectively. The probabilities of toxicity in the neighborhood of the target α lie in the centered Interval I = [α − ϵ, α + ϵ]. This interval results from our choice of and can be made arbitrarily small. Indeed it can be reduced to a single point, α by choosing ϵ equal to 0. From the point of view of the MTD parameter θ, the marginal $Λ_{θ}^{d}$ distributes its probability mass according to the relative position of the doses d. The prior Λ_θ weights only those scenarios in which the MTD is θ. Given θ, $Λ_{θ}^{d}$ is the prior of toxicity probability at dose d itself. The position of d relative to θ, summarized in the following assumption, indicates that: (i) if d = θ, as θ is the parameter indicating the MTD, the marginal weights are on the centered interval I, (ii) if d > θ (i.e. d ∈ 𝒜_θ), then $Λ_{θ}^{d}$ puts its mass on the highest probability interval A, (iii) if d < θ (i.e d ∈ ℬ_θ), then $Λ_{θ}^{d}$ puts its mass on the lowest probability interval B, (iv) if the dose d is non-ordered with θ (i.e. d ∈ 𝒞_θ), the marginal $Λ_{θ}^{d}$ is uninformative and the weights are on the whole interval [0,1].

Assumption 3. Λ_θ is a product of unidimensionnal distribution, i.e.

Λ_{θ} (d P) = \prod_{d \in D} Λ_{θ}^{d} (d P_{d}),

and the support $S_{θ}^{d}$ of the distribution $Λ_{θ}^{d} (d P_{d})$ satisfies: $d \in 𝒜_{θ} \Rightarrow S_{θ}^{d} = A$ ; $d \in ℬ_{θ} \Rightarrow S_{θ}^{d} = B$ ; $d \in 𝒞_{θ} \Rightarrow S_{θ}^{d} = [0, 1]$ ; $d = θ \Rightarrow S_{θ}^{d} = I$ .

The probabilities of toxicity in each set 𝒜_θ, ℬ_θ and 𝒞_θ are conditionally independent once we have fixed S_θ. Marginally, of course, they are not independent so that any independency assumption fails to be valid when considering the whole model {(Λ_θ)_θ∈D, Π}. This is intuitively clear when a stochastic order is chosen for the marginal ${(Λ_{θ}^{d})}_{θ \in D}$ , at a certain dose $r : \forall r, d < d^{'} \Rightarrow Λ_{d^{'}, n}^{r} ⪯ Λ_{d^{'}, n}^{r}$ . The coherence principle introduced by Cheung (2005) can be extended to the case of partial ordering.

Definition 1 (Partial ordering coherence). A method ℳ is coherent if the sequence of selected doses satisfies: (X_n, Y_n) = (d, 0) ⇒ X_n+1 ∈ D\ℬ_d and (X_n, Y_n) = (d, 1) ⇒ X_n+1 ∈ D \ 𝒜_d.

This definition matches the one of Cheung (2005) in the case of a single-agent trial, and can be seen to be a logical extension of this criteria in the case of partial ordering. Note that the stochastic ordering implies that the poSPM is coherent.

Proposition 1. Under Assumptions 3 and 6 (appendix), the poSPM is coherent.

Working with a simple product of the marginals allows us to choose conjugate priors for the likelihood, in particular Beta distributions of which the uniform is a special case.

4. Identifying the Maximum Tolerated Contour (MTC)

4.1. Semiparametric Model

In the setting of partially ordered doses, the semiparametric class of methods on the contour (poSPMc) takes its inspiration from the PIPE model (Mander and Sweeting, 2015) and a method for single agents (Clertant and O’Quigley, 2018).

The set of dose combinations D can be naturally partitioned into two subsets defined by the toxicity target α. The partition includes one set of dose combinations that are associated with toxicity probabilities below α : D⁻ = {d ∈ D : P_d < α}, and a set of dose combinations associated with toxicity probabilities above α : D⁺ = {d ∈ D : P_d > α}. We have: D = D⁻ ∪ D⁺. The maximum tolerated contour, also called the MTC curve, is a line separating the set D⁻ and D⁺ (see Figure 2). The goal of poSPMc is to allocate patients at acceptable doses around the contour in order to better study dose combinations close to the toxicity threshold. If these dose combinations are not ordered, then they do not share information in terms of the probability of efficacy. In general a contour c is a line which separates the set of dose combinations into two ordered sets of doses: one set of dose combinations above the contour, 𝒜_c, and another set of dose combination below the contour, ℬ_c. For any given contour c, we note that:

d \in 𝒜_{c} and d^{'} \in ℬ_{c} \Rightarrow d > d^{'} or d and d^{'} non-ordered

In Figure 2, we can describe any contour as a polygonal chain tracing out a path from (0.5, J + 0.5) to (I + 0.5, 0.5) with steps of size 1 along the abscissa and of size −1 on the ordinate axis. Only rightward and downward steps are permitted. As a result, for the range D containing I × J dose combinations, there exist $(\begin{matrix} I + J \\ I \end{matrix})$ possible contours c. Let C be the set of these contours and c^⋆ the maximum tolerated contour (MTC) defined by the true, unknown, toxicity probabilities at each dose combination. Estimating the MTC may appear to be a considerable challenge as the number of contours increases exponentially with the number of doses: 70 possible contours for a 4 × 4 grid and 962 for a 6 × 6 grid. Fortunately, each contour c is associated with a minimal set ℳ_c of dose combinations on which we need to have observations in order, with probability one, to confirm or disprove that this contour can be identified as the MTC. The dose combinations belonging to ℳ_c are the closest doses of the contour in our partially ordered set of doses D (see Figure 2). In the following definition, the minimum and maximum operators are applied to partially ordered sets.

Definition 2. The minimal set of dose combinations, ℳ_c, associated with the contour c is: ℳ_c = min 𝒜_c ∪ max ℬ_c.

For a grid with I doses for agent A₁ and J doses for agent A₂, the number of dose combinations belonging to a minimal set varies between 1 and 2 × min(I, J), except in the case I = J where the maximum is equal to 2I − 1. The average number of combinations inside a minimal set is equal to 2IJ/(I+J). If a minimal set contains only one dose combination, all of the toxicity probabilities (P_d)_d∈D are on the same side of the target, either below or above. If a minimal set contains only two doses, it corresponds to a vertical or a horizontal contour.

Let $ℳ_{c^{⋆}}$ be the minimal set associated with the MTC. Dose combinations in $ℳ_{c^{⋆}}$ are not necessarly the ones closest to α in term of toxicity probabilities. However, $ℳ_{c^{⋆}}$ satisfies the following properties which justify that a model for a dose combination study focuses on the contour or the associated minimal set.

Proposition 2. 1) For all dose combinations $d \notin ℳ_{c^{⋆}}$ , there exists a dose $d^{'} \in ℳ_{c^{⋆}}$ ordered with d such that: |P_d − α| > |P_d′ − α|.

2) The MTD belongs to the minimal set associated with the $M T C : d^{⋆} \in ℳ_{c^{⋆}}$ .

The proof is immediate by using Definition 2. The proposed model takes place in the setting of SPM; γ is the parameter of interest, the MTC itself. As in Section 3, S is a broad set of scenarios and P = (P_d)_d∈D is an element of S with P_d the toxicity probability of the scenario P at dose d. The set S is partitioned according to $γ : S = \cup_{γ \in C} S_{γ}$ , where S_γ = {P ∈ [0, 1]^D : d ∈ 𝒜_γ ⇒ P_d > α and d ∈ ℬ_γ ⇒ P_d < α}. S is partitioned into $(\begin{matrix} I + J \\ I \end{matrix})$ classes indexed by γ, where every scenario of the same class has the same MTC.

Let Π be the prior for the parameter γ, which means that Π is a probability measure on the discrete space C. Each class S_γ is endowed with a prior noted Λ_γ = Λ(.|γ), and the family (Λ_γ)_γ∈C is called the prior model. This family can be easily shaped by the following assumption and the use of truncated beta distributions for each marginal.

Assumption 4. Λ_γ is a product of one-dimensional distributions, i.e.

Λ_{γ} = \prod_{d \in D} Λ_{γ}^{d},

where $Λ_{γ}^{d}$ is a prior for P_d, the toxicity probability at dose d. The support $S_{γ}^{d}$ of the distribution $Λ_{γ}^{d}$ satisfies: $d \in 𝒜_{γ} \Rightarrow S_{γ}^{d} = [α, 1]$ ; $d \in ℬ_{γ} \Rightarrow S_{γ}^{d} = [0, α]$ .

In Figure 3, the distribution Λ_γ is illustrated when γ represents the two contours in Figure 2. The parameters of these two distributions are the ones used in the numerical experiments. Note that, in Figure 3 (d) marginal prior model Λ_(a) is steepest than Λ_(b); the dose (2,3) and (3,3) are in the minimal set for contour (a) but not for contour (b).

The family of distributions (Λ_γ)_γ∈C and Π define a prior on the whole set of scenarios S. The posterior Π_n on the parameter of the MTC given the data $(X_{1}^{n}, Y_{1}^{n})$ is:

Π_{n} (γ) \propto \int_{S_{γ}} ℒ (P, X_{1}^{n}, Y_{1}^{n}) Λ_{γ} (d P) Π (γ) .

(4.4)

The current estimate of the contour is.

{\hat{γ}}_{n} = \underset{γ \in D}{argmax} Π_{n} (γ) .

(4.5)

The algorithm for allocating the patient (or the next cohort of patients) during the trial is based on the following two steps: (i) the prior Π_n is updated and the most probable contour ${\hat{γ}}_{n}$ is selected; (ii) the dose where the next patient will be treated is selected by an allocation strategy in the most probable minimal set according to our posterior: $X_{n + 1} \in ℳ_{c^{⋆}}$ . Such an allocation strategy spreading the observations inside the estimated minimal set is developped in Section 4.2.

The PIPE method (Mander and Sweeting, 2015) might be seen as the first example using a semiparametric method on the contour. However, it has some strong limitations in term of calibration as it does not clearly specify priors on the MTC and the space family (S_γ)_γ∈C. For this method, the toxicity probability at each dose d is modelled by a weak beta prior with parameter a_d and b_d. For each contour, a posterior probability is obtained by multiplying the posterior of the interval [0, α] for every dose below the contour and the posterior of the interval [α, 1] for those doses above the contour. Let B_I(a, b) be the truncated beta distribution with parameter a and b on the interval I and B(x, a, b) be the incomplete beta function. In the setting of SPM, the PIPE model is defined by:

{\begin{array}{l} Λ_{c} & ~ \prod_{d \in ℬ_{c}} B_{[0, α]} (a_{d}, b_{d}) \times \prod_{d \in 𝒜_{c}} B_{[α, 1]} (a_{d}, b_{d}) \\ Π (c) & \propto \prod_{d \in ℬ_{c}} B (α; a_{d}, b_{d}) \times \prod_{d \in 𝒜_{c}} (1 - B (α; a_{d}, b_{d})) . \end{array}

Note that, for the PIPE method, the prior model (Λ_c)_c∈C and the prior on the contour Π are defined by the I × J prior of the toxicity probabilities at each dose combination. The indirect calibration of the prior model and the prior on the set of contours can undermine the performance of this method. In particular, the prior Π defined through the PIPE model is not chosen for allowing a progressive exploration of the range of doses but is dependent on the target toxicity probability α.

4.2. Allocation strategy

In a minimal set, many of the doses are not ordered and the question of allocation within this set needs to be addressed. This is not straightforward and we do not exhaust all possibilities here. Several allocation strategies can be made available and any particular choice will result in some particular trial behaviour. The allocation strategy introduced hereafter is a compromise between one that spreads experimentation ‘equally’ against one that tests those doses that will, in expectation, bring more information into the study. We also need restrict experimentation to an area of the minimal set considered to be safe. After n observations, the next dose X_n+1 is:

X_{n + 1} = min_{d \in ℳ_{{\hat{γ}}_{n}} \ T_{n}} \frac{1}{k_{d}} {(\sum_{j \in ℬ_{d} \cup {d}} n_{j}^{0}) \times H (α ∣ 0) + (\sum_{j \in 𝒜_{d} \cup {d}} n_{j}^{1}) \times H (α ∣ 1)},

(4.6)

where H(p|q) is the entropy of p relative to q : H(p|q) = −q × log(p) − (1 − q) log(1 − p), with the convention log(0) = −∞ and 0 × (−∞) = 0, and k_d = #𝒜_d+#ℬ_d+1. In Equation 4.6, the relative entropy, which corresponds to a quantity of information, is counter-balanced by k_d, the number of dose combinations ordered with d, in other words, the number of dose combinations on which we aim to learn something subsequent to an observation on d.

The next dose X_n+1 is selected among the safe dose combinations in the estimated minimal set: $ℳ_{{\hat{γ}}_{n}} \ T_{n}$ . the set of dose combinations on which we already have some evidence of overly toxicity is denoted T_n. It is estimated by using local Bayesian tests at each of the doses and the assumption of partial ordering. A dose d₀ is excluded from the study if:

d_{0} \in T_{n} = {d \in D : \exists d^{'} \in H_{n}, d \geq d^{'}},

with $H_{n} = {d \in D : ℙ_{𝒰} [P_{d} > θ_{T} ∣ (n_{d}, n_{d}^{1})] > δ_{T}}$ , and 𝒰 is the uniform prior on the parameter space [0, 1].

This allocation strategy spreads the observations along the estimated contour and tests slightly more often the doses in the middle of the grid. Since a DLT corresponds to a greater quantity of information, those doses with a smaller number of toxicities are tried more often (for α = 0.25 : H(α|1)/H(α|0) = log(α)/log(1 − α) ≈ 4.82). A longer description of this allocation strategy can be found in Section C

5. Large sample theory

Two distinct and complementary asymptotic behaviors are described here. The first kind of large sample behavior is that of ϵ-sensitivity, introduced by Cheung (2011). This corresponds to the almost sure convergence to a dose combination for which the true probability of toxicity lies inside the interval [α − ϵ; α + ϵ].

Definition 3. Let ε ≥ 0 and I = [α−ε; α+ε]. We consider the set ℰ(I, P) of the collection of dose combinations associated with a toxicity probability belonging to I, i.e. ℰ(I, P) = {d ∈ D : P_d ∈ I}. A method is ε-sensitive if for all scenarios P = (P_d)_d∈D such that ℰ(I, P) ≠ ∅, we have:

ℙ [\exists N, \forall n > N : X_{n} \in ℰ (I, P)] = 1.

The ε-sensitivity corresponds to strong consistency for a single dose associated with a toxicity rate close to the threshold α. This dose is not necessarily the MTD if there exist two or more doses in the interval I. The almost sure convergence to the MTD is obtained if the MTD is a unique dose in ℰ(I, P). Such an assumption on the scenario is necessary in the light of the impossibility theorem of Azriel (2011). The particularity of our semiparametric method for a single MTD (poSPM) is that the interval I is a parameter which can be directly calibrated. This requires us to choose a small interval I which could potentially fail to contain any of the toxicity probabilities associated with the given doses. For this reason, the complementary behavior of the ϵ-sensitivity introduced by Clertant and O’Quigley (2017) is extended to the case of partial ordering. We define as balanced behavior the almost sure convergence to a set of doses. More S precisely, a sequence ${(X_{n})}_{n \in ℕ}$ converges to a set B, denoted by $X_{n} \overset{S}{\to} B$ , when:

max_{x \in B} (lim_{n \to} inf_{+ \infty} δ (X_{n}, x)) = 0,

and where δ(., .) is Euclidean distance. In the case of a complete ordering, the sequence ${(X_{n})}_{n \in ℕ}$ converges to the set consisting of the two consecutive doses on either side of the target α. This set of two doses corresponds to the definition of the minimal set for the maximum tolerated contour (see definition 2): $ℳ_{c^{⋆}} = max D^{-} \cup min D^{+}$ .

Definition 4. Let D be a range of doses. A method is balanced, if for all scenarios, we have:

X_{n} \overset{S}{\to} ℳ_{c^{⋆}}, a . s .

The doses recommended infinitely often by the current estimator of a balanced method are all the doses in $M_{c^{⋆}}$ and only these doses. The following theorem completes in a natural way the impossibility theorem formulated by Azriel (2011).

Theorem 1. For any allocation method, the three following statements are equivalent :

For all scenarios, there exists a statistic F on the sample $(X_{1}^{n}, Y_{1}^{n})$ such that $F (X_{1}^{n}, Y_{1}^{n}) \to d^{⋆}, a . s .$
For all scenarios, there exists a statistic F on the sample $(X_{1}^{n}, Y_{1}^{n})$ such that $F (X_{1}^{n}, Y_{1}^{n}) \to c^{⋆}, a . s .$ .
For all scenarios, there exists a set of dose combinations D′ with $M_{c^{⋆}} \subset D^{'}$ such that the sequence of recommended doses satisfies:
$X_{n} \overset{S}{\to} D^{'}, a . s .$

In particular, this basic theorem expresses a necessary condition for any method: the minimal set $M_{c^{⋆}}$ is the smallest set of dose combinations on which we need to have observations in order to find almost surely the MTD and the MTC under general circumstances. Some assumptions are needed on our models in order to obtain the desired asymptotic behavior. The regularity assumptions (7, Appendix) on the prior model of the poSPM and the poSPMc are readily met. An other assumption concerns the allocation strategy.

Assumption 5. (a) The next dose combination X_n+1 is selected from $ℳ_{{\hat{γ}}_{n}}$ .

(b) All the doses in the limit supremum of ${(ℳ_{{\hat{γ}}_{n}})}_{n \in ℕ}$ are selected infinitely often:

\forall d \in \underset{n \geq 1}{\cap} \underset{k \geq n}{\cup} ℳ_{{\hat{γ}}_{k}}, {\sum_{i = 1}^{n} 1_{{X_{n} = d}} \underset{n \to \infty}{\to} \infty}, a . s .

Assumption 5 (b) means that if a contour is selected infinitely often by ${\hat{γ}}_{n}$ , all the doses in the minimal set associated to this contour would be explored infinitely often (a.s). This is a necessary conditions to obtain a balanced behavior (simple consequence of Theorem 1’s proof). This constructive assumption is satisfied by the allocation strategy defined in Section 4.2.

Theorem 2. (a) Suppose that P^T is the true scenario. Under assumptions 3 and 7, we have: if ϵ > 0 and ℰ(I, P^T) contains no couple of ordered doses then the poSPM is ε-sensitive; when ϵ = 0 then poSPM is balanced.

(b) Under assumptions 4, 5 and 7, poSPMc is balanced.

When ϵ = 0, the poSPM is balanced which means that the method explores all the dose combinations included in $ℳ_{c^{⋆}}$ when the sample size becomes large enough. However, this only describes asymptotic behavior and, at finite sample sizes, the simulations will often converge to a single dose identified as the MTD (see Figure 1). Conversely, the balanced behavior of poSPMc at a finite sample size can be observed in Figure 4.

Figure 4: — A simulated Trial under scenario T using poSPMc and the final posterior ${\tilde{Π}}_{60}$ .on the dose combinations. Red cross: Dose Limiting Toxicity; black circle: no toxicity response.

6. Numerical experiments: MTC setting

In the MTC setting, we compared the poSPM to the PIPE design in terms of each method’s ability to identify and treat patients along a target MTC. Each trial targets a toxicity level of α = 0.20, with a total sample size of 50 and 1 patient per cohort. The calibration of the two methods is fully described in appendix E.1 and E.3. Table 2 shows the operating characteristics of the 2 methods under the 4 scenarios shown in Table 5. As in the MTD setting, for each scenario, we report the percentage of patient allocation (experimentation %) and percentage of MTD selection (recommendation %) for doses contained within five different ranges of true toxicity probabilities. Since α = 0.20, the target interval containing the true MTD is [0.15, 0.25]. In Scenarios 1 and 4, the poSPM and PIPE method have very similar operating characteristics with regards to recommending and experimenting at combinations in the target interval. The difference in the methods in Scenarios 1 and 4 can be observed in the intervals outside the target range. In both scenarios, when outside the target interval, the poSPM allocates to combinations below the target range, whereas the PIPE method tends to be more aggressive. This has an impact on the accuracy index (see Equation D.10) for experimentation in Scenario 4, with poSPM yielding a larger value: 0.67 vs. 0.55. For both of these scenarios, PIPE induces a higher overall percentage of DLT’s, on average.

Table 2:

Experimentation and recommendation percentages for the poSPM and the PIPE design. Column containing the MTD in bold. Scenarios 1 to 4 correspond to scenarios A, C, E, G in Mander and Sweeting (2015). they can be found in Appendix A.

Scenario	Method	Experimentation %					Acc	%DLTs
Scenario	Method	[0, 0.10)	[0.10, 0.15)	[0.15, 0.25]	(0.25, 0.30]	(0.30, 1]	Acc	%DLTs
1	poSPM	9.0	25.8	56.9	7.7	0.4	0.35	16.9
	PIPE	2.8	7.5	59.4	26.1	4.2	0.39	22.0
2	poSPM	0.0	26.4	45.0	9.4	19.1	0.88	23.5
	PIPE	0.0	6.2	21.2	11.5	61.0	0.63	36.0
3	poSPM	12.2	27.0	36.1	22.8	1.9	0.38	18.0
	PIPE	0.0	6.6	21.8	11.5	60.2	0.33	35.9
4	poSPM	31.2	19.4	32.4	13.7	3.2	0.67	14.2
	PIPE	12.3	14.0	34.0	26.0	13.4	0.55	22.1
		Recommendation %					Acc	${\bar{N}}_{M T D}$
		[0, 0.10)	[0.10, 0.15)	[0.15, 0.25]	(0.25, 0.30]	(0.30, 1]	Acc	${\bar{N}}_{M T D}$
1	poSPM	2.0	19.9	71.6	6.3	0.0	0.63	3.73
	PIPE	0.3	5.2	70.3	23.7	0.4	0.64	2.23
2	poSPM	0.0	17.8	54.3	12.8	15.0	0.92	2.60
	PIPE	0.0	8.8	38.4	17.4	35.3	0.84	1.44
3	poSPM	3.8	29.1	36.6	29.5	0.9	0.44	3.18
	PIPE	0.0	9.8	37.7	17.5	35.0	0.28	1.44
4	poSPM	20.1	22.6	39.1	16.8	1.2	0.77	4.2
	PIPE	17.4	23.5	38.0	19.1	2.0	0.77	2.77

Open in a new tab

In Scenario 2, poSPM is the best performing method by all metrics. It demonstrates better performance from the viewpoints of experimentation (45.0% vs. 21.2%) and recommendation (54.3% vs. 38.4%) percentages in the target interval. PIPE again tends to be more agressive, allocating 61% of patients to combinations with toxicity probabilities in the interval (0.30, 1.00]. The improvement of poSPM over the PIPE is also reflected in the accuracy indices in Scenario 2. In Scenario 3, the methods have similar operating characteristics in terms of recommendation percentage in the target interval (36.6% vs. 37.7%), although the poSPM maintains an advantage when considering experimentation and accuracy. Similarly to Scenario 2, PIPE allocates 60.2% of patients to combinations with toxicity probabilities in the interval (0.30, 1.00]. The more aggressive behavior of PIPE is further indicated in the overall percentage of DLT’s observed, as this value is both higher than the corresponding value for poSPM, as well as larger than the target α = 0.20 in all scenarios. Considering the average performance over the four scenarios the poSPMc is the best method by all the metrics. In particular the poSPMc recommends a larger number of doses (3.43 doses vs. 1.97) yet maintaining greater accuracy than the PIPE method (0.69 of accuracy recommendation vs. 0.63).

7. CNL and vinblastine combination, a study of Scenario T

Three versions of our semiparametric class of methods are compared on Scenario T introduced in Table 1 in the context of CNL and vinblastine combination:

poSPMc is the semiparametric method targeting the MTD contour (or the associated minimal set); the calibration is the same as in Section 6,
poSPM1 is the semiparametric method targeting a single MTD, without including expert knowledge (uninformative prior); the calibration is the same as in Appendix D,
poSPM2 is the semiparametric method targeting a single MTD which is calibrated in line with expert knowledge as described in Section 2 (‘if possible, the investigators prefer to increase the level of vinblastine until 1.5 mg/m²’), the calibration is the same as in Appendix D except that the mass of the prior Π on doses (i, j) ∈ {2, …, 5} × {1, 2} is divided by two.

In Table 3, the three methods are compared on Scenario T. The target α is 20% and 60 patients are to be enrolled in the study. 10 000 trials are simulated for each method. Cheung’s accuracy metric is defined in Section D. The metric ${\bar{N}}_{M T D}$ is the average number of dose combinations recommended at the end of the trial: 4.31 for poSPMc (less than the number of doses in the minimal set of Scenario T), 1 for the other methods. Regarding the final recommendation percentages in interval [0.15, 0.25], poSPM1 obtains better performance than poSPM2 and poSPMc. For poSPM2, this is due to the steep toxicity profile around the MTDs in the middle of the grid rather than around the dose (1, 4), which is then more difficult to locate. The difference between poSPMc and poSPM1 in terms of allocation and recommendation percentage in interval [0.15, 0.25] is due both to the exploration behavior of poSPMc and the multiple recommendation at the end of the trial (It is more difficult to recommend 4 dose combinations in interval [0.15, 0.25] than 1). However, the percentage of dose combinations recommended by poSPMc which are inside the interval [0.15, 0.25] is still satisfactory (60.7%).

Table 3:

Experimentation and recommendation percentages for poSPMc, poSPM1 and poSPM2 on Scenario T. Column containing the MTD in bold.

Method	Experimentation %					Acc	%DLTs
Method	[0, 0.10)	[0.10, 0.15)	[0.15, 0.25]	(0.25, 0.30]	(0.30, 1]	Acc	%DLTs
poSPMc	27.5	8.8	47.3	8.9	7.4	0.52	18.1
poSPM1	14.7	19.4	55.7	8.7	18.7	0.48	22.2
poSPM2	10.7	20.1	42.9	14.1	11.9	0.48	21.6
	Recommendation %					Acc	${\bar{N}}_{M T D}$
	[0, 0.10)	[0.10, 0.15)	[0.15, 0.25]	(0.25, 0.30]	(0.30,1]	Acc	${\bar{N}}_{M T D}$
poSPMc	13.9	8.4	60.7	11.5	5.4	0.63	4.31
poSPM1	5.4	2.2	73.0	9.1	10.1	0.67	1
poSPM2	1.7	18.2	62.9	11.5	5.6	0.65	1

Open in a new tab

In order to show the benefits of targeting the MTD curve (poSPMc), four hypotheses concerning the efficacy profile of CNL and vinblastine are considered. The minimal set of Scenario T contains dose combinations (1,4), (2,3), (3,2), (3,3) and (4,1). Note that the true MTD always belongs to the minimal set; for Scenario T, this is (4, 1). The hypotheses lean on the set of doses with sufficient efficacy probabilities in the minimal set. In the four hypotheses, the only dose combinations which are sufficiently efficacious and not associated with overly toxicity probabilities belong to set H₁ = {(2, 3), (3, 3)}, H₂ = {(3, 2), (3, 3)}, H₃ = {(4, 1)} and H₄ = {(1, 4)}, respectively. These four hypotheses are coherent with the partial order applied to the efficacy probabilities. They represent 4 possible profiles of efficacy. Note that a dose combination of CNL and Vinblastine is considered to have a sufficient efficacy profile if its efficacy rate exceeds that associated with competitive therapies. The aim of the Phase I design is to then recommend such a dose combination for further exploration in Phase II. In Table 4, the percentage of trials (among the 10 000) which recommend a dose in the set H1, H2, H3 and H4 are detailed for each of the three methods. The advantage of the method targeting the MTD curve over methods aiming for a single MTD is a significant one. For poSPMc, the percentage of trials in which at least one dose with a sufficient efficacy profile is recommended is between 60 and 75% for all the hypotheses. The same metric goes from 2 to 45% for poSPM1 and poSPM2. This does not mean that the two methods are not accurate, but they are not equipped to deal with this situation.

Table 4:

Experimentation and percentage of trials recommending at least a dose in sets H₁ = {(2, 3), (3, 3)}, H₂ = {(3, 2),(3, 3)}, H₃ = {(4, 1)} and H₄ = {(1, 4)} for poSPMc, poSPM1 and poSPM2. Each of this set may be the only one containing dose combitions sufficiently efficacious and not associated with overly toxicity probabilities.

		Experimentation %
Scenario T	Method	H ₁	H ₂	H ₃	H ₄
	poSPMc	18.8	16.7	10.6	8.0
	poSPM1	32.6	37.2	4.3	2.9
	poSPM2	12.6	24.5	0.1	17.8
		% of trials Recommendation
Scenario T	Method	H ₁	H ₂	H ₃	H ₄
	poSPMc	65.2	74.3	64.8	62.8
	poSPM1	44.9	43.6	6.5	3.75
	poSPM2	21.3	41.3	0.2	20.2

Open in a new tab

Balanced behavior of poSPMc is outlined via a trial presented in Figure 4. At the beginning of the trial, the method explores each of the dose combinations of the progressive inverted diagonals (e.g. (1,2) and (2,1)) until a first DLT is observed. Following the twentieth patient, almost all the recommended dose combinations are in the minimal set of Scenario T.

Note that poSPMc updates a distribution Π on the possible maximum tolerated contour. The method does not immediately lead to a posterior distribution on the MTD. However, as the model (Λ, Π) is a distribution on the range of scenarios, one can still calculate a distribution $\tilde{Π}$ on the MTD parameter by integrating out the toxicity probabilities conditional upon each possible value that could be assumed by the MTD. Instead of doing this complex calculation, we can use the following formula which holds for all n :

\forall c \in C, \forall d \in D, {\tilde{Π}}_{n} \propto \frac{1}{\bar{𝒱_{d}}} \sum_{c \in 𝒱_{d}} Π_{n} (c),

(7.7)

where 𝒱_d is a neighborhood in terms of contours for dose combination d : 𝒱_d = {c ∈ C: d ∈ ℳ_c}, and $\bar{𝒱_{d}}$ its cardinality. The following proposition shows that ${\tilde{Π}}_{n}$ is the desired result by considering a poSPM model $(\tilde{Λ}, \tilde{Π})$ for which the updating process is conjugate with the one of the poSPMc model.

Proposition 3. For all $n \in ℕ$ , the probability ${\tilde{Π}}_{n}$ defined by Equation 7.7 is the posterior on the set of dose combinations from the poSPM model $(\tilde{Λ}, \tilde{Π})$ where:

\tilde{Π} (d) \propto \frac{1}{{\bar{𝒱}}_{d}} \sum_{c \in 𝒱_{d}} Π (c) and {\tilde{Λ}}_{d} = \sum_{c \in 𝒱_{d}} r_{c}^{d} Λ_{c} with r_{c}^{d} = \frac{Π (c)}{\sum_{c \in 𝒱_{d}} Π (c)} .

(7.8)

Thus, there always exists a model $(\tilde{Λ}, \tilde{Π})$ of poSPM such that, for all $n \in ℕ$ , the distributions ${\tilde{Π}}_{n}$ can be seen to correspond to the posterior of $\tilde{Π}$ . This conjugacy property associates, with every poSPMc model on the MTD curve, a poSPM model for a single MTD. However, this association is not trivial and appealing to an allocation strategy for poSPMc allows the design to explore the full MTD curve. At the end of a trial with poSPMc, the final recommendation can rely on the posterior ${\tilde{Π}}_{n}$ . In Figure 4, more than 60% of the posterior mass of $\tilde{Π}$ is on the minimal set of Scenario T.

Ackowledgements

Dr. Clertant is supported by Investissements d’Avenir programme ANR-11-IDEX-0005-02 and 10-LABX-0017, Laboratoire dexcellence INFLAMEX. Dr. Wages is supported by National Institutes of Health Award R01CA247932. Additional philanthropic support was provided by Elizabeth Cronly to the Patients & Friends Research Fund of University of Virginia Cancer Center. Thanks to Dr. Roxane Duroux for her participation in the early phase of this project.

Appendix

A. Scenarios for the MTC setting

Table 5:

Toxicity scenarios 1–4 for the two-drug combinations. Dose combinations in the minimal set are in bold. Doses associated to a toxicity probability equal to 0.20 belong to D⁻ and D⁺ (see Section 4); they are always in the minimal set.

	Dose	Drug A2
	Level	1	2	3	4	1	2	3	4
	Scenario 1					Scenario 2
	4	22	26	30	34	55	65	75	85
	3	16	20	24	28	40	50	60	70
	2	10	14	18	22	25	35	45	55
	1	4	8	12	16	10	20	30	40
Drug
A1	Scenario 3					Scenario 4
	4	11	21	31	41	10	30	50	80
	3	10	20	30	31	6	15	30	45
	2	9	19	29	30	4	10	15	20
	1	8	18	28	29	1	2	3	4

Open in a new tab

B. Notations

Table 6:

Important notations

Notation	Description
d or X	a dose combination
Y	response in term of toxicity
D	set of dose combinations
I and J	number of doses for agent 1 and 2
𝒜_d, ℬ_d and 𝒞_d	set of doses above, below and non-ordered with d
P	a scenario, i.e a I × J matrix of bernoulli parameters
c and ℳ_c	a contour and a minimal set associated to this contour
𝒜_c and ℬ_c	set of dose combinations above and below c
C	set of possible contour in the range of dose combinations
d*	true Maximum Tolerated Dose (MTD)
c*	true Maximum Tolerated Contour (MTC)
Π	distribution on the MTD or the MTC parameter
θ	MTD parameter
γ	MTC parameter
Λ = (Λ_θ)_θ∈D	distributions on the scenarios conditioned by the MTD parameter
Λ = (Λ_γ)_γ∈C	distributions on the scenarios conditioned by the MTC parameter
S, S_θ, S_γ	support of Λ, Λ_θ and Λ_γ
$Λ_{θ}^{d}$ (or $Λ_{γ}^{d}$ )	marginal at dose combination d

Open in a new tab

C. Allocation strategy

In a minimal set, many of the doses are not ordered and the question of allocation within this set needs to be addressed. This is not straightforward and we do not exhaust all possibilities here. Several allocation strategies can be made available and any particular choice will result in some particular trial behaviour. The allocation strategy introduced hereafter is a compromise between one that spreads experimentation ‘equally’ against one that tests those doses that will, in expectation, bring more information into the study. We also need for experimentation to be restricted to an area of the minimal set considered to be safe.

While allocating patients within the estimated minimal set, we want to do our best to avoid the selection of dose combinations which are already associated with having an unacceptably high toxicity probability. We can address this issue by making use of a Bayesian test based on a uniform prior 𝒰 on the space of Bernoulli parameters. The set H_n describes those dose combinations indicated as being too toxic following these local Bayesian tests at each of the doses.

H_{n} = {d \in D : ℙ_{𝒰} [P_{d} > θ_{T} ∣ (n_{d}, n_{d}^{1})] > δ_{T}} .

We are then in a position to use the partial ordering in D to extend these exclusions to other doses that are unfavourably ordered with the doses belonging to this set, e.g. a dose d′ in 𝒜_d with d belonging to H_n.

Exclusion rule:

A dose d₀ is excluded from the study if:

d_{0} \in T_{n} = {d \in D : \exists d^{'} \in H_{n}, d \geq d^{'}},

where T_n is the set of dose combinations considered as overly toxic. A dose combination of the estimated minimal set $ℳ_{{\hat{γ}}_{n}}$ will not be tested if it belongs to T_n. Note that this exclusion rule corresponds also to a stopping rule when the evidence of unacceptably high toxicity is associated with the lowest dose. This rule is an extension in the context of partial ordering to rules introduced by Ji et al (2010).

The approach based on a compromise between spreading the observations along the contour as opposed to making observations on those doses that we estimate would bring more information into the study leans on two quantities: (i) the number of doses which will potentially benefit from the next observation through the partial ordering structure and (ii) the amount of information already collected. Each dose d, can be associated with a value k_d corresponding to the number of dose combinations ordered with d plus one: k_d = # 𝒜_d + # ℬ_d + 1, where the cardinality of a set E is noted $\bar{E}$ . The value k_d corresponds to the number of dose combinations on which we aim to learn something subsequent to an observation on d. Observing a DLT at dose d implies that the patient would have experienced a DLT for all the doses in 𝒜_d, and conversely, all the doses in ℬ_d would have been safe for a patient who does not experience a DLT at dose d. This structure is naturally taken into account by our model. When the objective is to optimize the quantity of information obtained during the trial, those doses associated with the highest value k_d are, in expectation, those that provide the most information. Note that these doses are located along the diagonal going from the combination (1, 1) to (I, J). The notion of entropy is our second tool. It corresponds to a quantity of information per unit of data, in our case the observation related to a single patient. As the goal of the study is to determine doses that are close to α in term of toxicity probability, we will use the concept of relative entropy between each observation and the target α. If p and q denotes two Bernoulli distributions and their parameters, the entropy of p relative to q is: H(p|q) = −q × log(p)−(1−q)log(1−p), with the convention log(0) = −∞ and 0×(−∞) = 0. Thus, n×H(α|1) is the quantity of information relative to the target alpha which is brought by n DLT (H(α|0) for the non toxic observations). It corresponds to the log-likelihood in α. After n observations, the next dose X_n+1 is:

X_{n + 1} = min_{d \in ℳ_{{\hat{γ}}_{n}} ∖ T_{n}} \frac{1}{k_{d}} {(\sum_{j \in ℬ_{d} \cup {d}} n_{j}^{0}) \times H (α ∣ 0) + (\sum_{j \in 𝒜_{d} \cup {d}} n_{j}^{1}) \times H (α ∣ 1)} .

(C.9)

This allocation strategy spreads the observations along the estimated contour and tests slightly more often the doses in the middle of the grid. Since a DLT corresponds to a greater quantity of information those doses with a smaller number of toxicities are tried more often (for α = 0.25: H(α|1)/H(α|0) = log(α)/log(1 − α) ≈ 4.82).

D. Numerical experiments: MTD setting

We evaluate the operating characteristics of poSPM and compared them to the 2d-BOIN method (Lin et al., 2016) and the poCRM (Wages, Conaway and O’Quigley, 2011). Performance evaluation was based on several metrics under 4 toxicity scenarios. Our goal is to evaluate: (1) how well each method provides a recommendation of sets of doses at and around the target rate (i.e. acceptable MTD’s), and (2) how well each method allocates patients to acceptable MTD’s. While traditional evaluation measures, such as the percentage of recommendation and allocation to the true MTD’s are useful in assessing performance, it is also beneficial to consider the entire distribution of selected dose combination, as it provides more detailed information as to what combinations are being recommended. For evaluating recommendation, Cheung (2011) proposes to use the accuracy index so that, after n patients, he defines,

A_{n} = 1 - I \times J \times \frac{\sum_{i = 1}^{I} \sum_{j = 1}^{J} {(P_{d} - α)}^{2} \times ρ_{d}}{\sum_{i = 1}^{I} \sum_{j = 1}^{J} {(P_{d} - α)}^{2}},

(D.10)

where P_d is the true toxicity probability at dose combination d = (i, j), ρ_d is the percentage of trials in which combination d = (i, j) was selected as the MTD, and n is the total sample size. For experimentation, the same formula can be used with ρ_d representing the percentage of patients treated at combination d = (i, j). The maximum value of A_n is 1 with larger values (close to 1) indicating that the method has high accuracy. For each method, we simulated 10,000 trials under the 4 different sets of assumed DLT probabilities in a 6×6 grid of combinations with varying positions and number of true MTD’s, as shown in Table 7. The target toxicity rate is α = 0.25 and the total sample size for each simulated study is 40 patients. For each method, a cohort of size 1 is used. The calibration of the three methods is fully described in Appendix E.1 and E.2.

Table 8 shows the operating characteristics of the 3 methods under 10 000 trials for the 4 scenarios considered. For each scenario, we report the percentage of patient allocation (experimentation %) and percentage of MTD selection (recommendation %) for doses contained within five different ranges of the true toxicity probabilities. Since α = 0.25, the target interval containing the true MTD is [0.20, 0.30]. In Scenarios 1 and 2, the poSPM and poCRM have very comparable performance, with each method outperforming 2d-BOIN in terms of experimentation %, recommendation %, as well as accuracy of experimentation and recommendation. For both of these scenarios, 2d-BOIN induces a higher percentage of DLT’s, on average. In Scenario 3, the 2d-BOIN exhibits the best performance when considering experimentation and recommendation percentages in the target interval. The accuracy of recommendation of the poSPM is very close to that of the 2d-BOIN due to the higher recommendation percentage for 2d-BOIN on the toxicity probability interval [0, 0.10). In the final scenario, poSPM is the best performing method according to all metrics.

Table 7:

True toxicity probabilities for the four scenarios of a 6 × 6 grid, with maximum tolerated doses shown in bold

	Dose	Drug A1
	Level	1	2	3	4	5	6	1	2	3	4	5	6
	Scenario 1							Scenario 2
	6	20	29	31	43	47	50	37	45	51	54	55	58
	5	18	20	29	34	41	48	30	38	41	43	46	47
	4	16	19	21	32	36	42	23	25	35	36	40	42
	3	10	15	20	25	30	37	19	20	26	33	35	39
	2	3	9	16	19	21	32	15	17	21	24	31	33
	1	2	5	10	17	21	30	5	10	16	20	25	28
Drug
A2	Scenario 3							Scenario 4
	6	11	13	15	17	25	33	65	70	76	80	84	90
	5	9	11	13	15	16	25	55	63	69	77	80	85
	4	7	9	11	14	15	17	45	56	60	72	76	79
	3	5	8	10	11	13	15	35	42	52	65	70	73
	2	3	6	7	9	12	13	25	34	46	54	60	64
	1	1	3	5	7	9	11	15	25	36	43	49	55

Open in a new tab

We anticipate that there will always be certain scenarios in which some methods perform better than others. A useful tool in this regard for comparing dose-finding designs can be average performance over a range of scenarios. Across the 4 scenarios, the poSPM method, the 2d-BOIN design and poCRM methods demonstrated averages of 46.7%, 42.5%, and 43.3% recommendation percentages for combinations in the true target interval [0.20, 0.30], respectively. The overall percentage of observed toxicities of the poSPM method, the 2d-BOIN design, and poCRM methods were on average; 24.1%, 27.3%, and 23.1%, respectively. It is desirable for the value to be as close as possible to the target rate α. This is achieved by the poSPM. The average percentage of patients allocated to a dose combination in the target interval of the poSPM method, the 2d-BOIN design, and poCRM methods were 34.13%, 30.75%, and 31.75%, respectively. Based on the accuracy index for recommendation, the poSPM yielded an average value of 0.77, the 2d-BOIN method produced an average value of 0.73, and the poCRM design resulted in an average value of 0.71. For accuracy of experimentation, the average index values over the four scenarios were 0.54, 0.48, and 0.52 for the poSPM, 2d-BOIN, and poCRM, respectively.

Table 8:

Experimentation and recommendation percentages for the poSPM, the BOIN design and the poCRM in four scenarios of a 6 × 6 grid of true toxicity probabilities. Column containing the MTD in bold.

Scenario	Method	Experimentation %					A ₄₀	%DLTs
Scenario	Method	[0, 0.10)	[0.10, 0.20)	[0.20, 0.30]	(0.30, 0.40]	(0.40, 1]	A ₄₀	%DLTs
1	poSPM	11.1	18.7	42.3	22.3	6.6	0.35	23.9
	BOIN	8.6	16.2	32.0	23.9	19.3	0.34	27.5
	poCRM	11.0	26.0	43.0	17.0	4.0	0.37	22.2
2	poSPM	4.6	18.5	44.0	26.6	6.4	0.54	25.8
	BOIN	4.3	15.3	40.0	31.2	18.2	0.26	29.3
	poCRM	4.0	22.0	47.0	24.0	4.0	0.64	25.1
3	poSPM	14.6	52.6	17.7	15.1	0.0	0.38	17.5
	BOIN	14.3	38.7	25.1	21.8	0.0	0.48	20.0
	poCRM	25.0	57.0	10.0	9.0	0.0	0.20	14.6
4	poSPM	0.0	24.2	32.5	25.5	17.8	0.90	29.2
	BOIN	0.0	18.4	25.9	24.4	28.9	0.83	32.3
	poCRM	0.0	18.0	27.0	29.0	26.0	0.86	32.3
		Recommendation %					A ₄₀
		[0, 0.10)	[0.10, 0.20)	[0.20, 0.30]	(0.30, 0.40]	(0.40, 1]	A ₄₀
1	poSPM	0.4	16.0	54.2	25.1	4.3	0.68
	BOIN	1.2	16.9	46.6	27.4	7.45	0.65
	poCRM	1.0	24.0	56.0	16.0	3.0	0.68
2	poSPM	0.1	10.1	56.5	29.6	3.7	0.74
	BOIN	0.1	13.69	44.5	33.5	8.1	0.63
	poCRM	0.0	15.0	59.0	25.0	3.0	0.76
3	poSPM	0.6	47.0	30.6	21.8	0.0	0.71
	BOIN	2.5	44.3	37.4	15.8	0.0	0.72
	poCRM	12.0	63.0	18.0	8.0	0.0	0.46
4	poSPM	0.0	17.5	45.5	28.7	8.2	0.95
	BOIN	0.0	11.5	41.4	30.7	13.3	0.93
	poCRM	0.0	12.0	40.0	35.0	11.0	0.94

Open in a new tab

E. Calibration

E.1. poCRM, 2d-BOIN and PIPE methods

For poCRM, we utilized six possible orderings in all scenarios, arranging the combinations across rows, up columns, and up or down any diagonal as suggested by Wages and Conaway (2013). A uniform prior was placed on the orderings. The skeleton values for poCRM were generated according to the algorithm of Lee and Cheung (2009) using the getprior function in R package dfcrm. Specifically, we used getprior(0.035,0.25,12,36), and all simulation results were generated using the functions of the R package poCRM.

For the BOIN method, we used the default cutoff values α₁ = 0.6α and α₂ = 1.4α and a Beta(0.5, 0.5) prior for the toxicity probability at each combination. The boundaries for the optimal interval for the BOIN method are (0.197, 0.298).

The prior distributions for PIPE are set to be the true DLT probabilities taken from Scenario 1 in Table 5. PIPE uses a neighbourhood dose-skipping constraint, as well as the closest doses chosen from the admissible dose set. The dose escalation algorithm employs the smallest sample size strategy, and a weak prior distribution (1/16) is specified.

E.2. poSPM

The marginals of the prior model are beta distributions truncated according to their support (Assumption 3). Let B_I(a, b) be the truncated beta distribution with parameters a and b and let B_I(m, T) be an alternative parametrization of this distribution such that: B_I(m, T) = B(m × T + 1,(1 − m) × T + 1). Then, m is the mode of the beta distribution and T is a dispersion parameter.

The family of I × J matrix (m_θ)_θ∈D is associated with the prior model (Λ_θ)_θ∈D. The element of the matrix m_θ in position d, noted $m_{θ}^{d}$ , is the mode of the marginal $Λ_{θ}^{d}$ . The value T₁ and T₂ are the dispersion parameters used on the different marginals. In the Bayesian setting, they can be equated to a number of pseudo observations providing toxicities and non-toxicities. The rank of a dose combination θ = (i, j), noted $\bar{θ}$ , is equal to i+j. All the doses on the same diagonal have the same rank. Let r₁ and r₂ be two positive values used for the calibration of the prior Π. The poSPM model used in the simulations can be summarized by:

{\begin{array}{l} Λ_{θ} & ~ \prod_{d \in ℬ_{θ}} B_{B} (m_{θ}^{d}, T_{1}) \times \prod_{d \in 𝒜_{θ}} B_{A} (m_{θ}^{d}, T_{1}) \times \prod_{d \in 𝒞_{θ}} B_{[0, 1]} (m_{θ}^{d}, T_{2}) \times B_{I} (m_{θ}^{θ}, T_{1}) \\ Π (θ) & \propto r_{1}^{\bar{θ} - 2} \times r_{2}^{\bar{θ} - 3}, \end{array}

We set: $𝒜_{θ}^{'} = {d \in 𝒜 : \bar{θ} + 1 < \bar{d}}$ and $ℬ_{θ}^{'} = {d \in ℬ : \bar{θ} > \bar{d} + 1}$ . The parameters of the prior model are: $m_{θ}^{d} = α (1 + {0.4 1}_{𝒜_{θ}} (d) + {0.2 1}_{𝒜_{θ}^{'}} (d) - {0.4 1}_{ℬ_{θ}} (d) - {0.2 1}_{ℬ_{θ}^{'}} (d))$ , T₁ = 40 and T₂ = 10. The parameters r₁ and r₂ are chosen to produce a progressive allocation in the range of doses. The goal is to obtain a prior Π that is non-informative with the purpose of compensating the weight given to the highest dose combinations by the first non-toxic observations. For α = 0.25, we choose: r₁ = 0.942724 and r₂ = 0.95566. In order to explore the diagonal going from (1, 1) to (I, J) before a DLT is observed, a very small weight (10⁻⁵) is added to these doses. Thus, the poSPM follows the sequence of dose combinations (1, 1), (1, 2), (2, 2), (2, 3), (3, 3), … until a first DLT is observed. After N observations, the final recommendation is made by using our current estimator ${\hat{θ}}_{N}$ , which corresponds to the most probable MTD according to the posterior Π_N.

E.3. poSPMc

We use the same notation as for the poSPM calibration. The poSPMc model fulfills Assumption 4. The allocation strategy defined by Equation (4.6) is used. The family of I × J matrix (m_γ)_γ∈C is associated with the prior model (Λ_γ)_γ∈C. The rank of a contour γ, noted $\bar{γ}$ , is the number of dose combination in ℬ_γ. T is the dispersion parameter of the truncated beta distributions, r₁ and r₂ are two positive values used for the calibration of the prior Π.

{\begin{array}{l} Λ_{γ} & ~ \prod_{d \in ℬ_{γ}} B_{[0, α]} (m_{γ}^{d}, T) \times \prod_{d \in 𝒜_{γ}} B_{[α, 1]} (m_{γ}^{d}, T_{1}), \\ Π (γ) & \propto r_{1}^{\bar{γ} - 2} \times r_{2}^{\bar{γ} - 3}, \end{array}

The parameters of the prior model are: $m_{γ}^{d} = α (1 + {0.5 1}_{𝒜_{γ}} (d) + {0.25 1}_{𝒜_{γ} ∖ ℳ_{γ}} (d) - {0.4 1}_{ℬ_{γ}} (d) - {0.2 1}_{ℬ_{γ} ∖ ℳ_{γ}} (d))$ and T = 25. The parameters r₁ and r₂ are chosen to produce a progressive allocation in the range of dose combinations. The goal is to obtain a prior Π that is non informative with the idea of compensating the weight given to the highest contour in term of rank by the the first non-toxic observations. For α = 0.2, we choose r₁ = 0.8739592 and r₂ = 0.9749345 and for α = 0.3, we choose r₁ = 0.8117365 and r₂ = 0.950334. For the exclusion rules (End of section 4.2), we choose to exclude a dose when the probability of our posterior on the upper interval attains 95%: δ = 0.95. An ϵ = 10⁻⁵ is added to the part multiplying 1/k_d such that the value k_d drives the allocation between dose combinations for which we do not yet posses any observations. After N observations, we recommend all the dose combinations in the estimated minimal set $ℳ_{{\hat{γ}}_{N}}$ , except the doses for which we have less than 2 observations and the doses in $H_{n}^{'}$ which are considered as toxic:

H_{n}^{'} = {d \in D : ℙ_{𝒰} [P_{d} > α + 0.05 ∣ (n_{d}, n_{d}^{1})] > 0.9} .

F. Computation of minimal set

In the two dimensional case, I and J are the number of doses for each of the agents. The set of contours could be described as all the polygonal chains tracing out a path from (0.5, J + 0.5) to (I + 0.5, 0.5) with steps of size 1 along the abscissae and of size −1 on the ordinate axis. Only rightward and downward steps are permitted. This set is also equivalent to all the combinations of I among I + J which could be generated with the R function combn of package combinat. From there, it is easy to obtain matrices indicating which dose combinations are below or above the contour c that correponds to the sets 𝒜_c and ℬ_c.

Computing the minimal set is equivalent to computing min 𝒜_c and max ℬ_c. For the general problem “computing the maximum of a partially ordered set” the greatest calculation burden is 𝒪(n²): in a set where no element are a priori ordered, all the elements have to be compared to solve the problem. In the case of two drugs, the structure of $ℕ^{2}$ leads to a time complexity 𝒪(n). Let ${mat}_{ℬ_{c}}$ be a I × J matrix of 0 and 1 indicating the dose combinations in ℬ_c. From row N to 1:

Select the last element equal to 1 of the row in ${mat}_{ℬ_{c}}$ , if it exists. Otherwise, go to next row.
Compare the selected element to the previous one. Keep it, if they are notordered. Otherwise, delete it.

Finally, the selected elements are the maximum set of ℬ_c.

Note that in order to save time, all the arrays containing the minimal set associated to each contour can be saved for use in simulations. If we do so, the 𝒪(n²) solution (exhaustive comparison two by two) could be used as it is only applied one time.

G. Proofs of two properties and a general theorem

G.1. Coherence

When there is no confusion, we write Λ_θ(dP_r) for $Λ_{θ}^{r} (d P_{r})$ , with r ∈ D We then state a stochastic partial ordering assumption on the prior-model.

Assumption 6. Let d and d^′ be two doses such that d < d′. For all marginal r ∈ D, the posterior $Λ_{d, n}^{r}$ is stochastically greater than $Λ_{d^{'}, n}^{r}$ :

Λ_{d, n}^{r} ([0, x]) \leq Λ_{d^{'}, n}^{r} ([0, x]) \forall x \in [0, 1] .

This assumption is satisfied by the calibration of poSPM presented in Section E.2.

Proposition 1. Under Assumptions 3 and 6 (appendix), the poSPM is coherent.

Proof. Suppose that Y_n+1 = 1. The case Y_n+1 = 0 can be solved in the same way. By construction, we have

Π_{n + 1} (θ) \propto [\int P Λ_{θ, n} (d P)] Π_{n} (θ) .

Furthermore,

\int P_{r} Λ_{θ, n} (d P) = \int [\int 1_{{0 \leq x \leq P_{r}}} μ (d x)] Λ_{θ, n} (d P_{r}) = \int Λ_{θ, n}^{r} (] x, 1]) μ (d x) .

If ${\hat{θ}}_{n} = r$ , then for all θ ∈ D, Π_n(θ) ≤ Π_n(r). Let t > r. According to Assumption 6, we know that $Λ_{r, n}^{r} (.)$ is stochastically greater than $Λ_{t, n}^{t} (.)$ , i.e.

\int Λ_{r, n}^{r} (] x, 1]) μ (d x) \geq \int Λ_{t, n}^{r} (] x, 1]) μ (d x),

that is

\int P_{r} Λ_{r, n} (d P) \geq \int P_{r} Λ_{t, n} (d P) .

Finally Π_n+1(r) ≥ Π_n+1(t), which ends the proof. □

G.2. Conjugacy property

\tilde{Π} (d) \propto \frac{1}{\bar{𝒱_{d}}} \sum_{c \in 𝒱_{d}} Π (c) a n d {\tilde{Λ}}_{d} = \sum_{c \in 𝒱_{d}} r_{c}^{d} Λ_{c} with r_{c}^{d} = \frac{Π (c)}{\sum_{c \in 𝒱_{d}} Π (c)} .

Proof. We have the following proportionnality relations:

{\tilde{Π}}_{n} (d) \propto \frac{1}{\bar{𝒱_{d}}} \sum_{c \in 𝒱_{d}} Π_{n} (c) \propto \frac{1}{\bar{𝒱_{d}}} \sum_{c \in 𝒱_{d}} [\int L_{n} (P) Λ_{c} (d P)] \times \frac{Π (c)}{\sum_{c \in 𝒱_{d}} Π (c)} \times \sum_{c \in 𝒱_{d}} Π (c) \propto [\int L_{n} (P) \sum_{c \in 𝒱_{d}} r_{c}^{d} \times Λ_{c} (d P)] \times (\frac{1}{{\bar{𝒱}}_{d}} \sum_{c \in 𝒱_{d}} Π (c)) \propto [\int L_{n} (P) {\tilde{Λ}}_{d} (d P)] \times \tilde{Π} (d) .

□

G.3. Proof of Theorem 1

Proof. The proof is made by showing that the two first statements are equivalent to the third one. As the proofs of these two equivalences are very similar, we focus on: (i) ⇔ (iii). The implication (i) ⇐ (iii) is immediate by using the Law of Large Number for the frequentist estimators $n_{d}^{1} / n_{d}$ at each dose of the minimal set. We show (i) ⇒ (iii) by using a reductio ad absurdum argument.

T is a scenario whose the maximum tolerated dose is MTD_T and the maximum tolerated contour is MTC_T. We choose this scenario T and a dose d₀ ∈ MTC_T such that there exists a set of sequences of our sample, $A = {a = (x_{1}^{\infty}, y_{1}^{\infty})}$ satisfying: (1) $\forall a = (x_{1}^{\infty}, y_{1}^{\infty}) \in A$ , $F (x_{1}^{n}, y_{1}^{n}) \to M T D_{T}$ , (2) $ℙ_{T} (A) > 0$ , (3) $\forall a \in A, d_{0} \notin \cap_{N = 1}^{\infty} \cup_{n = N}^{\infty} {x_{n}}$ . This is possible as the absence of (3) for any dose contradicts the negation of (iii) and the absence of (1) and (2) together contradicts (i). The point (3) implies that there exists $N_{0} \in ℕ$ such that:

ℙ_{T} (B = {s \in A : ∄ n > N_{0}, x_{n} = d_{0}}) > 0.

We introduce the two following sets: $B^{'} = {ω \in Ω : \exists a = (x_{1}^{\infty}, y_{1}^{\infty}) \in A$ , $(X_{N_{0}}^{\infty}, Y_{N_{0}}^{\infty}) (ω) = (x_{N_{0}}^{\infty}, x_{N_{0}}^{\infty})}$ and $B^{″} = {ω \in Ω : \exists a = (x_{1}^{\infty}, y_{1}^{\infty}) \in A$ , $(X_{1}^{N_{0}}, Y_{1}^{N_{0}}) (ω) = (x_{1}^{N_{0}}, x_{1}^{N_{0}})}$ . We have: $ℙ_{T} (B) = ℙ_{T} (B^{'} ∣ B^{″}) \times ℙ_{T} (B^{″})$ and then $ℙ_{T} (B^{'} ∣ B^{″}) > 0$ . Let T′ be a scenario such that, for all d ∈ D \ {d₀}, P_T_′(Y = 1|X = d) = P_T(Y = 1|X = d) and MTD_T_′ = d₀. We then have: $ℙ_{T^{'}} (B^{'} ∣ B^{″}) > 0$ , as the same method chooses the next dose being used for the two scenarios and the dose is not tried after N₀ for any sequence of B. Moreover, $B^{″} = \underset{(x_{1}^{\infty}, y_{1}^{\infty}) \in A}{\cup} {ω \in Ω : (X_{1}^{N_{0}}, Y_{1}^{N_{0}}) (ω) = (x_{1}^{N_{0}}, x_{1}^{N_{0}})}$ with:

ℙ_{T^{'}} ({ω \in Ω : (X_{1}^{N_{0}}, Y_{1}^{N_{0}}) (ω) = (x_{1}^{N_{0}}, x_{1}^{N_{0}})}),

as the constraints are on the first N₀ terms. Thus, $ℙ_{T^{'}} (B) > 0$ and on this set of sequences of the sample the statistic F converges to MTD_T ≠ d₀ = MTD_T_′.□

H. Proof of asymptotic results for poSPM

H.1. Proof of Theorem 2 (a)

This assumption leans upon the regularity of the prior model.

Assumption 7. The following conditions are valid except when $Λ_{θ}^{θ}$ is a Dirac measure.

For all d ∈ D, the marginal distribution $Λ_{θ}^{d}$ is absolutely continuous with respect to the Lebesgue measure and $λ_{θ}^{d}$ denotes its density function.
There exist two numbers s and S in $ℝ_{+}^{*}$ , such that, for all θ and d in D, we have:
$\forall P_{d} \in S_{θ}^{d}, s < λ_{θ}^{d} (P_{d}) < S .$

Note that this assumption can be used for the poSPMc by replacing θ by γ, the parameter of the contour.

The second point is only useful for the sake of the demonstration when some β_(i,j) are equal to 0 or 1. In order to establish asymptotic properties for the poSPM, we introduce the set $\tilde{D}$ to indicate those doses that are observed infinitely often:

d \in \tilde{D} \Leftrightarrow n_{d} \underset{n \to \infty}{\to} \infty .

Proof. Let us start with the proof of ε-sensitivity. In this proof, we are interested in the asymptotic behavior of poSPM, that is why we are able to ignore those doses tested only a finite number of times and we can reason as though they had never been used. Specifically, the doses in this proof are always considered to be in $\tilde{D}$ . We assume that ℰ(I, P^T) is not empty (P^T denotes the true scenario). Let $r \in \tilde{D} ∖ ℰ (I, P^{T})$ . We can distinguish two cases. The first case is the existence of a dose d ∈ ℰ(I, P^T) such that d is ordered with r. We are then reduced to the SPM in the case of total ordering and the proof can be found in Clertant and O’Quigley (2017).

The second case is where there exists no dose in ℰ(I, P^T) ordered with r. So there exists a dose d ∈ ℰ(I, P^T) not ordered with r, because we assume that ℰ(I, P^T) is not empty. We want now to compare the integrals I_n,r and I_n,d, where g(p, n, m) = pⁿ(1 − p)^m.

\frac{I_{n, r}}{I_{n, d}} = \prod_{k \in \tilde{D}} \frac{M_{n, r}^{k}}{M_{n, d}^{k}} = \prod_{k \in \tilde{D}} \frac{\int_{S_{r}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) Λ_{r} (d P_{k})}{\int_{S_{d}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) Λ_{d} (d P_{k})}

(H.11)

= \prod_{k \in \tilde{D}} \frac{\int_{S_{r}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) λ_{r} (P_{k}) d P_{k}}{\int_{S_{d}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) λ_{d} (P_{k}) d P_{k}}

(H.12)

\leq \frac{S}{s} \prod_{k \in \tilde{D}} \frac{\int_{S_{r}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) d P_{k}}{\int_{S_{d}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) d P_{k}},

(H.13)

where Equation (H.11) follows from Assumption 3, Equation (H.12) from Assumption 7 (a) and Inequation (H.13) from Assumption 7 (b). We are able to state the following property. For all functions f that are continuous on [0, 1], we have

\int f (P_{k}) \frac{g (P_{k}, n_{k}^{1}, n_{k}^{0})}{Beta (n_{k}^{1} + 1, n_{k}^{0} + 1)} d P_{k} \underset{n_{k} \to \infty}{\to} \int f (P_{k}) 1_{{P_{k}}} (P_{k}) γ (d P_{k}) = f (P_{k}),

(H.14)

where Beta(.) denotes the Beta function and γ the counting measure. Let then $k \in \tilde{D}$ . We are looking for the behavior, when n_k increases without bound, of

R_{k} = \frac{\int_{S_{r}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) d P_{k}}{\int_{S_{d}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) d P_{k}} .

The case $S_{r}^{k} = S_{d}^{k}$ is straightforward. For the other cases, we use the convergence expressed in Equation (H.14). As d is not ordered with r, with Assumption 3, we have six cases left to deal with.

$S_{r}^{k} = A$ or B or I and $S_{d}^{k} = [0, 1]$ . Then $R_{k} \underset{n_{k} \to \infty}{\to} 1_{A} (P_{k}^{T})$ or $1_{B} (P_{k}^{T})$ or 0. This last result is due to the fact that r ∉ ℰ(I, P^T).
$S_{r}^{k} = [0, 1]$ and $S_{d}^{k} = I$ . Then $R_{k} \underset{n_{k} \to \infty}{\to} 1$ , because d ∈ ℰ(I, P^T).
$S_{r}^{k} = [0, 1]$ and $S_{d}^{k} = A$ . Then $R_{k} \underset{n_{k} \to \infty}{\to} 1 / 1_{A} (P_{k}^{T}) = 1$ . As there is no couple of ordered dose in ℰ(I, P^T), $P_{k}^{T}$ belongs to A.
$S_{r}^{k} = [0, 1]$ and $S_{d}^{k} = B$ . With the same argument as the previous case, we have $R_{k} \underset{n_{k} \to \infty}{\to} 1$ .

Finally, going back to Inequality (H.13), we conclude that I_n,r/I_n,d tends to 0 when n increases without bound. This leads to a contradiction because, as $r \in \tilde{D}$ , this ratio is greater than 1 infinitely often. So $\tilde{D} \subset ℰ (I, P)$ , that is, the poSPM is ε-sensitive.

We prove now that poSPM is also balanced. Assumption 3 allows us to focus on the marginal ratio

\frac{M_{n, r}^{k}}{M_{n, t}^{k}} = \frac{\int g (P_{j}, n_{k}^{1}, n_{k}^{0}) Λ_{r}^{k} (d P_{k})}{\int g (P_{k}, n_{k}^{1}, n_{k}^{0}) Λ_{t}^{k} (d P_{k})} and \frac{I_{n, r}}{I_{n, t}} = \prod_{k \in D} \frac{M_{n, r}^{k}}{M_{n, t^{k}}} .

Assumption 7 involves:

d (P_{k}^{T}, S_{r}^{k}) = d (P_{k}^{T}, S_{t}^{k}) \Rightarrow 0 < \underset{n \to \infty}{lim inf} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} \leq \underset{n \to \infty}{lim sup} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} < + \infty, a . s .

(H.15)

The ε-balanced behavior corresponds to the case where, forall k ∈ D, $P_{k}^{T} \notin I$ . We show that

r \notin ℳ_{c^{⋆}} \Rightarrow ℙ ({n_{r} \to \infty}) = 0

(H.16)

By symmetry we can choose r ∈ D⁻. The set $ℳ_{c^{⋆}} \cap D^{-} \cap 𝒜_{r}$ is not empty. Let t be a dose in $ℳ_{c^{⋆}} \cap D^{-} \cap 𝒜_{r}$ . By using Equation (H.15), as 𝒜_t ⊂ 𝒜_r and $S_{t}^{k} = [0, 1]$ when k ∈ 𝒞_t, we have

\forall k \in A_{t} \cup C_{t}, ℙ (\underset{n \to \infty}{lim sup} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} < \infty ∣ n_{k} \to \infty) = 1.

If k ∈ (ℬ_r ∪ 𝒞_r) ∩ ℬ_t then $P_{k}^{T} \in B$ and we obtain the same result

ℙ (\underset{n \to \infty}{lim sup} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} < \infty ∣ n_{k} \to \infty) = 1.

We consider now the ratios $M_{n, r}^{k} / M_{n, t}^{k}$ when k ∈ (ℬ_t ∩ 𝒜_r)∪{t, r}. In that case, we have $P_{k}^{T} \in B$ , $S_{t}^{k} = B$ and $S_{r}^{k}$ equals to I or A. By using Proposition 4, we have

\forall k \in (ℬ_{t} \cap 𝒜_{r}) \cup {t, r}, ℙ (lim_{n \to \infty} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} = 0 ∣ n_{k} \to \infty) = 1.

We then have

ℙ (lim_{n \to \infty} \frac{I_{n, r}}{I_{n, t}} = 0 ∣ n_{r} \to \infty) = 1 and ℙ ({lim_{n \to \infty} \frac{I_{n, r}}{I_{n, t}} = 0} \cap {n_{r} \to \infty}) = 0,

which proves Equation (H.16). We achieve the proof of the ε-balanced property by showing that

t \in ℳ_{c^{⋆}} \Rightarrow ℙ (n_{t} \to \infty) = 1

(H.17)

By using Equation (H.16), we have, for all r and t in $ℳ_{c^{⋆}}$

\forall k \in D, k \neq r and k \neq t, ℙ (\underset{n \to \infty}{lim sup} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} < \infty) = 1.

Moreover, for all $r \in ℳ_{c^{⋆}}$ , $P_{r}^{T}$ is included in $S_{t}^{r}$ . As $P_{r}^{T} \notin I = S_{r}^{r}$ , we have

\forall r \in ℳ_{c^{⋆}}, ℙ (lim_{n \to \infty} \frac{M_{n, r}^{r}}{M_{n, t}^{r}} = 0 ∣ n_{r} \to \infty) = 1.

Let E_r be the event {n_r → ∞} ∩ {n_t → ∞}^c. Then,

ℙ (lim_{n \to \infty} \frac{I_{n, r}}{I_{n, t}} = 0 ∣ E_{r}) = 1 and ℙ ({lim_{n \to \infty} \frac{I_{n, r}}{I_{n, t}} = 0} \cap E_{r}) = 0,

As $ℙ (\sum_{ℳ_{c^{⋆}}} n_{k} \to + \infty) = 1$ , we have $ℙ (\cup_{ℳ_{c^{⋆}}} E_{r}) = 1$ , which proves Equation (H.17) and ends the demonstration of the poSPM ε-balanced behavior. □

H.2. Proof of Theorem 2 (b)

Proof. We set:

I_{n, c} = \prod_{k \in \tilde{D}} M_{n, c}^{k} = \prod_{k \in \tilde{D}} \int_{S_{c}^{k}} g (P_{k}, n_{k}^{1}, n_{k}^{0}) Λ_{c} (d P_{k})

We note that the regularity assumption 7 involves:

δ (P_{k}^{T}, S_{r}^{k}) = δ (P_{k}^{T}, S_{t}^{k}) \Rightarrow 0 < \underset{n \to \infty}{lim inf} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} \leq \underset{n \to \infty}{lim sup} \frac{M_{n, r}^{k}}{M_{n, t}^{k}} < \infty, a . s .

We will show the following assertion

c \in C \ c^{*} \Rightarrow ℙ (n_{c} \to \infty) = 0

(H.18)

As c ≠ c*, we have $ℳ_{c} \neq ℳ_{c^{⋆}}$ and there exists k ∈ ℳ_c such that $P_{k}^{T}$ is included in $S_{c^{*}}^{k}$ and not in $S_{c}^{k}$ . By using Proposition 4, we have

ℙ (lim_{n \to \infty} \frac{M_{n, c}^{k}}{M_{n, c^{*}}^{k}} = 0 ∣ n_{c} \to \infty) = ℙ (lim_{n \to \infty} \frac{M_{n, c}^{k}}{M_{n, c^{*}}^{k}} = 0 ∣ n_{k} \to \infty) = 1,

where the first equality arises from Assumption 5(b). For all doses d ∈ D, the distribution Λ_c_* models correctly the probability of toxicity, in other words $P_{d}^{T} \in S_{𝒞^{*}}$ . We then have

ℙ (lim_{n \to \infty} \frac{I_{n, c}}{I_{n, c^{*}}} = 0 ∣ n_{c} \to \infty) = 1 and ℙ ({lim_{n \to \infty} \frac{I_{n, c}}{I_{n, c^{*}}} = 0} \cap {n_{c} \to \infty}) = 0,

which ends the proof. (H.18). □

I. A general bayesian property

The following property is used in the proof of asymptotic results. It has already been proved in (Clertant and O’Quigley, 2018). Here, for the convenience of the reader, we reproduce this property without its proof.

The couple (Ω, 𝒜) denotes an abstract space endowed with its σ-field. We denote by I a finite set and by ${(X_{k})}_{k \in ℕ}$ a sequence of independent random variables taking their values in I. Let F be the set of functions from I to the segment [0, 1]. For any element q ∈ F, q_i denotes its value at i ∈ I. Let $S = {q \in F : \sum_{i \in I} q_{i} = 1}$ be the probability space on which we want to work. We say that a random variable X follows the distribution q if $ℙ {X = i} = q_{i}$ , for all i ∈ I. Let Λ₁ and Λ₂ be two probabilities on the Borel σ-field ℬ of S. Let S₁ and S₂ be the topological supports of Λ₁ and Λ₂ respectively. We would like to know the asymptotic behavior of the ratio of the expected likelihood under Λ₁ on the one under Λ₂. We define the operator r as follows

r (Λ_{1}, Λ_{2}, n) = \frac{\int \prod_{k = 1}^{n} q_{X_{k}} Λ_{1} (d q)}{\int \prod_{k = 1}^{n} q_{X_{k}} Λ_{2} (d q)} = \frac{\int \prod_{i \in I} q_{i}^{n_{i}} Λ_{1} (d q)}{\int \prod_{i \in I} q_{i}^{n_{i}} Λ_{2} (d q)},

Where $n_{i} = \sum_{k = 1}^{n} 1_{{X_{k} = i}}$ , for i ∈ I We assume that, under the true probability β, the random variables X_k, $k \in ℕ$ are identically distributed. The convergence of r(Λ₁, Λ₂, n) depends mainly on the localization of β compared to the supports S₁ and S₂. To deal with this problem, we make use of the usual concept of entropy. The entropy of q relative to p is $H (q ∣ p) = - \sum_{i \in I} p_{i} \log q_{i}$ , with the conventions log 0 = −∞ and 0 × −∞ = 0. We suppose that β is closer to S₁ than S₂ in terms of entropy.

Assumption 8. Let V be a subspace of S₂ satisfying Λ₂(V) > 0. There exists δ > 0 such that

inf_{q \in S_{1}} H (q ∣ β) - \sup_{q \in V} H (q ∣ β) > 4 δ .

This leads to a simple characterisation of the behavior of r(Λ₁, Λ₂, n).

Proposition 4. Under Assumption 8, we have $r (Λ_{1}, Λ_{2}, n) \underset{n \to \infty}{\to} 0$ .

Contributor Information

Matthieu Clertant, LAGA, LabEx Inflamex, Université Sorbonne Paris Nord, France.

Nolan A. Wages, Translational Research and Applied Statistics, University of Virginia, U.S.A.

John O’Quigley, Department of Statistical Science, University College London, U.K..

References

Azriel D, Mandel M, and Rinott Y (2011). The treatment versus experimentation dilemma in dose finding studies. Journal of Statistical Planning and Inference, 141(8):2759–2768. [DOI] [PMC free article] [PubMed] [Google Scholar]
Braun TM and Wang S (2010). A hierarchical bayesian design for phase i trials of novel combinations of cancer therapeutic agents. Biometrics, 66(3):805–812. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheung K (2005). Coherence principles in dose-finding studies. Biometrika, 92(4):863–873. [Google Scholar]
Cheung YK (2011). Dose finding by the continual reassessment method. CRC Press. [Google Scholar]
Clertant M and O’Quigley J (2017). Semiparametric dose finding methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(5):1487–1508. [Google Scholar]
Clertant M and O’Quigley J (2018). The role of minimal sets in dose finding studies. Journal of Nonparametric Statistics, 30(4):1016–1031. [Google Scholar]
Clertant M and O’Quigley J (2018). Semiparametric dose finding methods: special cases. Journal of the Royal Statistical Society: Series C (Statistical Methodology), 68(2):271–288. [Google Scholar]
Huang X, Biswas S, Oki Y, Issa J, and Berry DA (2007). A parallel phase i/ii clinical trial design for combination therapies. Biometrics, 63(2):429–436. [DOI] [PubMed] [Google Scholar]
Iasonos Alexia and OQuigley John (2016). Dose expansion cohorts in phase i trials. Statistics in biopharmaceutical research, 8(2):161–170. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ji Yuan, Liu Ping, Li Yisheng, and Bekele B Nebiyou (2010). A modified toxicity probability interval method for dose-finding trials. Clinical Trials, pages 653–663. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mander AP and Sweeting MJ (2015). A product of independent beta probabilities dose escalation design for dual-agent phase i trials. Statistics in Medicine, 34(8) :1261–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
Storer BE (1989). Design and analysis of phase i clinical trials. Biometrics, pages 925–937. [PubMed] [Google Scholar]
Thall PF, Millikan RE, Mueller P, and Lee S (2003). Dose-finding with two agents in phase i oncology trials. Biometrics, 59(3):487–496. [DOI] [PubMed] [Google Scholar]
Wages NA, Conaway MR, and O’Quigley J (2011). Continual reassessment method for partial ordering. Biometrics, 67(4):1555–1563. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang K and Ivanova A (2005). Two-dimensional dose finding in discrete dose space. Biometrics, 61(1):217–222. [DOI] [PubMed] [Google Scholar]

[R1] Azriel D, Mandel M, and Rinott Y (2011). The treatment versus experimentation dilemma in dose finding studies. Journal of Statistical Planning and Inference, 141(8):2759–2768. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Braun TM and Wang S (2010). A hierarchical bayesian design for phase i trials of novel combinations of cancer therapeutic agents. Biometrics, 66(3):805–812. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Cheung K (2005). Coherence principles in dose-finding studies. Biometrika, 92(4):863–873. [Google Scholar]

[R4] Cheung YK (2011). Dose finding by the continual reassessment method. CRC Press. [Google Scholar]

[R5] Clertant M and O’Quigley J (2017). Semiparametric dose finding methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(5):1487–1508. [Google Scholar]

[R6] Clertant M and O’Quigley J (2018). The role of minimal sets in dose finding studies. Journal of Nonparametric Statistics, 30(4):1016–1031. [Google Scholar]

[R7] Clertant M and O’Quigley J (2018). Semiparametric dose finding methods: special cases. Journal of the Royal Statistical Society: Series C (Statistical Methodology), 68(2):271–288. [Google Scholar]

[R8] Huang X, Biswas S, Oki Y, Issa J, and Berry DA (2007). A parallel phase i/ii clinical trial design for combination therapies. Biometrics, 63(2):429–436. [DOI] [PubMed] [Google Scholar]

[R9] Iasonos Alexia and OQuigley John (2016). Dose expansion cohorts in phase i trials. Statistics in biopharmaceutical research, 8(2):161–170. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Ji Yuan, Liu Ping, Li Yisheng, and Bekele B Nebiyou (2010). A modified toxicity probability interval method for dose-finding trials. Clinical Trials, pages 653–663. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Mander AP and Sweeting MJ (2015). A product of independent beta probabilities dose escalation design for dual-agent phase i trials. Statistics in Medicine, 34(8) :1261–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Storer BE (1989). Design and analysis of phase i clinical trials. Biometrics, pages 925–937. [PubMed] [Google Scholar]

[R13] Thall PF, Millikan RE, Mueller P, and Lee S (2003). Dose-finding with two agents in phase i oncology trials. Biometrics, 59(3):487–496. [DOI] [PubMed] [Google Scholar]

[R14] Wages NA, Conaway MR, and O’Quigley J (2011). Continual reassessment method for partial ordering. Biometrics, 67(4):1555–1563. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Wang K and Ivanova A (2005). Two-dimensional dose finding in discrete dose space. Biometrics, 61(1):217–222. [DOI] [PubMed] [Google Scholar]

PERMALINK

SEMIPARAMETRIC DOSE FINDING METHODS FOR PARTIALLY ORDERED DRUG COMBINATIONS

Matthieu Clertant

Nolan A Wages

John O’Quigley

Abstract

1. Introduction

2. Context and motivation

Table 1:

Figure 1:

3. Semiparametric methods and a single MTD target

4. Identifying the Maximum Tolerated Contour (MTC)

4.1. Semiparametric Model

Figure 2:

Figure 3:

4.2. Allocation strategy

5. Large sample theory

Figure 4:

6. Numerical experiments: MTC setting

Table 2:

7. CNL and vinblastine combination, a study of Scenario T

Table 3:

Table 4:

Ackowledgements

Appendix

A. Scenarios for the MTC setting

Table 5:

B. Notations

Table 6:

C. Allocation strategy

Exclusion rule:

D. Numerical experiments: MTD setting

Table 7:

Table 8:

E. Calibration

E.1. poCRM, 2d-BOIN and PIPE methods

E.2. poSPM

E.3. poSPMc

F. Computation of minimal set

G. Proofs of two properties and a general theorem

G.1. Coherence

G.2. Conjugacy property

G.3. Proof of Theorem 1

H. Proof of asymptotic results for poSPM

H.1. Proof of Theorem 2 (a)

H.2. Proof of Theorem 2 (b)

I. A general bayesian property

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases