Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Nov 1.
Published in final edited form as: J Stat Plan Inference. 2016 Jul 15;178:128–139. doi: 10.1016/j.jspi.2016.07.002

Optimal designs based on the maximum quasi-likelihood estimator

Gang Shen a, Seung Won Hyun a,*, Weng Kee Wong b
PMCID: PMC5287418  NIHMSID: NIHMS807222  PMID: 28163359

Abstract

We use optimal design theory and construct locally optimal designs based on the maximum quasi-likelihood estimator (MqLE), which is derived under less stringent conditions than those required for the MLE method. We show that the proposed locally optimal designs are asymptotically as efficient as those based on the MLE when the error distribution is from an exponential family, and they perform just as well or better than optimal designs based on any other asymptotically linear unbiased estimators such as the least square estimator (LSE). In addition, we show current algorithms for finding optimal designs can be directly used to find optimal designs based on the MqLE. As an illustrative application, we construct a variety of locally optimal designs based on the MqLE for the 4-parameter logistic (4PL) model and study their robustness properties to misspecifications in the model using asymptotic relative efficiency. The results suggest that optimal designs based on the MqLE can be easily generated and they are quite robust to mis-specification in the probability distribution of the responses.

Keywords: Approximate Design, Design Efficiency, Dose-finding Study, Equivalence Theorem, Heteroscedasticity, Maximum likelihood Estimator

1. Introduction

Model assumptions are frequently made for drawing statistical inference but they may not be tenable in practice. The optimal design constructed under wrong model assumptions can be very inefficient. When the model is nonlinear, a further complication is that nominal values for the model parameters are required before the optimal designs can be implemented. It is thus important to find a design that is robust to various forms of mis-specification in the statistical model.

Different approaches have been proposed to find designs robust to various departures from the model assumptions. They typically concern mis-specifications in the error distribution, the response mean function, and the form of heteroscedasticity in the model. For instance, Burridge and Sebastiani (1994) and Dette and Wong (1999) proposed optimal designs when the variance is a function of the mean response, and Chen et al. (2008) proposed optimal minimax designs for a heteroscedastic polynomial model. However the bulk of the work focused on a single violation from the model assumptions. Some examples are Läuter (1974); Stigler (1971); Lee (1987, 1988); Studden (1982); Song and Wong (1998a,b); Dette and Wong (1996, 1999); the first six considered model mis-specification in mean function and the last three concerned mis-specification in the heteroscedasity structure.

Much of the research in optimal design assumes that the design criterion is formulated in terms of the Fisher Information matrix. This matrix can only be computed under the full specification of the probability distribution for the response. In contrast, the quasi-likelihood method introduced by Weddernburn (1974) assumes only a functional relationship between the mean and the variance of the response. This is a tenable assumption in practical situations, see for example, Box and Hill (1974), Bickel (1978) and, Jobson and Fuller (1980). This suggests that designs based on the quasi-likelihood method are likely to be more robust to model assumptions than methods that require a full parametric probability model.

There is very little work on optimal designs based on the MqLE for nonlinear regression models despite numerous successful applications of the quasi-likelihood method for generalized linear models (McCullagh and Nelder, 1989). A clear exception is work by Niaparast (2009), and Niaparast and Schwabe (2013), who derived optimal designs based on the MqLE for Poisson regression models. The first paper focused on the theoretical construction of D-optimal designs for the Poisson models with random intercepts, and the second paper extended the work to a general mixed effects Poission model with random coefficients and so the paper deals with random slopes rather than random intercepts. However, their work concerns only finding locally D-optimal designs for the case when the response follows a Poisson distribution.

We consider a more general situation and develop theory to find various types of optimal designs based on the MqLE where the response may be a discrete or a continuous random variable. We also show such optimal designs can be generated from algorithms currently used to find optimal designs based on the Fisher Information matrix. We list advantages of such designs compared to other designs based on other efficient estimators such as the MLE and the LSE when the sample size is large and study the robustness properties of D-optimal designs to mis-specified nominal parameter values. We focus on locally optimal designs (Chernoff, 1953) where nominal values for the model parameters are required before they can be implemented. Bayesian optimal designs can also be directly constructed from our methodology but will not be discussed here. One of the key results is that we show here that asymptotically, locally optimal designs based on the MqLE also have the same property as those constructed based on the MLEs but without the normality distributional assumption. When responses are from a member of the class of exponential family distributions, we prove that the optimal designs based on the MqLE is asymptotically just as efficient as the optimal designs based on the MLE. Such designs are also asymptotically more efficient than the designs based on asymptotic best linear unbiased estimators, such as LSE. When responses are not from a member of the exponential family of distributions, we show optimal designs based on the MqLE also perform well based on an asymptotic relative efficiency measure.

Section 2 reviews preliminaries and discusses the asymptotic optimality of MqLEs. In Section 3, we review approximate designs and equivalence theorems for checking optimality of an approximate design. We then develop the theory and an algorithm to find a variety of optimal designs based on the MqLE. As an illustrative application, we construct locally optimal designs for dose response studies using the widely used 4-parameter logistic model (4PL-model). We construct optimal designs for estimating models and meaningful function of the model parameters in the 4PL-model and compare its performance with those based on MLEs and a uniform design with 10 points when there is mis-specification in (i) the nominal values of the model parameters, (ii) the heteroscedasticity structure, and (iii) the error distributions. Section 5 concludes with a summary of our work and recommendations.

2. Preliminaries

Suppose we have a dose response study and we have resources to take n observations at d distinct doses x1, …, xd with ni replicates at xi, i = 1, …, d and i=1dni=n. Let y be the response at x with E(y) = μ and Var(y) = v(μ) where μ = μ(θ; x) and μ is a known function of an unknown r-dimensional parameter vector θ and dose x. For simplicity, we write μ(θ; x) as μ(θ) or simply μ if there is no confusion. Our stipulation Var(y) = v(μ) covers the situation of Niaparast (2009) but not the situation of Niaparast and Schwabe (2013) since the latter considered Var(y) = μ+μ2c(x) and c(x) is not a function of μ. Another difference is that the two papers assumed the ni replications at xi are not independent and we do here as in most dose response studies. Assuming that infμ v(μ) > 0, the log quasi-likelihood of y (with dispersion ϕ) is defined as

q(μ;y)-μy-tϕv(t)dt.

Let q.θq(μ;y) (quasi-score) and q¨2θθq(μ;y). Weddernburn (1974) showed that q(μ; y) behaves like a likelihood function in the sense that

E(q.)=0,E(q.q.)=-E(q¨)

and -E(q¨)=1ϕv(μ)μ.(θ)μ.(θ) (quasi-information), where μ.(θ)μθ. Given y1, y2, …, yn, the maximum quasi-likelihood estimate (MqLE) of θ maximizes the joint quasi-likelihood i=1nq(μi;yi).

More generally, let y = (y1, …, yn)′ be the vector of observations at the n doses and let μ = (μ1, …, μn)′ be its mean vector. Without loss of generality, let the first d elements of y correspond to observations from the d distinct doses x1, …, xd and let μ = (μ(θ; x1), …, μ(θ; xd))′ be the mean vector of the first d observations from the d distinct doses. Clearly μ = Cμ, where C is the first d rows of the n-dimensional identity matrix In.

Let θ̂n be the maximum quasi-likelihood estimator (MqLE) of θ, let Dn(θ)=(μ.1,,μ.n) be the full rank r×n matrix and let Vn-1(μ) be the inverse of the n×n diagonal matrix Vn(μ) = diag[v(μ1), …, v(μn)]. By definition, θ̂n satisfies

i=1nq.(μi;yi)=i=1nyi-μiϕv(μi)μiθ=1ϕDn(θ)Vn-1(μ)(y-μ)=0,

assuming that i=1nq¨(μi;yi) is positive definite. The MqLE of θ is then obtained iteratively via the quasi Newton-Raphson method, i.e., at the (k + 1)th iteration with μ^i(k)=μ(θ^n(k);xi),

θ^n(k+1)=θ^n(k)+{Dn(θ^n(k))Vn-1(μ^(k))Dn(θ^n(k))}-1Dn(θ^n(k))Vn-1(μ^(k))(y-μ^(k)).

Observe that the MqLE is a Z-estimator with properties analogous to a MLE. Under similar regularity assumptions required for the MLE of θ, cf., Lehmann (1999), §7.1 & §7.3, (C1) – (C7), the MqLE θ̂n as an M-estimator of θ is also asymptotically linear and unbiased satisfying

θ^n=θ+(DnVn-1Dn)-1DnVn-1(y-μ)+op(n-1/2). (1)

Here and elsewhere, we have suppressed the arguments in Dn(θ) and Vn-1(μ) and write them as Dn and Vn-1, for simplicity, when there is no confusion. The symbol op(n−1/2) denotes a random vector whose L norm is o(n−1/2) in probability. We continue to use this notation for vectors and matrices for their L norm with the specified order when there is no confusion.

Eq(1) implies that n(θ^n-θ)dN(0,ϕΔ-1) if Δ=limnn-1(DnVn-1Dn) exists, viz. Weddernburn (1974) or McCullagh (1983). Consequently, the MqLE of μ is μ̂ = μ(θ̂n). If we expand μ(θ) using a Taylor expansion about a neighborhood of θ0, assuming supθ ||μ̇(θ)|| < ∞ and μ(θ) is twice differentiable, we have

μ(θ)=μ(θ0)+μ.(θ0)(θ-θ0)+O(θ-θ02) (2)

and μθ=D, where D=(μ.1,,μ.d) is a r×d matrix. By the delta method, it follows that

n(μ^-μ)dN(0,ϕDΔ-1D).

Here we state an asymptotic optimality of the MqLE and in A.1 of the appendix, we provide its justifications with technical details beyond that sketched out in McCullagh (1983).

Theorem 1

Let θ̃n be an asymptotically linear unbiased estimator of θ0 satisfying

θn=θ0+(LnLn)-1LnVn-1/2[y-μ(θ0)]+op(n-1/2), (3)

where Ln=(l1,,ln) is an arbitrary r×n full rank matrix. If n-1i=1nlili tends to a positive definite matrix Λ (r×r and max1≤i≤n ||li|| = o(n1/2)), then μ̂ is asymptotically optimal in the sense that n{Var(μ̃)−Var(μ̂)} is asymptotically a non-negative definite matrix.

The implication of Theorem 1 is that under mild conditions, the MqLE μ̂ is the asymptotically best linear unbiased estimator of μ. This result may not be surprising because McCullagh (1983) had noted that if y have a distribution from the exponential family with pdf f(y; η) = exp{(b(η))/aϕc(y; ϕ)} satisfying η = η(μ), E(y) = μ = μ(θ; x) and Var(y) = v = a(ϕ)v(μ), the MLE of θ, θ̌n, has the approximation

θˇn=θ+(DnVn-1Dn)-1DnVn-1(y-μ)+op(n-1/2). (4)

It follows that the MLE θ̌n has the same linear approximation as the MqLE θ̂n when the observations come from an exponential family with canonical parameter η(μ) and variance function v(μ), see Eq(1). By the delta method, this implies that the MqLE μ̂ of μ is asymptotically as efficient as the MLE μ̌ = μ(θ̌) of μ. Consequently, asymptotically linear unbiased estimators of a function of θ are not more accurate than MqLE for estimating the same function of θ.

Corollary 1

MqLE θ̂ is asymptotically superior to LSE θ̌LSE for any designs under the regularity conditions.

Proof

Under the regularity conditions, LSE θ̌LSE of θ satisfies

θˇnLSE=θ+(DnDn)-1Dn(y-μ)+op(n-1/2),

so θ̌LSE is a member of the class of asymptotic best linear unbiased estimators with (LnLn)-1Ln=(DnDn)-1DnVn1/2 in Eq(4). By Theorem 1, n{Var(θ̌LSE) − Var(θ̂)} is non-negative definite.

This implies that n{Var(μˇLSE)-Var(μ^)} is non-negative definite by virtue of the delta method, and so the MqLE μ̂ of μ is superior to the LSE μˇLSE of μ.

Therefore the optimal designs based on the MqLE and the MLE have the same efficiency when the distribution of y is a member of exponential family distributions, and the optimal design based on the MqLE asymptotically performs just as well or better than optimal designs based on any other asymptotically linear unbiased estimators such as the least square estimator (LSE).

3. Locally optimal designs based on the MqLE

Optimal designs for nonlinear models are often constructed using strong parametric model assumptions and it is well known that they can be sensitive to the assumptions. Optimal designs based on the MqLE require less stringent assumptions and they can be found once the variance function Var(y) is formulated as a function of E(y). In this section, we first discuss optimality design criteria and approximate designs. We then construct locally optimal designs based on the MqLE and compare their performances with locally optimal design based on the MLE and a uniform design; the latter is popular because they are intuitive appealing and easy to construct by taking equal number of observations at equally spaced doses. As an illustrative application, we construct and compare optimal designs based on the MLE and MqLE for the 4-parameter logistic model, which is widely used in biometry.

The setup assumes that we have resources to take n observations from a model with

E(yij)=μi=μ(θ,xi),Var(yij)=vi=ϕv(μi);j=1,2,,ni,i=1,2,,d.

Here θ is a r-dimensional vector of parameters and we have ni replicates at each doses (or log doses) xi with i=1dni=n. The observation from the jth replicate at the ith dose is yij and its distribution is unknown except for the relationship between its mean and variance. All the observations are assumed to be independent and observations at the same dose are identically distributed. Given the design criterion and model assumptions, the design questions are how many doses are required, where these doses are and how many replications are required at each dose. In general, such questions are very difficult to answer, where the optimal designs can depend sensitively on the value of n, the regression model and the optimality criterion. To facilitate practitioners to use such designs, we would require an endless table of optimal designs! For nonlinear models, a longer list is required because for each set of nominal values of the model parameters, the locally optimal designs also depend on the set.

3.1. Approximate designs and design criteria

An increasingly common and effective approach to solve design questions is to use approximate designs, which are essentially probability measures defined on the given dose interval. For our problem, we denote such a design with d doses by ξ={(xi,wi)}1d, where xi is the ith dose selected from a given dose interval and wi = ni/n is the proportion of the total observations allocated to xi, i = 1, …, d. Given an optimality criterion, the design problem requires us to determine the optimal value of d and optimal choices for (xi, wi), i = 1, …, d. The implemented design takes roughly nwi observations at xi, subject to the constraint that each nwi is rounded to a positive integer and they sum to n.

The advantages of working with approximate designs are well documented in the literature, see for example, the voluminous work of Kiefer (1985). The main ones are that the approach provides a unified framework for finding and verifying optimal designs under any concave (or convex) optimality criterion, and when a design is not optimal, the approach provides an easy evaluation of the proximity of the design to the optimum without knowing the optimum (Pazman, 1986). In addition, algorithms for finding a broad class of optimal designs are available. A typical application of this approach is to first calculate the Fisher information matrix for a given design and formulate the design criterion as a concave (or convex) function of this matrix.

For our setup, we recall that the quasi-information matrix is obtained from the negative of the expectation of the second derivatives of the log quasi-likelihood function with respect to the model parameters. Specifically, the normalized quasi-information matrix for θ is Δ(θ;ξ)=n-1ϕ-1(DnVn-1Dn), which can be written as Δ(θ;ξ)=1ϕi=1dwig(xi)g(xi) with ϕ as the dispersion parameter and g(xi)=v(μi)-1/2(μiθ1,μiθ2,,μiθr) evaluated at the nominal values for θ. This quasi-information matrix is derived based only on the variance-mean relationship of the response distribution but has the same form as the Fisher information matrix. Both matrices become equal when responses come from a normal distribution with constant variance. As we show shortly, if the design criterion is a concave (convex) function of the quasi-information matrix, an equivalence theorem can be similarly developed for checking whether a design based on the MqLE is optimal among all designs on the given design interval. Equivalence theorems for verifying whether a design based on MLEs is optimal or not are available in design monographs such as Fedorov (1972), Pazman (1986), Pukelsheim (2006) or Fedorov and Leonov (2013).

A common objective is to find an efficient design to estimate the vector of parameters θ or some functions of model parameters. D-optimality seeks to minimize the volume of the confidence ellipsoid for θ by choice of a good design and so a D-optimal design provides the most accurate estimates for the model parameters. When there are several functions of the model parameters to estimate, C-optimality seeks to minimize the asymptotic average variance of the estimated functions. Consequently, the C-optimal design provides the most accurate inference for estimating the user-selected set of functions of the model parameters. If there is only a single function of the model parameters to estimate, C-optimality becomes c-optimality and a c-optimal design minimizes the asymptotic variance of the estimated function of interest.

We also consider designs for estimating s(sr) linear combinations of the parameters Aθ, where A is a r × s matrix of rank s. Here we are only interested in approximate designs ξ for which Aθ is estimable. The quasi-information matrix of a design ξ for estimating Aθ is M = (A′[Δ(θ; ξ)]A)−1, where Δ(θ; ξ) is a generalized inverse of Δ(θ, ξ).

A useful class of concave design criteria, indexed by a scalar t, for models with r parameters and formulated in terms of information matrix M is:

ϕt(M)={λmin(M)t=-;(det[M])1/rt=0;(1rTr[Mt])1/tt<0,

where λmin(M) denotes the minimum eigenvalue of M. Because the criteria are concave, standard arguments using the directional derivatives provide us with a tool to verify if a design is optimal among all approximate designs on the given design space. For example, it can be shown that for any t > −∞, if A is user-selected matrix, the design ξt is optimal among all designs on the given dose interval for estimating Aθ if and only if there is G = [Δ(θ; ξt)] such that

g(x)GA(AGA)-(t+1)AGg(x)Tr[(AGA)-t], (5)

with equality at the dose levels of the design ξt (Pukelsheim, 2006). The locally optimal design becomes a locally D-optimal design when t = 0 and becomes a locally C-optimal design for estimating functions of the model parameters when t = −1. Locally c-optimal designs for estimating a function of the model parameters is a special case of the C-optimal design when s = 1. The function on the left hand side of (5) is sometimes called the sensitivity function and by examining whether its graph satisfies the conditions of the equivalence theorem, one can confirm the optimality of the design. All our optimal designs reported in this paper have been verified by an appropriate equivalence theorem.

D-optimal design maximizes ϕ0(M). If we set A = Ir and t = 0 in (5), the equivalence theorem for D-optimality is obtained and the design ξD is D-optimal design if and only if for any dose x in the dose interval,

d(x,ξD)=g(x)[Δ(θ;ξD)]-1g(x)r,

with equality at the dose levels of the design ξD. Alternatively, we may want to have an efficient design for estimating several functions of the model parameters at the same time. In this case, we use the C-optimality criterion to find a design to minimize Tr[M−1], which is equivalent to maximize ϕ−1(M). If the known functions of interest are C1(θ), C2(θ),…, Cs(θ), we set A = (Ċ1(θ), Ċ2(θ), …, Ċs(θ)), C.i(θ)=Ci(θ)θ and t = −1. From (5), we have the design ξC is C-optimal design among all designs on the dose interval if and only if for any dose x in the dose interval,

d(x,ξC)=g(x)[Δ(θ;ξC)]-A2Tr[A[Δ(θ;ξC)]-A],

with equality at the dose levels of the design ξC.

Let the vector of weights w = (w1, …, wd−1)′ because wd=1-i=1d-1wi. The quasi-information matrix has the same form as the Fisher information matrix and we show in A.2 and A.3 of the appendix that the vector of weights w of the D and C-optimal designs can be obtained as follows:

Theorem 2

The vector of weights w of the D-optimal design are the non-negative roots of the equation det[Δ(θ;ξ)]w=0. If the number of distinct dose levels in the locally D-optimal design is the same as the dimension of θ in μ(θ, x) (i.e., d = r), the D-optimal is equally weighted.

Theorem 3

The vector of weights w of the C-optimal design are the non-negative roots of the equation Tr(A[Δ(θ;ξ)]-1A)w=0.

Knowing a good upper bound on the number of support points of the optimal design can frequently simplify the search for the optimal design considerably. Ideas in Hyun et al. (2013) and Yang (2010) may be applicable to identify an upper bound on the number of points for the optimal design based on the MqLE. In our study, we provide an algorithm for finding optimal designs based on the MqLE without requiring knowledge of a useful upper bound.

3.2. Design efficiency and algorithms

Following convention, the efficiencies of a design ξ for estimating the r− dimensional vector θ and a set of interesting functions C(θ) based on the MqLE are, respectively, measured by

eD(ξ)=(det[Δ(θ;ξ)]det[Δ(θ;ξD)])1r

and

eC(ξ)=Tr(A[Δ(θ;ξC)]-1A)Tr(A[Δ(θ;ξ)]-1A).

If this ratio is one-half, the interpretation is that the design ξ needs to be replicated twice for it to do as well as the optimal design. In practice, we want designs with high efficiencies.

Standard design algorithms such as those proposed by Fedorov (1972) and their many modifications thereof, can be used to search for a D-optimal or a C-optimal design based on the MLE. An improved version is the Yang-Biedermann-Tang algorithm (Yang et al., 2013), where they showed their algorithm can more effectively generate several types of optimal designs for nonlinear models with a few independent variables. Hyun et al. (2015) extended their algorithm to find multiple-objective optimal designs and this is the algorithm we used to find optimal designs in this work, after specializing our algorithm to find single-objective optimal designs. A distinguishing feature of our algorithm is that it first selects the starting design by optimizing dose levels via the Fedorov’s algorithm and obtains the optimal weights for the selected optimal doses using the Newton Raphson’s method based on Theorem 2 and 3. Hyun’s algorithm to search multiple-objective optimal designs is available in a R-package called VNM (Hyun et al., 2015) and the package can be freely downloaded from the R-archive. Interested readers may also write to the second author for the codes.

We now give examples using a popular model where heteroscedasticity is simply modeled via v(μ) = e, and h is some known constant; other functional forms for modeling the variance in terms of the mean, such as v(μ) = 2, can also be directly incorporated in our algorithm. We note that when h = 0, the quasi-information matrix is identical to the Fisher information matrix when errors are independent and normally distributed with a constant variance. More generally, since the Fisher information matrix for θ based on the MLE under normal distribution is F(θ;ξ)=i=1dwif(xi)f(xi) with f(xi)=[vi-1+12{v.ivi}2]1/2(μiθ1,μiθ2,,μiθr), where v.i=viμi.

Let ξD0 and ξC0, respectively, be the locally D- and C-optimal designs based on the MLE when we have normally distributed responses with homoscedastic errors. Let ξDh and ξCh, respectively, be the locally D- and C-optimal designs based on the MqLE for a heteroscedastic model. When errors are normally distributed with constant variance, we note that the latter two optimal designs based on the MqLE are identical to the optimal designs based on the LSE since MLE and LSE are the same.

3.3. Four-parameter logistic(4PL) model

The 4PL model is variously called the Hill model or the Emax model and is widely used across several disciplines. They include education testing as in item response theory (Hambleton and Swaminathan, 1985; Harris, 1989), enzyme kinetic studies (Dette et al., 2005), pharmaceutical studies (Bretz et al., 2010; Reeve and Turner, 2013) and in toxicology (Dinse, 2011; Gadakar and Call, 2015), to name a few. It is also used for curve-fitting analysis in bioassays or immunoassays such as ELISAs or dose-response curves. When some parameters of the model are set equal to certain values, the model reduces to the usual 2 parameter logistic model or the 3-parameter logistic model, which are also frequently used in the biomedical sciences. Because of the flexibility of the model, we select it as an exmeplary nonlinear model in biometrics to study the performance of optimal designs based on the MqLE.

The 4-parameter logistic model assumes symmetry around the inflection point and the mean of its continuous response is modeled by

μ(θ,x)=θ11+eθ2x+θ3+θ4,θ1>0,θ20. (6)

There are different parametrization of the models depending on the field and application at hand. Sometimes, the variable x may influence the choice of the parametrization. For example, the 4PL model (6) seems to be more widely used when the dose is measured in logarithmic units; otherwise, the Hill model seems to be the preferred choice. We choose the formulation in (6) because (i) its form provides more stable parameter estimates than other forms (Reeve and Turner, 2013) and (ii) it facilitates the numerical search of the optimal designs over a shorter dose interval.

An interesting dose to estimate is the ED50, which is the dose expected to result in having a response halfway between the upper limit and the lower limit of the mean response. In (6), x is the log dose, θ1 is the difference between the upper limit and the lower limit of the expected response, θ2 is the negative value of the Hill’s slope that controls the steepness of the curve, θ3 is −θ2log(ED50), and θ4 is the lower limit of the response. Another interesting dose to estimate is MED(δ), the dose that is expected to result in a change of δ units in the mean response from the minimum dose and δ is user-specified. For example, when working with an inhibitory drug, it may be desirable to estimate the dose that would shrink the tumor size by some user-specified δ units. It is straightforward to show that for the 4PL model, ED50 = −θ32 and MED(δ)={log(-δθ1+δ)-θ3}/θ2 if θ2 > 0 or MED(δ)={log(θ1-δδ)-θ3}/θ2 if θ2 < 0, where 0 < |δ| < θ1 and δ must have an opposite sign from θ2 (Hyun and Wong, 2015).

In what is to follow, we find locally D-optimal designs for estimating θ and locally C-optimal designs for estimating both ED50 and MED(δ) simultaneously for the 4PL model. For illustrative purpose, we set δ = −1 for MED for the rest of the paper. A direct calculation shows that the quasi-information matrix for the 4PL model is Δ(θ;ξ)=i=1dwig(xi)g(xi)/ϕ, where

g(x)=[ehμ(x,θ)]-1/2(11+eθ2x+θ3,-θ1xeθ2x+θ3(1+eθ2x+θ3)2,-θ1eθ2x+θ3(1+eθ2x+θ3)2,1).

The 4PL model has a continuous response and we assume heteroscedasticity is modeled by v(μ) = e after noting that this assumption is not appropriate for the classical logistic model with binary responses. The locally optimal design does not depend on the of ϕ but on the value of h for the given nominal values of θ. The parameter ϕ enters the information matrix as a multiplicative constant and so our optimal designs do not depend on its nominal value. The nominal values for the remaining parameters were taken from Khinkis et al. (2003) who found locally D-optimal designs for studying characteristics of seven anti-cancer drugs using the above model. The dose interval used in their study was [log(.001), log(1000)]. Equivalence theorems for the D-and C-optimality criteria can be obtained by setting A′ = I4 and A′ = (EḊ50, MĖD)′, respectively in (5). Here ED.50=(0,θ3θ22,-1θ2,0) and

ME.D={(-1(θ1+δ)θ2,θ3-log(-δθ1+δ)θ22,-1θ2,0),ifθ2>0(1(θ1-δ)θ2,θ3-log(θ1-δδ)θ22,-1θ2,0),ifθ2<0.

Table 1 shows the locally D-and C-optimal designs based on the MqLE for various values of h in the variance function when the nominal value for the model parameters is θ = θ1 = (1.563, 1.790, 8.442, 0.137)′. All locally D-optimal designs, as expected, are equally weighted with 4 doses but locally C-optimal designs may require 3 or 4 doses, depending on the values of heteroscedasticity parameter h. When h = 0, the locally optimal designs based on the MqLE ξDh and ξCh coincide with the corresponding optimal designs for the homoscedastic model. All locally optimal designs from our algorithm can be verified by examining the plot of the sensitivity function. For example, Figure 1 is the sensitivity plot d(x,ξCh) of the locally C-optimal design for estimating both the ED50 and MED(δ = −1) in the 4PL model when θ = θ1 and h = 1. The graph satisfies the conditions of the equivalence theorem and so confirms the optimality of the design based on the MqLE for estimating the ED50 or MED in the 4PL model.

Table 1.

Locally D-optimal designs ( ξDh) for estimating θ and locally C-optimal designs ( ξCh) for estimating both ED50 and MED(δ = −1) based on the MqLE when θ = θ1 and ED50 = −4.716 for the 4PL model with selected values of h in v(μ) = e. The optimal designs are obtained using the dose interval [log(.001), log(1000)] = [−6.91, 6.91]. The last two columns display the D-efficiencies (top rows) and C-efficiencies (bottom rows) of the locally homoscedastic D- and C-optimal designs ( ξD0,ξC0) and the uniform design at 10 points for different values of h.

h
ξDh
eD(ξD0)
eD(ξU10)
h = −2
(-6.91-5.55-4.506.910.2500.2500.2500.250)
0.900 0.547
h = −1
(-6.91-5.39-4.286.910.2500.2500.2500.250)
0.974 0.596
h = 0
(-6.91-5.21-4.086.910.2500.2500.2500.250)
1.000 0.617
h = 1
(-6.91-5.01-3.886.910.2500.2500.2500.250)
0.973 0.607
h = 2
(-6.91-4.80-3.716.910.2500.2500.2500.250)
0.899 0.896
h = 5
(-5.43-4.22-3.236.910.2500.2500.2500.250)
0.532 0.530
h
ξCh
ec(ξC0)
ec(ξU10)
h = −2
(-6.91-5.62-4.716.910.1790.0580.4420.321)
0.763 0.226
h = −1
(-6.91-4.90-4.49-2.020.2750.2410.2550.229)
0.962 0.241
h = 0
(-6.91-4.67-2.700.3780.4450.177)
1.000 0.202
h = 1
(-6.91-4.63-2.910.4960.3910.113)
0.937 0.158
h = 2
(-6.91-4.65-2.850.6110.3220.067)
0.792 0.126
h = 5
(-6.91-4.29-3.202.110.8210.1190.0450.016)
0.457 0.222

Figure 1.

Figure 1

Plot of d(x,ξC1) for the 4PL model when θ = (1.563, 1.790, 8.442, 0.137)′ and v(μ) = e with h = 1 confirms optimality of the design ξC1.

Table 1 also displays the D-and C-efficiencies of optimal designs based on MqLEs and the uniform design ξU10 with 10 points evenly spread out across the dose range [log(.001), log(1000)]. A uniform design is a popular choice among practitioners because it is simple to implement and intuitively appealing. An issue in the choice of a uniform design is the number of points it has. Larger number of points have been shown to result in lower efficiencies (Wong and Lachenbruch, 1996) and so an informed decision on the number of points it has is required. We choose 10 doses on a log scale here because this design allows us to conduct a lack of fit test to assess model adequacy. Another reason is that there are real studies that uses 10 doses to fit toxicology data using the 4PL model. The take home message from Table 1 is that the homoscedastic optimal designs ξD0 and ξC0 have high efficiencies when there is only small mis-specification in the heteroscedastic assumption, via the values of h, and they become inefficient when the amount of heteroscedasticity is increased. In addition, the ED50 is always located between the middle two points when there are 4 optimal design points or close to the middle points when there are 3 optimal design points. It seems that the middle two design points or the middle points move toward to the right of the ED50 dose as the value of h increases. The 10-point uniform design ξU10 performs much worse under the same situation. For the range of values of h considered for the 10-point uniform design, the C-efficiencies for estimating both ED50 and MED(δ = −1) average about 19%, with some as low as 12%. For D-efficiencies, they average 59% with some as low as 31.5%. This reaffirms that the choice of an appropriate uniform design is important; in this case, a uniform design with fewer points such as 7 or 8 points is likely to perform better than one with 10 points (Wong and Lachenbruch, 1996).

We recall that MLE provides the best performance when the error distribution is correctly specified. The MqLE is asymptotically as efficient as the MLE and optimal designs based on MLE or MqLE are the same if responses are from a member of the exponential family distribution. We now investigate the robustness properties of optimal designs based on the MqLE when responses do not come from a member of the exponential class of family distributions. We do this by using an asymptotic efficiency measure and consider two distributions with heteroscedasticity. Let μi = μ(θ, xi) and vi = ϕei. The two distributions are

  • D1= a normal distribution with E(yij) = μi and V ar(yij) = vi;

  • D2= a gamma distribution with the shape parameter α=μi2/vi and scale parameter β = μi/vi.

The asymptotic efficiency compares the covariance matrix of the MqLE θ̂ with the one based on MLE θ̌, and so it is measured by

ARE(θ^,θˇ)=(det[Δ(θ;ξ)]/det[M(θ;ξ)])1/4,

where M*(θ; ξ) is the Fisher information matrix based on the MLE θ̌ under the given distribution. For D1, M*(θ; ξ) = F(θ; ξ) and for D2, M*(θ; ξ) is given in the appendix A.4. The given Fisher information matrices for D1 and D2 are generic versions for any functional forms of vi. We first compare the asymptotic efficiencies of the MqLE and MLE using the same design, i.e. the uniform design ξU10. Then the locally D-optimal design ξDh is used to construct the MqLE and its asymptotic efficiency is evaluated again to see how much gain was realized using the optimal design compared to efficiency gains from the MLE using ξU10.

Table 2 shows the asymptotic efficiencies of the MqLE for various ϕ and h values under D1 and D2 with θ = θ1. The asymptotic efficiency of the MqLE depends on the heteroscedasticity of the responses since it is changed by both ϕ and h. When the same design ξU10 is used for both estimators, the MLE performs better than the MqLE as we expected, but for D1, the MqLE performs close to the MLE when ϕ ≤ 0.2 and h ≤ 1, and as discussed earlier, the MqLE becomes identical to the MLE when h = 0. The asymptotic efficiency of the MqLE is increased almost double fold when its optimal design ξDh is used. It is also observed that the MqLE obtained from ξDh is more efficient than the MLE found from ξU10 for selected values of h and ϕ under D1. For example, when ϕ = 0.1, h = −2, and ξDh is used to obtain the MqLE, ARE(θ̂, θ̌) = 1.718 and this tells that the MLE needs about 72% more subjects to do as well as the MqLE. For D2, we did not observe the same outperformance of MqLE obtained from ξDh, but the optimal designs produce the MqLE that performs about as well as the MLE when ϕ < 0.2 and h < 1. The overall results suggest that when the distribution of the response is not a member of exponential family distributions, the MqLE still performs reasonably well compared with the MLE for some cases. In addition, the performance of the MqLE can be dramatically increased by using the optimal designs based on the MqLE.

Table 2.

Asymptotic efficiencies of the MqLE when ξU10 or ξDh is used under D1 and D2 with h = −2,−1, 0, 1, 2, ϕ = 0.1, 0.2, 0.5, 1.0, and θ = θ1 = (1.563, 1.790, 8.442, 0.137)′.

Distr. ϕ ξU10
ξDh

h= −2 h = −1 h = 0 h = 1 h = 2 h = −2 h = −1 h = 0 h = 1 h = 2
D1 0.1 0.939 0.975 1 0.875 0.388 1.718 1.637 1.621 1.441 0.676

0.2 0.889 0.953 1 0.781 0.259 1.625 1.599 1.621 1.286 0.451

0.5 0.772 0.891 1 0.596 0.134 1.410 1.495 1.621 0.981 0.233

1.0 0.634 0.806 1 0.432 0.075 1.177 1.352 1.621 0.711 0.131

D2 0.1 0.529 0.493 0.446 0.373 0.249 0.967 0.867 0.734 0.614 0.434

0.2 0.409 0.371 0.321 0.244 0.138 0.747 0.622 0.520 0.402 0.240

0.5 0.272 0.237 0.189 0.122 0.057 0.497 0.397 0.306 0.201 0.098

1.0 0.192 0.162 0.118 0.066 0.028 0.351 0.271 0.191 0.109 0.049

Table 3 compares robustness properties of locally D-optimal designs based on the MqLE and MLE when there is mis-specifications in the nominal values for θ. They displays D-efficiencies of ξDh with h = −2, 0, 2,5 in the variance function assuming that the true values of the nominal values for the 4PL model are θ = θ1. We recall that ξDh with h = 0 is the optimal design based on the MLE. Both types of optimal designs seem to be sensitive to nominal values and seem more so when there is large heteroscedasticity in the model, as in the case when h = 5. As expected, they perform poorly when there is a large deviation from the true values, especially for θ2 and θ3. The optimal designs seem to be more sensitive to θ2 than θ3. Since θ2 is the slope controlling parameter with a very narrow range of possible values, locally optimal designs become much less efficient for a small departure from the true value of θ2 compared to the case for θ3. For this model, it is known that ξD0 does not depend on θ1 and θ4 (Li and Majumdar, 2008). However, we observe that when h ≠ 0, ξDh depends on θ1 and θ4 and ξDh has quite high D-efficiencies for a broad of values for these parameters. The performance of the design ξDh becomes less efficient as the value of h increases. In addition, Table 4 shows the robustness properties of the same designs when there is a mis-specification in the ED50 under the assumption that θ = θ1. The same patterns are observed here like Table 3. The optimal designs work very poorly when there is a large departure from the true value of the ED50.

Table 3.

D-efficiencies of locally D-optimal designs ( ξDh) based on the MqLE for h = −2, 0, 2, 5 when their nominal values for the 4-PL Model are mis-specified to be θ2,, θ9 and the vector of true value is θ1 = (1.563, 1.790, 8.442, 0.137)′.

θ = (θ1, θ2, θ3, θ4)
eD(ξD0)
eD(ξD-2)
eD(ξD2)
eD(ξD5)
θ1 = (1.563, 1.790, 8.442, 0.137) 1.000 1.000 1.000 1.000
θ2 = (1.563, 2, 10, 0.137) 0.933 0.945 0.909 0.867
θ3 = (1.563, 3, 8, 0.137) 0.017 0.010 0.022 0.020
θ4 = (1.563, 1.5, 6, 0.137) 0.769 0.786 0.718 0.595
θ5 = (1.563, 1.5, 4, 0.137) 0.192 0.195 0.154 0.072
θ6 = (1.563, 1, 8, 0.137) 0.593 0.686 0.483 0.108
θ7 = (1.563, 0.5, 1, 0.137) 0.306 0.381 0.212 0.069
θ8 = (2, 1.790, 8.442, 1) 1.000 0.993 0.993 0.918
θ9 = (3, 1.790, 8.442, 0.137) 1.000 0.930 0.960 0.637

Table 4.

D-efficiencies of locally D-optimal designs ( ξDh) based on the MqLE for h = −2, 0, 2, 5 when there is a mis-specification in ED50 and the true value is ED50 = −4.716 under θ1 = (1.563, 1.790, 8.442, 0.137)′.

ED50
eD(ξD0)
eD(ξD-2)
eD(ξD2)
eD(ξD5)
−5.5 0.732 0.731 0.695 0.600
−5 0.955 0.955 0.949 0.932
−4 0.744 0.743 0.716 0.661
−3 0.221 0.209 0.196 0.085
1 < 0.001 < 0.001 < 0.001 < 0.001

4. Conclusions

In this work, we showed how to construct optimal designs based on the MqLE, which are derived under less stringent model assumptions than those for the MLE. Using only the functional relationship between the mean and variance of the response variable, we showed that the MqLE is asymptotically best linear unbiased estimator and when the response has a distribution belonging to the exponential family, MqLE is as efficient as MLE. Consequently, the proposed locally optimal designs based on the MqLE have asymptotic optimality properties comparable to those based on the MLE. Often, optimal designs are obtained based on the LSE which is the same as the MLE under normal distribution with homoscedasticity. We proved that the optimal design based on the MqLE is asymptotically more efficient than the optimal designs based on the LSE.

Although the MqLE has good asymptotic optimality, it is rarely used to construct optimal designs. We provided some theoretical results and show these optimal designs based the MqLE are quite generally robust to distributional assumptions in a dose-response study settings. In particular, we found two types of locally optimal designs based on the MqLE for the 4PL model: D-optimal design for estimating the model parameters and C-optimal design for estimating both the ED50 and MED at the same time. They were compared with the corresponding optimal designs based on the MLE for a homoscedastic model and with a uniform design with 10 doses. Our results show that optimal designs based on the MqLE can produce estimators that perform well under various model mis-specifications. We focused on the 4PL model but the methodology can be easily applied to find other types of optimal designs for different nonlinear models, as long as the optimality criterion is a concave or convex function of the information matrix. We also showed such designs can be constructed using current algorithms.

In conclusion, the robustness properties of a design to model assumptions should always be assessed before implementation. Ideally the implemented design has acceptable efficiencies under a variety of violations in the model assumptions and a change in the design criterion. While broad conclusions are usually difficult to establish, especially when there are multiple objectives in the study, it is important to carry out such an investigation for specific violations in the model assumptions to better understand sensitivities of the design to model perturbations before implementation. Our proposed designs here provide an alternative to optimal designs based on MLE when model assumptions are questionable.

Highlights.

  • Propose optimal designs based on the MqLE that require less stringent model assumptions.

  • Discuss asymptotic optimality of the MqLE.

  • Develop the theory and an algorithm to find various optimal designs based on the MqLE.

  • The proposed designs here provide an alternative to optimal designs based on the MLE when model assumptions are questionable.

Acknowledgments

The research of Wong reported in this paper was partially supported by the National Institute Of General Medical Sciences of the National Institutes of Health under Award Number R01GM107639.

5. Appendix

A.1. Proof of Theorem 1

Proof

Clearly θ̂n in Eq(1) belongs to the class of asymptotically linear unbiased estimators. Each component of the n-dimensional random vector Vn-1/2[yn-μ(θ0)] has a univariate distribution with mean 0 and variance ϕ, so by the Lindeberg-Feller CLT,

n(θn-θ0)=n(LnLn)-1LnVn-1/2[yn-μ(θ0)]+op(1)dN(0,ϕΛ-1).

By the delta method, n[μ(θn)-μ(θ0)]dN(0,DϕΛ-1D) and along with Eq(2), we have μ(θ̃n) = μ(θ0)+D(θ0)(θ̃nθ0)+ε with ||ε|| = Op(||θ̃nθ0||2), which implies that

(LnLn)-1LnVn-1/2[μ(θn)-μ(θ0)]=(LnLn)-1LnVn-1/2D(θn-θ0)+(LnLn)-1LnVn-1/2ε.

Since n-1i=1nliliΛ, max1≤in ||li|| = o(n1/2) and infμ v(μ) > 0, we have

(LnLn)-1LnVn-1/2ε{1nmax1inΛ-1li}{n(θn-θ0)}{θn-θ0}=op(θn-θ0).

We observe that Eq(2) and Eq(3) imply (LnLn)-1LVn-1/2[μ(θn)-μ(θ0)]=θn-θ0+o(1), so (LnLn)-1LnVn-1/2Dn=Ip+o(1), where Ip is the p×p identity matrix. To complete the proof, let Mn=(Vn-1/2Dn)[(Vn-1/2Dn)(Vn-1/2Dn)]-1(Vn-1/2Dn), which is an idempotent and symmetric matrix. It follows that {InMn} is also symmetric and idempotent, and hence non-negative definite. Further,

limnn(Var(μ)-Var(μ^))=ϕlimnn{D(LL)-1L}{In-Mn}{D(LL)-1L},

and the desired conclusion follows.

The justifications for the next two results rely on a result from Hyun et al. (2011) who showed that if M(θ; ξ) is the Fisher information matrix, then for any matrix B, the matrix with (i, j)th element equal to

2det[B[M(θ;ξ)]-1B]wiwj

is a non-negative definite matrix.

A.2. Proof of Theorem 2

Proof

Maximizing the D-optimality criterion det[Δ(θ; ξ)] is equivalent to minimizing det[[Δ(θ; ξ)]−1]. The optimal design points are non-negative roots of the equation

det[[Δ(θ;ξ)]-1]wi=0

provided that they minimize the inverse of the determinant of the quasi-information matrix, which is true if the matrix with (i, j)th element equal to

2det[[Δ(θ;ξ)]-1]wiwj

is non-negative definite.

The key step is to recognize that the quasi-information matrix Δ(θ; ξ) has the same form as the Fisher information matrix, and it is also a full rank non-negative definite matrix. By setting B = Ir, the above result from Hyun et al. (2011) implies the determinant of the quasi-information matrix is maximized.

A.3. Proof of Theorem 3

Proof

The C-optimality criterion is Tr(A′[Δ(θ; ξ)]−1A) and we want to minimize it by choice of a design ξ. The proof is similar to Theorem 2 after we first note that the weights of the C-optimal design are non-negative roots of the equation Tr(A[Δ(θ;ξ)]-1A)wi=0. Then we use the invariance property of the Tr function under cyclic permutation and the same result from Hyun et al. (2011) to show the matrix with (i, j)th element equal

2Tr(A[Δ(θ;ξ)]-1A)wiwj

is non-negative definite.

A.4. The Fisher information matrix for the MLE θ̌ under Gamma distribution

Let the shape parameter and scale parameter of the Gamma distribution be α=μi2/vi and β = μi/vi, respectively. If y has such a distribution, then E(y) = α/β and Var(y) = α/β2. For the gamma distribution, the log likelihood function is the following:

i=1nl(μi;yi)=i=1n{μi2vilog(μivi)-log(Γ(μi2vi))+(μi2vi-1)log(yi)-μi2yivi}.

Assuming we have a random sample of size n at d distinct log doses, a direct calculation shows the Fisher information matrix for θ̌ is

M(θ;ξ)=i=1dwif(xi)f(xi),

where f(xi)=[P1+P2-P3]1/2(μiθ1,μiθ2,,μiθr) with

P1=2vi2-μiviv¨i-4μiviv.i+2(μiv.i)2vi3(log(vi)+ψ0-2log(μi));P2=2μivi-μi2v.ivi2(v.ivi+ψ12μivi-μi2v.ivi2-1μi);

and

P3=1vi(1-μiv.ivi).

Here v.i=viμi;v¨i=2viμi2;ψ0=μilog(Γ(μi2vi)) and ψ1=μiψ0.

Footnotes

The contents in this paper are solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Bickel PJ. Using residuals robustly. i: Tests for heteroscedasticity, non-linearity. Annals of Statistics. 1978;6:266–291. [Google Scholar]
  2. Box GEP, Hill WJ. Correcting inhomogeneity of variance with power transformation weighting. Technometrics. 1974;16:385–389. [Google Scholar]
  3. Bretz F, Dette H, Pinheiro J. Practical considerations for optimal designs in clinical dose finding studies. Statistics in Medicine. 2010;29:731–742. doi: 10.1002/sim.3802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Burridge J, Sebastiani P. D-optimal designs for generalized linear models with variance proportional to the square of the mean. Biometrika. 1994;81:295–304. [Google Scholar]
  5. Chen RB, Wong WK, Li KY. Optimal minimax designs over a prespecified interval in a heteroscedastic polynomial model. Statistics and Probability Letters. 2008;78:1914–1921. doi: 10.1016/j.spl.2008.01.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chernoff H. Locally optimal designs for estimating parameters. The Annals of Mathematical Statistics. 1953;24:586–602. [Google Scholar]
  7. Dette H, Melas VB, Wong WK. Optimal designs for goodness of fit of the michaelis-menten enzyme kinetic function. Journal of American Statistical Association. 2005;100:1370–1381. [Google Scholar]
  8. Dette H, Wong WK. Bayesian optimal designs for models with partially specied heteroscedastic structure. The Annals of Statistics. 1996;24:2108–2127. [Google Scholar]
  9. Dette H, Wong WK. Optimal designs when the variance is a function of the mean. Biometrics. 1999;55:925–929. doi: 10.1111/j.0006-341x.1999.00925.x. [DOI] [PubMed] [Google Scholar]
  10. Dinse GE. An em algorithm for fitting a 4-parameter logisitc model to binary dose-response data. Journal of Agricultural, Biological, and Environmental Statistics. 2011;16:221–232. doi: 10.1007/s13253-010-0045-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fedorov V. In: Theory of optimal experiments. Studden WJ, Klimko EM, translators and editors. New York: Academic, New York; 1972. [Google Scholar]
  12. Fedorov V, Leonov S. Optimal Design for Nonlinear Response Models. Chapman and Hall/CRC Biostatistics Series 2013 [Google Scholar]
  13. Gadakar SR, Call GB. Computational tools for fitting the hill equation to dose-response curves. Journal of Pharmacological and Toxicological Methods. 2015;71:68–76. doi: 10.1016/j.vascn.2014.08.006. [DOI] [PubMed] [Google Scholar]
  14. Hambleton RK, Swaminathan H. Item response theory: Principles and applications. Boston: Kluwer-Nijhoff; 1985. [Google Scholar]
  15. Harris D. Comparison of 1–2 and 3-parameter irt models. Educational Measurement: Issues and Practice. 1989;8:35–41. [Google Scholar]
  16. Hyun SW, Wong WK. Multiple objective optimal designs to study the interesting features in a dose-response relationship. International Journal of Biostatistics. 2015;11:253–271. doi: 10.1515/ijb-2015-0044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hyun SW, Wong WK, Yang Y. Vnm: An r package for finding multiple-objective optimal designs for the 4-parameter logistic model. r package version 2.0. Journal of Statistical Software. 2015 url http://cran.r-project.org/package=vnm. Under review.
  18. Hyun SW, Yang M, Flournoy N. Optimal designs for response functions with a downturn. Journal of Statistical Planning and Inference. 2011;141:559–575. [Google Scholar]
  19. Hyun SW, Yang M, Flournoy N. A procedure for finding an improved upper bound on the number of optimal design points. Computational Statistics and Data Analysis. 2013;58:276–282. [Google Scholar]
  20. Jobson JD, Fuller WA. Least squares estimation when the covariance matrix and parameter vector are functionally related. Journal of the American Statistical Society. 1980;75:176–181. [Google Scholar]
  21. Khinkis LA, Levasseur L, Faessel H, Greco WR. Optimal design for estimating parameters of the 4-parameter hill model. Nonlinearity in Biology, Toxicology, and Medicine. 2003;1:363–377. doi: 10.1080/15401420390249925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Kiefer J. Jack Carl Kiefer collected papers III, Design of Experiments. Springer-Verlay; New York: 1985. [Google Scholar]
  23. Läuter E. Optimal multipurpose designs for regression models. Mathematische Operationsforschung und Statistik. 1974;7:51–68. [Google Scholar]
  24. Lee CMS. Constrained optimal designs for regression models. Communication and Statistics-Theory and Methods. 1987;16:765–783. [Google Scholar]
  25. Lee CMS. Constrained optimal designs. Journal of Statistical Planning and Inference. 1988;18:377–389. [Google Scholar]
  26. Lehmann EL. Elements of large-sample theory. Springer-Verlay; New York: 1999. [Google Scholar]
  27. Li G, Majumdar D. D-optimal designs for logistic models with 3 and four parameters. Journal of Statistical Planning and Inference. 2008;138:1950–1959. [Google Scholar]
  28. McCullagh P. Quasi-likelihood functions. The Annals of Statistics. 1983;11:59–67. [Google Scholar]
  29. McCullagh P, Nelder JA. Generalized linear models. 2. Chapman and Hall; London: 1989. [Google Scholar]
  30. Niaparast M. On optimal design for a poisson regression model with random intercept. Statistics and Probability Letters. 2009;79:741–747. [Google Scholar]
  31. Niaparast M, Schwabe R. Optimal design for quasi-likelihood estimation in poisson regression with random coefficients. Journal of Statistical Planning and Inference. 2013;143:296–306. [Google Scholar]
  32. Pazman A. Mathematics and Its Applications. D. Reidel Publishing Company, Kluwer Academic Publishers; Hingham MA, USA: 1986. Foundations of optimum experimental design. [Google Scholar]
  33. Pukelsheim F. Optimal design of experiment. Society for Industrial and Applied Mathematics (SIAM); Philadelphia, PA: 2006. [Google Scholar]
  34. Reeve R, Turner JR. Pharmacodynamic models: parameterizing the hill equation, michaelis-menten, the logistic curve, and relationships among these models. Journal of Biopharmaceutical Statistics. 2013;23:648–661. doi: 10.1080/10543406.2012.756496. [DOI] [PubMed] [Google Scholar]
  35. Song D, Wong WK. On the construction of Grm-optimal designs. Statistica Sinica. 1998a;9:263–272. [Google Scholar]
  36. Song D, Wong WK. Optimal two-point designs for the michaelis-menten model with heteroscedastic errors. Communications in Statistics- Theory and Methods. 1998b;27:1503–1516. [Google Scholar]
  37. Stigler S. Optimal experimental design for polynomial regression. Journal of the American Statistical Association. 1971;66:311–318. [Google Scholar]
  38. Studden WJ. Some robust-type d-optimal designs in polynomial regression. Journal of the American Statistical Association. 1982;77:916–921. [Google Scholar]
  39. Weddernburn RWM. Quasi-likelihood functions, generalized linear models, and the gauss-newton method. Biometrika. 1974;61:439–447. [Google Scholar]
  40. Wong WK, Lachenbruch PA. Designing studies for dose response. Statistics in Medicine. 1996;15:343–360. doi: 10.1002/(SICI)1097-0258(19960229)15:4<343::AID-SIM163>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
  41. Yang M. On the de la garza phenomenon. Annals of Statistics. 2010;38:2499–2524. [Google Scholar]
  42. Yang M, Biedermann S, Tang E. On optimal designs for nonlinear models: a general and efficient algorithm. Journal of the American Statistical Association. 2013;108:1411–1420. [Google Scholar]

RESOURCES