Bioequivalence tests based on individual estimates using non-compartmental or model-based analyses: evaluation of estimates of sample means and type I error for different designs

Anne Dubois; Sandro Gsteiger; Etienne Pigeolet; France Mentré

doi:10.1007/s11095-009-9980-5

. Author manuscript; available in PMC: 2010 Oct 30.

Published in final edited form as: Pharm Res. 2009 Oct 30;27(1):92–104. doi: 10.1007/s11095-009-9980-5

Bioequivalence tests based on individual estimates using non-compartmental or model-based analyses: evaluation of estimates of sample means and type I error for different designs

Anne Dubois ^1,^*, Sandro Gsteiger ², Etienne Pigeolet ², France Mentré ¹

PMCID: PMC2881952 PMID: 19876723

Abstract

The main objective of this work is to compare the standard bioequivalence tests based on individual estimates of the area under the curve and the maximal concentration obtained by non compartmental analysis (NCA) to those based on individual empirical Bayes estimates (EBE) obtained by nonlinear mixed effects models. We evaluate by simulation the precision of sample means estimates and the type I error of bioequivalence tests for both approaches. Crossover trials are simulated under H₀ using different numbers of subjects (N) and of samples per subject (n). We simulate concentration-time profiles with different variability settings for the between-subject and within-subject variabilities and for the variance of the residual error. Bioequivalence tests based on NCA show satisfactory properties with low and high variabilities, except when the residual error is high which leads to a very poor type I error or when n is small which leads to biased estimates. Tests based on EBE lead to an increase of the type I error when the shrinkage is above 20% which occurs notably when NCA fails. In those cases, tests based on individual estimates cannot be used.

Keywords: pharmacokinetics, bioequivalence tests, non compartmental analysis, nonlinear mixed effects model, SAEM algorithm

1. Introduction

Pharmacokinetic (PK) bioequivalence studies are performed to compare different drug formulations. The most commonly used design for bioequivalence trials is the two-period two-sequence crossover design. This design is recommended by the Food and Drug Administration (FDA) (1) and the European Medicines Evaluation Agency (EMEA) (2). FDA and EMEA recommend to test bioequivalence from the log ratio of the geometric means of two parameters: the area under the curve (AUC) and the maximal concentration (C_max). These endpoints are usually estimated by non compartmental analysis (NCA) using the trapezoidal rule to evaluate AUC (3). NCA requires few hypotheses but a large number of samples per subject (usually between 10 and 20).

PK data can also be analyzed using nonlinear mixed effects models (NLMEM). This method is more complex than NCA but has several advantages: it takes benefit of the knowledge accumulated on the drug and can characterize the PK with few samples per subject. This allows to perform analyses in patients, the target population, and in whom pharmacokinetics can be different from healthy subjects. Non compartmental AUC is computed by trapezoidal rule which ignores assay error. NCA does not take into account non linear pharmacokinetics, which can bias the bioavailability estimation (4) and may amplify small bioavailability differences between drug products (5). The European guideline on similar biological medicinal products, which frequently exhibit non linear pharmacokinetics, recommends to estimate in the comparative PK studies, the elimination characteristics such as clearance (6). It is known that in these conditions, clearance is not accurately estimated by NCA. Models can also lead to better understanding of the biological system than a fully empirical approach and therefore help interpret ambiguous results.

However, the use of NLMEM is still rare in early phases of drug development or to analyze crossover studies. There are only seven published studies which use NLMEM to analyze bioequivalence trials (7, 8, 9, 10, 11, 12, 13) and except in Zhou et al (12), all analyze a dataset with many samples per subject. Five papers (7, 8, 9, 10, 13) compare tests based on individual NCA estimates to tests based on NLMEM and all conclude that the results are similar. Yet, they use different statistical approaches to test bioequivalence with NLMEM. Furthermore, none perform bioequivalence tests on individual estimates of AUC and C_max obtained from NLMEM. Pentikis et al (8) propose the estimation of AUC and C_max by standard nonlinear regression as an alternative to the NCA and Zhou et al (12) perform bioequivalence tests on the individual empirical Bayes estimates (EBE) of the volume of distribution and the steady-state through concentration. Otherwise, bioequivalence tests are performed on treatment effect parameters (7, 8, 9, 10, 11, 13). All authors agree that simulation studies are needed to evaluate bioequivalence tests based on NLMEM and to compare them to tests based on individual NCA estimates.

In this work, we compare the standard analysis of bioequivalence crossover trials based on NCA to the same usual analysis based on individual EBE obtained by NLMEM. We study the influence of the design for each approach. There is already one published simulation study of Panhard and Mentré which evaluates bioequivalence tests based on EBE estimated through NLMEM (14). Our present study relies on the work of Panhard and Mentré as starting point and adds several new features.

The major distinctness concerns the studied tests on the individual estimates (EBE or NCA). Panhard and Mentré perform the Student paired test and the Wilcoxon paired signed rank test whereas we use a linear mixed effects model (LMEM). As specified in the regulatory guidelines (1, 2), the bioequivalence analysis should take into account sources of variation that can be reasonably assumed to have an effect on the endpoints AUC and C_max. Therefore, LMEM including treatment, period, sequence and subject effects are usually used to analyze the log-transformed data (15).

Panhard and Mentré limit their comparison to bioequivalence tests on AUC and do not evaluate tests based on C_max. In the present study, both endpoints are analyzed; indeed we expect some issues for bioequivalence test performed on C_max as the estimation of C_max by NCA is sensitive to the design and the computation of C_max from EBE is more complex than for AUC. To simulate PK profiles and then to estimate individual parameters by NLMEM, Panhard and Mentré use a pharmacokinetic model parametrized using AUC as one of the PK parameters whereas we choose a more common parameterization, replacing AUC by the clearance of the drug.

For the estimation of NLMEM parameters, Panhard and Mentré use an algorithm based on a first order linearization with respect to the random effects, the first order conditional estimates (FOCE) algorithm (16) implemented in the R function nlme (17). The FOCE algorithm is the more widely used algorithm and corresponds to the industry standard for model-based PK analyses as it is implemented in NONMEM. Yet, this algorithm presents some convergence issues which could be avoided with the use of a stochastic algorithm using the exact maximum likelihood, such as the stochastic approximation expectation maximisation (SAEM) algorithm (18, 19, 20). The SAEM algorithm is implemented in the free software MONOLIX (21) (first version February 2005) and is applied to several population PK analyses (22, 23, 24).

The main objective of this work is to compare standard bioequivalence tests based on individual estimates of AUC and C_max obtained by NCA or by NLMEM. The comparison is based on the precision of the sample means of log(AUC) and log(C_max) and on the type I error of bioequivalence tests for both estimation methods. In section 2 of the article, we describe the model, the simulation study, both estimation methods (NCA and NLMEM), the evaluation of precsion of sample means, how bioequivalence tests are performed and how shrinkage on the tested parameters is estimated. The main results of the simulation are exposed in section 3. Finally, the study results and perspectives are discussed.

2. Methods

2.1. Simulation study

2.1.1. Simulation model

We analyze two-period two-sequence crossover PK trials where subjects are randomly allocated to one of two treatment sequences. In the first sequence (Ref – Test), subjects receive the reference treatment (Ref) and the test treatment (Test) in period one and two, respectively. In the second sequence (Test – Ref), subjects receive treatments in the reverse order (Test then Ref). Designs are balanced, i.e. there is the same number of subjects N/2 for each sequence.

In the following, we denote y_ijk the concentration for individual i (i = 1, ···, N) at sampling time j (j = 1, ···, n_ik) for period k (k = 1, 2). We also denote f the nonlinear pharmacokinetic function which links concentrations to sampling times. The nonlinear mixed effects model can be written as follows:

y_{ijk} = f (t_{ijk}, θ_{i k}) + ε_{ijk}

(1)

where θ_ik = (θ_ikl; l = 1, ···,p)′ is the p-vector of the PK parameters of subject i for period k. ε_ijk is the residual error assumed to be normally distributed with zero mean and variance $σ_{ijk}^{2}$ , with:

σ_{ijk}^{2} = {(a + b f (t_{ijk}, θ_{i k}))}^{2}

(2)

This is a combined error model with two parameters: a for the additive and b for the proportional part. We assume a multivariate log-normal distribution for the individual parameters θ_ik. In absence of covariates, the l^th individual parameter can be decomposed as:

θ_{ikl} = μ_{l} e^{η_{i l} + κ_{ikl}}

(3)

with μ = (μ_l; l = 1, ···,p)′ the p-vector of fixed effects, η_i = (η_il; l = 1, ···,p)′ the vector of random effects of subject i and κ_ik = (κ_ikl; l = 1, ···,p)′ the vector of random effects of subject i at period k. η_i represents the variability between individuals and it is named between-subject variability (BSV). κ_ik represents the variability between two periods of treatment for the same individual and it is called within-subject variability (WSV). η_i and κ_ik are assumed to be normally distributed with zero mean and with covariance matrices of size p × p denoted Ω and Γ, respectively. In this study we assume that Ω and Γ are diagonal. η_i, κ_ik and ε_ijk are assumed to be independent.

We introduce three categorical covariates into the statistical model: the treatment T_ik, the period P_k and the sequence S_i. The reference classes for each covariate are defined as follows: T_ik is fixed to zero for the treatment Ref and is equal to 1 for the treatment Test; P_k is fixed to zero for the first period and is equal to 1 for the second one; S_i is fixed to zero for the first sequence Ref — Test and is equal to 1 for the second one Test — Ref. β_T = (β_T,l; l = 1, ···,p)′, β_P = (β_P,l = 1, ···,p)′ and β_s = (β_s,l; l = 1, ···,p)′ correspond to vectors of the treatment, period and sequence effect. With these three covariates, μ_l of Eq. (3) is replaced by μ_ikl defined as:

μ_{ikl} = λ_{l} e^{β_{T, l} T_{i k} + β_{P, l} P_{k} + β_{S, l} S_{i}}

(4)

with λ = (λ_l; l = 1, ···,p)′ the p-vector of the fixed effects for the reference classes.

2.1.2. Theophylline pharmacokinetics

We use the concentration data of the anti-asthmatic drug theophylline to define the population PK model for the simulation study. These data are classical ones in population pharmacokinetics (17) and are used in previous simulation studies done by Panhard et al. (14, 25). The theophylline data include twelve subjects receiving a single oral dose of theophylline depending on their body weight (from 3 to 6 mg). For each patient, ten blood samples were taken at 0.25, 0.5, 1, 2, 3.5, 5, 7, 9, 12 and 24 h after administration and serum concentrations were measured. A one compartment model with first order absorption and first order elimination adequally describes the data and can be written as follows:

f (t, θ) = \frac{{FDk}_{a}}{C L - {V k}_{a}} (e^{{- k}_{a} t} - e^{- C L / V t})

(5)

where D is the dose, F the bioavailability, k_a the absorption rate constant, CL the clearance of the drug and V the volume of distribution. As only data after oral administration are obtained, the bioavailability cannot be estimated and, consequently, the vector θ of PK parameters is equal to (k_a, CL/F, V/F).

2.1.3. Simulation features

In this simulation study, we use rather similar settings as those of the simulation studies performed by Panhard et al. (14, 25). However we simulate two-period, two-sequence crossover pharmacokinetic trials whereas they simulate two-period, one-sequence crossover trials. For each trial, N/2 subjects are allocated to the sequence Ref – Test and N/2 subjects are allocated to the sequence Test – Ref. We fix the dose for all subjects to 4 mg which corresponds to the rounded median dose of the theophylline study. The vector of population parameters λ is composed of (λ_{k_a} = 1.48 h⁻¹, λ_CL/F = 40.36 mL/h, λ_V/F = 0.48 L) for the reference treatment. In order to mimic a change in bioavailability, we add a treatment effect β_T = (0,β_T,CL/F,β_T,V/F)′ on log(λ), i.e. we multiply λ_CL/F by e^β_T,CL/F and λ_V/F by e^β_T,V/F for the test treatment. The modification of bioavailability also affects AUC and C_max. Indeed, AUC = FD/CL and C_max is defined as:

\begin{array}{l} C_{\max} = f (t_{\max}, θ) = \frac{F D}{V} e^{- C L / V t_{\max}} \\ with t_{\max} = \frac{log (k_{a}) - log (C L / V)}{k_{a} - C L / V} \end{array}

(6)

We do not simulate a period effect or a sequence effect. We simulate with two levels of variability for the between-subject and within-subject variability. In the following, BSV and WSV are given as standard deviations of the log-transformed parameters multiply by 100 to be expressed in percent. The standard deviation on the log scale corresponds approximately to the coefficient of variation on the ordinary scale. For the low level, we fix BSV to 20% for k_a and CL/F and to 10% for V/F; WSV is fixed to half BSV for the three parameters. For the high level, we fix BSV to 50% and WSV to 15% for the three parameters. We also simulate with two levels of variability for the residual error: a = 0.1 mg/L, b = 10% for the low level, and a = 1 mg/L, b = 25% for the high level. The high level of residual error is only used with the high level of BSV and WSV. We call S_l,l, the variability setting with low variability for BSV and WSV and for the residual error, S_h,l, the variability setting with high variability for BSV and WSV and low for the residual error, and S_h,h, the variability setting with high variability for BSV and WSV and for the residual error. The three variability settings are summarized in Table I.

Table I.

Summary of the three variability settings used in the simulation study. The between-subject (BSV) and within-subject (WSV) variability are given as standard deviations of the log-parameters multiply by 100 and expressed in percent.

Variability	S_l_,_l	S_h_,_l	S_h_,_h
BSV	20% for k_a and CL/F 10% for V/F	50%	50%
WSV	10% for k_a and CL/F 5% for V/F	15%	15%
Residual error	a = 0.1 mg/L b = 10%	a = 0.1 mg/L b = 10%	a = 1 mg/L b = 25%

Open in a new tab

2.1.4. Simulation process

For each subject i = 1, ···, N of each simulated trial m = 1, ···, M, we simulate a vector of random effects η_i in Inline graphic (0, Ω) and two vectors of random effects κ_ik in (0, Γ), one for each period k = 1,2. To get the logarithm of each individual parameters log(θ_ikl), we add the logarithm of the mean parameter log(λ_l), the treatment effect β_T,l if needed (depending on the treatment group and the PK parameter considered), and both random effects η_il and κ_ikl. The concentrations f(t_ijk, θ_ik) predicted by the PK model at time t_ijk (j = 1, ···,n_ik) are then computed using the individual parameters. In these simulations, the sampling times for all subjects and both periods are similar. So j = 1, ···, n, where n is a fixed number of sampling times for each simulated design. Finally, we add a residual error, generated from a normal distribution Inline graphic (0, (a + b f(t_ijk, θ_ik))²), to each predicted concentration to obtain the simulated concentrations y_ijk. We do not incorporate in the simulation a limit of quantification (LOQ) because NCA cannot handle such data, contrary to the SAEM algorithm, and we do not want to favour the later. In the rare cases where the simulated concentration is below zero, we fix it to the value 0.1 mg/L.

We expect more of these fixed concentrations when variability increases but their proportion could also differ from a design to another if the sampling times differ. Consequently, for each simulated design and each variability setting, we compute the proportion of the concentrations fixed to 0.1 mg/L and study the corresponding sampling times.

2.1.5. Simulation designs

We simulate trials with four different designs, which are also used by Panhard et al (14, 25). We simulate with the original design with N = 12 subjects and n = 10 samples per subject and per period, taken at the times of the initial study (0.25, 0.5, 1, 2, 3.5, 5, 7, 9, 12 and 24 h after dosing). We also simulate with an intermediate design with N = 24 subjects and n = 5 samples, taken at 0.25, 1.5, 3.35, 12 and 24 h after dosing, a sparse design with N = 40 subjects and n = 3 samples, taken at 0.25, 3.35 and 24 h after dosing and a rich design N = 40 subjects and n = 10 samples, taken at the times of the initial study. For each design, we simulate using the variability settings S_l,l and S_h,l. We simulate using S_h,h only for the intermediate design. For each design and each variability setting, we simulate 1000 trials under two different hypotheses: H_0;80% where β_T = (0,log(0.8),log(0.8))′ and H_0;125% where β_T = (0,log(1.25),log(1.25))′. For each simulated trial, each simulated design and each variability setting, the simulated concentrations for the reference treatment are equal in both simulated hypotheses. In the following, we call simulation setting the association of one design with one variability setting and one hypothesis (H_0;80% or H_0;125%). Considering this, there are 18 different simulation settings (8 for S_l_,l and S_h,l, and 2 for S_h,h). All simulations are performed using the statistical software R 2.7.1. Figure 1 displays the individual data of one trial simulated under H_0;80% and H_0;125% with the intermediate design and three variability settings (S_l_,l, S_h,l and S_h,h).

Concentrations (*mg/L*) simulated for the intermediate design (N = 24, n = 5) for the reference treatment (left) and for the test treatment under H_0;80% (middle) and H_0;125% (right) using the variability settings *S_l*_,_l (top), *S_h*_,_l (middle) and *S_h*_,_h (bottom).

2.2. Estimation of individual parameters

2.2.1. Notations

We perform bioequivalence tests on AUC and C_max. To estimate the individual parameters using NCA or NLMEM, we do not consider periods or sequences. Only the treatment group (Ref or Test) is taken into account. For each simulated trial m = 1, ···, 1000 of one simulation setting, there are 2N individual AUC and 2N individual C_max, one for each subject i = 1, ···, N and each treatment group.

In the following, for one simulated trial, we call ${AUC}_{i}^{(Ref)}$ the true value of the individual AUC of subject i for the reference treatment and ${AUC}_{i}^{(Test)}$ the true value of the individual AUC of subject i for the test treatment; we also define ${\hat{AUC}}_{i}^{(Ref)}$ the estimated value of individual AUC of subject i for the treatment Ref obtained from NCA or NLMEM and ${\hat{AUC}}_{i}^{(Test)}$ the corresponding AUC for the treatment Test.

Same notations are applied to C_max. $C_{\max i}^{(Ref)}$ and $C_{\max i}^{(Test)}$ are the true value of the individual C_max of subject i f or the treatment Ref and Test, respectively. ${\hat{C_{\max}}}_{i}^{(Ref)}$ and ${\hat{C_{\max}}}_{i}^{(Test)}$ are the corresponding estimated value of individual C_max obtained from NCA or NLMEM.

In some cases, we may refer to these different individual parameters without specifying the treatment group. For each simulated trial, ${AUC}_{i}^{(Ref)}, {AUC}_{i}^{(Test)}, C_{\max i}^{(Ref)}$ and $C_{\max i}^{(Test)}$ are computed from the corresponding individual parameters k_a, CL/F and V/F simulated as described in section 2.1.4.

2.2.2. Estimation based on non compartmental analysis

First, we estimate AUC and C_max by non compartmental analysis (3) using a R function named mnca which we develop. For each simulated trial, this function provides the estimation of different NCA parameters for each subject and each treatment group. Different options have to be specified in mnca. In this study, we use the linear trapezoidal rule to compute the AUC_0–_last between the time of dose (equal to 0) and the last sampling time. To obtain the total AUC (between the time of dose and infinity), we compute the terminal slope equal to CL/V using the logarithm of the last concentrations to perform a linear regression. To do so, we use a fixed number of concentrations which depends on the number of samples per subject in the design.

To avoid biased estimation of the terminal slope, the first point used for its computation should be on the descending side of the concentration curve and not too close to C_max. Using the mean value of PK parameters, t_max, the sampling time corresponding to C_max, is about 2.06 h for both treatment groups (contrary to C_max, t_max is not affected by the change of bioavailability). Consequently, for the original and rich designs where n = 10, we use the last four concentrations which correspond to sampling times 7, 9, 12 and 24 h. NCA is normally performed on PK profiles containing ten sampling times per subject or more. For intermediate and sparse designs where n = 5 and n = 3 respectively, the total AUC is estimated by NCA for completeness. For these two designs, we use the last two concentrations which correspond to sampling times 12 and 24 h for the intermediate design, and to 3.35 and 24 h for the sparse design.

Figure 2 displays the individual concentration curves of one simulated trial for the original, intermediate and sparse designs and the two variability settings S_l,l and S_h,l. The bottom left graphic of the Figure 1 presents a similar graphic for the intermediate design and S_h,h, completing our illustration. For rich and intermediate designs, the number of concentrations used to compute the terminal slope seems reasonable. Same observation can be done for the rich design because the sampling times are similar to those of the original design, only the number of subjects differs. For sparse design, the number of concentration used to compute the terminal slope is chosen by default, first point being close to C_max.

Concentrations (*mg/L*) simulated for the original (N =12, n = 10, left), intermediate (N =24, n = 5, middle) and sparse (N = 40, n = 3, right) designs for the reference treatment using the variability settings *S_l*_,_l (top) and *S_h*_,_l (bottom).

Other assumptions are made to compute the terminal slope, to handle particular PK profiles, especially for the intermediate and sparse designs where only two points are used for the estimation. If the last two concentrations increase instead of decreasing or if they are similar up to the sixth digit, we consider the terminal slope be missing, i.e. there is no estimation of the total AUC for the subject and treatment concerned. The proportion of missing ${\hat{AUC}}_{i}$ should increase with variability and could differ from a design to another due to different sampling times. Consequently, for each design and each variability setting, we compute the proportion of missing ${\hat{AUC}}_{i}$ .

For all designs, C_max is estimated as the maximal concentration observed. Contrary to AUC, there is no missing C_max.

2.2.3. Estimation based on nonlinear mixed effects model

We also estimate AUC and C_max from the individual empirical Bayes estimates of the PK parameters after population analyses. In this study we use the SAEM algorithm implemented in MONOLIX 2.4 to estimate the NLMEM parameters (population and individual parameters). For each simulated trial, we analyze separately the concentrations of each treatment group using NLMEM without taking into account periods and sequences. As each subject receives both treatments, data of each treatment group contain observations from all subjects. In the following, we describe the statistical model used to fit the data of the reference treatment. We consider $y_{i j}^{(Ref)}$ the concentration for individual i (i = 1,···, N) at time t_ij (j = 1,···,n) and or the treatment Ref. Depending on the sequence of the subject i, $y_{i j}^{(Ref)}$ corresponds to concentration of the first or second period. The statistical model used has no covariate because no period or sequence effect are incorporated. Furthermore, since periods are not considered, WSV cannot be separated from BSV. Consequently, the l^th individual parameter is defined as:

θ_{i l}^{(Ref)} = μ_{l}^{(Ref)} e^{η_{i j}^{(Ref)}}

(7)

Ω(^Ref) is the covariance matrix of the vector of random effects $η_{i}^{(Ref)}$ . A similar statistical model is applied to fit the data of the treatment Test.

Of note, given the BSV and WSV, the overall variability is equal for both treatment groups, i.e. Ω⁽^Ref⁾ =Ω⁽^Test⁾. However, for each simulated trial, their estimates, Ω̂⁽^Ref⁾ and Ω̂⁽^Test⁾, are different. The overall simulated variability is 22.4% for k_a and CL/F and 11.2% for V/F under S_l_,_l, and 52.2% for the three PK parameters under S_h,l and S_h,h.

After having estimated the population parameters for the data of one treatment group of one simulated trial, we estimate the conditional modes of the corresponding individual parameters which are defined as the individual empirical Bayes estimates. These EBE provide the individual estimates of PK parameters (k_a, CL/F and V/F). We then derive individual ${\hat{AUC}}_{i}^{(Ref)}$ and ${\hat{C_{\max}}}_{i}^{(Ref)}$ or ${\hat{AUC}}_{i}^{(Test)}$ and ${\hat{C_{\max}}}_{i}^{(Test)}$ depending on the treatment group considered. Contrary to NCA, there is no missing ${\hat{AUC}}_{i}$ obtained by NLMEM using the SAEM algorithm.

2.2.4. Evaluation of estimates of sample means

In this study we compute individual ${\hat{AUC}}_{i}$ and $\hat{C_{\max i}}$ for 1000 replicates of different designs, different variabilities and different treatment groups using two types of estimation. To analyze and compare the accuracy and precision of the estimates of the sample means of log(AUC) and log(C_max) using NCA or EBE, we compute estimation error for each treatment group (Ref or Test) of each simulated trial. To take into account sampling variability, for each dataset we compute the estimation error as the difference between the sample mean of the estimates (NCA or EBE) and the sample mean of the true simulated values. In the following, definitions are given for ${\hat{AUC}}_{i}^{(Ref)}$ . Same definitions apply to ${\hat{AUC}}_{i}^{(Ref)}, {\hat{C_{\max}}}_{i}^{(Ref)}$ and ${\hat{C_{\max}}}_{i}^{(Test)}$ . For each simulated trial, the estimation error for the sample mean of log(AUC) for the reference treatment is computed as:

{e e}_{AUC}^{(Ref)} = \frac{1}{N^{*}} \sum_{i = 1}^{N^{*}} log ({\hat{AUC}}_{i}^{(Ref)}) - \frac{1}{N} \sum_{i = 1}^{N} log ({AUC}_{i}^{(Ref)})

(8)

with ${\hat{AUC}}_{i}^{(Ref)}$ the AUC estimated by NCA or derived from EBE for subjects i = 1, ···, N^*, and AUC_i⁽^Ref⁾ the true simulatd parameter for subjects i = 1, ···, N. For the estimation of individual parameters by NCA, there may be missing ${\hat{AUC}}_{i}$ , so that N^* ≤ N.

For one simulation setting, we call ${e e}_{AUC, m}^{(Ref)}$ the estimation error for the sample mean of log(AUC) computed for the reference treatment and the m^th simulated trial (m = 1, · · ·, 1000). We then define the bias and root mean square error (RMSE) computed from ${e e}_{AUC, m}^{(Ref)}$ over the 1000 replicates as:

\begin{array}{l} {bias}_{AUC}^{(Ref)} = \frac{1}{1000} \sum_{m = 1}^{1000} {e e}_{AUC, m}^{(Ref)} \\ {rmse}_{AUC}^{(Ref)} = \sqrt{\frac{1}{1000} \sum_{m = 1}^{1000} {({e e}_{AUC, m}^{(Ref)})}^{2}} \end{array}

(9)

As well as computing bias and RMSE, we compute the 95% confidence interval of ${bias}_{AUC}^{(Ref)}$ using the standard error of the mean and the 97.5% quantile of the Gaussian distribution. If zero does not belong to the 95% confidence interval of ${bias}_{AUC}^{(Ref)}$ , we can conclude that bias is significantly different from zero with a type I error of 5%.

2.3. Bioequivalence test

2.3.1. Implementation of the two one-sided tests

We perform the standard bioequivalence analysis recommended by FDA and EMEA (1, 2). The individual parameters are log-transformed and analyzed using a linear mixed effects model written as follows:

log (θ_{ikl}) = ν_{l} + β_{T, l} T_{i k} + β_{P, l} P_{k} + β_{S, l} S_{i} + ξ_{i l} + ε_{ikl}

(10)

where θ_ikl represents the l^th individual parameter (AUC if l = 1 or C_max if l = 2) for subject i (i = 1, · · ·, N) at period k (k = 1, 2). ν_l is the mean value for the studied log-transformed metric. The three covariates T_ik, P_k and S_i, for treatment, period and sequence are defined as before. It is assumed that the random subject effect ξ_il (l = 1,2) and the residual error ε_ikl (l = 1,2) are independently normally distributed with zero mean.

For each simulation setting, the individual estimates ${\hat{AUC}}_{i}$ and $\hat{C_{\max i}}$ obtained from NCA and NLMEM are analyzed by the LMEM described above. To check the properties of the TOST, we also analyze the true simulated value AUC_i and C_maxi. As specified before, for AUC estimated by NCA, they may be missing ${\hat{AUC}}_{i}$ . In that case, the LMEM is performed on less than 2N ${\hat{AUC}}_{i}$ .

After fitting the LMEM to individual metrics, a bioequivalence test is performed on the estimate of treatment effect β̂_T,l. The null hypothesis of the bioequivalence test recommended by the guidelines (1, 2) and performed on the l^th individual parameter is H₀: {β_T,l ≤ log(0.8) or β_T,l ≥ log(1.25)}. H₀ is rejected if the 90% confidence interval (90% CI) of β̂_T,l lies within [log(0.8); log(1.25)]. These limits of the bioequivalence test correspond to a ratio of the geometric mean falling within 80%–125%. This approach based on the 90% CI is equivalent to Schuirmann’s two one-sided tests (TOST) procedure (26). H₀ is composed of two unilateral hypotheses {β_T,l ≤ log(0.8)} and {β_T,l ≥ log(1.25)}. Both are tested separately by a one-sided test with a type I error of 5%. The p-value of the TOST is the maximum of both p-values of the one-sided tests and for each test the limit is the 95% quantile of the Student distribution with df degrees of freedom.

For balanced datasets, the N/2 subjects of each sequence are considered as two independent samples from normal populations with equal variances, and df = N − 2 (15, 27). For unbalanced datasets, i.e. when there is one or more missing ${\hat{AUC}}_{i}$ in a dataset for NCA, the determination of the degrees of freedom is more complex. Different approximations are available as for example the containment method (28), the Kenward-Roger adjustment (29) or the Satterthwaite’s procedure approximation (28, 29). In this study, we use the R function lme from the package nlme to perform the LMEM in which the degrees of freedom are estimated using the containment method (17). There, the degrees of freedom are calculated as: df = n_obs − N − 2 where n_obs is the total number of individual parameters. When there is no missing value, this approach coincides with the degrees of freedom computed in balanced datasets (because then n_obs = 2N).

2.3.2. Evaluation of the type I error

Bioequivalence tests are evaluated for ${\hat{AUC}}_{i}$ and $\hat{C_{\max i}}$ estimated by NCA or NLMEM on trials simulated under the composite null hypothesis H₀. Bioequivalence tests are also performed on the true simulated values AUC_i and C_maxi. The type I error of the TOST procedure is defined as the supremum of the type I errors over the null space (30). It corresponds to the supremum of the type I error of the two one-sided tests. As suggested by Liu and Weng (31), the type I error of the bioequivalence test can be evaluated for each boundary of H₀ space, i.e. log(0.8) and log(1.25). Consequently, we simulate for each design of each variability setting 1000 trials under each unilateral hypothesis H_0;80% and H_0;125% as specified before.

For each unilateral hypothesis H_0;80% and H_0;125%, the type I error is estimated by the proportion of the simulated trials for which the null hypothesis H₀ is rejected. If the bioequivalence tests were performed on the true parameters (AUC_i and C_maxi), the results of both type I errors should be identical because H_0;80% and H_0;125% are symetric but we are working with estimates. As proposed by Panhard and Mentré (14), we define the global type I error as the maximum value of both type I errors estimated. Due to the 1000 replicates, the 95% prediction interval (95% PI) for a type I error of 5% is [3.7%; 6.4%].

2.3.3. Shrinkage and tests based on empirical Bayes estimates

It is known in NLMEM that, with sparse individual information, the individual estimates of random effects shrink towards their mean value which is zero (32). For the reference treatment group of each simulated trial, the shrinkage on the l^th individual EBE (k_a, CL/F or V/F) can be defined as:

{S h}_{l}^{(Ref)} = 1 - \frac{var ({\hat{η}}_{i l}^{(Ref)})}{{\hat{ω}}_{l}^{(Ref) 2}}

(11)

where $var ({\hat{η}}_{i l}^{(Ref)})$ is the empirical variance of the l^th individual estimated random effects and ${\hat{ω}}_{l}^{(Ref) 2}$ is the estimated variance of the corresponding random effects.

AUC and C_max are secondary parameters of the NLMEM because they are defined as functions of the PK parameters, k_a, CL/F and V/F. As the shrinkage on individual EBE, the shrinkage on log(AUC) and log(C_max) can also be computed. Consequently, we can study the link between the type I error of bioequivalence tests based on EBE and the amount of shrinkage.

For log(AUC), Eq.(11) can be expressed as:

{S h}_{AUC}^{(Ref)} = 1 - \frac{var (log ({\hat{AUC}}_{i}^{(Ref)}))}{{\hat{ω}}_{{AUC}^{(Ref)}}^{2}}

(12)

where $var (log ({\hat{AUC}}_{i}^{(Ref)}))$ is the empirical variance of the individual estimates $log ({\hat{AUC}}_{i}^{(Ref)})$ and ${\hat{ω}}_{{AUC}^{(Ref)}}^{2}$ is its estimated variance in the model. As log(AUC) = log(D) − log(CL/F), $ω_{{AUC}^{(Ref)}}^{2} = ω_{C L / F^{(Ref)}}^{2}$ and ${\hat{ω}}_{{AUC}^{(Ref)}}^{2}$ is the estimated value ${\hat{ω}}_{C L / F^{(Ref)}}^{2}$ .

For one simulation setting, we call ${S h}_{AUC, m}^{(Ref)}$ the shrinkage on log(AUC) computed for the reference treatment for the m^th simulated trial (m = 1, · · ·, 1000). To summarize the 1000 ${S h}_{AUC, m}^{(Ref)}$ of each simulation setting, we compute the median shrinkage over these 1000 values.

Eq.(12) can be applied to log(C_max); $var (log ({\hat{C_{\max}}}_{i}^{(Ref)}))$ is computed from the individual estimates as for AUC. As the definition of C_max given in Eq.(6) is complex, the variance of log(C_max) for the reference treatment, $ω_{C_{\max}^{(Ref)}}^{2}$ cannot be computed from $ω_{k_{a}^{(Ref)}}^{2}, ω_{C L / F^{(Ref)}}^{2}$ and $ω_{V / F^{(Ref)}}^{2}$ . It must be approximated for instance using the delta method (33). The expression and details are given in Appendix. As for AUC, the median shrinkage over the 1000 values of ${S h}_{C_{\max, m}}^{(Ref)}$ is computed for each simulation setting.

3. Results

3.1. Simulated data and missing values

As explained in section 2.1.4, if the simulated concentration is below zero, it is fixed to 0.1 mg/L. As expected, the proportion of these fixed concentrations differs from one variability setting to another and from one design to another, except for the original and rich design where the sampling times are similar. The maximal proportion is rather small and is 0.03% for S_l_,l, 1.6% for S_h,l and 8.5% for S_h,h. For S_l,l, all fixed concentrations correspond to the last sampling time which is 24 h for all designs. For S_h,l, there are fixed concentrations corresponding to different sampling times but fixed concentrations at 24 h are majoritary, with a minimal proportion of 90%. For S_h,h, fixed concentrations corresponds mostly to 24 h (54%) and then mainly to 0.25 h (20%) and 12 h (19%).

Over all the simulations, some ${\hat{AUC}}_{i}$ estimated by NCA are missing due to particular individual PK profiles (see section 2.2.2). The proportion of missing ${\hat{AUC}}_{i}$ is similar in both hypotheses and remains rare for the four designs of S_l,l and S_h,l. For both variability settings, the maximal proportion corresponds to the intermediate design (N = 24, n = 5) with 0.02% and 3.3% for S_l,l and S_h,l, respectively. This proportion is 25% for S_h,h. Among missing ${\hat{AUC}}_{i}$ of S_h,h, 12% are due to concentrations fixed to 0.1 mg/L, i.e. due to two similar last concentrations. Other missing ${\hat{AUC}}_{i}$ are due to two last concentrations increasing instead of decreasing. As expected, there is no simulated trial where all ${\hat{AUC}}_{i}$ for both treatment groups are missing. In other words, the estimation error for the sample mean of log(AUC) or log(C_max) is computed on the 1000 simulated trial for each simulation setting, and the type I errors of bioequivalence test are estimated on 1000 replicates for AUC and C_max for both hypotheses H_0;80% and H_0;125%.

3.2. Evaluation of estimates of sample means

Figure 3 displays the bias (top) and RMSE (bottom) on sample mean estimates for log(AUC) (left) and log(C_max) (right) estimated for the reference treatment. Results are similar for both treatment groups (Ref and Test) and both unilateral hypotheses (results not shown). The 95% confidence interval of the bias is not shown in Figure 3 because this interval is tighter than the width of the displayed symbol and all biases are significantly different from zero. There is more bias and larger RMSE for NCA than for EBE for all designs and all variability settings. Note that biases and RMSE are computed on log scale, so that, for instance, a value of 0.038 corresponds approximatively to an error of 3.8% on the ordinary scale for the geometric mean. For NCA estimates, the bias and RMSE increase when the number of samples per subject decreases and are lower for S_l,l compared to S_h,l. For the intermediate design (N = 24, n = 5), the bias on the sample mean of log(AUC) is 0.038, 0.094 and 0.15 for S_l,l, S_h,l and S_h,h, respectively; RMSE is 0.044, 0.12 and 0.21, respectively.

Bias (top) and root mean square error (RMSE, bottom) of estimates of the sample mean for *log*(*AUC*) (left) and *log*(*C_max*) (right) for the reference treatment from 1000 trials for different designs (N: number of subjects, n: number of samples per subject) and different variability settings *S_l*_,_l (○), *S_h*_,_l (□) and *S_h*_,_h (△). The white symbols represent the individual estimates obtained from NCA and the grey ones the individual estimates obtained from EBE.

For individual estimates based on EBE, the bias is small (less than 0.02) for both parameters (log(AUC) and log(C_max)), all designs and all variability settings whereas RMSE increase when the number of samples per subject decreases and is majoritary lower for S_l,l compared to S_h,l. For instance, for the intermediate design, the bias on the sample mean of log(AUC) is −0.0096, −0.016 and −0.010 for S_l,l, S_h,l and S_h,h respectively; RMSE is 0.019, 0.031 and 0.10, respectively.

3.3. Bioequivalence test

Table II and Figure 4 provide the results of the type I error of bioequivalence tests performed on the treatment effect of log(AUC) and log(C_max). Table II contains the estimated type I error for each unilateral hypothesis, each design of each variability setting, for the true simulated values and both types of estimates (NCA and EBE). Figure 4 represents the global type I error for log(AUC) (top) and log(C_max) (bottom) versus the design for each variability setting and both types of estimates. The global type I error is defined as the supremum of both estimated type I errors.

Table II.

Type I error of the bioequivalence tests performed on the treatment effect of log(AUC) and log(C_max) for each unilateral hypothesis, H_0;80% and H_0;125%. The type I error is estimated from 1000 bioequivalence trials simulated under H_0;80% or H_0;125% for different designs (N: number of subjects, n: number of samples per subject), different variability settings S_l_,_l, S_h_,_l and S_h_,_h, for the true simulated values (SIM) and both types of estimates (NCA and EBE). Due to the 1000 replicates, the 95% PI for a type I error of 5% is [3.7%; 6.4%].

			N = 40, n = 10			N = 12, n = 10			N = 24, n = 5			N = 40, n = 3
			SIM	NCA	EBE	SIM	NCA	EBE	SIM	NCA	EBE	SIM	NCA	EBE
S_l_,_l	AUC	H_0;80%	3.9	4.0	5.5	5.4	5.2	7.7	4.3	4.3	8.0	3.9	5.9	14.8
		H_0;125%	4.6	5.1	5.8	5.4	5.2	7.4	4.4	3.8	7.5	4.6	5.1	16.2
	C_max	H_0;80%	4.5	6.6	10.0	5.7	5.1	9.0	5.8	5.3	14.6	4.5	6.8	30.6
		H_0;125%	4.9	6.3	9.1	5.2	5.6	10.9	5.3	5.2	16.2	4.9	5.5	29.1
S_h,l	AUC	H_0;80%	3.9	5.4	4.7	5.4	4.4	6.8	4.3	5.2	7.1	3.9	4.5	8.5
		H_0;125%	4.6	6.1	5.2	5.4	4.7	6.1	4.4	3.9	5.8	4.6	5.1	11.5
	C_max	H_0;80%	4.5	5.1	4.0	5.3	5.3	5.3	5.5	6.0	6.5	4.5	7.2	9.2
		H_0;125%	5.0	5.4	5.0	5.2	5.1	5.8	5.7	6.1	7.1	5.0	6.2	7.8
S_h_,_h	AUC	H_0;80%							4.3	0.8	20.6
		H_0;125%							4.4	0.4	22.2
	C_max	H_0;80%							5.5	7.0	13.8
		H_0;125%							5.7	9.3	17.0

Open in a new tab

Global type I error of the bioequivalence tests performed on the treatment effect of *log*(*AUC*) (top) and *log*(*C_max*) (bottom). The global type I error is estimated from 1000 bioequivalence trials simulated under H_0;80% and H_0;125% for different designs (N: number of subjects, n: number of samples per subject) and different variability settings *S_l*_,_l (○), *S_h*_,_l (□) and *S_h*_,_h (△). The white symbols represent the individual estimates obtained from NCA and the grey ones the individual estimates obtained from EBE. The dashed lines represent the nominal level at 5% and its 95% prediction interval ([3.7%; 6.4%]).

For the bioequivalence test performed on the true simulated values, the type I error for all designs, all variability settings and both null hypotheses lie in the 95% PI of the nominal level showing the good performance of the TOST. Mostly, for one type of estimates (NCA or EBE) and one design of one variability setting, the type I errors of both hypotheses are close.

For log(AUC), the global type I error of test based on NCA estimates lies between the 95% PI of the nominal level for the four designs of S_l,l and S_h,l and it is much too conservative for S_h,h. For instance, for the intermediate design, the global type I error is respectively 4.3%, 5.2% and 0.8% for S_l,l, S_h,l and S_h,h. For C_max, test based on NCA estimates has a correct global type I error for the original and intermediate designs simulated with S_l,l and S_h,l. The global type I error is above the 95% PI for the sparse design (N = 40,n = 3) simulated with S_l,l and S_h,l and the intermediate design simulated with S_h,h.

Surprisingly, tests based on EBE often lead to an increased type I error especially for the sparse design. For AUC, the global type I error remains at the nominal level for the rich design (N = 40, n = 10). For C_max, the global type I error lies between the 95% PI for the rich and the original designs simulated with S_l,l. The global type I error increases when the number of samples per subject decreases and is lower for S_h,l compared to S_l,l and S_h,h. Most of the type I errors are below 10% for S_l,l and S_h,l. For AUC and the intermediate design, the global type I error is respectively 8.0%, 7.1% and 22.2% for S_l,l, S_h,l and S_h,h.

Figure 5 represents the global type I errors of bioequivalence tests for the treatment effect on log(AUC) (top) and log(C_max) (bottom) obtained from NLMEM versus the median shrinkage on the corresponding parameter for the reference treatment. The distribution of the shrinkage is similar for both treatment (Ref and Test) and both unilateral hypotheses (results not shown). For both parameters, the median shrinkage is lower for S_h,l than for S_l,l. For log(AUC), the median shrinkage is also higher for S_h,h than for S_h,l. There is a clear relationship between the inflation of the global type I error and the amount of shrinkage with type I error greater than 15% for shrinkage greater than 20%.

Global Type I error of the bioequivalence tests performed on the treatment effect of *log*(*AUC*) (top) and *log*(*C_max*) (bottom) versus the median shrinkage on the parameter of interest for the reference treatment and different simulation settings *S_l*_,_l (○), *S_h,l* (□) and *S_h,h* (△). The rich design design (N = 40, n = 10) is represented by white symbols, the original design (N = 12, n = 10) by light grey symbols, the intermediate design (N = 24, n = 5) by dark grey symbols and the sparse design (N = 40, n = 3) by black symbols. The dashed lines represent the nominal level at 5% and its 95% prediction interval ([3.7%; 6.4%]).

4. Discussion

In this study, we compare the standard bioequivalence analysis performed on individual estimates of AUC and C_max obtained by NCA to the same bioequivalence analysis performed on individual EBE obtained by NLMEM. To do so, we perform a simulation study with different designs and different levels of variability. The estimation of parameters and the type I error are evaluated for both types of estimates.

Compared with the simulation study of Panhard and Mentré (14), we use the bioequivalence analysis recommended in the guidelines (1, 2) and we study both parameters (AUC and C_max). Besides, the simulation study of Panhard and Mentré is performed using the FOCE algorithm implemented in R function nlme. The FOCE algorithm is widely used to perform population PK analyses but, in simulation studies which compared different algorithms available, stochastic EM algorithms (like the SAEM algorithm) obtained the best results for accuracy and precision of estimates (34, 35).

As Panhard and Mentré, we simulate under both null hypotheses assuming a modification in the bioavailability F, i.e. assuming the same modification for CL/F and V/F which also affects similarly both tested parameters AUC and C_max. Consequently, the number of simulations are reduced because the unilateral hypothesis H_0;80% (H_0;125% respectively) for AUC corresponds to the unilateral hypothesis H_0;80% (H_0;125% respectively) for C_max; the same set of simulations is used for both parameters. However, other choices may be suitable as any PK parameter is likely to change between two formulations of the same drug. For instance, a change in the elimination rate CL/V due to interaction with excipient could be possible (36). Furthermore, we study only a one compartment model. We do not simulate multi-compartmental models. For both types of estimates (NCA and EBE), we perform bioequivalence test on AUC and C_max. Even with a multi-compartmental model, PK parameters would be summarized with these two endpoints even though the relationship between C_max and the PK parameters could be more complicated than for a one compartment model. As shown in Figure 5, the increase of the type I error of bioequivalence test based on EBE is linked to the shrinkage which already appears with one compartment model. We think this relationship should be similar for multi-compartmental models where more shrinkage is expected.

Conversely to the bias for estimates based on EBE, the bias for estimates based on NCA depends on the number of samples per subject and is large for sparse design (N = 40, n = 3) with high variability. Usually, NCA is used with rich designs where there are about ten to twenty samples per subject. This method is not well suited for trials performed in patients where the number of samples is often limited. In comparison to model-based approaches, the estimation of parameters through NCA has several drawbacks. It is giving equal weight to all concentrations without taking into account the measurement error. Furthermore, NCA is sensitive to missing data, especially for the determination of C_max and the computation of the terminal slope. Even without missing data, the interpolation of the AUC between the last sampling time and infinity is very sensitive to the number of samples used to compute the terminal slope and could be problematic for atypical concentration profiles. This later issue is perfectly illustrated by the simulation settings under S_h,h where 77% of the missing ${\hat{AUC}}_{i}$ are due to the two last concentrations increasing instead of decreasing. Contrary to NCA estimates, there is no missing ${\hat{AUC}}_{i}$ estimated by NLMEM due to this kind of PK profiles because all subjects are analyzed together and information given by classical PK profiles off-set information given by particular ones. NCA does not take into account all the knowledge accumulated on the PK of the studied drug as each new analysis by NCA erases the past contrary to NLMEM. Finally, although we do not simulate such data, NCA applied to nonlinear pharmacokinetics provides meaningless parameters and it cannot handle data below the limit of quantification. In this study, we choose to not introduce LOQ in the simulation because we do not want to favour the SAEM algorithm which can fit such data. We are aware that fixing some concentrations to 0.1 mg/L could introduce some bias. To avoid such arbitrary fixing, another common procedure is to resample until a valid value is obtained; however, resampling can also introduce a bias. Anyhow, the proportion of fixing value remains very low for S_l,l and S_h,l. It is more important for S_h,h but it is responsible for only 12% of the missing ${\hat{AUC}}_{i}$ estimated by NCA.

When the number of samples per subject is large and the variability is not too high, tests based on individual NCA estimates remain a good approach since they are simple and showed satisfactory properties for both tested parameters. For C_max and the sparse design, we expected an increase of the type I error because there is no sampling time corresponding to the maximal concentration which is close to 2 h. But even with poor sample mean estimates, the type I error is maintained at the nominal level of 5%. Though, for simulation with S_h,h, the type I error of AUC is very conservative (0.8%) which shows the limits of NCA for data with high residual error.

Tests based on individual EBE have higher type I error than tests based on NCA estimates. Our results on the type I error for S_l,l are consistent with the results obtained by Panhard and Mentré with the same variability setting. For the sparse design, the type I error of tests based on EBE is surprisingly high. In that case, EBE shrink towards their mean value and they are more similar in both treatment groups. Therefore, the discrimination of the AUC or C_max between both treatment groups is more difficult which leads to an increase of the type I error (bioequivalence is obtained more easily). These results are consistent with the results of the simulation study performed by Bertrand et al (37). In that work, they evaluate by simulation the analysis of variance (ANOVA) performed on individual EBE to test the influence of a single nucleotide polymorphism on a pharmacokinetic parameter of a drug. They show the impact of the shrinkage on the power of ANOVA. The power is reduced when the shrinkage increases. In other words, it is more difficult to discriminate between the genotypes with high shrinkage even when data are simulated with a difference.

As discussed by Schuirmann (26), the TOST procedure can be very conservative for highly variable drugs. Consequently, several improvements of this procedure have been proposed as in Berger et al (30), Brown et al (38) or Cao et al (39) to mention only a few. We are aware that there is still a great arguing on which bioequivalence test should be performed. However, we study only the classical TOST in this paper because our main objective is to compare the same standard bioequivalence analysis recommended in the guidelines (1, 2) and performed on individual estimates obtained by two estimation methods (NCA and EBE). Nevertheless, in this simulation study, the type I error of bioequivalence test performed on the true individual simulated values is always at the nominal level of 5%, even for S_h_,_h where the variability is particularly high. Therefore, we can conclude that, in this study, there is no issue about the TOST procedure. Consequently, liberal or conservative type I errors of bioequivalence tests performed on estimates cannot be imputed to the TOST but rather to the individual parameters estimation.

Tests based on individual estimates, NCA estimates or EBE, cannot be used for data with high residual error or when the number of samples per subject is small. In those cases, the type I error for tests based on NCA estimates is very poor or NCA estimates are biased and the shrinkage of EBE induces an increase of the type I error. In these situations, other tests based on a global analysis of all data should be considered. Panhard et al. already developed a global bioequivalence Wald test based on NLMEM (14, 25). This test is directly performed on the treatment effect parameter after fitting together the data of both treatment groups with the estimation of within-subject variability. In this study, they also used the FOCE algorithm implemented in nlme. Recently, Panhard and Samson developed an extension of the SAEM algorithm for NLMEM including the estimation of the within-subject variability (40). However, the likelihood ratio test for bioequivalence has not been developed, due to the composite null hypothesis. Additional methodological developments and simulations are needed to study bioequivalence tests after global analysis of all PK data. This will be especially useful for drugs with non linear pharmacokinetics and conditions where rich sampling is difficult to achieve, i.e. in pediatric studies or for drugs which cannot be administered in healthy subjects for safety reasons, such as oncology drugs.

Acknowledgments

We would like to thank the Modeling and Simulations group at Novartis Pharma AG, Basel, which supports by a grant Anne Dubois during this work.

Appendix

Approximation of the variance of log(C_max) by the delta method

For a one compartment model with first order absorption and first order elimination, C_max is defined in Eq.(6) as a function of the three PK parameters, k_a, CL/F and V/F. The variance of log(C_max), $ω_{C_{\max}}^{2}$ , is approximated by the delta method (33) as:

ω_{C_{\max}}^{2} \approx {(\frac{\partial log (C_{\max})}{\partial log (k_{a})})}_{log (μ)}^{2} ω_{k_{a}}^{2} + {(\frac{\partial log (C_{\max})}{\partial log (C L / F)})}_{log (μ)}^{2} ω_{C L / F}^{2} + {(\frac{\partial log (C_{\max})}{\partial log (V / F)})}_{log (μ)}^{2} ω_{V / F}^{2}

(13)

where log (μ) = (log(μ_{k_a}), log(μ_CL/F), log(μ_V/F))′. After computing the derivatives, $ω_{C_{\max}}^{2}$ can be approximated by:

\begin{array}{l} ω_{C_{\max}}^{2} \approx Δ^{2} (ω_{k_{a}}^{2} + ω_{C L / F}^{2}) + {(Δ - 1)}^{2} ω_{V / F}^{2} \\ with Δ = \frac{μ_{C L / F} (μ_{C L / F} - μ_{k_{a}} μ_{V / F}) + μ_{k_{a}} μ_{C L / F} μ_{V / F} log (\frac{μ_{k_{a}} μ_{V / F}}{μ_{C L / F}})}{{(μ_{k_{a}} μ_{V / F} - μ_{C L / F})}^{2}} \end{array}

(14)

In this simulation study, the general formula above is applied to approximate the variance of log(C_max) for both treatment groups (Ref and Test). Given the treatment effect we simulate for the treatment Test, both approximations, $ω_{C_{\max}^{(Ref)}}^{2}$ and $ω_{C_{\max}^{(Test)}}^{2}$ , are equal.

To approximate the variance of log(C_max) by the delta method, we use the true simulated values of μ⁽^Ref⁾ and Ω⁽^Ref⁾ described in section 2.2.3. To evaluate the delta method, we also estimate the variance of log(C_max), using the simulated parameter values of the rich design (N = 40, n = 10) for the reference treatment, under S_l_,_l and S_h,l. For both variability settings, $ω_{C_{\max}^{(Ref)}}^{2}$ is estimated as the empirical variance of the 40000 true simulated values of log(C_{max_i}⁽^Ref⁾). For S_l_,_l, the standard deviation of log(C_max) for the reference treatment expressed in percent is 10.5% both by simulation and the delta method. For S_h_,_l, it is 46.3% and 46.7% by simulation and the delta method, respectively.

These results on the true simulated values validate the approximation of the variance of log(C_max) by the delta method. Consequently, we apply it to the data of each treatment group for each simulated trial of the simulation study to approximate ${\hat{ω}}_{C_{\max}^{(Ref)}}^{2}$ ( ${\hat{ω}}_{C_{\max}^{(Test)}}^{2}$ respectively) using μ̂⁽^Ref⁾ (μ̂⁽^Test⁾ respectively) and Ω̂⁽^Ref⁾ (Ω̂⁽^Test⁾ respectively).

References

1.FDA. Technical report. FDA; 2001. Guidance for Industry - Statistical Approaches to establishing bioequivalence. [Google Scholar]
2.EMEA. Technical report. EMEA; 2001. Note for guidance on the investigation of bioavailability and bioequivalence. [Google Scholar]
3.Gabrielson J, Weiner D. Pharmacokinetic and pharmacodynamic data analysis: concepts and applications. Apotekarsocieteten; Stockholm: 2006. [Google Scholar]
4.Jusko WJ, Koup JR, Alván G. Nonlinear assessment of phenytoin bioavailability. Journal of Pharmacokinetics and Biopharmaceutics. 1976;4:327–336. doi: 10.1007/BF01063122. [DOI] [PubMed] [Google Scholar]
5.Hayashi N, Aso H, Higashida M, Kinoshita H, Ohdo S, Yukawa E, Hiquchi S. Estimation of rhG-CSF absorption kinetics after subcutaneous administration using a modified Wagner-Nelson method with a nonlinear elimination model. European Journal of Pharmaceutical Sciences. 2001;13:151–158. doi: 10.1016/s0928-0987(00)00219-0. [DOI] [PubMed] [Google Scholar]
6.EMEA. Technical report. EMEA; 2006. Guideline on similar biological medicinal products containing biotechnology-derived proteins as active substance: non-clinical and clinical issues. [Google Scholar]
7.Kaniwa N, Aoyagi N, Ogata H, Ishii M. Application of the NONMEM method to evaluation of the bioavailability of drug products. Journal of Pharmaceutical Sciences. 1990;79:1116–1120. doi: 10.1002/jps.2600791215. [DOI] [PubMed] [Google Scholar]
8.Pentikis H, Henderson J, Tran N, Ludden T. Bioequivalence: individual and population compartmental modeling compared to noncompartmental approach. Pharmaceutical Research. 1996;13:1116–1121. doi: 10.1023/a:1016083429903. [DOI] [PubMed] [Google Scholar]
9.Combrink M, McFadyen ML, Miller R. A comparison of standard approach and the NONMEM approach in the estimation of bioavailability in man. The Journal of Pharmacy and Pharmacology. 1997;49:731–733. doi: 10.1111/j.2042-7158.1997.tb06101.x. [DOI] [PubMed] [Google Scholar]
10.Maier GA, Lockwood GF, Oppermann JA, Wei G, Bauer P, Fedler-Kelly J, Grasela T. Characterization of the highly variable bioavailability of tiludronate in normal volunteers using population pharmacokinetic methodologies. European Journal of Drug Metabolism and Pharmacokinetics. 1999;24:249–254. doi: 10.1007/BF03190028. [DOI] [PubMed] [Google Scholar]
11.Hu C, Moore K, Kim Y, Sale M. Statistical issues in a modeling approach to assessing bioequivalence or PK similarity with presence of sparsely sampled subjects. Journal of Pharmacokinetics and Pharmacodynamics. 2003;31:312–339. doi: 10.1023/b:jopa.0000042739.44458.e0. [DOI] [PubMed] [Google Scholar]
12.Zhou H, Mayer P, Wajdula J, Fatenejad S. Unaltered etanercept pharmacokinetics with concurrent methotrexate in patients with rheumatoid arthritis. Journal of Clinical Pharmacology. 2004;44:1235–1243. doi: 10.1177/0091270004268049. [DOI] [PubMed] [Google Scholar]
13.Fradette C, Lavigne J, Waters D, Ducharme M. The utility of the population approach applied to bioequivalence in patients. Therapeutic Drug Monitoring. 2005;27:592–600. doi: 10.1097/01.ftd.0000174005.51383.2f. [DOI] [PubMed] [Google Scholar]
14.Panhard X, Mentré F. Evaluation by simulation of tests based on non-linear mixed-effects models in pharmacokinetic interaction and bioequivalence cross-over trials. Statistics in Medicine. 2005;24:1509–1524. doi: 10.1002/sim.2047. [DOI] [PubMed] [Google Scholar]
15.Hauschke D, Steinijans V, Pigeot I. Bioequivalence studies in drug development. John Wiley & sons; Chichester: 2007. [Google Scholar]
16.Lindstrom M, Bates D. Nonlinear mixed effects models for repeated measures data. Biometrics. 1990;46:673–687. [PubMed] [Google Scholar]
17.Pinheiro JC, Bates DM. Mixed-effects models in S and Splus. Springer; New-York: 2000. [Google Scholar]
18.Delyon B, Lavielle M, Moulines E. Convergence of a stochastic approximation version of EM algorithm. The Annals of Statistics. 1999;27:94–128. [Google Scholar]
19.Kuhn E, Lavielle M. Coupling a stochastic approximation version of EM with a MCMC procedure. ESAIM Probability and Statistics. 2004;8:115–131. [Google Scholar]
20.Samson A, Lavielle M, Mentré F. The SAEM algorithm for group comparison tests in longitudinal data analysis based on non-linear mixed-effects model. Statistics in Medicine. 2007;26:4860–4875. doi: 10.1002/sim.2950. [DOI] [PubMed] [Google Scholar]
21.The MONOLIX software . [accessed 05/07/09]. http://software.monolix.org/
22.Lavielle M, Mentré F. Estimation of population pharmacokinetic of saquinavir in HIV patients and covariate analysis with the SAEM algorithm. Journal of Pharmacokinetics and Pharmacodynamics. 2007;34:229–249. doi: 10.1007/s10928-006-9043-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Comets E, Verstuyft C, Lavielle M, Jaillon P, Becquemont L, Mentré F. Modelling the influence of MDR1 polymorphism on digoxin pharmacokinetic parameters. European Journal of Clinical Pharmacology. 2007;63:437–449. doi: 10.1007/s00228-007-0269-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Bertrand J, Treluyer J-M, Panhard X, Tran A, Auleley S, Rey E, Salmon-Céron D, Duval X, Mentré F the COPHAR2-ANRS 111 study group. Influence of pharmacogenetics on indinavir disposition and short-term response in HIV patients initiating HAART. European Journal of Clinical Pharmacology. 2009;65:667–678. doi: 10.1007/s00228-009-0660-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Panhard X, Taburet AM, Piketti C, Mentré F. Impact of modelling intra-subject variability on tests based on non-linear mixed-effects models in cross-over pharmacokinetic trials with application to the interaction of tenofovir on atazanavir in HIV patients. Statistics in Medicine. 2007;26:1268–1284. doi: 10.1002/sim.2622. [DOI] [PubMed] [Google Scholar]
26.Schuirmann DJ. A comparison of the two one-sided tests procedure and the power approach for assessing the equivalence of average bioavailability. Journal of Pharmacokinetics and Biopharmaceutics. 1987;15:657–680. doi: 10.1007/BF01068419. [DOI] [PubMed] [Google Scholar]
27.Chow SC, Liu JP. Design and analysis of bioavailability and bioequivalence studies. Marcel Dekker; 2000. [Google Scholar]
28.Verbeke G, Molenberghs G. Linear mixed models for longitudinal data. Springer; New-York: 2001. [Google Scholar]
29.Brown H, Prescott R. Applied mixed models in medicine. 2. John Wiley & sons; Chichester: 2006. [Google Scholar]
30.Berger R, Hsu J. Bioequivalence trials, intersection-union tests and equivalence confidence sets. Statistical Science. 1996;11:283–319. [Google Scholar]
31.Liu JP, Weng CS. Bias of two one-sided tests procedures in assessment of bioequivalence. Statistics in Medicine. 1995;14:853–861. doi: 10.1002/sim.4780140813. [DOI] [PubMed] [Google Scholar]
32.Savić R, Karlsson M. Shrinkage in empirical Bayes estimates for diagnostics and estimation. 2007. [accessed 05/07/09]. p. 16. Abstr 1087 available at http://www.pagemeeting.org/pdf_assets/9436-EBE_PAGE07_1_web.pdf. [DOI] [PMC free article] [PubMed]
33.Oehlert GW. A note on the delta method. The American Statistician. 1992;46:27–29. [Google Scholar]
34.Girard P, Mentré F. A comparison of estimation methods in nonlinear mixed effects models using a blind analysis. 2005. [accessed 05/07/09]. p. 14. Abstr 834 available at http://www.page-meeting.org/page/page2005/PAGE2005O08.pdf.
35.Bauer R, Guzy S, Ng C. Survey of population analysis methods and software for complex pharmacokinetic and pharmacodynamic models with examples. The AAPS Journal. 2007;9:60–83. doi: 10.1208/aapsj0901007. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Rescigno A, Powers J, Herderick EE. Bioequivalent or nonbioequivalent ? Pharmalogical Research. 2001;43:543–546. doi: 10.1006/phrs.2001.0820. [DOI] [PubMed] [Google Scholar]
37.Bertrand J, Comets E, Laffont C, Chenel M, Mentré F. Pharmacogenetics and population pharmacokinetics: impact of the design on three tests using the SAEM algorithm. Journal of Pharmacokinetics and Pharmacodynamics. 2009;36:317–339. doi: 10.1007/s10928-009-9124-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Brown LD, Hwang JTG, Munk A. An unbiased test for the bioequivalence problem. The Annals of Statistics. 1997;25:2345–2367. [Google Scholar]
39.Cao L, Mathew T. A simple numerical approach toward improving the two-one sided test for average bioequivalence. Biometrical Journal. 2008;50:205–211. doi: 10.1002/bimj.200710407. [DOI] [PubMed] [Google Scholar]
40.Panhard X, Samson A. Extension of the SAEM algorithm for nonlinear mixed models with two levels of random effects. Biostatistics. 2009;10:121–135. doi: 10.1093/biostatistics/kxn020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.FDA. Technical report. FDA; 2001. Guidance for Industry - Statistical Approaches to establishing bioequivalence. [Google Scholar]

[R2] 2.EMEA. Technical report. EMEA; 2001. Note for guidance on the investigation of bioavailability and bioequivalence. [Google Scholar]

[R3] 3.Gabrielson J, Weiner D. Pharmacokinetic and pharmacodynamic data analysis: concepts and applications. Apotekarsocieteten; Stockholm: 2006. [Google Scholar]

[R4] 4.Jusko WJ, Koup JR, Alván G. Nonlinear assessment of phenytoin bioavailability. Journal of Pharmacokinetics and Biopharmaceutics. 1976;4:327–336. doi: 10.1007/BF01063122. [DOI] [PubMed] [Google Scholar]

[R5] 5.Hayashi N, Aso H, Higashida M, Kinoshita H, Ohdo S, Yukawa E, Hiquchi S. Estimation of rhG-CSF absorption kinetics after subcutaneous administration using a modified Wagner-Nelson method with a nonlinear elimination model. European Journal of Pharmaceutical Sciences. 2001;13:151–158. doi: 10.1016/s0928-0987(00)00219-0. [DOI] [PubMed] [Google Scholar]

[R6] 6.EMEA. Technical report. EMEA; 2006. Guideline on similar biological medicinal products containing biotechnology-derived proteins as active substance: non-clinical and clinical issues. [Google Scholar]

[R7] 7.Kaniwa N, Aoyagi N, Ogata H, Ishii M. Application of the NONMEM method to evaluation of the bioavailability of drug products. Journal of Pharmaceutical Sciences. 1990;79:1116–1120. doi: 10.1002/jps.2600791215. [DOI] [PubMed] [Google Scholar]

[R8] 8.Pentikis H, Henderson J, Tran N, Ludden T. Bioequivalence: individual and population compartmental modeling compared to noncompartmental approach. Pharmaceutical Research. 1996;13:1116–1121. doi: 10.1023/a:1016083429903. [DOI] [PubMed] [Google Scholar]

[R9] 9.Combrink M, McFadyen ML, Miller R. A comparison of standard approach and the NONMEM approach in the estimation of bioavailability in man. The Journal of Pharmacy and Pharmacology. 1997;49:731–733. doi: 10.1111/j.2042-7158.1997.tb06101.x. [DOI] [PubMed] [Google Scholar]

[R10] 10.Maier GA, Lockwood GF, Oppermann JA, Wei G, Bauer P, Fedler-Kelly J, Grasela T. Characterization of the highly variable bioavailability of tiludronate in normal volunteers using population pharmacokinetic methodologies. European Journal of Drug Metabolism and Pharmacokinetics. 1999;24:249–254. doi: 10.1007/BF03190028. [DOI] [PubMed] [Google Scholar]

[R11] 11.Hu C, Moore K, Kim Y, Sale M. Statistical issues in a modeling approach to assessing bioequivalence or PK similarity with presence of sparsely sampled subjects. Journal of Pharmacokinetics and Pharmacodynamics. 2003;31:312–339. doi: 10.1023/b:jopa.0000042739.44458.e0. [DOI] [PubMed] [Google Scholar]

[R12] 12.Zhou H, Mayer P, Wajdula J, Fatenejad S. Unaltered etanercept pharmacokinetics with concurrent methotrexate in patients with rheumatoid arthritis. Journal of Clinical Pharmacology. 2004;44:1235–1243. doi: 10.1177/0091270004268049. [DOI] [PubMed] [Google Scholar]

[R13] 13.Fradette C, Lavigne J, Waters D, Ducharme M. The utility of the population approach applied to bioequivalence in patients. Therapeutic Drug Monitoring. 2005;27:592–600. doi: 10.1097/01.ftd.0000174005.51383.2f. [DOI] [PubMed] [Google Scholar]

[R14] 14.Panhard X, Mentré F. Evaluation by simulation of tests based on non-linear mixed-effects models in pharmacokinetic interaction and bioequivalence cross-over trials. Statistics in Medicine. 2005;24:1509–1524. doi: 10.1002/sim.2047. [DOI] [PubMed] [Google Scholar]

[R15] 15.Hauschke D, Steinijans V, Pigeot I. Bioequivalence studies in drug development. John Wiley & sons; Chichester: 2007. [Google Scholar]

[R16] 16.Lindstrom M, Bates D. Nonlinear mixed effects models for repeated measures data. Biometrics. 1990;46:673–687. [PubMed] [Google Scholar]

[R17] 17.Pinheiro JC, Bates DM. Mixed-effects models in S and Splus. Springer; New-York: 2000. [Google Scholar]

[R18] 18.Delyon B, Lavielle M, Moulines E. Convergence of a stochastic approximation version of EM algorithm. The Annals of Statistics. 1999;27:94–128. [Google Scholar]

[R19] 19.Kuhn E, Lavielle M. Coupling a stochastic approximation version of EM with a MCMC procedure. ESAIM Probability and Statistics. 2004;8:115–131. [Google Scholar]

[R20] 20.Samson A, Lavielle M, Mentré F. The SAEM algorithm for group comparison tests in longitudinal data analysis based on non-linear mixed-effects model. Statistics in Medicine. 2007;26:4860–4875. doi: 10.1002/sim.2950. [DOI] [PubMed] [Google Scholar]

[R21] 21.The MONOLIX software . [accessed 05/07/09]. http://software.monolix.org/

[R22] 22.Lavielle M, Mentré F. Estimation of population pharmacokinetic of saquinavir in HIV patients and covariate analysis with the SAEM algorithm. Journal of Pharmacokinetics and Pharmacodynamics. 2007;34:229–249. doi: 10.1007/s10928-006-9043-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Comets E, Verstuyft C, Lavielle M, Jaillon P, Becquemont L, Mentré F. Modelling the influence of MDR1 polymorphism on digoxin pharmacokinetic parameters. European Journal of Clinical Pharmacology. 2007;63:437–449. doi: 10.1007/s00228-007-0269-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Bertrand J, Treluyer J-M, Panhard X, Tran A, Auleley S, Rey E, Salmon-Céron D, Duval X, Mentré F the COPHAR2-ANRS 111 study group. Influence of pharmacogenetics on indinavir disposition and short-term response in HIV patients initiating HAART. European Journal of Clinical Pharmacology. 2009;65:667–678. doi: 10.1007/s00228-009-0660-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Panhard X, Taburet AM, Piketti C, Mentré F. Impact of modelling intra-subject variability on tests based on non-linear mixed-effects models in cross-over pharmacokinetic trials with application to the interaction of tenofovir on atazanavir in HIV patients. Statistics in Medicine. 2007;26:1268–1284. doi: 10.1002/sim.2622. [DOI] [PubMed] [Google Scholar]

[R26] 26.Schuirmann DJ. A comparison of the two one-sided tests procedure and the power approach for assessing the equivalence of average bioavailability. Journal of Pharmacokinetics and Biopharmaceutics. 1987;15:657–680. doi: 10.1007/BF01068419. [DOI] [PubMed] [Google Scholar]

[R27] 27.Chow SC, Liu JP. Design and analysis of bioavailability and bioequivalence studies. Marcel Dekker; 2000. [Google Scholar]

[R28] 28.Verbeke G, Molenberghs G. Linear mixed models for longitudinal data. Springer; New-York: 2001. [Google Scholar]

[R29] 29.Brown H, Prescott R. Applied mixed models in medicine. 2. John Wiley & sons; Chichester: 2006. [Google Scholar]

[R30] 30.Berger R, Hsu J. Bioequivalence trials, intersection-union tests and equivalence confidence sets. Statistical Science. 1996;11:283–319. [Google Scholar]

[R31] 31.Liu JP, Weng CS. Bias of two one-sided tests procedures in assessment of bioequivalence. Statistics in Medicine. 1995;14:853–861. doi: 10.1002/sim.4780140813. [DOI] [PubMed] [Google Scholar]

[R32] 32.Savić R, Karlsson M. Shrinkage in empirical Bayes estimates for diagnostics and estimation. 2007. [accessed 05/07/09]. p. 16. Abstr 1087 available at http://www.pagemeeting.org/pdf_assets/9436-EBE_PAGE07_1_web.pdf. [DOI] [PMC free article] [PubMed]

[R33] 33.Oehlert GW. A note on the delta method. The American Statistician. 1992;46:27–29. [Google Scholar]

[R34] 34.Girard P, Mentré F. A comparison of estimation methods in nonlinear mixed effects models using a blind analysis. 2005. [accessed 05/07/09]. p. 14. Abstr 834 available at http://www.page-meeting.org/page/page2005/PAGE2005O08.pdf.

[R35] 35.Bauer R, Guzy S, Ng C. Survey of population analysis methods and software for complex pharmacokinetic and pharmacodynamic models with examples. The AAPS Journal. 2007;9:60–83. doi: 10.1208/aapsj0901007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Rescigno A, Powers J, Herderick EE. Bioequivalent or nonbioequivalent ? Pharmalogical Research. 2001;43:543–546. doi: 10.1006/phrs.2001.0820. [DOI] [PubMed] [Google Scholar]

[R37] 37.Bertrand J, Comets E, Laffont C, Chenel M, Mentré F. Pharmacogenetics and population pharmacokinetics: impact of the design on three tests using the SAEM algorithm. Journal of Pharmacokinetics and Pharmacodynamics. 2009;36:317–339. doi: 10.1007/s10928-009-9124-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Brown LD, Hwang JTG, Munk A. An unbiased test for the bioequivalence problem. The Annals of Statistics. 1997;25:2345–2367. [Google Scholar]

[R39] 39.Cao L, Mathew T. A simple numerical approach toward improving the two-one sided test for average bioequivalence. Biometrical Journal. 2008;50:205–211. doi: 10.1002/bimj.200710407. [DOI] [PubMed] [Google Scholar]

[R40] 40.Panhard X, Samson A. Extension of the SAEM algorithm for nonlinear mixed models with two levels of random effects. Biostatistics. 2009;10:121–135. doi: 10.1093/biostatistics/kxn020. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Bioequivalence tests based on individual estimates using non-compartmental or model-based analyses: evaluation of estimates of sample means and type I error for different designs

Anne Dubois

Sandro Gsteiger

Etienne Pigeolet

France Mentré

Abstract

1. Introduction

2. Methods

2.1. Simulation study

2.1.1. Simulation model

2.1.2. Theophylline pharmacokinetics

2.1.3. Simulation features

Table I.

2.1.4. Simulation process

2.1.5. Simulation designs

Figure 1.

2.2. Estimation of individual parameters

2.2.1. Notations

2.2.2. Estimation based on non compartmental analysis

Figure 2.

2.2.3. Estimation based on nonlinear mixed effects model

2.2.4. Evaluation of estimates of sample means

2.3. Bioequivalence test

2.3.1. Implementation of the two one-sided tests

2.3.2. Evaluation of the type I error

2.3.3. Shrinkage and tests based on empirical Bayes estimates

3. Results

3.1. Simulated data and missing values

3.2. Evaluation of estimates of sample means

Figure 3.

3.3. Bioequivalence test

Table II.

Figure 4.

Figure 5.

4. Discussion

Acknowledgments

Appendix

Approximation of the variance of log(Cmax) by the delta method

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Approximation of the variance of log(C_max) by the delta method