Abstract
When no single outcome is sufficient to capture the multidimensional impairments of a disease, investigators often rely on multiple outcomes for comprehensive assessment of global disease status. Methods for assessing covariate effects on global disease status include the composite outcome and global test procedures. One global test procedure is the O’Brien’s rank-sum test, which combines information from multiple outcomes using a global rank-sum score. However, existing methods for the global rank-sum do not lend themselves to regression modeling. We consider sensible regression strategies for the global percentile outcome (GPO), under the transformed linear model and the monotonic index model. Posing minimal assumptions, we develop estimation and inference procedures that account for the special features of the GPO. Asymptotics are established using U-statistic and U-process techniques. We illustrate the practical utilities of the proposed methods via extensive simulations and application to a Parkinson’s disease study.
Keywords: Global percentile outcome, Rank-sum, Monotonic index model, Non-smooth objective function, Transformed linear model
1. Introduction
For many diseases with multifaceted symptoms and deteriorations, there does not exist a single outcome that can comprehensively quantify the global disease severity. One example is the NINDS Exploratory Trials in Parkinson’s Disease (NET-PD) Long-term Study 1 (LS-1), a multicenter, placebo-controlled clinical trial. The primary goal was to examine whether the nutritional supplement creatine could slow the progression of PD (Kieburtz et al., 2015). However, it is well known that PD is complex and multidimensional, and no single outcome can provide a thorough summary of the disease severity or progression. The NET-PD LS-1 steering committee defined the primary outcome as the rank-sum of five individual measures, namely change from baseline to five years in Schwab England Activities of Daily Living, 39-Item Parkinson’s Disease Questionnaire, Ambulatory Capacity, Symbol Digit Modalities, and the year-5 measurement of modified Rankin Scale (Elm and Investigators, 2012; Kieburtz et al., 2015). Situations when a single primary outcome is insufficient are also common for other diseases with complex symptoms, such as stroke (Tilley et al., 1996), Alzheimer’s disease (Lim et al., 2016), and heart failure (Felker and Maisel, 2010).
When scientific interest centers on an overall assessment of the global disease status, one may consider a combined analysis of the outcomes, instead of separate analyses of each outcome (Tilley et al., 2014; Pocock et al., 1987). One approach to achieve a combined analysis is by first integrating the multiple outcomes into a univariate composite outcome. When all the individual outcomes are numeric, composite scores are often formed as a linear combination of the outcomes through summation, averaging, or the composite z or t score. However, when some of the individual outcomes are skewed or involve outliers, the meaningfulness of such composite scores may be undermined (Strauss et al., 2006; Vanderploeg, 2014).
Another approach to achieve a combined analysis is to adopt a global test procedure. For continuous outcomes, the Hotelling’s T2 is analogous to the two-sample t-test and evaluates the equivalence in the mean vector of the multiple outcomes. O’Brien (1984) proposed a global statistical test (GST) by first transforming the multiple outcomes to a univariate rank-sum score. Subjects are first ranked with respect to each individual outcome; the rank-sum score is then calculated by summing up the outcome-specific ranks for each subject and then compared between different groups. The GST was shown to be robust to skewed distributions and was often more powerful than the Hotelling’s T2 (O’Brien, 1984). Huang et al. (2005) examined the theoretical properties of the GST and extended the methods to Behrens-Fisher problems. Ramchandani et al. (2016) studied a global rank test based on different composite scoring functions for multiple, possibly censored, outcomes.
The rank-sum GST has been successfully applied in many clinical studies (Fereshtehnejad et al., 2015; Simuni et al., 2015; Hinson et al., 2017; Luo et al., 2016), including the LS-1 study (Kieburtz et al., 2015). However, existing statistical methods for the global rank-sum are mainly designed for two-sample or K-sample hypothesis testings. They do not accommodate the identification of essential prognostic factors of the overall disease status from a collection of patient characteristics, the evaluation of global treatment efficacy while adjusting for confounders, or the examination of any potential moderators of treatment efficacy. Such analytical tasks are often of interest in clinical settings (Pocock et al., 1987) and would be greatly facilitated by solid statistical tools for regression modeling.
In this paper, we seek to fill this gap by developing sensible regression strategies. While the global rank-sum score is not directly interpretable, we focus on a global percentile outcome (GPO), defined as the average of the subjects’ percentile rank for each outcome. As such, the GPO ranges between 0 and 1 and facilitates easy interpretation. Considering the formulation of this global percentile outcome, it is important to adopt modeling frameworks that require weak restrictions on the distribution of the GPO and on the form of the relationship between the GPO and covariates. We consider herein the transformed linear model and the monotonic index model (Han 1987, Sherman 1993, Cavanagh and Sherman 1998, Fan et al. 2017), both of which are semiparametric and pose relaxed model assumptions. Research efforts have been devoted to estimation methods under these modeling frameworks, but existing works are only designed for a single outcome. The GPO, however, is a composite outcome formed by multiple individual outcomes. As we shall show later, the GPO is not directly observable and needs to be estimated, and the estimated GPO is no longer independent across subjects. As such, existing regression strategies do not account for the special feature of the GPO. Moreover, even with a single outcome, there remain some challenges in variance estimation and inference under the monotonic index model. Existing variance estimation methods often require smoothing, which can cause uncertainty due to the need to specify a bandwidth parameter.
In this work, our goal is to develop inferential strategies for the GPO under the flexible transformed linear model and the monotonic index model. In Section 2, we propose estimation and inference procedures that will account for the special features of the GPO. We first consider the situation when all individual outcomes are continuous, then accommodate scenarios when some of the individual outcomes involve ties or have a discrete distribution. The proposed methods are rigorously justified by theoretical results on consistency and asymptotic normality, and we develop consistent variance estimation procedures. The numerical performance of the proposed methods is evaluated via extensive simulations in Section 3. In Section 4, we apply the proposed methods to the NET-PD LS-1 study to examine the risk factors on global disease progression.
2. Methods
2.1. Formulation of the global percentile outcome
To illustrate the motivation, we first consider a commonly used transformed linear model for a single outcome Yi,
where is a vector of prognostic factors for subject i (i = 1, 2,…, n), f(·) is a monotonic increasing link function, and errors εi’s are of mean 0 and assumed to be independent, and identically distributed (i.i.d.). Note that this model can be equivalently written as
| (1) |
where is a random variable representing the percentile level of Yi in the study population, F(·) is the marginal cumulative distribution function (CDF) of Yi, and h(·) is the marginal quantile function of f(Yi).
Our focus lies in the situation when there are M outcomes Y1, …, YM, each with marginal CDF Fm(y) = P(Ym ≤ y), m = 1, 2, …, M, where M is an integer that does not grow with the sample size. Without loss of generality, suppose that higher values represent worse disease severity for all outcomes. To integrate information from these M outcomes, we propose a global percentile outcome (GPO) as
| (2) |
which represents the composite percentile across M outcomes for subject i.
If some of the individual outcomes are deemed more clinically important than the other outcomes, one could pre-specify weights based on prior knowledge and define a weighted version of the GPO as , where . Without loss of generality, we focus on the unweighted definition of the GPO below, which corresponds to the situation with wm ≡ 1/M, but all estimation and inference procedures also apply to the weighted version.
GPO ranges between [0, 1], and bears straightforward interpretation as the global level of subject i in the study population, when all M outcomes are considered simultaneously. When each of the M outcomes captures valuable information about certain aspects of the disease, but none of them is conclusive, the GPO effectively combines the information from them to achieve comprehensive assessments. Further, the GPO is robust to outliers and/or skewed distributions and invariant to monotone transformations. For now, we consider situations when all Ym’s are continuous outcomes and do not involve ties. The situation when some of these outcomes involve ties or have a discrete distribution will be discussed in Section 2.5.
As the average of M uniformly distributed random variables, the distribution of is generally non-uniform and depends on M as well as the association between Y1, …, YM. See the left column of Fig. 1 for the histogram of formed by three correlated outcomes, Y1, Y2 and Y3, that follow the Frank copula (top row) or the Clayton copula (bottom row). As it is challenging to correctly specify the distribution of the GPO, it is desirable to adopt modeling strategies that rely on weak distributional assumptions.
Fig. 1.

Histograms of and , where is formed by three correlated continuous outcomes that follow the Frank copula (top panel) or the Clayton copula (bottom panel).
In practice, the true marginal CDFs of the Ym’s in the population are often unknown, preventing us from observing the true . We can derive an empirically estimated as
| (3) |
where denotes the empirical CDF of Ym, m = 1, 2, …, M. Also note that is proportional to the global rank-sum in O’Brien (1984).
2.2. Transformed linear model
We first consider a transformed linear model (1), where εi has an unspecified distribution with zero mean and finite variance. The monotonic increasing link function h(·) can be pre-specified as the identity link or other commonly used link functions. The choice of h(·) is relevant to the model mis-specification and estimation efficiency. In preliminary simulations, we find that the quantile function of a truncated normal distribution with zero mean and variance 1, truncation points ±3, is a sensible choice of h(·). Other link functions, such as the logit link, may also be considered. Fig. 1 compares the histogram of and for correlated outcomes when h(·) is the truncated normal link.
Under model (1) and with a pre-specified h(·), we adopt the following estimating equation to obtain the estimator of regression coefficient, denoted as ,
| (4) |
While the equation can be easily solved using standard software, it is not straightforward to derive the asymptotic distribution and variance of . This is because are constructed based on the empirical CDFs, which use all the observations. As such, are not independent across subjects. In Supplemental Material A, we use the technique of Hájek projection for U-statistics to sort out the asymptotic distribution of , which leads to the theorem below.
Theorem 1. Under model (1) and regularity conditions A1–A3 in Supplemental Material A, we have and
where , , and .
The detailed proof, coupled with the definition of ξji and , can be found in Supplemental Material A. For matrix B, s2i represents the variability in the estimating function when is known, and s1i corresponds to the additional variability due to the estimated . The sandwich-type variance covariance matrix can be estimated explicitly, by replacing A and B with their respective empirical counterparts.
2.3. Monotonic index model
The transformed linear model features easy implementation and convenient variance estimation. However, one disadvantage is that it requires a prespecified link function. In practice, there is often little prior knowledge to inform the selection of h(·), and mis-specification of this link function may lead to misleading inference. To avoid the specification of h(·), we consider a monotonic index model (Han, 1987; Cavanagh and Sherman, 1998) for GPO as follows,
| (5) |
Here, εi is an i.i.d. error term with an unspecified distribution, and ζ(a1, a2) is an unspecified bivariate function that is monotonically increasing in both components. The regression coefficient β0 links covariates explicitly to the GPO under minimal assumptions. Model (5) includes many commonly used models as special cases, such as model (1) which corresponds to ζ(a1, a2) = h−1(a1 + a2). Because of the unspecified ζ(·), we require the that ∥β∥2 = 1 to ensure model identifiability. Therefore, number of unknown parameters in β0 equals p − 1, where p is the dimension of X. Without loss of generality, we shall treat θ ≡ (β2, …, βp)⊤ as free parameters below and let θ0 ≡ (β20, …, βp0)⊤.
Considering the monotonicity of the link function, we extend the rank estimation strategies (Han, 1987; Cavanagh and Sherman, 1998) for parameter estimation and set the objective function as
| (6) |
where gives the sample rank of among the n subjects. g(·) is a monotonic increasing function, and its choice does not affect the consistency of , but affects the efficiency. We set g(·) also as the quantile function of a truncated normal distribution with zero mean, σ2 = 1 and truncation points ±3, which rendered favorable numerical performance in our exploratory simulation studies.
We can obtain by maximizing (6) through optimization algorithms that accommodate non-smooth objective functions, such as the Nelder–Mead algorithm in R package optim, and then obtain the corresponding . The non-intercept part of can serve as a nice initial value after being standardized to have the unit norm. To ensure locating the global maxima, it is beneficial to implement the optimization algorithm with multiple initial values, which can be obtained by adding random errors to the estimator from the transformed linear model. Moreover, since the sign of β1 is uncertain given θ = (β2, …, βp)T, we may use a covariate with known direction as X1, after verifying this direction in the transformed linear estimator. When the direction is uncertain for all elements of X, one should implement the optimization twice, once with a positive sign and once with a negative sign for β1, and retain the maximizer with larger fitted objective function.
Deriving the asymptotic distribution of and was complicated by the non-smoothness of Gn(·) and the dependence between . To tackle these, we show that Γn(θ) = Gn(θ) − Gn(θ0) is asymptotically equivalent to in the vicinity of θ0. The first part is a zero-mean order-3 U process accounting for the difference between and , and the second part Γ2n(θ) is an order-2 U process involving the true only. Define , , and let V be the second derivative of Γ(θ) evaluated at θ0. We apply the U-process decomposition technique (Sherman, 1994) to show that in the vicinity of θ0, where Wn constitutes two parts that correspond to and Γ2n(θ) respectively and asymptotically follows a zero-mean normal distribution with variance covariance matrix Δ. Along these arguments, we obtained the following Theorem; the detailed proof is detailed in Supplemental Materials B.
Theorem 2. Under model (5) and conditions B1–B4 in Supplemental Material B, we have , and converges to a normal distribution with mean 0 and variance covariance matrix V−1ΔV−1.
In addition to the exact objective function in (6), we considered a smoothed objective function, which may be easier to solve numerically. Specifically, the indicator functions are approximated by Φ{(Xi − Xk)β/Cn}, where Cn is a bandwidth parameter, and Φ(·) is the CDF of a standard normal distribution. One may let Cn = C × n−1/3 for a constant C, and the resulting objective function is
| (7) |
which can be optimized via standard optimization algorithms. We evaluate the performance of this smoothed objective function in our numerical studies.
2.4. Variance estimation for the rank estimator
Although the variance covariance matrix of is derived theoretically in Supplemental Material B, the formula involves unknown components that are difficult to estimate explicitly. This issue often arises with non-smooth objective function or estimating equations. Moreover, the standard bootstrap technique does not apply here, as it creates many ties and complicates the pairwise comparisons in the objective function. To resolve these issues, following the resampling procedures in Jin et al. (2001) and Peng and Huang (2008), we propose a perturbation algorithm to achieve a consistent variance estimation for .
Let ωi be an i.i.d. positive random variable with mean 1 and variance 1 for i = 1, 2, …, n, such as ωi ~ Exponential(1). We propose to first perturb the estimated GPO as
Next, using the same set of , we perturb the objective function Gn(β) as
| (8) |
the maxima of which is denoted by . In practice, one can repeat the perturbation for B times to obtain , b = 1, 2, …, B, where B can be set as a reasonably large number such as 200 or 400. The sample variance covariance matrix of can serve as a desirable estimator for the variance covariance matrix of . The rationale of this perturbation scheme lies in the fact that the conditional distribution of given the observed data is asymptotically equivalent to the unconditional distribution of . Formal justifications are provided in Supplemental Material C.
2.5. Handling outcomes with ties and ordinal outcomes
Up to this point, we have focused on situations where all Ym, m = 1, 2, …, M are continuous without ties. Our estimation and inference procedures can be extended to situations where some of these outcomes involve ties or have a discrete distribution. First, consider the situation where all individual outcomes are continuous, but some of them involve ties. We can replace the empirical CDF by
| (9) |
where gives the rank of Yim in {Y1m, …, Ynm} using the fractional ranking strategy for ties. Next, consider the situation when an individual outcome, say Ym, is binary or ordinal. We adopt F(y) = {P(Ym ≤ y) + P(Ym < y)}/2 instead of the standard CDF, P(Ym ≤ y), in the definition of GPO in (2). Similarly, in the calculation of , we use (9) in place of the empirical CDF.
It is worth noting that one may have more than one individual outcome that involves ties or has discrete distributions, as long as the resulting GPO combining all individual outcomes remains continuous. With these modifications, the previous estimation and inference procedures continue to work.
3. Simulation study
We conducted extensive simulations to examine the finite-sample performance of the proposed transformed linear estimator and rank estimator . We considered four covariates, where X1 ~ Unif (−1, 1), X2 ~ Bernoulli(0.5), X3 ~ Normal(0, 12), and X4 ~ Gamma(shape = 2, rate 2). Four correlated outcomes, Y1, Y2, Y3 and Y4, were generated from the Clayton copula with parameter 2, and marginal distributions of these outcomes are Y1 ~ Normal(0, 12), Y2 ~ Exponential(1), Y3 ~ t(df = 3), and Y4 ~ Unif(−1, 1), respectively.
The following four different setups regarding the relationship between the GPO and covariates were considered:
Setup 1: , εi ~ Normal(0, 0.82),
Setup 2: , εi ~ Exponential(1),
Setup 3: , εi ~ Unif (0.5, 1.5),
Setup 4: , εi ~ Unif (0.5, 1.5),
where Hs(·) are all monotone functions for s = 1, 2, 3, 4. As such, the monotonic index model (5) is correctly specified under all four setups. The transformed linear model (1) is subject to mis-specification under all setups, and the degree of mis-specification is larger under Setups 3–4 than that under Setups 1–2. The first two setups still follow the transformed linear model, but the link function is subject to mis-specification. The true links under Setups 1–2 were plotted in Figure S1 of Supplemental Material D, along with the assumed link, to show the degree of mis-specification. Setups 3–4 represent more complex effect patterns than specified by the model. We let the true values β0 = (0.5, −0.707, 0, 0.5) with ∥β0∥2 = 1.
Under each setup, we ran 1000 simulations with sample sizes of n = 200 or n = 400. For each simulated dataset, we implemented the perturbation resampling method with B = 200 to obtain the standard error estimates. R codes for implementing the proposed method can be found at https://github.com/rli1010/GPOreg. Table 1 summarizes simulation results for the proposed estimators, including empirical biases (BIAS), empirical standard deviations (ESD), average of perturbation resampling-based standard errors (ASE), and the empirical coverage probabilities (ECP) of 95% Wald-type confidence intervals and empirical rejection rates (ERR) for testing the null hypotheses : for j = 1, 2, 3, 4. Without loss of generality, we adopted the quantile function of the truncated normal distribution with truncation points ±3 for the h(·) and g(·) functions. To facilitate the comparisons between and , we standardized the transformed linear estimator by its L2-norm; and obtained the variance estimation for the standardized estimator using the Delta method.
Table 1.
Simulation results for the transformed linear estimator and the rank estimator, where β0 = (0.5, −0.707, 0, 0.5).
| n | Transformed linear estimator | Rank estimator | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| BIAS | ESD | ASE | ECP | ERR | BIAS | ESD | ASE | ECP | ERR | ||
| Setup 1 | |||||||||||
| 200 | 0.000 | 0.089 | 0.086 | 0.941 | 1.000 | −0.011 | 0.102 | 0.102 | 0.942 | 0.986 | |
| −0.001 | 0.073 | 0.072 | 0.931 | 1.000 | 0.013 | 0.086 | 0.088 | 0.937 | 1.000 | ||
| −0.002 | 0.059 | 0.057 | 0.942 | 0.058 | −0.002 | 0.066 | 0.068 | 0.956 | 0.044 | ||
| −0.024 | 0.080 | 0.073 | 0.915 | 1.000 | −0.002 | 0.098 | 0.095 | 0.930 | 0.996 | ||
| 400 | 0.003 | 0.063 | 0.062 | 0.929 | 1.000 | −0.006 | 0.071 | 0.073 | 0.947 | 1.000 | |
| −0.005 | 0.053 | 0.051 | 0.927 | 1.000 | 0.005 | 0.061 | 0.062 | 0.947 | 1.000 | ||
| 0.000 | 0.043 | 0.041 | 0.934 | 0.066 | 0.000 | 0.047 | 0.048 | 0.948 | 0.052 | ||
| −0.022 | 0.053 | 0.052 | 0.916 | 1.000 | −0.003 | 0.066 | 0.068 | 0.950 | 1.000 | ||
| Setup 2 | |||||||||||
| 200 | 0.007 | 0.078 | 0.077 | 0.944 | 1.000 | −0.011 | 0.088 | 0.092 | 0.953 | 0.994 | |
| −0.017 | 0.063 | 0.061 | 0.901 | 1.000 | 0.003 | 0.072 | 0.078 | 0.958 | 1.000 | ||
| 0.001 | 0.055 | 0.053 | 0.936 | 0.064 | 0.000 | 0.058 | 0.062 | 0.960 | 0.040 | ||
| −0.051 | 0.064 | 0.063 | 0.845 | 1.000 | −0.007 | 0.082 | 0.087 | 0.953 | 1.000 | ||
| 400 | 0.012 | 0.056 | 0.054 | 0.923 | 1.000 | −0.003 | 0.060 | 0.064 | 0.962 | 1.000 | |
| −0.015 | 0.045 | 0.043 | 0.904 | 1.000 | 0.005 | 0.051 | 0.053 | 0.952 | 1.000 | ||
| 0.000 | 0.038 | 0.038 | 0.955 | 0.045 | 0.000 | 0.039 | 0.043 | 0.969 | 0.031 | ||
| −0.044 | 0.044 | 0.044 | 0.806 | 1.000 | −0.001 | 0.057 | 0.060 | 0.953 | 1.000 | ||
| Setup 3 | |||||||||||
| 200 | 0.002 | 0.034 | 0.033 | 0.938 | 1.000 | −0.001 | 0.037 | 0.040 | 0.956 | 1.000 | |
| −0.013 | 0.028 | 0.026 | 0.893 | 1.000 | 0.003 | 0.030 | 0.031 | 0.952 | 1.000 | ||
| 0.000 | 0.025 | 0.023 | 0.936 | 0.064 | 0.000 | 0.027 | 0.028 | 0.952 | 0.048 | ||
| −0.025 | 0.028 | 0.027 | 0.828 | 1.000 | 0.001 | 0.035 | 0.036 | 0.959 | 1.000 | ||
| 400 | 0.001 | 0.024 | 0.024 | 0.938 | 1.000 | −0.002 | 0.026 | 0.027 | 0.964 | 1.000 | |
| −0.016 | 0.019 | 0.019 | 0.834 | 1.000 | 0.001 | 0.021 | 0.022 | 0.949 | 1.000 | ||
| 0.000 | 0.017 | 0.016 | 0.943 | 0.057 | −0.001 | 0.018 | 0.019 | 0.963 | 0.037 | ||
| −0.026 | 0.020 | 0.019 | 0.720 | 1.000 | 0.000 | 0.023 | 0.025 | 0.957 | 1.000 | ||
| Setup 4 | |||||||||||
| 200 | 0.016 | 0.074 | 0.072 | 0.929 | 1.000 | −0.004 | 0.084 | 0.086 | 0.957 | 0.993 | |
| −0.009 | 0.063 | 0.060 | 0.912 | 1.000 | 0.009 | 0.073 | 0.074 | 0.949 | 1.000 | ||
| 0.001 | 0.051 | 0.049 | 0.935 | 0.065 | 0.002 | 0.054 | 0.056 | 0.946 | 0.054 | ||
| −0.049 | 0.068 | 0.065 | 0.865 | 1.000 | −0.006 | 0.085 | 0.085 | 0.939 | 0.996 | ||
| 400 | 0.017 | 0.052 | 0.051 | 0.912 | 1.000 | 0.000 | 0.057 | 0.060 | 0.952 | 1.000 | |
| −0.015 | 0.042 | 0.042 | 0.913 | 1.000 | 0.005 | 0.048 | 0.051 | 0.955 | 1.000 | ||
| 0.000 | 0.034 | 0.035 | 0.951 | 0.049 | 0.000 | 0.037 | 0.039 | 0.964 | 0.036 | ||
| −0.049 | 0.049 | 0.046 | 0.801 | 1.000 | −0.003 | 0.056 | 0.059 | 0.942 | 1.000 | ||
We observe in Table 1 that the rank estimator is virtually unbiased under all settings, and the empirical bias tends to shrink when the sample size increases. The perturbation resampling-based standard errors agree with the empirical standard deviations quite well and, as expected, decrease with the sample size at the rate. The ECPs are close to the nominal level of 95% under both sample sizes. The ERR’s for testing are close to 0.05 when . The ERR’s are quite high when equals 0.5 or −0.707, suggesting satisfactory power of the proposed Wald test.
The transformed linear estimator has reasonable performance, but it tends to have larger bias than . This is likely because the transformed linear model poses stronger model assumptions that are subject to mis-specification. The ESDs tend to be slightly smaller than their counterparts for the rank estimator, and the ASEs provide good approximation to the ESD. However, the ECPs are sometimes lower than the nominal level and do not improve when the sample size increases.
We conducted additional simulation studies for the transformed linear model under Setups 1 and 3, using various link functions h(·). The results were displayed in Table S1 of Supplemental Material D. Under Setup 1 where the transformed linear model holds, the true link used in the data generation led to the best performance in terms of efficiency and coverage. When the link function and/or the model form were mis-specified, the truncated normal link and the logit link showed better performance than the identity link. The performance of the transformed linear estimator was reasonable when the degree of mis-specification is small (Setup 1) but worsened when the degree of mis-specification increases (Setup 3).
Furthermore, we have conducted simulations for the rank estimator by adopting other choices of g(·), including the identity link and the rank of . The results are summarized in Table S2 of Supplemental Material D. The performance of the rank estimator was satisfactory and largely comparable using different choices of g(·). The truncated normal link rendered ESD and root mean squared error (RMSE) that were slightly smaller, though the difference was very small. The identity link was slightly inferior than the truncated normal link and the rank in terms of efficiency.
We also examined the performance of the kernel-smoothed objective function in (7). Results of the corresponding estimators are displayed in Table 2, where the top panel represents the results under the exact objective function, and the bottom panels correspond to the kernel-smoothed objective function with different C’s in Cn = C × n−1/3. We observe a bias–variance trade-off when n = 200, with increasing bias and decreasing variance when C gets larger. When the bandwidth is too large (e.g., C = 3), the bias becomes non-ignorable, and the convergence rates (CR) decrease dramatically. However, the bias–variance trade-off becomes less noticeable when n = 400, where the results are not sensitive to the choice of C. These results suggest that the performance of the kernel-smoothed estimator is very similar to that of the original estimator when the bandwidth is not too large.
Table 2.
Simulation results of the rank estimator under Setup 4, with and without kernel smoothing (bandwidth Cn = C × n−1/3), where β0 = (0.5,−0.707, 0, 0.5). Column CR reports the convergence rate.
| C | n = 200 | n = 400 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BIAS | ESD | ASE | ECP | ERR | CR | BIAS | ESD | ASE | ECP | ERR | CR | ||
| Exact | −0.004 | 0.084 | 0.086 | 0.957 | 0.993 | 0.997 | 0.000 | 0.057 | 0.060 | 0.952 | 1.000 | 0.995 | |
| 0.009 | 0.073 | 0.074 | 0.949 | 1.000 | 0.005 | 0.048 | 0.051 | 0.955 | 1.000 | ||||
| 0.002 | 0.054 | 0.056 | 0.946 | 0.054 | 0.000 | 0.037 | 0.039 | 0.964 | 0.036 | ||||
| −0.006 | 0.085 | 0.085 | 0.939 | 0.996 | −0.003 | 0.056 | 0.059 | 0.942 | 1.000 | ||||
| 0.1 | −0.009 | 0.079 | 0.087 | 0.960 | 0.998 | 1.000 | −0.004 | 0.059 | 0.060 | 0.948 | 1.000 | 1.000 | |
| 0.007 | 0.072 | 0.075 | 0.956 | 1.000 | 0.002 | 0.050 | 0.051 | 0.954 | 1.000 | ||||
| 0.000 | 0.053 | 0.055 | 0.963 | 0.037 | 0.001 | 0.038 | 0.038 | 0.964 | 0.036 | ||||
| −0.002 | 0.082 | 0.084 | 0.955 | 0.999 | −0.003 | 0.057 | 0.059 | 0.945 | 1.000 | ||||
| 0.25 | −0.008 | 0.077 | 0.084 | 0.960 | 0.999 | 1.000 | −0.004 | 0.058 | 0.058 | 0.941 | 1.000 | 1.000 | |
| 0.006 | 0.069 | 0.073 | 0.960 | 1.000 | 0.003 | 0.049 | 0.050 | 0.949 | 1.000 | ||||
| 0.000 | 0.052 | 0.054 | 0.965 | 0.035 | 0.001 | 0.037 | 0.037 | 0.961 | 0.039 | ||||
| −0.003 | 0.080 | 0.082 | 0.948 | 1.000 | −0.002 | 0.056 | 0.057 | 0.940 | 1.000 | ||||
| 0.5 | −0.006 | 0.074 | 0.079 | 0.957 | 1.000 | 1.000 | −0.004 | 0.057 | 0.055 | 0.929 | 1.000 | 1.000 | |
| 0.008 | 0.067 | 0.069 | 0.950 | 1.000 | 0.003 | 0.048 | 0.047 | 0.942 | 1.000 | ||||
| 0.001 | 0.051 | 0.051 | 0.950 | 0.050 | 0.001 | 0.036 | 0.036 | 0.953 | 0.047 | ||||
| −0.002 | 0.077 | 0.078 | 0.942 | 1.000 | −0.002 | 0.055 | 0.055 | 0.932 | 1.000 | ||||
| 1 | −0.004 | 0.071 | 0.074 | 0.946 | 1.000 | 1.000 | −0.002 | 0.056 | 0.052 | 0.925 | 1.000 | 1.000 | |
| 0.010 | 0.065 | 0.065 | 0.945 | 1.000 | 0.004 | 0.047 | 0.045 | 0.939 | 1.000 | ||||
| 0.001 | 0.051 | 0.050 | 0.943 | 0.057 | 0.001 | 0.036 | 0.035 | 0.947 | 0.053 | ||||
| 0.000 | 0.075 | 0.073 | 0.935 | 1.000 | −0.001 | 0.054 | 0.052 | 0.930 | 1.000 | ||||
| 2 | 0.005 | 0.067 | 0.069 | 0.948 | 1.000 | 0.978 | 0.001 | 0.055 | 0.051 | 0.921 | 1.000 | 1.000 | |
| 0.020 | 0.065 | 0.064 | 0.943 | 1.000 | 0.009 | 0.046 | 0.045 | 0.936 | 1.000 | ||||
| 0.002 | 0.055 | 0.053 | 0.941 | 0.059 | 0.001 | 0.038 | 0.036 | 0.948 | 0.052 | ||||
| 0.005 | 0.073 | 0.070 | 0.938 | 1.000 | 0.002 | 0.052 | 0.050 | 0.930 | 1.000 | ||||
| 3 | 0.046 | 0.058 | 0.064 | 0.914 | 1.000 | 0.595 | 0.010 | 0.053 | 0.049 | 0.913 | 1.000 | 0.967 | |
| 0.051 | 0.067 | 0.066 | 0.889 | 1.000 | 0.019 | 0.047 | 0.045 | 0.934 | 1.000 | ||||
| 0.005 | 0.060 | 0.059 | 0.933 | 0.067 | 0.001 | 0.040 | 0.039 | 0.946 | 0.054 | ||||
| 0.004 | 0.073 | 0.069 | 0.924 | 1.000 | 0.008 | 0.051 | 0.050 | 0.925 | 1.000 | ||||
Finally, we examined the performance of proposed estimators in the situation where some of the outcomes are discrete. The data generation scheme for Y1 and Y2 remained the same. The marginal distribution of Y3 follows Bernoulli(0.5), and Y4 ∈ {1, 2, 3, 4} marginally follows Multinomial{4; (0.1, 0.5, 0.15, 0.25)}. The resulting is still continuously distributed, and we generated data to follow the aforementioned Setup 4. As seen in Table 3, the performance of both estimators is comparable to their counterparts when all the individual outcomes are continuous.
Table 3.
Simulation results under Setup 4, where some of the outcomes are discrete and β0 = (0.5, −0.707, 0, 0.5).
| n | Linear transformation estimator | Rank regression estimator | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| BIAS | ESD | ASE | ECP | ERR | BIAS | ESD | ASE | ECP | ERR | ||
| 200 | 0.007 | 0.080 | 0.076 | 0.925 | 1.000 | −0.011 | 0.085 | 0.088 | 0.948 | 0.996 | |
| −0.020 | 0.065 | 0.061 | 0.889 | 1.000 | 0.007 | 0.074 | 0.074 | 0.941 | 1.000 | ||
| −0.003 | 0.052 | 0.050 | 0.940 | 0.060 | −0.002 | 0.056 | 0.057 | 0.957 | 0.043 | ||
| −0.057 | 0.069 | 0.067 | 0.843 | 1.000 | −0.002 | 0.085 | 0.087 | 0.935 | 0.998 | ||
| 400 | 0.013 | 0.054 | 0.054 | 0.939 | 1.000 | −0.003 | 0.058 | 0.062 | 0.963 | 1.000 | |
| −0.023 | 0.043 | 0.044 | 0.878 | 1.000 | 0.003 | 0.050 | 0.052 | 0.943 | 1.000 | ||
| 0.002 | 0.038 | 0.036 | 0.940 | 0.060 | 0.001 | 0.040 | 0.040 | 0.953 | 0.047 | ||
| −0.058 | 0.049 | 0.048 | 0.749 | 1.000 | −0.004 | 0.060 | 0.060 | 0.945 | 1.000 | ||
4. Real data example
We applied the proposed methods to the aforementioned NET-PD LS-1 study (Kieburtz et al., 2015), where the primary outcome of global disease progression was defined as the rank-sum of five individual measures at the 5-year visit (Elm and Investigators, 2012). Some of the individual outcomes were reversely coded, such that higher values were worse for all individual outcomes. Application of the O’Brien’s GST yielded a two-sided p-value of p = 0.45, suggesting no significant difference between the creatine arm and the placebo arm (Kieburtz et al., 2015). In the following analysis, we aimed to evaluate whether and how baseline covariates affect the disease progression for early stage PD patients. Our analytic data consisted of n = 704 subjects with complete data. Besides the treatment arm (creatine vs. placebo), we considered the following baseline covariates in the regression model: race (non-Hispanic White vs. others), gender (male vs. female), age of symptom onset (OnsetAge), Total Functional Capacity (TFC), levodopa equivalent dose (LED), Beck Depression Inventory score (BDI), and the Scales for Outcomes of Parkinson’s Disease Cognition (SCOPA-COG) score. These covariates were identified as potentially important risk factors in Bega et al. (2015) and/or our exploratory analyses. The continuous covariates were centered at the mean and standardized by their respective standard deviation.
Table 4 presents the estimated regression coefficients (COEF), standard errors (SE), and p-values under the transformed linear model (left panel) and the monotonic index model (right panel), respectively. Both estimators were restricted to have the unit norm, such that the COEFs represent the relative importance of individual risk factors. The results under the two models, as shown in Table 4, were largely consistent, though some differences were also noted. The SEs were comparable under the two models. Both methods identified younger age of symptom onset, lower baseline functional capacity, and worse baseline cognition status as prognostic factors of faster PD progression. The use of creatine (treatment) was insignificant under both models, and the COEF was closer to 0 under the monotonic index model. The monotonic index model also detected significant racial effect, where the non-Hispanic White race was associated with slower disease progression. This association was not captured by the transformed linear model.
Table 4.
Data analysis: estimated regression coefficients, standard error, and p-values.
| Transformed linear model | Monotonic index model | |||||
|---|---|---|---|---|---|---|
| COEF | SE | p | COEF | SE | p | |
| OnsetAge | 0.571 | 0.139 | <.001 | 0.480 | 0.138 | 0.001 |
| TFC | −0.318 | 0.125 | 0.011 | −0.214 | 0.102 | 0.036 |
| SCOPACOG | −0.555 | 0.150 | <.001 | −0.550 | 0.134 | <.001 |
| BDI | 0.151 | 0.118 | 0.203 | 0.161 | 0.117 | 0.167 |
| LED | 0.133 | 0.107 | 0.214 | 0.150 | 0.143 | 0.293 |
| Race | −0.458 | 0.311 | 0.142 | −0.611 | 0.286 | 0.033 |
| Treatment | −0.121 | 0.218 | 0.580 | 0.000 | 0.203 | 0.998 |
Fig. 2 displays the scatter plots with the estimated score on the x-axis and Table S2 of or on the y-axis. The solid curve represents the smoothed Lowess estimate. In both columns, the slope showed some curvature and appeared flatter for scores below −1. This pattern suggests potential model mis-specification under a (transformed) linear model, and the results under the monotonic index model are thus preferred for this data application.
Fig. 2.

Data analysis: scatter plot of vs. (left column) and vs. (right column), overlaid with the lowess curve.
5. Discussion
In this paper, we proposed regression strategies for the global percentile outcome (GPO), defined as the composite percentile of multiple individual outcomes. Regression models were formulated to associate the covariates directly to the GPO under relaxed and realistic assumptions, accounting for the fact that the distribution of GPO is non-normal and non-uniform. While the transformed linear model is appealing due to its simple implementation and easy interpretation, the monotonic index model has advantages in flexibility and robustness. Both estimators have been shown to have satisfactory numerical performance and asymptotic properties. Although the proposed methods were motivated by Parkinson’s disease studies, they can be readily applied to many disease fields where it is of interest to examine covariate effects on the global disease burden.
When the extent of mis-specification is small in our simulation studies, the transformed linear model may render smaller standard errors, which is desirable for small datasets. It is also easy to fit with a unique zero-crossing and an explicit variance formula. However, due to its stronger model assumptions, it may show larger bias and lower coverage rates when the degree of mis-specification increases. In practice, there is usually limited prior information about the underlying link function. Therefore, it is desirable to conduct some diagnostic procedures regarding the form of the link function, similar to what we did in Fig. 2 for the motivating example, to evaluate the plausibility of the assumed link. When the sample size is moderate to large, we recommend to adopt the rank estimator for more robust conclusions.
In special cases where an additional condition is satisfied for the covariate vector X, the transformed linear estimator is also robust to the mis-specified link. Li and Duan (1989) showed that if the distribution of X is elliptical symmetric and satisfies condition (A.2) therein, the slope part of is consistent up to a multiplicative scalar. Specifically, the slope part would be consistent after standardized to the unit norm. In applications where the elliptical symmetric assumption can be verified for the regressors, the transformed linear model can be fitted without concerns about the link violation.
Our modeling procedure examines covariate effects on the global disease level, as captured by a composite outcome based on ranks and percentiles. The direction of covariate effects on the individual outcomes may not always agree with its counterpart on the global outcome, especially in the presence of additional covariates and/or small sample sizes. Therefore, the results only pertain to the global disease level and may not apply simultaneously to all the individual outcomes. If the effect of a covariate on one outcome does not agree with those on other outcomes, the proposed method can still be applied, with the understanding that the inclusion of this outcome in the GPO may lead to a smaller covariate effect estimate. Following the regression modeling of GPO, one could analyze the individual outcomes in secondary analyses to gain more specific insights.
We have focused on complete data, but the proposed method could be extended to handle missing data when combined with multiple imputation. Moreover, it is of interest to extend the proposed modeling strategies to correlated observations in clustered data and longitudinal data. These directions are beyond the scope of the current work but merit future research.
Supplementary Material
Acknowledgments
This work was partially supported by awards from the National Institutes of Health (R03NS108136, R01DK117209, U24CA230144, R01CA269696), the National Science Foundation (DMS-1914496), and Cancer Prevention and Research Institute of Texas (RP200633). The authors acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC resources that have contributed to the research results reported within this paper.
Footnotes
Appendix A. Supplementary data
Supplementary material related to this article can be found online at https://doi.org/10.1016/j.jspi.2022.06.009.
References
- Bega D, Kim S, Zhang Y, Elm J, Schneider J, Hauser R, Fraser A, Simuni T, 2015. Predictors of functional decline in early Parkinson’s disease: NET-PD LS1 cohort. J. Parkinson’s Dis 5 (4), 773–782. [DOI] [PubMed] [Google Scholar]
- Cavanagh C, Sherman RP, 1998. Rank estimators for monotonic index models. J. Econometrics 84 (2), 351–381. [Google Scholar]
- Elm JJ, Investigators T.N.N.-P., 2012. Design innovations and baseline findings in a long-term parkinson’s trial: The national institute of neurological disorders and stroke exploratory trials in Parkinson’s Disease long-term study–1. Mov. Disord 27 (12), 1513–1521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fan C, Lu W, Song R, Zhou Y, 2017. Concordance-assisted learning for estimating optimal individualized treatment regimes. J. R. Stat. Soc. Ser. B Stat. Methodol 79 (5), 1565–1582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Felker GM, Maisel AS, 2010. A global rank end point for clinical trials in acute heart failure, Circ.: Heart Fail 3 (5), 643–646. [DOI] [PubMed] [Google Scholar]
- Fereshtehnejad S-M, Romenets SR, Anang JB, Latreille V, Gagnon J-F, Postuma RB, 2015. New clinical subtypes of parkinson disease and their longitudinal progression: a prospective cohort comparison with other phenotypes. JAMA Neurol. 72 (8), 863–873. [DOI] [PubMed] [Google Scholar]
- Han AK, 1987. Non-parametric analysis of a generalized regression model: the maximum rank correlation estimator. J. Econometrics 35 (2–3), 303–316. [Google Scholar]
- Hinson VK, Delambo A, Elm J, Turner T, 2017. A randomized clinical trial of atomoxetine for mild cognitive impairment in Parkinson’s disease. Mov. Disord. Clin. Pract 4 (3), 416–423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang P, Tilley BC, Woolson RF, Lipsitz S, 2005. Adjusting o’brien’s test to control type I error for the generalized nonparametric Behrens–Fisher problem. Biometrics 61 (2), 532–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin Z, Ying Z, Wei L-J, 2001. A simple resampling method by perturbing the minimand. Biometrika 88 (2), 381–390. [Google Scholar]
- Kieburtz K, Tilley BC, Elm JJ, Babcock D, Hauser R, Ross GW, Augustine AH, Augustine EU, Aminoff MJ, Bodis-Wollner IG, et al. , 2015. Effect of creatine monohydrate on clinical progression in patients with Parkinson disease: a randomized clinical trial. JAMA 313 (6), 584–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li K-C, Duan N, 1989. Regression analysis under link violation. Ann. Statist 1009–1052. [Google Scholar]
- Lim YY, Snyder PJ, Pietrzak RH, Ukiqi A, Villemagne VL, Ames D, Salvado O, Bourgeat P, Martins RN, Masters CL, et al. , 2016. Sensitivity of composite scores to amyloid burden in preclinical Alzheimer’s disease: Introducing the Z-scores of attention, verbal fluency, and Episodic memory for Nondemented older adults composite score. Alzheimer’s Dement.: Diagn. Assess. Dis. Monit 2, 19–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo S, Lawson AB, He B, Elm JJ, Tilley BC, 2016. BayesIan multiple imputation for missing multivariate longitudinal data from a parkinson’s disease clinical trial. Stat. Methods Med. Res 25 (2), 821–837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Brien PC, 1984. Procedures for comparing samples with multiple endpoints. Biometrics 40 (4), 1079–1087. [PubMed] [Google Scholar]
- Peng L, Huang Y, 2008. Survival analysis with quantile regression models. J. Amer. Statist. Assoc 103 (482), 637–649. [Google Scholar]
- Pocock SJ, Geller NL, Tsiatis AA, 1987. The analysis of multiple endpoints in clinical trials. Biometrics 487–498. [PubMed] [Google Scholar]
- Ramchandani R, Schoenfeld DA, Finkelstein DM, 2016. Global rank tests for multiple, possibly censored, outcomes. Biometrics 72 (3), 926–935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sherman RP, 1993. The limiting distribution of the maximum rank correlation estimator. Econometrica 61 (1), 123–137. [Google Scholar]
- Sherman RP, 1994. Maximal inequalities for degenerate U-processes with applications to optimization estimators. Ann. Statist 439–459. [Google Scholar]
- Simuni T, Kieburtz K, Tilley B, Elm JJ, Ravina B, Babcock D, Emborg M, Hauser R, Kamp C, Morgan JC, et al. , 2015. Pioglitazone in early Parkinson’s disease: a phase 2, multicentre, double-blind, randomised trial. Lancet Neurol. 14 (8), 795–803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strauss E, Sherman EM, Spreen O, 2006. A Compendium of Neuropsychological Tests: Administration, Norms, and Commentary. American Chemical Society. [Google Scholar]
- Tilley B, Huang P, O’Brien PC, 2014. Global assessment variables. Wiley; StatsRef: Statist. Ref. Online. [Google Scholar]
- Tilley BC, Marler J, Geller NL, Lu M, Legler J, Brott T, Lyden P, Grotta J, et al. , 1996. Use of a global test for multiple outcomes in stroke trials with application to the National Institute of Neurological Disorders and Stroke t-PA Stroke Trial. Stroke 27 (11), 2136–2142. [DOI] [PubMed] [Google Scholar]
- Vanderploeg RD, 2014. Clinician’s Guide to Neuropsychological Assessment. Psychology Press. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
