Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2014 Apr 4;15(4):745–756. doi: 10.1093/biostatistics/kxu012

Almost efficient estimation of relative risk regression

Garrett M Fitzmaurice 1,*, Stuart R Lipsitz 2, Alex Arriaga 2, Debajyoti Sinha 3, Caprice Greenberg 4, Atul A Gawande 5,6
PMCID: PMC4168315  PMID: 24705141

Abstract

Relative risks (RRs) are often considered the preferred measures of association in prospective studies, especially when the binary outcome of interest is common. In particular, many researchers regard RRs to be more intuitively interpretable than odds ratios. Although RR regression is a special case of generalized linear models, specifically with a log link function for the binomial (or Bernoulli) outcome, the resulting log-binomial regression does not respect the natural parameter constraints. Because log-binomial regression does not ensure that predicted probabilities are mapped to the [0,1] range, maximum likelihood (ML) estimation is often subject to numerical instability that leads to convergence problems. To circumvent these problems, a number of alternative approaches for estimating RR regression parameters have been proposed. One approach that has been widely studied is the use of Poisson regression estimating equations. The estimating equations for Poisson regression yield consistent, albeit inefficient, estimators of the RR regression parameters. We consider the relative efficiency of the Poisson regression estimator and develop an alternative, almost efficient estimator for the RR regression parameters. The proposed method uses near-optimal weights based on a Maclaurin series (Taylor series expanded around zero) approximation to the true Bernoulli or binomial weight function. This yields an almost efficient estimator while avoiding convergence problems. We examine the asymptotic relative efficiency of the proposed estimator for an increase in the number of terms in the series. Using simulations, we demonstrate the potential for convergence problems with standard ML estimation of the log-binomial regression model and illustrate how this is overcome using the proposed estimator. We apply the proposed estimator to a study of predictors of pre-operative use of beta blockers among patients undergoing colorectal surgery after diagnosis of colon cancer.

Keywords: Bernoulli likelihood, Convergence problems, Maclaurin series, Poisson regression, Quasi-likelihood

1. Introduction

We consider prospective study designs where it is of scientific interest to estimate relative risks (RRs) conditional on covariates. Interestingly, in many studies where RRs are the parameters of primary scientific interest, odds ratios (ORs) are reported instead. This can be explained in part by the technical advantages of logistic regression (e.g. no constraints on the regression parameters) and the widespread availability of appropriate software. Although RR regression is a special case of generalized linear models, specifically with a log link function for the binomial (or Bernoulli) outcome, the resulting log-binomial regression does not respect the natural parameter constraints. Because log-binomial regression does not ensure that predicted probabilities are mapped to the [0,1] range, maximum likelihood (ML) estimation is often subject to numerical instability that leads to convergence problems. It has been noted by several authors that convergence problems are especially likely to arise when the predicted probabilities are close to 1 (Wacholder, 1986; Lu and Tilley, 2001; Zou, 2004; Carter and others, 2005). Several approaches have been proposed to circumvent the convergence problems associated with ML estimation of log-binomial regression. These include: (1) directly estimating RR using a constrained ML method that truncates the range of the probabilities (Wacholder, 1986); (2) adding a small constant to each subject's Bernoulli outcome in the sample (Clogg and others, 1991; Deddens and others, 2003); (3) indirectly estimating RR using the mathematical relationship between OR and RR for a single binary covariate (Zhang and Yu, 1998); and (4) quasi-likelihood method of moments techniques (Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005).

One approach that has been widely studied is the use of Poisson regression estimating equations (Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005). That is, the Poisson likelihood equations are used to estimate the RR regression parameters without constraints. The estimating equations for Poisson regression yield consistent, asymptotically normal (CAN) estimators of the RR regression parameters. Moreover, Carter and others (2005) found that they alleviate convergence problems associated with ML estimation of log-binomial regression parameters. Although estimating equations for Poisson regression yield consistent estimators of the RR regression parameters, they are inefficient because the “weight function” is misspecified as Poisson instead of bionomial or Bernoulli. As we demonstrate later, the loss of efficiency tends to be greatest in the very setting where their use is required, i.e. when predicted probabilities are far from zero.

We consider the relative efficiency of the Poisson regression estimator and develop an alternative, almost efficient estimator for the RR regression parameters. The proposed method uses near-optimal weights based on a Maclaurin series (Taylor series expanded around 0) approximation to the true Bernoulli weight function. If the Maclaurin series is truncated at its first term, this yields the Poisson regression estimator. Truncation at higher terms in the series yields near-optimal weights and almost efficient estimators. Using method-of-moments, assuming the RR regression model is correctly specified, the estimators are consistent for any given truncation of the Maclaurin series approximation to the optimal weight function. We examine the asymptotic relative efficiency (ARE) for an increase in the number of terms in the series. We also make recommendations for choice of the number of terms to avoid similar finite sample convergence problems as with ML estimation of log-binomial regression parameters. We present results of a simulation study that highlight the potential gains in efficiency in finite samples.

The proposed method is motivated by a study of best practices for patients undergoing surgery for colorectal cancer (Arriaga and others, 2009). A panel of colorectal and general surgeons was assembled to ascertain a set of 37 evidence-based practices that they considered to be the most pertinent to the evaluation and management of a patient undergoing colorectal surgery after diagnosis of colon cancer. Patients with known heart disease who are given beta blockers prior to surgery have been found to have a significantly reduced risk of post-operative death (Poldermans and others, 1999). Thus, one of the key practices is giving beta blockers when indications (heart disease) are present. In this study, the medical records of 210 cancer patients with cardiac conditions from three hospitals were reviewed (due to confidentiality, hospital names must remain anonymous). Here the binary outcome of interest is whether the patient was given beta blockers (yes, no) prior to surgery. Upon review of the medical records, it was found that beta blockers were given to 130 out of the 210 Inline graphic patients, indicating that many doctors are not meeting the best practices guideline for almost Inline graphic of the patients. The goal of this study is to determine predictors of pre-operative use of beta blockers in these patients. The main predictors of interest are the patient's age (in years), race (categorized as white versus other), gender, number of comorbidities, and “American Society of Anesthesiologists (ASA) score.” The ASA score is a global assessment of the physical status of the patient (Owens and others, 1978) and yields a two-level indicator of a patient's pre-operative disease status at diagnosis (Inline graphic disease, Inline graphic-threatening disease). Table 1 presents the overall distributions of beta blocker use, age, race, gender, number of comorbidities, and ASA score. Although all of these cardiac patients should receive pre-operative beta blockers, it is of interest to explore whether pre-operative disease status (ASA score), age, race, gender, and number of comorbidities are predictive of those patients who were given beta blockers. The study investigators conjectured that patients with higher risks of complications, i.e. patients who are older, with worse ASA score and more comorbidities, are more likely to receive pre-operative beta blockers. It is also of interest to examine whether there are any differences by race and gender.

Table 1.

Descriptive statistics for patient characteristics from study of pre-operative use of beta blockers among patients undergoing colorectal surgery

Variable Level Number (Inline graphic)
Pre-operative beta blockers Yes 130 Inline graphic
No 80 Inline graphic
ASA score 0 92 Inline graphic
1 118 Inline graphic
Gender Male 98 Inline graphic
Female 112 Inline graphic
Race White 182 Inline graphic
Other 28 Inline graphic
Number of comorbidities 0 30 Inline graphic
1 62 Inline graphic
2 58 Inline graphic
3 36 Inline graphic
4 17 Inline graphic
5 4 Inline graphic
6 3 Inline graphic
Age, median (range) 68.5 (26–95)

In Section 2, we describe the RR regression model, the corresponding Bernoulli likelihood, and the proposed estimating equations for the RR regression parameters. In Section 3, we present results of a study examining the ARE of the proposed estimator for increasing number of terms in the series. In Section 4, we present results of a simulation study that demonstrate the potential gains in efficiency in finite samples.

2. RR regression model

Let Inline graphic denote the binary response (success or failure) for subject Inline graphic, Inline graphic, where Inline graphic is the number of independent subjects. Then Inline graphic is the success probability, where Inline graphic is a Inline graphic vector of covariates. In RR regression, the success probability is modeled using the log link,

2.

or, equivalently, Inline graphic, where Inline graphic is the vector of regression parameters. One can easily show that the elements of Inline graphic (with the exception of the intercept) have interpretation as log-RRs (see, for example, Jewell, 2003). For the remainder of this paper, we assume that the main interest centers on estimating the regression parameter vector Inline graphic.

The Bernoulli likelihood is

2. (2.1)

The ML estimating equations for Inline graphic are Inline graphic, where

2. (2.2)

The MLE is the asymptotically efficient estimate. However, when the success probability approaches 1, the denominator of (2.2) approaches 0, resulting in convergence problems. Wacholder (1986) constrained the likelihood to prevent Inline graphic from approaching 0; however, these modifications are still subject to convergence problems (Baumgarten and others, 1989).

In general, to estimate Inline graphic one can use estimating equations (or quasi-likelihood equations; Wedderburn, 1974) of the form Inline graphic, where

2. (2.3)

with Inline graphic being a “weight” function of Inline graphic Assuming that the regression model for Inline graphic has been correctly specified, i.e. Inline graphic these estimating equations yield CAN estimators of Inline graphic for any bounded weight function Inline graphic (see, for example, Rotnitzky, 2009). Specifically, it can be shown that the asymptotic distribution of Inline graphic, the estimator for Inline graphic with a particular choice of Inline graphic, satisfies

2. (2.4)

where

2.

and

2.

Note that the asymptotically efficient estimate is the MLE, with weight function Inline graphic and asymptotic covariance determined by Inline graphic. Consistent estimators of the asymptotic covariance of the estimated regression parameters can be obtained using the empirical estimator of Inline graphic proposed by Huber (1967), White (1982), and Royall (1986). The empirical variance estimator is obtained by evaluating Inline graphic at Inline graphic and substituting Inline graphic for Inline graphic; this is widely known as the sandwich variance estimator.

The use of Poisson regression for estimating Inline graphic has received much attention recently (e.g. Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005). The Poisson regression estimating equations are Inline graphic, where

2. (2.5)

It is apparent that the Poisson regression estimating equations are simply a special case of (2.3) with Inline graphic Thus, although the Poisson regression estimating equations produce consistent estimators of Inline graphic, they can be quite inefficient because Inline graphic is not the optimal or asymptotically efficient weight function. In general, weight functions closer to Inline graphic will have higher efficiency. Thus, the goal of this paper is to choose a Inline graphic close to Inline graphic but one that also avoids the convergence problems associated with ML. Lumley and others (2006) considered the weights Inline graphic Inline graphic and Inline graphic and estimated their relative efficiency (relative to the MLE). Lumley and others found that with Inline graphic close to 1 these three estimating equations can give inefficient estimates; this is due to the fact that the three weight functions considered do not closely approximate the optimal weight function.

To develop more efficient estimating equations, we first note that the Maclaurin series (Abramowitz and Stegun, 1970) of Inline graphic is

2.

This series converges for Inline graphic. For our proposed estimating equations for Inline graphic, we consider using weight functions of the form

2. (2.6)

for different finite values of Inline graphic. Note that the Poisson regression estimating equations can be considered the Maclaurin series truncated at Inline graphic (i.e. constant weights). Higher values of Inline graphic will more closely approximate the optimal weights associated with the MLE and should yield more efficient estimates.

For any finite value of Inline graphic the proposed weight function can be implemented in standard statistical software for generalized linear models that allows user-defined variance functions (e.g. PROC GLIMMIX in SAS); there is negligible increase in computational time for larger values of Inline graphic However, some care must be exercised in the choice of value for Inline graphic; for very large values of Inline graphic Inline graphic will be very close to Inline graphic and the resulting estimating equations will be unstable when Inline graphic is close to 1. Recall that convergence problems with ML estimation arise when Inline graphic is close to 1 and the Inline graphicth observation receives excessively large weight in the estimating equations. We note here that, because Inline graphic Inline graphic will always be positive. Also, for success probabilities close to 1 the weights are bounded, with largest Inline graphic, so that convergence problems are less likely unless very large values of Inline graphic are chosen. In the next section, we examine the ARE for increasing number of terms (Inline graphic) in the series and make recommendations for the choice of the number of terms to avoid similar finite sample convergence problems as with ML estimation. The challenge is to find some minimum value of Inline graphic that provides near-optimal weights but essentially bounds the largest weights when Inline graphic is close to 1.

3. Asymptotic relative efficiency

The goal is to find a weight function that approximates the optimal weight function, Inline graphic, but avoids assigning extremely large weights when Inline graphic is close to 1. Using the truncated Maclaurin series of Inline graphic, Inline graphic, we examine the ARE for increasing finite values of Inline graphic. For this study of ARE, we consider a log RR regression model with a single covariate,

3.

where, for simplicity, we let Inline graphic have a discrete uniform distribution on the set Inline graphic. We let Inline graphic be negative, so that Inline graphic is the maximum value for any Inline graphic; in contrast, Inline graphic is the minimum value for any Inline graphic. For the study of ARE, we first specify the maximum, Inline graphic, and minimum, Inline graphic, values for Inline graphic, which in turn fully specifies the parameters Inline graphic and Inline graphic Specification of the model in this way allows us to explore properties of the estimators as Inline graphic approaches 1. For different choices of values of Inline graphic and Inline graphic, we examine the ARE of the estimator of Inline graphic based on Inline graphic for increasing values of Inline graphic ranging from 0 to 100. Recall that, when Inline graphic the weights are constant and equivalent to the Poisson regression estimator. Fixing Inline graphic we let Inline graphic range from 0.2 to 0.8.

Given a discrete uniform distribution for Inline graphic, and a Bernoulli distribution for Inline graphic given Inline graphic, the asymptotic variance of Inline graphic can be obtained from (2.4) by simply considering an artificial sample comprised of one properly weighted observation for each possible realization of (Inline graphic). The weights are determined by the respective joint probabilities of the given realizations. Following Rotnitzky and Wypij (1994), the asymptotic variance of Inline graphic can be ascertained by weighting each contribution to Inline graphic and Inline graphic by its respective probability. That is, we take the expectations of Inline graphic and Inline graphic, the components of Inline graphic in (2.4), by summing all of the possible realizations weighted by their respective probabilities.

A plot of the ARE for increasing values of Inline graphic is given in Figure 1. The four panels of Figure 1 display the AREs for Inline graphic, respectively. As Inline graphic increases, the concentration of success probabilities that are close to 1 increases as the median of the probabilities increases from 0.44 when Inline graphic to 0.89 when Inline graphic. In the four panels of Figure 1, the ARE for the Poisson regression estimator (Inline graphic) is in the 60–70% range. This highlights the loss of efficiency associated with the use of constant weights when the probabilities are not small. These results are in close agreement with those reported by Lumley and others (2006). As anticipated, the ARE increases monotonically with increasing values of Inline graphic. AREs of approximately 95–97% are obtained for the proposed estimator when Inline graphic is between 40 and 60. We note that, for Inline graphic close to 1, Inline graphic and Inline graphic bound the maximum weights at approximately 41 and 61, respectively. For larger values of Inline graphic, there appears to be diminishing returns in terms of increases in efficiency. More importantly, however, larger values of Inline graphic seem far more likely to produce problems with convergence in finite samples due to excessive weight assigned to observations with Inline graphic close to 1. In the next section, we examine the finite sample performance of the proposed estimator when Inline graphic and Inline graphic.

Fig. 1.

Fig. 1.

ARE for increasing number of terms (Inline graphic) in Maclaurin series expansion of the optimal Bernoulli weight function.

4. Simulation study

In this section, we consider the finite sample properties of the proposed estimator. For the simulations, we used a similar configuration as in the study of ARE presented in Section 3:

4.

where, for simplicity, we let Inline graphic have a uniform (0,1) distribution. For the different values of Inline graphic and Inline graphic (or equivalently, Inline graphic and Inline graphic), we conducted simulations for Inline graphic with 2500 simulation replications performed for each configuration. We performed simulations fixing Inline graphic and letting Inline graphic range from 0.2 to 0.8 in 0.2 unit increments. The simulations were used to compare the MLE (unconstrained), the Poisson regression estimator (Inline graphic), and the proposed estimator based on a Maclaurin series approximation with Inline graphic and Inline graphic.

Table 2 presents the relative bias, defined as Inline graphic the root mean square error, the coverage probabilities of 95% Wald confidence intervals for Inline graphic, as well as the percentage of simulation replications in which the convergence criterion was met. From Table 2, we see that the percent convergence for ML is less than Inline graphic for all configurations; for ML, we report the results only for those simulations that converged for ML; the latter results can be considered “conditional on the likelihood convergence criterion”. In contrast, there were no convergence problems for any of the estimating equations approaches. In terms of bias, standard ML has non-negligible relative bias, and the relative bias increases as Inline graphic increases. The relative bias varies from approximately 8% to 30%. All other approaches have negligible (Inline graphic5%) relative biases. For the estimating equations approaches, the relative efficiencies can be estimated as the square of the ratio of the root mean square errors. The relative efficiencies of Poisson regression versus the Maclaurin series with Inline graphic is between Inline graphic and Inline graphic increasing from Inline graphic when Inline graphic to Inline graphic when Inline graphic For these simulations, the relative efficiency of the Maclaurin series with Inline graphic versus the Maclaurin series with Inline graphic is above Inline graphic for all configurations (Inline graphic when Inline graphic and Inline graphic when Inline graphic Also, the relative efficiency of the Maclaurin series with Inline graphic versus the Maclaurin series with Inline graphic is approximately Inline graphic for all configurations. Interestingly, for these simulation configurations, even use of Inline graphic terms in the Maclaurin series yields high efficiency.

Table 2.

Simulation results for estimation of Inline graphic for Inline graphic Inline graphic and Inline graphic

Inline graphic Method Percent relative bias of Inline graphic Root MSE Coverage probability (%) Percent converged (%)
0.2 Standard ML Inline graphic7.51 3.103 95.6 47.9
Poisson 2.64 3.767 95.8 100.0
MS (Inline graphic) 0.97 3.097 95.7 100.0
MS (Inline graphic) 0.50 3.019 96.1 100.0
MS (Inline graphic) 0.21 2.985 96.0 100.0
0.4 Standard ML Inline graphic12.02 2.278 94.0 48.2
Poisson 1.60 2.657 94.5 100.0
MS (Inline graphic) Inline graphic0.45 2.183 94.2 100.0
MS (Inline graphic) Inline graphic0.97 2.131 94.4 100.0
MS (Inline graphic) Inline graphic1.33 2.106 94.3 100.0
0.6 Standard ML Inline graphic17.53 1.668 93.3 48.0
Poisson 0.81 1.877 95.0 100.0
MS (Inline graphic) Inline graphic1.50 1.553 95.3 100.0
MS (Inline graphic) Inline graphic2.10 1.505 95.0 100.0
MS (Inline graphic) Inline graphic2.51 1.482 94.9 100.0
0.8 Standard ML Inline graphic31.42 1.153 90.5 46.7
Poisson 1.22 1.244 93.6 100.0
MS (Inline graphic) Inline graphic2.07 1.109 93.2 100.0
MS (Inline graphic) Inline graphic3.02 1.078 93.2 100.0
MS (Inline graphic) Inline graphic3.65 1.063 93.2 100.0

5. Application to study of pre-operative use of beta blockers in patients with colon cancer

We apply the proposed methodology to the analysis of pre-operative use of beta blockers (yes/no) among patient undergoing colorectal surgery after diagnosis of colon cancer (Arriaga and others, 2009). To examine the relationship between the binary outcome and the five patient-level predictors of interest, we fit the following RR regression model:

5. (5.1)

where Inline graphic is the conditional probability that the Inline graphicth patient receives pre-operative beta blockers; Inline graphic is 1 if life-threatening disease and 0 otherwise; Inline graphic is the number of comorbidities; Inline graphic is 1 if male and 0 if female; Inline graphic is 1 if White race and 0 if otherwise; and Inline graphic is age in years (although the reported effect of age is multiplied by 10 for easier comparison of results in Table 3).

Table 3.

Comparison of (log) RR regression estimates for the probability of pre-operative use of beta blockers among patients undergoing colorectal surgery

Effect Approach Estimate SE Inline graphic Inline graphic-value
Intercept Standard ML Inline graphic1.186 0.310 Inline graphic3.82 Inline graphic0.001
Poisson Inline graphic1.542 0.321 Inline graphic4.81 Inline graphic0.001
MS (Inline graphic) Inline graphic1.445 0.262 Inline graphic5.51 Inline graphic0.001
MS (Inline graphic) Inline graphic1.445 0.253 Inline graphic5.71 Inline graphic0.001
MS (Inline graphic) Inline graphic1.439 0.251 Inline graphic5.73 Inline graphic0.001
ASA Standard ML Inline graphic0.009 0.097 Inline graphic0.10 0.923
Poisson Inline graphic0.017 0.119 Inline graphic0.15 0.884
MS (Inline graphic) Inline graphic0.007 0.116 Inline graphic0.06 0.950
MS (Inline graphic) Inline graphic0.007 0.114 Inline graphic0.07 0.948
MS (Inline graphic) Inline graphic0.008 0.114 Inline graphic0.07 0.946
Comorbid Standard ML 0.065 0.044 1.46 0.145
Poisson 0.080 0.040 2.00 0.047
MS (Inline graphic) 0.078 0.030 2.60 0.010
MS (Inline graphic) 0.076 0.028 2.74 0.007
MS (Inline graphic) 0.074 0.027 2.77 0.006
Male Standard ML Inline graphic0.068 0.098 Inline graphic0.69 0.488
Poisson Inline graphic0.120 0.107 Inline graphic1.11 0.266
MS (Inline graphic) Inline graphic0.053 0.088 Inline graphic0.60 0.552
MS (Inline graphic) Inline graphic0.048 0.080 Inline graphic0.60 0.551
MS (Inline graphic) Inline graphic0.047 0.076 Inline graphic0.61 0.541
White Standard ML Inline graphic0.105 0.131 Inline graphic0.80 0.422
Poisson Inline graphic0.157 0.152 Inline graphic1.03 0.303
MS (Inline graphic) Inline graphic0.139 0.148 Inline graphic0.94 0.349
MS (Inline graphic) Inline graphic0.138 0.148 Inline graphic0.93 0.351
MS (Inline graphic) Inline graphic0.138 0.148 Inline graphic0.93 0.352
Age Standard ML 0.111 0.044 2.52 0.013
Poisson 0.160 0.047 3.41 0.001
MS (Inline graphic) 0.139 0.037 3.76 Inline graphic0.001
MS (Inline graphic) 0.139 0.034 4.08 Inline graphic0.001
MS (Inline graphic) 0.139 0.033 4.18 Inline graphic0.001

Table 3 presents the estimates of Inline graphic obtained using standard ML (as implemented in SAS PROC GENMOD), the Poisson quasi-likelihood approach (McNutt and others, 2003; Zou, 2004; Carter and others, 2005), and the Maclaurin series approach with Inline graphic Inline graphic and Inline graphic Of note, there were convergence problems with ML but not with any of the other approaches. In particular, for ML SAS PROC GENMOD produced a warning message that “The relative Hessian convergence criterion of 0.0198000361 is greater than the limit of 0.0001. The convergence is questionable”.

As conjectured by the study investigators, the results in Table 3 indicate that older patients and patients with more comorbidities are significantly more likely to receive pre-operative beta blockers. From the results presented in Table 3, it is also apparent that the quantitative variable age and the ordinal variable “number of comorbidities” show the largest estimated efficiency gains when comparing the Poisson (Inline graphic) to the Maclaurin series estimators. The relative efficiencies can be estimated as the square of the ratio of the estimated standard errors. For the covariate “number of comorbidities”, the estimated relative efficiency of Poisson regression versus the Maclaurin series with Inline graphic is approximately Inline graphic For the covariate age, the estimated relative efficiency of Poisson regression versus the Maclaurin series with Inline graphic is approximately Inline graphic Thus, in this particular application, Poisson regression appears to be quite inefficient compared with the proposed Maclaurin series approach.

For the covariate “number of comorbidities”, the estimated relative efficiency of Maclaurin series with Inline graphic versus the Maclaurin series with Inline graphic is greater than Inline graphic similarly, for the covariate age, it is above Inline graphic Thus, in this applications, there appears to be discernible gains from using a Maclaurin series with Inline graphic instead of Inline graphic However, there is no appreciable difference between Maclaurin series with Inline graphic and Inline graphic This reinforces the results from Sections 3 and 4 where it was found that there are diminishing returns in terms of increases in efficiency when Inline graphic is greater than 40–60.

6. Conclusion

To circumvent the usual convergence problems associated with the ML estimator, we propose estimators that approximate the optimal weight function based on the truncated Maclaurin series of Inline graphic. This use of a near-optimal weight function that bounds the largest weights yields estimators with relatively high efficiency that also avoid convergence problems. In our study of asymptotic efficiency, the proposed estimator with weight function based on 40–60 terms from the Maclaurin series was 95–97% efficient relative to the MLE. This compares favorably to the Poisson regression estimator that was found to be only 60–70% efficient. In simulations with samples of size 50 (data not shown) and 100, we found similar gains in relative efficiency and no convergence problems with the proposed estimator based on 40–60 terms. In addition, the proposed estimator, using any finite number of terms, is straightforward to implement in standard statistical software for generalized linear models that allows user-defined variance functions (e.g. PROC GLIMMIX in SAS). Finally, we note that estimators that approximate the optimal weight function based on truncated series expansions may also be useful for other generalized linear models in which the link function does not respect the natural parameter constraints (e.g. linear or linear-expit models for binary data (Kovalchik and others, 2013)).

In general, RR regression is most useful when the scientific goal is to estimate the association between an exposure or intervention and a commonly occurring binary outcome, with appropriate adjustment for additional covariates. However, we note that although the estimating equations (2.3) yield consistent estimators of Inline graphic for any choice of weight function, including the optimal weight function and the approximation to it proposed in this paper, the estimators are not constrained to produce estimated Inline graphic. As a result, RR regression should be avoided altogether when the scientific goal is to make predictions; when the goal is prediction, models where the constraints on the probabilities are automatically satisfied (e.g. logistic regression) should be adopted instead.

Funding

We are grateful for the support provided by grants MH054693 and CA160679 from the United States National Institutes of Health (NIH).

Acknowledgements

Conflict of Interest: None declared.

References

  1. Abramowitz M., Stegun I. A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. New York: Dover Publications; 1970. 9th printing. [Google Scholar]
  2. Arriaga A. F., Lancaster R. T., Berry W. R., Regenbogen S. E., Lipsitz S. R., Kaafarani H. M., Elbardissi A. W., Desai P., Ferzoco S. J., Bleday R., others The better colectomy project: association of evidence-based best-practice adherence rates to outcomes in colorectal surgery. Annals of Surgery. 2009;250:507–513. doi: 10.1097/SLA.0b013e3181b672bc. [DOI] [PubMed] [Google Scholar]
  3. Baumgarten M., Seliske P., Goldberg M. S. Warning regarding the use of GLIM macros for the estimation of risk ratios. American Journal of Epidemiology. 1989;130:1065. doi: 10.1093/oxfordjournals.aje.a115407. [DOI] [PubMed] [Google Scholar]
  4. Carter R. E., Lipsitz S. R., Tilley B. C. Quasi-likelihood estimation for relative risk regression models. Biostatistics. 2005;6:39–44. doi: 10.1093/biostatistics/kxh016. [DOI] [PubMed] [Google Scholar]
  5. Clogg C. C., Rubin D. B., Schenker N., Schultz B., Weidman L. Multiple imputation of industry and occupation codes in census public-use samples using Bayesian logistic regression. Journal of the American Statistical Association. 1991;86:68–78. [Google Scholar]
  6. Deddens J. A., Petersen M. R., Lei X. Proceeding of the 28th Annual SAS Users Group International Conference. Cary, NC: SAS Institute Inc.; 2003. Estimation of prevalence ratios when PROC GENMOD does not converge; pp. 270–278. [Google Scholar]
  7. Huber P. J. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. Berkeley: University of California Press; 1967. The behavior of maximum likelihood estimates under nonstandard conditions; pp. 221–233. Volume 1. [Google Scholar]
  8. Jewell N. P. Statistics for Epidemiology. New York: Chapman & Hall/CRC Press; 2003. [Google Scholar]
  9. Kovalchik S. A., Varadhan R., Fetterman B., Poitras N. E., Wacholder S., Katki H. A. A general binomial regression model to estimate standardized risk differences from binary response data. Statistics in Medicine. 2013;32:808–821. doi: 10.1002/sim.5553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Lu M., Tilley B. C. Use of odds ratio or relative risk to measure a treatment effect in clinical trials with multiple correlated binary outcomes: data from the NINDS t-PA stroke trial. Statistics in Medicine. 2001;20:1891–1901. doi: 10.1002/sim.841. [DOI] [PubMed] [Google Scholar]
  11. Lumley T., Kronmal R., Ma S. Relative risk regression in medical research: models, contrasts, estimators, and algorithms. UW Biostatistics Working Paper Series. 2006 Working Paper 293. http://biostats.bepress.com/uwbiostat/paper293 . [Google Scholar]
  12. McNutt L. A., Wu C., Xue X., Hafner J. P. Estimating relative risk in cohort studies and clinical trials of common events. American Journal of Epidemiology. 2003;157:940–943. doi: 10.1093/aje/kwg074. [DOI] [PubMed] [Google Scholar]
  13. Owens W. D., Felts J. A., Spitznagel E. L., Jr ASA physical status classifications: a study of consistency of ratings. Anesthesiology. 1978;49:239–243. doi: 10.1097/00000542-197810000-00003. [DOI] [PubMed] [Google Scholar]
  14. Poldermans D., Boersma E., Bax J. J., Thomson I. R., van de Ven L. L., Blankensteijn J. D., Baars H. F., Yo T. I., Trocino G., Vigna C., Roelandt J. R., van Urk H. The effect of bisoprolol on perioperative mortality and myocardial infarction in high-risk patients undergoing vascular surgery. The New England Journal of Medicine. 1999;341:1789–1794. doi: 10.1056/NEJM199912093412402. [DOI] [PubMed] [Google Scholar]
  15. Rotnitzky A. Inverse probability weighted methods. In: Fitzmaurice G., Davidian M., Verbeke G., Molenberghs G., editors. Longitudinal Data Analysis. Boca Raton: Chapman & Hall/CRC Press; 2009. pp. 453–476. [Google Scholar]
  16. Rotnitzky A., Wypij D. A note on the bias of estimators with missing data. Biometrics. 1994;50:1163–1170. [PubMed] [Google Scholar]
  17. Royall R. M. Model robust confidence intervals using maximum likelihood estimators. International Statistical Review. 1986;54:221–226. [Google Scholar]
  18. Traissac P., Martin-Prevel Y., Delpeuch F., Maire B. Regression logistique vs autres modeles lineaires generalises pour l'estimation de rapports de prevalences. Rev Epidemiol Sante Publique. 1999;47:593–604. [PubMed] [Google Scholar]
  19. Wacholder S. Binomial regression in GLIM: estimating risk ratios and risk differences. American Journal of Epidemiology. 1986;123:174–184. doi: 10.1093/oxfordjournals.aje.a114212. [DOI] [PubMed] [Google Scholar]
  20. Wedderburn R. W. M. Quasi-likelihood functions, generalized linear models, and the gauss newton method. Biometrika. 1974;61:439–447. [Google Scholar]
  21. White H. Maximum likelihood estimation under mis-specified models. Econometrica. 1982;50:1–26. [Google Scholar]
  22. Zhang J., Yu K. F. What's the relative risk?: a method of correcting the odds ratio in cohort studies of common outcomes. Journal of the American Medical Association. 1998;280:1690–1691. doi: 10.1001/jama.280.19.1690. [DOI] [PubMed] [Google Scholar]
  23. Zou G. A modified Poisson regression approach to prospective studies with binary data. American Journal of Epidemiology. 2004;159:702–706. doi: 10.1093/aje/kwh090. [DOI] [PubMed] [Google Scholar]

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES