Skip to main content
Health Services Research logoLink to Health Services Research
letter
. 2016 Apr 18;51(3):1114–1116. doi: 10.1111/1475-6773.12493

Response to “Inference Using Sample Means of Parametric Nonlinear Data Transformations”

William H Greene 1, Bryan E Dowd 2,, Edward C Norton 3
PMCID: PMC4874813  PMID: 27091682

We thank Joseph Terza for his careful read (Terza 2016) of our paper (Dowd, Greene, and Norton 2014). There is no disagreement about how to evaluate the asymptotic distribution of a nonlinear function (γ^) of estimated parameters (β^) or about the first term (A) in Terza's equation 6. The discussion is entirely about the second term (B) in equation 6.

The function in question is γ^k=1ni=1nγ^ik, for example, the partial effect of the k th explanatory variable on the probability that a binary‐dependent variable is equal to 1 in a logit or probit model, averaged across the observations, X i, in the sample. The subscript i refers to the individual observation in a sample of size n. Our discussion of the standard error of γ^k was based on the equation γ^ik=g(β^)|Xi, consistent with the assumption that the data are “fixed in repeated samples.” This means that it is possible to collect additional samples of data with the same values of X. Different samples may produce different values of the dependent variable conditional on X, but the difference will be due solely to the observations having different error terms.

The expression γ^ik=g(β^)|Xi is easy to understand when g(β^)|Xi is the partial effect of X k evaluated for a specific type of individual whose values of X are equal to X o. Asymptotic theory has a clear interpretation in this case. In ever larger samples of people with X = X o, β^ converges to the true population β; the variance of g(β^)|X=Xo shrinks to zero; and g(β^)|X=Xo converges to the true value for the entire population for whom X = X o.

Our discussion was based on the assumption that the sample of observations that produced the estimates of β^ is a collection of individuals, each with their own value of X=Xio and moving from γ^k=g(β^)|X=Xo for a subset of the sample to γ^k=1ni=1ng(β^)|Xi=Xio for the full sample changes nothing about the assumptions underlying the explanatory variable values, or the interpretation of asymptotic theory—that is, that the effect of larger samples is to add more error terms to each value of Xi=Xio.

Terza takes issue with the assumption of fixed X. If the overarching assumption is changed to random sampling from the population of (y i, X i), then a second term that accounts for the sampling variation of the average over X i enters the asymptotic variance of γ^k. This is the ‘B’ term in equation 6, which, after accommodating the common definition of the asymptotic variance, is 1n1ni=1ng(β^,Xi)g¯(β^,X)2.

Researchers can come to different conclusions on this issue. Terza's approach has intuitive appeal because when every X i truly is an element of a random sample, estimates of the population mean of functions of X i , that is, 1ni=1ng(β,Xi), converge in probability to the population mean of g(βX i) as the sample size increases. However, in our discussion, the analogy to larger samples is an increasing number of samples drawn from the same values of X i as in the original estimation sample, that is, bootstrap samples. As we showed in our original article, bootstrapping produces standard error estimates that are virtually identical to those produced by the delta method without Terza's term B, and to those produced by the method of Krinsky and Robb.

Analysts will decide which approach best suits their purposes. We do note the following about recent common practice. If the analyst requests γ^k=1ni=1nγ^ik from ‘margins’ in Stata or ‘PARTIALS’ in NLOGIT, the software will compute the standard error of γ^k using the delta method, that is, 1ni=1ng(β^)|Xi=Xio, term A in Terza's equation 6, rather than 1ni=1ng(β^,Xi), that is, both terms A and B.

Supporting information

Appendix SA1: Author Matrix.

Acknowledgments

Joint Acknowledgment/Disclosure Statement: No financial support.

Disclosures: None.

Disclaimers: None.

References

  1. Dowd, B. E. , Greene W. H., and Norton E. C.. 2014. “Computation of Standard Errors.” Health Services Research 49 (2): 731–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Terza, J. V. 2016. “Inference Using Sample Means of Parametric Nonlinear Data Transformations.” Health Services Research 51 (3 Pt 1): 1109–13. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix SA1: Author Matrix.


Articles from Health Services Research are provided here courtesy of Health Research & Educational Trust

RESOURCES