Skip to main content
Philosophical transactions. Series A, Mathematical, physical, and engineering sciences logoLink to Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
. 2008 Apr 11;366(1874):2377–2388. doi: 10.1098/rsta.2008.0030

A method of assessing the sensitivity of the Cochran–Mantel–Haenszel test to an unobserved confounder

Binbing Yu 1,*, Joseph L Gastwirth 2
PMCID: PMC2446605  NIHMSID: NIHMS54121  PMID: 18407900

Abstract

Observational studies, including the case-control design frequently used in epidemiology, are subject to a number of biases and possible confounding factors. Failure to adjust with them may lead to an erroneous conclusion about the existence of a causal relationship between exposure and disease. The Cochran–Mantel–Haenszel (CMH) test is widely used to measure the strength of the association between an exposure and disease or response, after stratifying on the observed covariates. Thus, observed confounders are accounted for in the analysis. In practice, there may be causal variables that are unknown or difficult to obtain. Hence, they are not incorporated into the analysis. Sensitivity analysis enables investigators to assess the robustness of the findings. A method for assessing the sensitivity of the CMH test to an omitted confounder is presented here. The technique is illustrated by re-examining two datasets: one concerns the effect of maternal hypertension as a risk factor for low birth weight infants and the other focuses on the risk of allopurinol on having a rash. The computer code performing the sensitivity analysis is provided in appendix A.

Keywords: Cochran–Mantel–Haenszel test, confounders, sensitivity analysis, stratification

1. Introduction

Observational studies are commonly used to evaluate the effect of treatments or chemical exposure, e.g. the association between risk factors and a subsequent event. Unlike randomized experiments, researchers cannot control the assignment of treatments to subjects to ensure that subjects receiving different treatments or exposure are similar with respect to major covariates. The Cochran–Mantel–Haenszel (CMH; Cochran 1954; Mantel & Haenszel 1959) test is widely used to test the strength of the association after controlling for the observed confounders. The data are stratified into multiple 2×2 tables by known confounders, where in each stratum the distributions of the confounders in both groups are similar.

Since observational studies remain subject to unobserved confounding, one should explore how sensitive the results are to a possible omitted variable. Extensive work has been done on how to perform sensitivity analysis for observational data by Cornfield et al. (1959), Rosenbaum & Rubin (1983, 1984), Rosenbaum & Krieger (1990), Gastwirth (1992), Rosenbaum (2002) and others. Most of the literature models the association between an unobserved confounder, denoted by U, and treatment assignment, and then recomputes the test statistics and p values for a range of plausible differences in the prevalences of the confounder in the two groups. If a moderate degree of confounding could change the inference about the treatment effect on the outcome, then one questions the soundness of the conclusions. Gastwirth et al. (1998) proposed a dual and simultaneous sensitivity analysis for matched pair data, which extended a strengthened form of the original Cornfield inequality (Gastwirth 1988, p. 296). Both the relationship between U and treatment assignment and that between U and response are jointly modelled. This simultaneous form allows one to incorporate subject matter knowledge about both relationships into the sensitivity analysis.

This paper extends the ‘simultaneous’ approach to examine the sensitivity of the CMH test to an unobserved confounder. The rest of this paper is organized as follows. Section 2 reviews the CMH test and describes the stratified model for the simultaneous relationships. Then we apply the sensitivity analysis to a dataset examining maternal hypertension as a risk factor for low birth weight (LBW) infants and another dataset concerned with a side effect (rash) of allopurinol in §3. We discuss the potential use and future development of sensitivity analysis in observational studies in §4.

2. Sensitivity analysis for the CMH test

Let X be the exposure, e.g. environmental hazard, genetic factor, radiation exposure; and Y be the outcome, e.g. disease, mortality, or in an employment discrimination case, being promoted or passing a pre-employment exam. We are interested in testing the causal relationship between X and Y using the CMH test to control for other known factors related to the response. Usually the data are organized into J 2×2 tables, where in each stratum the covariates have similar values. The data for the jth stratum (j=1, …, J) are shown below.

outcome Y

exposure group X event (1) non-event (0) total
exposed (1) aj bj m1j
not exposed (0) cj dj m0j
total n1j n0j Nj

Let π1j (π0j), j=1, …, J, be the probability that subjects who are exposed (not exposed) in stratum j has an event. In terms of these parameters, the odds ratio between exposure and outcome in stratum j is

θj=π1j(1π0j)/[π0j(1π1j)],j=1,,J.

Furthermore, assume that the odds ratio has a common value θ. Let D={aj,bj,cj,dj,j=1,,J} be the observed data for the J 2×2 tables. When the responses for each group in all strata follow independent binomial distributions, the full data have the product binomial likelihood function

L(θ|D)=j=1J(m1jaj)π1jaj(1π1j)bj(m0jcj)π0jcj(1π0j)dj. (2.1)

From (2.1), we have E(aj)=m1jπ1j, V(aj)=m1jπ1j(1π1j), E(cj)=m0jπ0j and V(cj)=m0jπ0j(1π0j).

If all the confounders are observed and controlled by stratification, then the null hypothesis of no-exposure effect is H0: π0j=π1j, j=1, …, J or H0: θ=1. Cochran's (1954) statistic for testing H0: θ=1 versus H1: θ>1 is

W0=j=1Jwj(p1jp0j)[j=1Jwjp¯j(1p¯j)]1/2, (2.2)

where wj=m1jm0j/Nj, p¯j=(m1jp1j+m0jpˆ0j)/Nj and p1j=aj/m1j, p0j=cj/m0j are the estimates of π1j and π0j, respectively. Using the conditional central hypergeometric likelihood rather than the product binomial (2.1), Mantel & Haenszel (1959) proposed a similar test. The two tests are asymptotically equivalent and are often referred to as the CMH test. The CMH test is known to be the optimal method of combining multiple 2×2 tables with common odds ratio (Woolson et al. 1986).

The large sample distribution of (2.2) converges to a standard normal distribution under the null hypothesis that θ=1 (Cochran 1954). More generally, the statistic

W=j=1Jwj[(p1jp0j)(π1jπ0j)]{j=1Jwj2[π1j(1π1j)m1j+π0j(1π0j)m0j]}1/2, (2.3)

converges to a standard normal distribution as Nj→∞ (Woolson et al. 1986).

Sometimes, information on known risk factors is not available or other risk factors related to the response have not been discovered. Here we assume that there is an unobserved factor U and that the relationship between binary outcome Y and exposure X and confounder U is modelled by

πxj|u=P(Y=1|X=x,S=j,U=u)=exp(αj+βx+γu)1+exp(αj+βx+γu), (2.4)

where γ is the effect of U, which is called the strength parameter (Yu & Gastwirth 2005), and β is the true effect of exposure X. The null hypothesis of no-exposure effect is H0: β=0 and the alternative hypothesis is H1: β≠0.

The unobserved variable U may also be associated with the exposure. Assuming that U is binary, the prevalence of U by exposure level in each stratum is modelled by

P(U=1|X=x,S=j)=exp(λj+δjx)1+exp(λj+δjx). (2.5)

Here, the δj, indicating the association between X and U in each stratum, are called the imbalance parameters (Yu & Gastwirth 2005). To simplify the calculation, we assume that the imbalance parameters δj are equal to δ. Then, qj=eλj/(1+eλj) is the prevalence of U in the non-exposed group in stratum j. When the confounder U is omitted in the analysis, the marginal probability of response at each exposure level x in stratum j becomes

πxj=exp(αj+βx+γ)1+exp(αj+βx+γ)exp(λj+δx)1+exp(λj+δx)+exp(αj+βx)1+exp(αj+βx)11+exp(λj+δx).

The true null hypothesis of no-exposure effect is H0: β=0, which is really the hypothesis of interest. Note that H0: π0j=π1j, j=1, …, J, and H0: β=0 are not equivalent when there is an omitted confounder. If U is not a confounder, i.e. δ=0 or γ=0, then the equalities π1j=π0j, j=1, …, J, still hold under the null hypothesis H0: β=0, so that the CMH test C0 is indeed testing the effect of exposure. When U is indeed a confounder, it has been shown that, if both γ>0 and δ>0, then π1j=π0j, j=1, …, J, even when the true null hypothesis holds (Yu & Gastwirth 2005) and the CMH test would be biased. This implies that confounder U could induce spurious positive association between exposure and outcome. However, if γ>0 and δ<0 then the true association between X and Y will be underestimated if the confounder is omitted.

If the strength parameter γ, the imbalance parameter δ and the prevalences q1, …, qJ of the confounder are known or can be reliably estimated from other sources, the estimates of π1j and π0j can be calculated from the likelihood (2.1) under the null hypothesis H0: β=0 (see appendix A). Then the correct CMH statistic is obtained by substituting the estimates (πˆ1j,πˆ0j) for (π0j,π1j) in (2.3) as the effect of U is now incorporated into the response probabilities πxj. In practice these quantities are unknown, so one assumes a possible range for (q1, …, qJ, γ, δ); the test statistic and the p value are calculated using (2.3). A table showing the p values with respect to different parameters for confounding can be used to assess the plausibility that the current inference could be altered due to the effect of an unobserved factor.

3. Application

The proposed sensitivity analysis will be applied to data from two observational studies. In these applications, C=exp (γ) measures the effect of U on binary response Y and D=exp (δ) measures the increasing odds of U being positive when X increases by one unit.

(a) Sensitivity analysis of maternal hypertension as a risk factor for LBW infants

Data on 500 singleton births in a London Hospital (Hills & De Stavola 2002) were used to examine the sensitivity of the association of maternal hypertension as a risk factor for LBW infants. The dataset provides the birth information of 500 singletons with the following eight variables: identity number for mother and baby, birth weight of baby (indicator for birth weight less than 2500 g), gestation period (indicator for gestation period less than 37 weeks), maternal age, indicator for maternal hypertension and sex of baby (1, male; and 2, female). Ten mothers with missing gestation period information were dropped from the analysis. The exposure of interest (X) is maternal hypertension and the event of interest is LBW infant (Y).

‘Advanced’ maternal age is defined as any expectant mother who will have reached her 35th birthday by delivery date. The mothers are divided into ‘young’ and ‘advanced’ age group according to this criterion. The number of infants with normal birth weight (NBW) and LBW by sex, gestation status, maternal age and maternal hypertension are shown in table 1. The CMH test is significant with p=0.015 and the odds ratio is 2.74 with 95% CI (1.23, 6.14). The Breslow–Day test for homogeneity shows that the odds ratio estimates are homogeneous across different strata (p=0.785). Is it safe to draw the conclusion that maternal hypertension is causally related to LBW?

Table 1.

Infants' birth weight by sex, gestation status, maternal age and hypertension.

normal blood pressure hypertension


sex gestation maternal age NBW LBW NBW LBW
male termed young 98 2 15 2
advanced 87 6 13 2
preterm young 7 5 2 4
advanced 7 2 1 3
female termed young 91 5 13 1
advanced 83 3 6 0
preterm young 4 6 2 5
advanced 2 11 0 2

Despite substantial reductions in US infant mortality during the past several decades, black–white disparities in infant mortality rates persist. Important determinants of racial/ethnic differences in infant mortality are LBW, defined as less than 2500 g (US Department of Health and Human Services 2000). The number of LBW infants among 1000 live births for the whites are 5.7–6.5 and for the blacks are 12.7–13.0 from 1980 to 2000 (Centers for Disease Control and Prevention 2002; http://www.cdc.gov/mmwr/preview/mmwrhtml/mm5127a1.htm). Hence, race is an important risk factor with an apparent odds ratio of approximately 2.0–2.4 for LBW (David & Collins 1997). A recent study (Geronimus et al. 2007) also showed that being black increases the odds ratio of hypertension by 2.11–4.04 for women between 15 and 65 years of age. Also, LBW is related to many other risk factors, e.g. the use of assisted reproductive technology, use of alcohol or drugs. Here, we focus on the race as the unobserved confounder (U) because it is a recognized risk factor. We assume that the mother's race is not related to the sex of their babies, but that the percentage of mothers who are black varies across the maternal age groups as well as gestation term. The possible scenarios for the percentage of black mothers by maternal age and gestation term are shown in table 2. In scenarios 1 and 2, we assume that mothers with preterm babies consist of a higher percentage of black women. In scenarios 3 and 4, we assume that there are more black mothers with normal maternal age. In scenario 5, the black mothers tend to be younger and have more preterm babies than whites.

Table 2.

The prevalences of black mothers by maternal age and gestation term for five possible scenarios.

scenario (%)

gestation maternal age prevalence 1 2 3 4 5
termed normal q1 10 5 10 20 10
advanced q2 10 5 5 10 5
preterm normal q3 20 10 10 20 20
advanced q4 20 10 5 10 10

Table 3 shows the two-sided p values of the adjusted CMH test with respect to different values of C and D for scenarios 1–5. Note that the strength parameter C=exp (γ) is the odds ratio of a black mother having a LBW baby compared with a white mother. The imbalance parameter D=exp (γ) is the increased odds ratio of a mother being black if she has hypertension. When C=1 or D=1, race is not a confounder and the CMH test is unchanged.

Table 3.

The effect of U on the p value of the CMH test for different values of C and D.

D=exp (δ)

scenario C=exp (γ) 2 3 4 5
1 2 0.064 0.079 0.094 0.107
3 0.078 0.110 0.140 0.170
4 0.091 0.138 0.186 0.230
5 0.103 0.165 0.227 0.287
2 2 0.057 0.066 0.074 0.083
3 0.065 0.083 0.101 0.119
4 0.072 0.099 0.127 0.156
5 0.079 0.115 0.153 0.192
3 2 0.060 0.070 0.081 0.091
3 0.070 0.092 0.114 0.136
4 0.079 0.112 0.147 0.181
5 0.087 0.131 0.177 0.223
4 2 0.067 0.085 0.100 0.114
3 0.084 0.119 0.152 0.182
4 0.098 0.150 0.199 0.244
5 0.111 0.178 0.242 0.300
5 2 0.060 0.070 0.081 0.091
3 0.070 0.092 0.114 0.136
4 0.079 0.112 0.147 0.181
5 0.087 0.131 0.177 0.223

Another way to perform the sensitivity analysis is to obtain the threshold values of the sensitivity parameters C and D, which increase the p value of the CMH test to 0.05. Figure 1 shows the plots of sensitivity parameters C and D, which corresponds to p=0.05. From figure 1, we see that when C=1.5 and D=1.1, then the two-sided p value would reach the 0.05 level. Thus, both table 3 and figure 1 show that the p value of the CMH test is sensitive to moderate changes in C and D, indicating that the significant association between maternal hypertension and LBW might have arisen from a hidden bias due to confounding. Hence, other studies, including race and ethnicity information, are needed to reach the conclusion that maternal hypertension causes LBW. If the studies demonstrate that increased incidence of black mothers with hypertension was slight with odds ratio less than 1.5, then the significance of the London study would not be affected by the omission of race.

Figure 1.

Figure 1

A plot of the values of the sensitivity parameters C and D required to raise the two-sided p value of the CMH test to 0.05.

Although in Rubin's causal model, sex and race are not used as ‘treatment’ variables in causal inference because it is difficult to ascribe causality to variables that cannot be altered (Holland 1986), here race is used as a proxy for other factors reflecting socio-economic disparities. More recently, race and sex have been used in the propensity score method to form ‘similar strata’ (Zanutto et al. 2005) but should not be the main potential outcome of a study as they cannot be altered.

(b) Sensitivity analysis of allopurinol as a cause of rash

Rosenbaum (2002) examined the sensitivity of a study indicating that allopurinol can cause rash (Boston Collaborative Drug Project 1972). Allopurinol is commonly used for the treatment and prevention of attacks of gout and certain types of kidney stones. It is also used to treat elevated uric acid levels in the blood and urine, which can occur in patients receiving chemotherapy for the treatment of leukaemia, lymphoma and other types of cancer. Allopurinol is usually well tolerated by most patients; however, some may experience possible side effects of skin rash, hives and itching.

Table 4 shows the joint frequencies of drug use and getting a rash case for males and females, respectively. The CMH test is highly significant with p<0.0001. In a randomized trial or in the absence of unobserved confounders, this indicates strong evidence that allopurinol causes rash (Rosenbaum 2002, p. 131). Because the patients could also be taking amoxicillin and ampicillin, which are known to cause rash, is it possible that the rash was caused by an unobserved factor, e.g. use of amoxicillin or other drugs, instead of allopurinol?

Table 4.

Drug use and rash cases by male and female.

sex drug no rash rash
males allopurinol 33 5
other drugs 645 36
females allopurinol 19 10
other drugs 518 58

Although information about the other drug use and users' allergic history was not collected, one can assess the sensitivity of the CMH test with respect to an unobserved confounder. Let U be the binary indicator for amoxicillin use, e.g. use amoxicillin (1) and no amoxicillin (0). Again, the prevalences of using amoxicillin among the non-allopurinol users are denoted by qj, j=1, …, J. The parameter δ indicates the imbalance of U between allopurinol users and non-users. The parameter γ indicates the increasing probability of having rash among users of amoxicillin. We assume two possible scenarios for the prevalences of amoxicillin use among the non-allopurinol users, i.e. qj=10% or 20%, where j=1, 2.

The increasing odds ratio of having rash for amoxicillin users compared with non-users is not known, but it is likely to be below 4. Based on table 5, we can see that the CMH test of allopurinol causing rash is not sensitive to the omitted variable (use of amoxicillin). Even when the strength parameter C=4 and the imbalance parameter D=4, the CMH test remains highly significant with p=0.002. This example also illustrates the additional information provided by the simultaneous sensitivity analysis compared with the earlier approaches by Cornfield et al. (1959) and Rosenbaum (2002), which focus primarily on the imbalance parameter and assume a confounder with infinitely large strength. Rosenbaum (2002, p. 131) used the parameter Γ, which is similar to D in our analysis, to measure the imbalance of treatment assignment with respect to the unobserved factor. The upper bounds on the p values based on his analysis are 0.036 and 0.30, for Γ=2 and 3, respectively, which indicates that the causal relationship between allopurinol use and rash could have arisen from an unobserved confounder that triples the risk of rash. While such a confounding is unlikely to exist, the conclusion based on the simultaneous analysis is stronger. Even when C=∞, the test barely reached non-significance for D=3 and qj=10%. This sensitivity analysis is more stringent, indicating the conclusion that allopurinol causes rash is very robust to an unobserved confounder. This is consistent with the current medical practice that lists rash as the one common side effect of allopurinol use (http://www.nlm.nih.gov/medlineplus/druginfo/medmaster/a682673.html).

Table 5.

One-sided p values of the CMH test for different values of (γ, δ).

p value (q1, q2, q3, q4)

(10%, 10%, 10%, 10%) (20%, 20%, 20%, 20%)


C=exp (γ) C=exp (γ)


D=exp (δ) 2 3 4 2 3 4
2 <0.001 <0.001 <0.001 0.005 <0.001 <0.001 <0.001 0.004
3 <0.001 <0.001 <0.001 0.057 <0.001 <0.001 <0.001 0.033
4 <0.001 <0.001 0.001 0.176 <0.001 <0.001 0.002 0.094
0.004 0.046 0.161 0.999 0.005 0.045 0.130 0.890

4. Discussion

Because the CMH test is widely used to analyse the data obtained in epidemiological studies, legal cases and social sciences, it is important to conduct a sensitivity analysis to answer whether that inference could plausibly be explained by an unobserved confounder before drawing a firm conclusion.

The proposed simultaneous sensitivity analysis approach is similar to the primal sensitivity analysis method by Rosenbaum (2002). The major difference is the introduction of the strength parameter. The primal sensitivity analyses closely parallel the theory of randomized experiments and have been developed for many statistical tests and estimators, e.g. the McNemar and Wilcoxon signed-rank tests and the Hodges–Lehmann estimator (Rosenbaum 2002). The simultaneous approach allows one to use more subject matter knowledge. However, it also requires more assumptions about both relationships. The simultaneous approach is especially useful when an observational study is challenged owing to failure to control for a particular unobserved variable, when there are plausible ranges for the values of γ and δ (Gastwirth et al. 1998).

At the design stage, it is also useful to examine the size and power of the corrected CMH test in the presence of omitted confounders in a range of realistic values of the strength and imbalance parameters. The calculation of size and power for the Cochran–Armitage trend test has been derived by Yu & Gastwirth (2005). A similar technique can be used for the CMH test with omitted variable.

In future studies, we plan to extend this methodology of sensitivity analysis to data where the unobserved confounders are continuous and ordinal. In real-world observational studies, there may be more than one omitted confounder; background information on the joint distribution of the multiple confounders can then be used to replace the univariate distribution (2.5) in a sensitivity analysis.

Acknowledgments

B.Y. was supported in part by the Intramural Research Programme of the National Institute on Aging and J.L.G. was supported in part by grant SES-0317956 from the National Science Foundation.

Appendix A. Estimation of (π1j,π0j)

This appendix describes the estimation of (π1j0j) under the product-binomial likelihood (2.1) with fixed values of (q1, …, qJ, β, γ, δ) for the allopurinol data in §3b. For stratum j, the prevalence of U in the non-exposed group is qj=exp (λj)/[1+exp (λj)]. Then the prevalence of U in the exposed group is rj=qjeδ/(1qj+qjeδ). Let Aj=exp(αj),B=exp(β),C=exp(γ). Then for fixed values of (β, δ, γ, qj), the marginal probabilities of response are

π1j=AjBC1+AjBCrj+AjB1+AjB(1rj) (A1)

and

π0j=AjC1+AjCqj+Aj1+Aj(1qj). (A2)

The log-likelihood of function (2.1) is logL(θ|D)=j=1Js(θ|D), where

j(θ|D)=ajlogπ1j+bjlog(1π1j)+cjlogπ0j+djlog(1π0j),

for stratum s. The maximum-likelihood estimate (MLE) of αj, and (π1j,π0j) could be obtained from SAS PROC NLP. The code for the following 2×2×2 table from the allopurinol study with q1=q2=0.2, β=0, γ=log (2) and δ=log (2) is shown below. The MLEs of (π1j,π0j) are given in PIEST.

graphic file with name rsta20080030if01.jpg

Footnotes

One contribution of 13 to a Theme Issue ‘Mathematical and statistical methods for diagnoses and therapies’.

References

  1. Boston Collaborative Drug Project. Excess of ampicillin rashes associated with allopurinol or hyperuricemia. N. Engl. J. Med. 1972;286:505–507. doi: 10.1056/NEJM197203092861002. [DOI] [PubMed] [Google Scholar]
  2. Centers for Disease Control and Prevention. Infant mortality and low birth weight among black and white infants—United States, 1980–2000. Morb. Mortal. Wkly Rep. 2002;51:589–592. [PubMed] [Google Scholar]
  3. Cochran W.G. Some methods for strengthening the common X2 tests. Biometrics. 1954;10:417–451. doi: 10.2307/3001616. [DOI] [Google Scholar]
  4. Cornfield J., Haenszel W., Lilienfeld A., Shimkin M., Wynder E. Smoking and lung cancer. J. Natl Cancer Inst. 1959;22:173–203. [PubMed] [Google Scholar]
  5. David R.J., Collins J.W. Differing birth weight among infants of U.S. born blacks, African-born blacks, and U.S. born whites. N. Engl. J. Med. 1997;337:1209–1214. doi: 10.1056/NEJM199710233371706. [DOI] [PubMed] [Google Scholar]
  6. Gastwirth J.L. Academic Press; San Diego, CA: 1988. Statistical reasoning in law and public policy. [Google Scholar]
  7. Gastwirth J.L. Methods for assessing the sensitivity of statistical comparisons used in Title VII cases to omitted variables. Jurimetrics. 1992;33:19–34. [Google Scholar]
  8. Gastwirth J.L., Krieger A.M., Rosenbaum P.R. Dual and simultaneous sensitivity analysis for matched pairs. Biometrika. 1998;85:907–920. doi: 10.1093/biomet/85.4.907. [DOI] [Google Scholar]
  9. Geronimus A.T., Keene D., Hicken M., Bound J. Black–white differences in age trajectories of hypertension prevalence among adult women and men, 1999–2002. Ethn. Dis. 2007;17:40–48. [PubMed] [Google Scholar]
  10. Hills M., De Stavola B. Timberlake Consultants Ltd; London, UK: 2002. A short introduction to Stata 8 for biostatistics.http://www.timberlake.co.uk [Google Scholar]
  11. Holland P.W. Statistics and causal inference. J. Am. Stat. Assoc. 1986;81:945–960. doi: 10.2307/2289064. [DOI] [Google Scholar]
  12. Mantel N., Haenszel W. Statistical aspects of retrospective studies of disease. J. Natl Cancer Inst. 1959;22:719–748. [PubMed] [Google Scholar]
  13. Rosenbaum P.R. 2nd edn. Springer; New York, NY: 2002. Observational studies. [Google Scholar]
  14. Rosenbaum P.R., Krieger A. Sensitivity analysis for two-sample permutation inference in observational studies. J. Am. Stat. Assoc. 1990;85:493–498. doi: 10.2307/2289789. [DOI] [Google Scholar]
  15. Rosenbaum P.R., Rubin D.B. Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcomes. J. R. Stat. Soc. Ser. B. 1983;45:212–218. [Google Scholar]
  16. Rosenbaum P.R., Rubin D.B. Reducing bias in observational studies using subclassification on the propensity score. J. Am. Stat. Assoc. 1984;79:516–524. doi: 10.2307/2288398. [DOI] [Google Scholar]
  17. US Department of Health and Human Services, 2000 Healthy people 2010, vols. 2 (conference ed.). Washington, DC: US Department of Health and Human Services.
  18. Woolson R.F., Bean J.A., Rojas P.B. Sample size for case-control studies using Cochran's statistic. Biometrics. 1986;42:927–932. doi: 10.2307/2530706. [DOI] [PubMed] [Google Scholar]
  19. Yu B., Gastwirth J.L. Sensitivity analysis for trend tests: application to the risk of radiation exposure. Biostatistics. 2005;6:201–209. doi: 10.1093/biostatistics/kxi003. [DOI] [PubMed] [Google Scholar]
  20. Zanutto E.L., Lu B., Hornik R. Using propensity score subclassification for multiple treatment doses to evaluate a national anti-drug media campaign. J. Educ. Behav. Stat. 2005;30:59–73. doi: 10.3102/10769986030001059. [DOI] [Google Scholar]

Articles from Philosophical transactions. Series A, Mathematical, physical, and engineering sciences are provided here courtesy of The Royal Society

RESOURCES