Abstract
Background. Known and unknown/unmeasured risk factors are the main sources of confounding effects in observational studies and can lead to false observations of elevated protective or hazardous effects. In this study, we investigate an alternative approach of analysis that is operated on field-specific knowledge rather than pure statistical assumptions. Method. The proposed approach introduces a proxy outcome into the estimation system. A proxy outcome possesses the following characteristics: (i) the exposure of interest is not a cause for the proxy outcome; (ii) causes of the proxy outcome and the study outcome are subsets of a collection of correlated variables. Based on these two conditions, the confounding-effect-driven association between the exposure and proxy outcome can then be measured and used as a proxy estimate for the effects of unknown/unmeasured confounders on the outcome of interest. Performance of this approach is tested by a simulation study, whereby 500 different scenarios are generated, with the causal factors of a proxy outcome and a study outcome being partly overlapped under low-to-moderate correlations. Results. The simulation results demonstrate that the conventional approach only led to a correct conclusion in 21% of the 500 scenarios, as compared to 72.2% for the alternative approach. Conclusion. The proposed method can be applied in observational studies in social science and health research that evaluates the health impact of behaviour and mental health problems.
1. Background
Due to lack of randomization, estimates obtained from observational studies are often affected by uncontrolled or unmeasured confounding effects. Several methods have been proposed to deal with the problem [1–11], but their application relies on assumptions about the distribution of the unknown confounding factor(s) in relation to the outcome, the exposure, and other known covariates.
The present study investigates an alternative approach of analysis that makes use of field-specific knowledge to determine a proxy outcome, which then will be employed to estimate uncontrolled confounding effects. A proxy outcome should satisfy the following conditions: (i) the exposure of interest is not a cause for the proxy outcome; and (ii) causes of the proxy outcome and the main outcome are subsets of a collection of correlated variables. If condition one is satisfied, then it is certain that the observed association between the exposure and the proxy outcome is completely driven by confounding effects. Nevertheless, condition one may be relaxed to some extent, for example, when it is certain that the confounding effect is by far stronger than possible causal effect. When condition two is satisfied, confounders for the proxy outcome and the outcome of interest are similar or at least correlated. For example, various forms of physical and mental health outcomes can be affected by a cluster of socioeconomic, behavioural, psychological, and genetic factors [12–22]. Researchers can apply their field knowledge and experience to determine the best proxy outcome for their outcome and exposure of interest.
Intuitively, let y and be the main outcome and the proxy outcome, respectively; r denotes the possible risk of y due to a given level of exposure; t 0 and t 1 the unexposed and exposed person-time at risk; d 0 and d 1 represent the number of cases of y observed in t 0 and t 1, whereas d 0′ and d 1′ are the number of cases of observed in t 0 and t 1, respectively. Further, suppose that P(d 0) and P(d 1) define the expected probability of one or more sufficient causes that the exposure of interest is not involved, occurring for y in a unit of person-time at risk within t 0 and t 1, respectively; correspondingly, P(d 0′) and P(d 1′) define the expected probability of one or more sufficient causes occurring for in a unit of person-time at risk within t 0 and t 1, respectively. Then, the observed crude risk ratios are
| (1) |
If the sufficient causes of y and are the same, largely overlapped, or strongly correlated, then
| (2) |
will be observed if the exposure is not causal for y (r = 0), whereas
| (3) |
will be observed if the exposure is causal for y (r > 0). Given this strict assumption, the method has been successfully applied in time series analysis recently, whereby the proxy outcome was described as a control series [23, 24]. For example, the study by Herttua and colleagues investigated the effect of alcohol price (exposure) on alcohol-related mortality (outcome of interest), while coronary operations were used as the control series (proxy outcome) [23]. Nevertheless, such a method may remain valid when assumptions are relaxed and can be applied to different study designs. Because the details of underlying causal mechanisms for an outcome (i.e., all sufficient causes) are typically unknown, it is best to ascertain the validity of the method through simulations. Therefore, we conduct a simulation study to test its application, focusing on situations when the causes of y and are only partly overlapped and have only low to moderate strength of association.
2. Method
2.1. Simulation Design
The simulation process follows the sufficient cause model [25]. In the simulation for an event to occur, at least one sufficient cause has to occur, which comprises the occurrence of two matched causal components and absence of any competing event. In addition, a randomly distributed small error term is introduced to ensure that perfect prediction (which interrupts the computing process) will not occur. To account for the fact that only certain real causal factors are known yet some of the noncausal factors are mistaken as causal factors, a collection of variables are included to encompass exposure, causal factors, and noncausal factors, while a subset from the pool provides the known variables. All simulations are performed within the STATA package release 12.
The simulations contain 500 replicates, with each replicate being generated through an independent process. There are seven simulation steps involved in each replicate.
Generate a collection of correlated binary variables, X 40×50000 = {X in}, i = (1, 2, 3,…, 40), and n = (1, 2, 3,…, 50000). For each n, X in is set to 1 if an intermediate process variable T in ≥ 0.75 and 0 otherwise, where T in = V in P i + U n(1 − P i) is a combination of a variable component (V i) and an unique component (U) for each i, both being uniform [0,1) distributed random variables, and P i is a random proportion drawn from a uniform [0.3, 0.8) distribution. The range [0.3, 0.8) is chosen in order to set a low to moderate level of correlation among X. The mean (standard deviation), 25th, 50th, and 75th percentiles of the correlation coefficients for the matrix X are 0.29 (0.12), 0.20, 0.26, and 0.34, respectively.
Determine the real causes for outcomes of interest A, B, and proxy outcome C. Real causes for A, B, and C are subsets of X in which X i and X i+10, i = (1,2, 3,…, 10) form ten matched pairs. Let F j,i indicate the factual causes, j = (A, B, C) and i = (1,2, 3,…, 10). In this simulation, we force F A,1 = 1 and F B,1 = F C,1 = 0; that is, X 1 is causal for A, but not causal for B and C. For i = (2, 3,…, 10), F j,i is a random value drawn from the Bernoulli distribution with probability of success = 0.5, value of success = 1, and value of failure = 0. For example if F A,2 = 1, then the pair of X 2 and X 12 is a cause of A.
Generate competing events for outcomes A, B, and C. Let E j,n denote the competing events for outcomes A, B, and C, j = (A, B, C) and n = (1, 2, 3,…, 50000). E is a Bernoulli distributed random variable with a probability of success = 0.1, value of success = 1, and value of failure = 0. E is also independent of X.
Generate background errors for outcomes A, B, and C. Let Q j,n denote “background” sufficient causes, j = (A, B, C) and n = (1, 2, 3,…, 50000). Q is a Bernoulli distributed random variable with a probability of success = 0.05, value of success = 1, and value of failure = 0. Q is independent of E and X. Q services as small random error, and it is introduced to smooth the computing process only.
Determine the status (occur or not occur) of outcomes A, B, and C. Let Y j,n where j = (A, B, C) and n = (1, 2, 3,…, 50000) denote the status of outcomes A, B, and C. Value of each Y j,n is determined as follows. For each observation n, Y j,n = 1 (outcome occurred) if Q j,n = 1, or for i = (1, 2, 3,…, 10), ∑i X in X (i+10)n F j,i ≥ 1 when E j,n = 0; Otherwise Y j,n = 0 (outcome not occurred).
Determine the known/suspected (not necessary the fact) “causal” factors (except X 1) for outcomes A and B. The known/suspected “causal” factors for outcomes A, B, and C are determined for X i for i = (1,2, 3,…, 40). Let K j,i denote the researcher's knowledge (not necessary the fact) of causes for outcomes A and B. Let j = (A, B), and K j,i indicates a known “causal” factor. Because X 1 is the exposure of interest, so we force each K j,i = 1 when i = 1. For i = (2, 3,…, 20), K j,i is a random value drawn from a Bernoulli distribution with a probability of success = 0.5. For i = (21, 22, 23,…, 40), K j,i is a random value drawn from the Bernoulli distribution with a probability of success = 0.15, value of success = 1, and value of failure = 0. The difference in the success rates between the two groups indicates that a real causal factor is more likely to be acknowledged than a noncausal factor.
Estimate the effects of X 1 on outcomes A and B based on the fact model, and compare the conventional approach with the proposed approach. Let G j,n, j = (A, B), n = (1, 2, 3,…, 50000), be the presence of causes (except X 1 and X 11) for outcomes A and B for each observation. For each n, G j,n = 1 for i = (2, 3 …, 10), if ∑i X in X (i+10)n F j,i ≥ 1; G j,n = 0 otherwise.
The true effects of X 1 on outcomes A and B based on the fact model are estimated by
| (4) |
where β j,i is the estimated real effect of X i on outcome j for j = (A, B). To estimate the effects of X 1 on outcomes A and B based on known/suspected confounders and applying standard multivariate logistic regression as the adjustment method, we have
| (5) |
where β j,i′ is the estimated effect of X i on outcome j for j = (A, B). To estimate the confounding effects on X 1 on outcomes A and B using the proxy outcome C, the logistic model for adjustment becomes
| (6) |
where is the estimated effect of X i on outcome C (proxy outcome) for j = (A, B).
The adjusted effects of X 1 on A and B are and , respectively.
2.2. Classification of Effect of X 1 on A and B
Based on the fact model, X 1 increases risk of outcome A if β A,1 > 0.05 and P value for β A,1 < 0.05; otherwise X 1 has no effect on A. Also, X 1 increases risk of outcome B if β B,1 > 0.05 and P value for β B,1 < 0.05; otherwise X 1 has no effect on B. The same effect patterns hold analogously for the conventional model and the alternative approach by replacing the regression coefficient β j,1 with β j,1′ and , respectively. Classifications of the effect of X 1 based on the fact model are then used as the gold standard to compare with the classifications based on the conventional approach and the alternative approach.
2.3. Empirical Application
A simple example is provided to clarify the methodology. For additional illustration of the proxy outcome method to adjust for residual confounding effects, interested readers are referred to the first author's recently published research [26]. Briefly, when investigating the effect of alcohol use (exposure) on general health status (outcome), both measured and unmeasured confounding factors are involved. Many of these confounding factors are clustered within the family such as socioeconomic determinants, environmental factors, lifestyle, and genetic susceptibility. Although current alcohol use by adults does not produce any physiological effect on their children's current health, observed effect of current alcohol use (exposure) on their children's health status (proxy outcome) can be used as an approximation of confounding effects.
This example used the data from the 2010 National Health Interview Survey. A first logistic regression model was fitted to compare the likelihood of having undesirable (poor or fair) health status (outcome) between lifetime abstainers and current light drinkers (exposure). A second logistic regression model was then applied to compare the likelihood of having undesirable health status in the children (proxy outcome) in relation to the drinking status of their parents. To adjust for confounding effects, natural logarithm of the odds ratios from the second model was introduced as an offset variable into the first model.
3. Results
Estimates based on the knowledge and conventional model from one replicate are shown in Tables 1 and 2 as an example. Both outcomes A and B are treated as the outcomes of interest, while outcome C is used as the proxy outcome. In this replicate, between outcomes A and C, there are four common causal factors. These account for 33% and 40% of all causal factors for A and C, respectively. Except for the exposure of interest (X 1), 54% of causal factors for A are known. The true effect of X 1 on outcome A (β A,1) based on the fact model is 1.27 (P value < 0.001) indicating that X 1 is a real causal factor to A. The estimated effect based on the conventional approach (β A,1′) is 0.79 (P value < 0.001). The estimated effect based on the alternative approach () is 0.79 − 0.09 = 0.70. Both the conventional approach and alternative approach lead to the same correct conclusion that X 1 is a causal factor to A.
Table 1.
Data example of a replicate/scenario, estimated effects (coefficients from logistic models) of exposure (X 1), and known “causal”/confounding factors of A on A and proxy outcome C.
| Effects on A | Effects on C |
Indicators for real causal factors (1 = Yes, 0 = No) |
||||
|---|---|---|---|---|---|---|
| Coefficient | P value | Coefficient | P value | Causal to A | Causal to C | |
| X 1 | 0.79 | 0.000 | 0.09 | 0.011 | 1 | 0 |
| X 2 | 0.19 | 0.000 | 0.92 | 0.000 | 0 | 1 |
| X 3 | 0.69 | 0.000 | 0.03 | 0.360 | 1 | 0 |
| X 4 | 0.46 | 0.000 | 0.50 | 0.000 | 0 | 0 |
| X 5 | ∗ | ∗ | 1 | 1 | ||
| X 6 | ∗ | ∗ | 1 | 1 | ||
| X 7 | 0.87 | 0.000 | 0.29 | 0.000 | 1 | 0 |
| X 8 | 0.20 | 0.000 | 0.84 | 0.000 | 0 | 1 |
| X 9 | 0.04 | 0.293 | 0.80 | 0.000 | 0 | 1 |
| X 10 | 0.77 | 0.000 | 0.13 | 0.000 | 1 | 0 |
| X 11 | ∗ | ∗ | 1 | 0 | ||
| X 12 | 0.15 | 0.000 | 0.79 | 0.000 | 0 | 1 |
| X 13 | ∗ | ∗ | 1 | 0 | ||
| X 14 | ∗ | ∗ | 0 | 0 | ||
| X 15 | 0.85 | 0.000 | 0.91 | 0.000 | 1 | 1 |
| X 16 | ∗ | ∗ | 1 | 1 | ||
| X 17 | 0.73 | 0.000 | 0.10 | 0.009 | 1 | 0 |
| X 18 | 0.15 | 0.000 | 0.89 | 0.000 | 0 | 1 |
| X 19 | ∗ | ∗ | 0 | 1 | ||
| X 20 | 0.95 | 0.000 | 0.35 | 0.000 | 1 | 0 |
| X 21 | ∗ | ∗ | 0 | 0 | ||
| X 22 | 0.27 | 0.000 | 0.16 | 0.000 | 0 | 0 |
| X 23 | 0.25 | 0.000 | 0.19 | 0.000 | 0 | 0 |
| X 24 | 0.20 | 0.000 | 0.18 | 0.000 | 0 | 0 |
| X 25 | ∗ | ∗ | 0 | 0 | ||
| X 26 | ∗ | ∗ | 0 | 0 | ||
| X 27 | ∗ | ∗ | 0 | 0 | ||
| X 28 | ∗ | ∗ | 0 | 0 | ||
| X 29 | 0.50 | 0.000 | 0.47 | 0.000 | 0 | 0 |
| X 30 | ∗ | ∗ | 0 | 0 | ||
| X 31 | ∗ | ∗ | 0 | 0 | ||
| X 32 | ∗ | ∗ | 0 | 0 | ||
| X 33 | ∗ | ∗ | 0 | 0 | ||
| X 34 | ∗ | ∗ | 0 | 0 | ||
| X 35 | ∗ | ∗ | 0 | 0 | ||
| X 36 | ∗ | ∗ | 0 | 0 | ||
| X 37 | ∗ | ∗ | 0 | 0 | ||
| X 38 | 0.13 | 0.001 | 0.17 | 0.000 | 0 | 0 |
| X 39 | ∗ | ∗ | 0 | 0 | ||
| X 40 | 0.16 | 0.000 | 0.17 | 0.000 | 0 | 0 |
*indicates variable is not known as a “causal” factor for A, therefore is not included in the models.
Table 2.
Data example of a replicate/scenario, estimated effects (coefficients from logistic models) of exposure (X 1), and known “causal”/confounding factors of B on B and proxy outcome C.
| Effects on B | Effects on C |
Indicators for real causal factors (1 = Yes, 0 = No) |
||||
|---|---|---|---|---|---|---|
| Coefficient | P value | Coefficient | P value | Causal to B | Causal to C | |
| X 1 | 0.12 | 0.001 | 0.16 | 0.000 | 0 | 0 |
| X 2 | ∗ | ∗ | 1 | 1 | ||
| X 3 | ∗ | ∗ | 1 | 0 | ||
| X 4 | 0.39 | 0.000 | 0.62 | 0.000 | 0 | 0 |
| X 5 | ∗ | ∗ | 0 | 1 | ||
| X 6 | ∗ | ∗ | 0 | 1 | ||
| X 7 | 0.24 | 0.000 | 0.43 | 0.000 | 0 | 0 |
| X 8 | ∗ | ∗ | 1 | 1 | ||
| X 9 | ∗ | ∗ | 0 | 1 | ||
| X 10 | ∗ | ∗ | 0 | 0 | ||
| X 11 | ∗ | ∗ | 0 | 0 | ||
| X 12 | 1.22 | 0.000 | 0.88 | 0.000 | 1 | 1 |
| X 13 | 1.15 | 0.000 | 0.17 | 0.000 | 1 | 0 |
| X 14 | 0.23 | 0.000 | 0.27 | 0.000 | 0 | 0 |
| X 15 | 0.16 | 0.000 | 1.04 | 0.000 | 0 | 1 |
| X 16 | 0.31 | 0.000 | 1.56 | 0.000 | 0 | 1 |
| X 17 | 0.13 | 0.000 | 0.19 | 0.000 | 0 | 0 |
| X 18 | ∗ | ∗ | 1 | 1 | ||
| X 19 | ∗ | ∗ | 0 | 1 | ||
| X 20 | 0.28 | 0.000 | 0.43 | 0.000 | 0 | 0 |
| X 21 | ∗ | ∗ | 0 | 0 | ||
| X 22 | 0.17 | 0.000 | 0.24 | 0.000 | 0 | 0 |
| X 23 | ∗ | ∗ | 0 | 0 | ||
| X 24 | ∗ | ∗ | 0 | 0 | ||
| X 25 | ∗ | ∗ | 0 | 0 | ||
| X 26 | ∗ | ∗ | 0 | 0 | ||
| X 27 | ∗ | ∗ | 0 | 0 | ||
| X 28 | 0.23 | 0.000 | 0.26 | 0.000 | 0 | 0 |
| X 29 | 0.31 | 0.000 | 0.58 | 0.000 | 0 | 0 |
| X 30 | ∗ | ∗ | 0 | 0 | ||
| X 31 | ∗ | ∗ | 0 | 0 | ||
| X 32 | ∗ | ∗ | 0 | 0 | ||
| X 33 | ∗ | ∗ | 0 | 0 | ||
| X 34 | ∗ | ∗ | 0 | 0 | ||
| X 35 | ∗ | ∗ | 0 | 0 | ||
| X 36 | ∗ | ∗ | 0 | 0 | ||
| X 37 | ∗ | ∗ | 0 | 0 | ||
| X 38 | ∗ | ∗ | 0 | 0 | ||
| X 39 | ∗ | ∗ | 0 | 0 | ||
| X 40 | ∗ | ∗ | 0 | 0 | ||
*indicates variable is not known as a “causal” factor for B, therefore is not included in the models.
Between outcomes B and C there are also four common factors. These account for 67% and 40% of all causal factors for B and C, respectively. Except the exposure of interest (X 1), 33% of causal factors for B are known. The true effect of X 1 on outcome B (β B,1) based on the fact model is −0.024 (P value = 0.668) indicating that X 1 is not a causal factor to B. The estimated effect based on the conventional approach (β B,1′) is 0.12 (P value < 0.001). The estimated effect based on the alternative approach () is 0.12 − 0.16 = −0.04. Therefore, based on estimation from the conventional approach one would mistakenly draw the conclusion that “X 1 is a causal factor to B.” However, given that , the alternative approach has led to the correct conclusion that X 1 is not a causal factor to B.
Table 3 summarises findings from the 500 replicates. Based on the fact model, the exposure of interest (X 1) is classified as a causal factor for outcome A in all replicates. This is in perfect agreement with the simulation process that X 1 is set to be a causal factor for A. In the simulation process, X 1 is set to be a noncausal factor for B; in 97.4% of the 500 replicates, the fact model concludes that X 1 is not a causal factor for B, but in 2.6% of the replicates the fact model concludes that X 1 is a causal factor for B. The disagreement between the simulation process and the fact model in these 2.6% replicates is a result of type I error (setting the two-sided confidence interval to 95%). Nevertheless, in all replicates, both the conventional and alternative approaches have classified X 1 as a causal factor for A. When comparing the estimates between the conventional approach and the fact model, the two models have led to the same conclusion in only 21% of the replicates, while in 72.2% of the replicates, the alternative approach has led to the same conclusion as the fact model. When the simulation process is used as the gold standard for classification (i.e., X 1 is causal for A, but not causal for B), the sensitivity of the new approach is 100%, and the specificity is 70.2%.
Table 3.
Effect of X 1 on A and B based on the fact model, conventional approach, and the alternative approach.
| Fact model | Conventional approach | Alternative approach (%) | ||
|---|---|---|---|---|
| X 1 causal to A but not B (%) | X 1 causal to A and B (%) | X 1 causal to A but not B (%) | X 1 causal to A and B (%) | |
| X 1 causal to A but not B | 93 (18.6%) | 394 (78.8%) | 347 (69.4%) | 140 (28.0%) |
| X 1 causal to A and B | 1 (0.2%) | 12 (2.4%) | 4 (0.8%) | 9 (1.8%) |
Table 4 presents results of the example. Alcohol use by adults has similar effects on their health status and their children's health status, whereas the effect on children's health status is mediated by confounding factors. To account for the uncontrolled confounding effects when estimating the effect of adults' alcohol use on their health status, an offset variable, which takes on the value of the natural logarithm of 1 for lifetime abstainers and 0.60 for current light drinkers, is added to the model. The adjustment changes the odds ratio from 0.54 (P < 0.001) to 0.90 (P = 0.38).
Table 4.
Empirical application, predicting adults' health status (outcome of interest) and their children's health status (proxy outcome) by alcohol use (proxy outcome).
| Have undesirable health status in adults | Have undesirable health status in children | |||||
|---|---|---|---|---|---|---|
| Odds Ratio | 95% confidence interval | Odds ratio | 95% confidence interval | |||
| Parents' drinking behaviour | ||||||
| Lifetime abstainers | 1.00 | Reference | 1.00 | Reference | ||
| Current light drinkers | 0.54 | 0.42 | 0.69 | 0.60 | 0.50 | 0.71 |
4. Discussion
In this study, we introduce a new analysis approach for causal effect. Although the new approach is only applicable to measurements of relative effects (i.e., risk ratios, odds ratios), it does not require any distributional assumption for the confounding variables in relation to the outcome, the exposure, and other known confounding variables. Instead, the approach merely assumes that the causes for the outcome of interest and proxy outcome are partly overlapped and correlated. The choice of an optimal proxy outcome is achievable by directly applying field expertise without advanced knowledge in statistics. The simulation results show that the alternative approach is far more accurate than the conventional approach in classifying causal associations, even under conditions of low to moderate correlation between the causes for the outcome and causes for the proxy outcome. The proposed approach appears to be suitable for observational studies in social science and health research that evaluate the health impact of behaviour and mental health problems, especially where clusters of causes for various outcomes are strongly correlated and overlapped in these fields [12, 27, 28].
It should be remarked that the analysis can only be performed when effects are measured by relative risk difference such as risk ratio or odds ratio. Another limitation is that false classification remains possible, even though the proposed method appears to have an advantage over the conventional approach. In this study, we demonstrate a new simulation process that incorporates the component causes, competing events, difference between the fact and the knowledge, to model realistic scenarios in observational studies. This simulation process could be further developed and used to determine how knowledge that deviates from the fact can introduce bias in estimates.
5. Conclusion
In conclusion, the proposed proxy outcome approach can be applied in observational studies in social science and health research that evaluate the health impact of behaviour and mental health problems.
Conflict of Interests
The authors declare that they have no conflict of interests.
References
- 1.Lin DY, Psaty BM, Kronmal RA. Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Biometrics. 1998;54(3):948–963. [PubMed] [Google Scholar]
- 2.Greenland S. The impact of prior distributions for uncontrolled confounding and response bias: a case study of the relation of wire codes and magnetic fields to childhood leukemia. Journal of the American Statistical Association. 2003;98(461):47–54. [Google Scholar]
- 3.Greenland S, Copas J, Jones DR, et al. Multiple-bias modelling for analysis of observational data. Journal of the Royal Statistical Society A. 2005;168(2):267–306. [Google Scholar]
- 4.McCandless LC, Gustafson P, Levy A. Bayesian sensitivity analysis for unmeasured confounding in observational studies. Statistics in Medicine. 2007;26(11):2331–2347. doi: 10.1002/sim.2711. [DOI] [PubMed] [Google Scholar]
- 5.Arah OA, Chiba Y, Greenland S. Bias formulas for external adjustment and sensitivity analysis of unmeasured confounders. Annals of Epidemiology. 2008;18(8):637–646. doi: 10.1016/j.annepidem.2008.04.003. [DOI] [PubMed] [Google Scholar]
- 6.Palmer TM, Thompson JR, Tobin MD, Sheehan NA, Burton PR. Adjusting for bias and unmeasured confounding in Mendelian randomization studies with binary responses. International Journal of Epidemiology. 2008;37(5):1161–1168. doi: 10.1093/ije/dyn080. [DOI] [PubMed] [Google Scholar]
- 7.VanderWeele TJ, Hernán MA, Robins JM. Causal directed acyclic graphs and the direction of unmeasured confounding bias. Epidemiology. 2008;19(5):720–728. doi: 10.1097/EDE.0b013e3181810e29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chiba Y. The sign of the unmeasured confounding bias under various standard populations. Biometrical Journal. 2009;51(4):670–676. doi: 10.1002/bimj.200800195. [DOI] [PubMed] [Google Scholar]
- 9.Lee W-C. Bounding the bias of unmeasured factors with confounding and effect-modifying potentials. Statistics in Medicine. 2011;30(9):1007–1017. doi: 10.1002/sim.4151. [DOI] [PubMed] [Google Scholar]
- 10.Vanderweele TJ, Arah OA. Bias formulas for sensitivity analysis of unmeasured confounding for general outcomes, treatments, and confounders. Epidemiology. 2011;22(1):42–52. doi: 10.1097/EDE.0b013e3181f74493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.McCandless LC, Gustafson P, Levy AR, Richardson S. Hierarchical priors for bias parameters in Bayesian sensitivity analysis for unmeasured confounding. Statistics in Medicine. 2012;31(4):383–396. doi: 10.1002/sim.4453. [DOI] [PubMed] [Google Scholar]
- 12.Jacobi F, Wittchen H-U, Hölting C, et al. Prevalence, co-morbidity and correlates of mental disorders in the general population: results from the German Health Interview and Examination Survey (GHS) Psychological Medicine. 2004;34(4):597–611. doi: 10.1017/S0033291703001399. [DOI] [PubMed] [Google Scholar]
- 13.Chan Y-F, Dennis ML, Funk RR. Prevalence and comorbidity of major internalizing and externalizing problems among adolescents and adults presenting to substance abuse treatment. Journal of Substance Abuse Treatment. 2008;34(1):14–24. doi: 10.1016/j.jsat.2006.12.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Teesson M, Hall W, Slade T, et al. Prevalence and correlates of DSM-IV alcohol abuse and dependence in Australia: findings of the 2007 National Survey of Mental Health and Wellbeing. Addiction. 2010;105(12):2085–2094. doi: 10.1111/j.1360-0443.2010.03096.x. [DOI] [PubMed] [Google Scholar]
- 15.Liang W, Chikritzhs T. Does light alcohol consumption during pregnancy improve offspring’s cognitive development? Medical Hypotheses. 2012;78(1):69–70. doi: 10.1016/j.mehy.2011.09.043. [DOI] [PubMed] [Google Scholar]
- 16.Liang W, Chikritzhs T. Alcohol consumption and health status of family members: health impacts without ingestion. Internal Medicine Journal. 2013;43(9):1012–1016. doi: 10.1111/imj.12041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Petridou E, Zavitsanos X, Dessypris N, et al. Adolescents in high-risk trajectory: clustering of risky behavior and the origins of socioeconomic health differentials. Preventive Medicine. 1997;26(2):215–219. doi: 10.1006/pmed.1996.0130. [DOI] [PubMed] [Google Scholar]
- 18.Lynch JW, Kaplan GA, Salonen JT. Why do poor people behave poorly? Variation in adult health behaviours and psychosocial characteristics by stages of the socioeconomic lifecourse. Social Science and Medicine. 1997;44(6):809–819. doi: 10.1016/s0277-9536(96)00191-8. [DOI] [PubMed] [Google Scholar]
- 19.Rozanski A, Blumenthal JA, Davidson KW, Saab PG, Kubzansky L. The epidemiology, pathophysiology, and management of psychosocial risk factors in cardiac practice: the emerging field of behavioral cardiology. Journal of the American College of Cardiology. 2005;45(5):637–651. doi: 10.1016/j.jacc.2004.12.005. [DOI] [PubMed] [Google Scholar]
- 20.Suzuki K-J, Nakaji S, Tokunaga S, Shimoyama T, Umeda T, Sugawara K. Confounding by dietary factors in case-control studies on the efficacy of cancer screening in Japan. European Journal of Epidemiology. 2005;20(1):73–78. doi: 10.1007/s10654-004-0872-z. [DOI] [PubMed] [Google Scholar]
- 21.Groenwold RHH, Hoes AW, Hak E. Confounding in publications of observational intervention studies. European Journal of Epidemiology. 2007;22(7):413–415. doi: 10.1007/s10654-007-9126-1. [DOI] [PubMed] [Google Scholar]
- 22.Lagiou P, Adami H-O, Trichopoulos D. Causality in cancer epidemiology. European Journal of Epidemiology. 2005;20(7):565–574. doi: 10.1007/s10654-005-7968-y. [DOI] [PubMed] [Google Scholar]
- 23.Herttua K, Mäkelä P, Martikainen P. An evaluation of the impact of a large reduction in alcohol prices on alcohol-related and all-cause mortality: time series analysis of a population-based natural experiment. International Journal of Epidemiology. 2011;40(2):441–454. doi: 10.1093/ije/dyp336.dyp336 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Poirier E, Watier L, Espie E, Weill F-X, de Valk H, Desenclos J-C. Evaluation of the impact on human salmonellosis of control measures targeted to Salmonella Enteritidis and Typhimurium in poultry breeding using time-series analysis and intervention models in France. Epidemiology and Infection. 2008;136(9):1217–1224. doi: 10.1017/S0950268807009788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rothman KJ, Greenland S, Poole C, Lash TL. Causation and causal inference. Modern Epidemiology. 1998;2:7–28. [Google Scholar]
- 26.Liang W, Chikritzhs T. Observational research on alcohol use and chronic disease outcome: new approaches to counter biases. The Scientific World Journal. 2013;2013:14 pages. doi: 10.1155/2013/860915.860915 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zuckerman M, Kuhlman DM. Personality and risk-taking: common biosocial factors. Journal of Personality. 2000;68(6):999–1029. doi: 10.1111/1467-6494.00124. [DOI] [PubMed] [Google Scholar]
- 28.Kessler RC, Nelson CB, McGonagle KA, Edlund MJ, Frank RG, Leaf PJ. The epidemiology of co-occurring addictive and mental disorders: implications for prevention and service utilization. American Journal of Orthopsychiatry. 1996;66(1):17–31. doi: 10.1037/h0080151. [DOI] [PubMed] [Google Scholar]
