Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
. 2003 Aug 15;73(3):612–626. doi: 10.1086/378208

Novel Case-Control Test in a Founder Population Identifies P-Selectin as an Atopy-Susceptibility Locus

Catherine Bourgain 1,,*, Sabine Hoffjan 1, Raluca Nicolae 1, Dina Newman 1, Lori Steiner 3, Karen Walker 3, Rebecca Reynolds 3, Carole Ober 1, Mary Sara McPeek 1,2
PMCID: PMC1180685  PMID: 12929084

Abstract

To avoid problems related to unknown population substructure, association studies may be conducted in founder populations. In such populations, however, the relatedness among individuals may be considerable. Neglecting such correlations among individuals can lead to seriously spurious associations. Here, we propose a method for case-control association studies of binary traits that is suitable for any set of related individuals, provided that their genealogy is known. Although we focus here on large inbred pedigrees, this method may also be used in outbred populations for case-control studies in which some individuals are relatives. We base inference on a quasi-likelihood score (QLS) function and construct a QLS test for allelic association. This approach can be used even when the pedigree structure is far too complex to use an exact-likelihood calculation. We also present an alternative approach to this test, in which we use the known genealogy to derive a correction factor for the case-control association χ2 test. We perform analytical power calculations for each of the two tests by deriving their respective noncentrality parameters. The QLS test is more powerful than the corrected χ2 test in every situation considered. Indeed, under certain regularity conditions, the QLS test is asymptotically the locally most powerful test in a general class of linear tests that includes the corrected χ2 test. The two methods are used to test for associations between three asthma-associated phenotypes and 48 SNPs in 35 candidate genes in the Hutterites. We report a highly significant novel association (P=2.10-6) between atopy and an amino acid polymorphism in the P-selectin gene, detected with the QLS test and also, but less significantly (P=.0014), with the transmission/disequilibrium test.

Introduction

Association studies are an essential step in the genetic dissection of complex traits. Whereas linkage studies yield relatively broad locations for susceptibility loci, association studies can be used to test the role of particular candidate genes. However, classical case-control tests might detect differences between cases and controls owing to ignored population substructure or improperly accounted relatedness among individuals and not necessarily owing to a true association between a locus and a trait. We focus here on the problem of performing valid association studies for binary traits in samples with related individuals in which the relationships are known. Such a situation may be encountered in outbred populations (Risch and Teng 1998; Slager and Schaid 2001). In isolated populations, the relatedness among individuals may be considerable, with many or all individuals related through multiple lines of descent. Neglecting such correlations among individuals can lead to seriously spurious associations. This has been illustrated by Newman et al. (2001) in a study conducted in a sample of Hutterites from South Dakota. The ∼750 members of their sample are descendants of just 64 founders and are related to each other by a 13-generation, 1,623-member pedigree. Using an association test for quantitative traits developed by Abney et al. (2002) that takes the pedigree structure into account, Newman et al. (2001) illustrated the dramatic effect on false-positive rates of neglecting interindividual correlations.

To overcome this problem, family-based association tests have been widely, if not systematically, used, even though they have some disadvantages. In particular, the need for genotype information on family members, such as parents or sibs, can drastically reduce the number of cases available for a study, a concern that may be particularly relevant for late-onset diseases. Devlin and Roeder (1999) have proposed to use genomic controls to correct association tests for unknown relatedness among individuals. However, when the genealogy of the sample is entirely known, it is preferable to use this information. Slager and Schaid (2001) have derived a correction factor for the Armitage trend test to account for the presence of close relatives (e.g., siblings and cousins) from outbred populations, but this method cannot handle complex inbred pedigrees. Furthermore, little is known about the relative power of these different approaches.

Here, we propose a method for case-control association studies of binary traits, a method suitable for any set of related individuals, provided that their genealogy is known. In particular, it can be used in large inbred pedigrees. The method takes into account interindividual correlations, as well as intraindividual correlations due to inbreeding, by conditioning on the pedigree structure. Because we want the method to be suitable even when the pedigree structure is far too complex to use an exact-likelihood calculation, we base inference on the quasi-likelihood score (QLS) function (also known as the “quasi-score function” [e.g., see McCullagh and Nelder 1989; Heyde 1997]). This quasi-score function has been proposed by Wu et al. (2000) and by M.S.M., X. Wu, and C.O. (unpublished data) for estimation of allele frequencies in large inbred pedigrees. In that case, it results in the best linear unbiased estimator. We extend their approach to construct a QLS test for allelic association. The exact computation of the correlations among alleles requires inbreeding coefficients for all individuals and kinship coefficients for all pairs of individuals included in the analysis.

We also present an alternative approach to this test, in which we derive a correction factor for the case-control association χ2 test based on the variance that appropriately accounts for inter- and intraindividual correlations. As for the QLS test, this correction factor is performed conditional on the pedigree structure. The latter strategy is similar to the one used by Slager and Schaid (2001), although they used a variance computed conditional on the identity-by-descent (IBD) information obtained using the marker genotype data rather than an unconditional variance. We note that exact computation of the conditional variance is generally infeasible in complex pedigrees. By deriving the noncentrality parameters of both the QLS and corrected χ2 statistics, we obtain analytical results that allow us to compare the power of the two approaches. Furthermore, we show that both tests belong to a general class of linear tests, with the QLS test being asymptotically the locally most powerful test of this class under certain regularity conditions.

In what follows, we shall first describe the QLS test statistic for allelic association and derive the variance correction of the case-control χ2 test. We will then study the null distributions of the tests and compare their power in Hutterite samples. Finally we will use these statistics to test, in Hutterite samples, the association between three different asthma-related phenotypes (asthma [MIM 600807], bronchial hyperresponsiveness [BHR], and atopy) and a set of 48 SNPs located in 35 candidate genes that were selected because of their known or suspected role in the inflammatory process.

Methods

The QLS Test for Allelic Association

We develop a test for association between a single marker and a binary trait, based on case-control data from a founder population but also useful in outbred populations with sampled relatives. We first focus on the case of a biallelic marker and then extend the method to the multiallelic case.

Biallelic Case

Consider a group of N subjects sampled from a population of known genealogy. Consider a biallelic marker with alleles labeled “0” and “1.” We start by considering the situation in which the marker has no association with the trait (the null model), and we briefly review the results of Wu et al. (2000) and M.S.M., X. Wu, and C.O. (unpublished data) on allele-frequency estimation for this case. We then present the alternative model and derive the QLS statistic. Let Y=(Y1,…Yi…,YN)T be a vector with element Yi equal to  1/2× (the number of alleles of type 1 in individual i). Yi=0,  1/2, or 1. Let p be the frequency of allele 1, 0<p<1, and let Σ be the covariance matrix of Y. It can be shown that Σ= 1/2p(1-p)K, where

graphic file with name AJHGv73p612df1.jpg

with hi being the inbreeding coefficient of individual i and φij the kinship coefficient between individuals i and j. We note that K will be invertible, provided that each MZ twin pair (if any) is entered as a single individual in the matrix.

Define m=E(Y). By construction, m is a column vector of length N (N-vector) with m=p1, where 1 is an N-vector of 1s. Let Dp= ∂m/∂p=1. Wu et al. (2000) and M.S.M., X. Wu, and C.O. (unpublished data) have proposed to use the quasi-score function U=DTpΣ-1(Y-m) to estimate p by setting Inline graphic (e.g., see McCullagh and Nelder 1989). The solution to this equation is

graphic file with name AJHGv73p612df2.jpg

which M.S.M. and X. Wu (unpublished data) have shown to be the best linear unbiased estimator of p.

We now consider the case in which the marker is associated with the trait (the alternative model). Suppose that Nc subjects among the N are cases and that Nt are controls so that Nc+Nt=N. To test for an allelic association between the marker and the disease, we propose to consider the following model: E(Y)=μ=(μ1,…μi,…μN)T with

graphic file with name AJHGv73p612df3.jpg

Under the null hypothesis of no association, r=0, whereas, under the alternative hypothesis, r≠0. p is now considered a nuisance parameter. We allow the covariance matrix of Y, Ω, to depend on both p and r. However, it turns out that we do not need to specify the exact form of Ω. We simply require that Ω=Σ when r=0 and that Ω be differentiable and invertible. The quasi-score function corresponding to our model is

graphic file with name AJHGv73p612df4.jpg

where

graphic file with name AJHGv73p612df778.jpg

Dp and Dr are N-vectors with Dp as previously described and Dr=(d1dN)T where di=1 if i is a case and di=0 if i is a control. We propose to use this quasi-score function to build a QLS statistic. The classical score statistic when the null hypothesis is composite (r=r0 and p is a nuisance parameter), as described by Cox and Hinkley (1974), has the following form

graphic file with name AJHGv73p612df5.jpg

where Inline graphic is the maximum likelihood estimate of p when r=r0. Inline graphic and Inline graphic are, respectively, the derivative over r of the log-likelihood and the (r,r)th entry of the inverse of the information matrix, both computed with Inline graphic and r=r0. This statistic does not involve the existence of a likelihood function from which the score function is derived by differentiation. It can therefore be generalized to the case of quasi likelihood (Heyde 1997), where the derivative of the log-likelihood is replaced by the quasi-score function U and the information matrix is replaced by E(UUT). In our case, this substitution results in the statistic

graphic file with name AJHGv73p612df6.jpg

where Inline graphic and Σ0 are, respectively, the expectation and the covariance matrix of Y evaluated at r=0 and Inline graphic, where Inline graphic is the maximum quasi-likelihood estimate of p when r=0. Thus, Inline graphic and Inline graphic with Inline graphic calculated using eq. (2). As demonstrated by Heyde (1997), WQLS should follow a χ2 distribution with 1 df under the null hypothesis, provided that Inline graphic under the null. Simulations testing the accuracy of this null distribution will be presented in the “Results” section.

Suppose, now, that the N subjects belong to F independent families sampled in an outbred population. For each family f of size Nf, we define Σf and Kf, its covariance and correlation matrices. Σf and Kf are Nf×Nf matrices with entries given in eq. (1) for Kf and Σf= 1/2p(1-p)Kf. When all the individuals in the sample are outbred, Kf has all its diagonal elements equal to 1. It comes from eq. (2) that

graphic file with name AJHGv73p612df7.jpg

where Yf is the Nf- vector of allele indicators for the Nf members of the fth family and Dpf is an Nf-vector of 1s. Similarly, WQLS is computed as in eq. (4), with all the terms of the form XTΣ-10B in this formula (where X and B are N-vectors: either Dp Dr or Inline graphic), computed as Inline graphic, where Xf and Bf are the Nf-subvectors of X and B corresponding to the Nf members of the fth family.

Multiallelic Case

Consider a locus with a different alleles. Let Y=(Y1,…,Ya-1)T be an [(a-1)N]-vector with Yk=(Yk1,…,YkN)T an N-vector and Yki equal to  1/2(the number of alleles of type k in individual i). If a particular allele k is suspected to be associated with the disease, the locus might be treated as biallelic k/non-k and the test performed just as described in the previous case. Arguably, when there is no prior idea of which allele might be associated with the disease, a more general alternative model should be considered. As a generalization of the biallelic case, we consider E(Y)=μ=(μ1,…,μa-1)T, where μk=(pk+s1rk,…,pk+sNrk)T with si=1 if i is a case and si=0 if i is a control. Here, we write p=(p1,…,pa-1)T and r=(r1,…,ra-1)T, and we assume 0<pk+rk<1 and 0<pk<1 for all k. As in the biallelic case, we allow for a very general form for the covariance matrix of Y. When r=0, this matrix is Σ=FK ( is defined in appendix A), where F is the (a-1)×(a-1) matrix of the form

graphic file with name AJHGv73p612df8.jpg

Note that, in the biallelic case, F reduces to  1/2p(1-p) and Σ= 1/2p(1-p)K, as given in the “Biallelic Case” subsection above. We show in appendix A that, in this case,

graphic file with name AJHGv73p612df9.jpg

where, for all k, Inline graphic, and Inline graphic is the (i,k) entry of F evaluated at Inline graphic. Here, Inline graphic is the quasi-likelihood estimator of p when r=0. For each k, Inline graphic is calculated using eq. (2) with Y replaced by Yk (Wu et al. [2000] and M.S.M., X. Wu, and C.O., unpublished data). In this case, we also expect the null distribution of WQLS to be ∼χ2 with (a-1) df, and we examine the accuracy of this approximation in the “Results” section. In a manner similar to the biallelic case, when the N subjects belong to F independent families, WQLS is computed as in eq. (7) with all the terms of the form XTK-1B in this formula—where X and B are N-vectors: either Dp, Dr or Inline graphic—computed as Inline graphic, where Xf and Bf are the Nf-subvectors of X and B corresponding to the Nf members of the f family.

General Framework for the QLS and χ2 Tests

The WQLS statistic can be seen as a particular case of a more general class of linear statistics of the form Inline graphic, where S=VTY with V≠0 a known [N(a-1)×(a-1)] matrix and E0(S)=0. E0(S) and Inline graphic are, respectively, the expectation and the variance of S when r=r0. Inline graphic depends on p and, in practice, is computed at Inline graphic. In what follows, we refer to this class of linear statistics as the “W class.” In the biallelic case, we can define VQLS=V1, where V1=K-1Dr-(DTrK-1Dp)(DTpK-1Dp)-1K-1Dp. Then, Inline graphic, where SQLS=VTQLSY (this result can be obtained by inserting eq. [2] into eq. [4] and moving all the factors of Inline graphic into the variance term). More generally, we show, in appendix B, that, in the multiallelic case, we have a similar result with VQLS=Ia-1V1, where Ia-1 is the identity matrix of dimension (a-1).

In the special case when the correlations among all the individuals, as well as between the two alleles of an individual, are zero, the classical case-control χ2 test for association also fits in the W class of statistics. For a biallelic locus, Vχ2=V2, where V2=Dr-(DTrDp)(DTpDp)-1Dp (note that V2 is just V1 with K replaced by I). Indeed, Inline graphic with Sχ2=VTχ2Y is equal to the classical case-control χ2 test statistic, as shown in appendix B. The multiallelic case is similar, with Vχ2=Ia-1V2 (see appendix B).

Correction Factor for the Classical χ2 Test

One way to extend the classical χ2 test so that it is valid when the above correlations are not zero, is to use the same Sχ2=VTχ2Y and recompute Inline graphic to take into account the correlations. In the biallelic case, this is done by making use of the fact that Inline graphic, rather than  1/2p(1-p)I. We call the resulting statistic “Wχ2corr,” and we have

graphic file with name AJHGv73p612df10.jpg

where Inline graphic has been substituted for p.

Let ρχ2corr be the correction factor to be applied to the Wχ2 to have a valid test. Wχ2corrχ2corrWχ2 with

graphic file with name AJHGv73p612df11.jpg

Note that ρχ2corr depends only on the sample composition (i.e., who are the cases and who are the controls), not on allele-frequency estimates. It can be shown that the same correction applies when there are a alleles at a locus, so that ρχ2corrWχ2 approximately follows a χ2 distribution with (a-1) df under the null hypothesis. Here again, if the N individuals belong to F independent families, Wχ2corr and ρχ2corr can be computed as above with all the terms of the form XTKB (where X and B are N-vectors) in these formulas, computed as Inline graphic.

Null Distribution of the WQLS and Wχ2corr Statistics

To determine whether the χ2 approximation provides the correct type I error for the tests based on WQLS and Wχ2corr, we performed simulations based on three different real case-control samples of Hutterites from South Dakota. The Hutterites are a North American religious isolate that originated in eastern Europe and whose entire population can be traced back to 90 ancestors in the 1700s/1800s. Sample 1 consisted of 701 Hutterites who were phenotyped for atopy, defined as a positive skin-prick testing (+SPT) to at least 1 of 14 airborne allergens. This sample included 310 individuals with atopy and 391 controls (no +SPT to any of the 14 allergens tested). Sample 2 consisted of 156 individuals with BHR and 434 controls without any asthma symptoms or BHR. Sample 3 consisted of the same 434 controls and 76 individuals out of the 156 cases of sample 2 with self-reported asthma symptoms and a doctor’s diagnosis of asthma and BHR. The latter phenotype will be referred to as “asthma” in what follows. The details of the phenotype have been described elsewhere (Ober et al. 2000). The complete genealogy of these 719 different individuals was constructed from a Hutterite pedigree of ⩾12,000 individuals. This yielded a 1,623-person pedigree that included all known ancestors of the individuals in the three samples. The inbreeding coefficients for all 719 individuals, as well as the kinship coefficients between any pair of individuals in each sample, were computed on the basis of the 1,623-person pedigree, using the algorithm of Boyce (1983). Mean values of the inbreeding and kinship coefficients for the three samples are presented in table 1. Two smaller subsamples were also considered, to evaluate the effect of sample size on the type I error. Sample 4 consisted of the 76 cases from sample 3 and 76 controls randomly drawn from the 434 corresponding controls. Sample 5 consisted of 30 cases randomly drawn from the 76 cases in sample 3 and 30 controls randomly drawn from the corresponding 434 controls.

Table 1.

Mean Inbreeding and Kinship Coefficients in Cases and in Controls from Three Hutterite Samples

Sample Size KinshipCoefficient InbreedingCoefficient
+SPT:
 Cases 310 .0436 .0363
 Controls 391 .0419 .0321
BHR:
 Cases 156 .0423 .0337
 Controls 434 .0426 .0348
Asthma:
 Cases 76 .0445 .0336
 Controls 434 .0426 .0348

Genotype information for markers unlinked to the phenotypes under study was simulated by randomly drawing alleles for the founders of the 1,623-person pedigree with fixed allele frequencies and then simulating the Mendelian transmission of these alleles throughout the pedigree. The validity of the χ2 null distribution was assessed by comparing the proportion of simulations showing a statistic whose value is greater than the χ2 threshold for a nominal type I error and the value of this nominal type I error.

Power of the WQLS and Wχ2corr Statistics

We show, in appendix C, that, under certain regularity conditions, WQLS is asymptotically the locally most powerful test of the W class of linear tests described earlier. We provide here analytical power calculations for WQLS and Wχ2corr to quantify the difference between the two tests. The basic assumption underlying these calculations is that, under the alternative hypothesis H1, both WQLS and Wχ2corr have a noncentral χ2 distribution with the respective noncentrality parameters λQLS and λχ2corr. (For instance, this would hold asymptotically for local alternatives [i.e., alternatives that are close to the null] under certain conditions on K.) λQLS is obtained by calculating WQLS with Y replaced by EH1(Y), where EH1(Y) is the expectation of Y under the alternative hypothesis H1. Similarly, λχ2corr is equal to Wχ2corr computed with Y replaced by EH1(Y). We focus, in what follows, on the biallelic case. Using the expression of WQLS as a function of SQLS given earlier, we have

graphic file with name AJHGv73p612df12.jpg

The alternative presented in (3) may also be written as EH1(Y)=pDp+rDr. Thus,

graphic file with name AJHGv73p612df13.jpg

Similarly, it comes from eq. (8) that

graphic file with name AJHGv73p612df14.jpg

For fixed values of (p,r) defining an alternative, the power of WQLS and Wχ2corr for a nominal type I error α is βQLS=1-ℛλQLS,1(Kα,1) and βχ2corr=1-ℛλχ2corr,1(Kα,1), respectively, where Kα,1 is the upper αth quantile of a χ21 distribution and where λ,1 is the distribution function of a noncentral χ21 with noncentrality parameter λ.

Testing Candidate Genes for Asthma in the Hutterites

The WQLS and the Wχ2corr statistics were used to test the association between asthma, BHR, and atopy and 48 biallelic markers located in 35 different genes that were selected because of their known or suspected role in the inflammatory process. Typing was done by multiplex PCR and an immobilized-probe linear-array system (LAS) (Mirel et al. 2002). These genes (n = number of polymorphisms per gene) included interleukin 4 (IL4 [MIM 147780], n=1), interleukin 4 receptor α chain (IL4RA [MIM 147730], n=3), interleukin 13 (IL13 [MIM 147683], n=1), β2-adrenergic receptor (ADRB2 [MIM 109690], n=3), intercellular adhesion molecule 1 (ICAM1 [MIM 147840], n=2), vascular cell adhesion molecule 1 (VCAM1 [MIM 192225], n=1), E-selectin (SELE [MIM 131210], n=1), P-selectin (SELP [MIM 173610], n=2), Fcε receptor β chain (FCERB1 [MIM 147138], n=1), monocyte differentiation antigen CD14 (CD14 [MIM 158120], n=1), uteroglobin (UGB [MIM 192020], n=1), transforming growth factor β1 (TGFB1 [MIM 190180], n=1), Eotaxin (SCYA11 [MIM 601156], n=2), chemokine receptor 2 (CCR2 [MIM 601267], n=1), chemokine receptor 3 (CCR3 [MIM 601268], n=1), chemokine receptor 5 (CCR5 [MIM 601373], n=2), T cell–specific transcription factor 7 (TCF7 [MIM 189908], n=1), interleukin 9 (IL9 [MIM 146931], n=1), interleukin 1 α chain (IL1A [MIM 147760], n=1), interleukin 1 β chain (IL1B [MIM 147720], n=2), interleukin 5 receptor α chain (IL5RA [MIM 147851], n=1), interleukin 6 (IL6 [MIM 147620], n=2), interleukin 10 (IL10 [MIM 124092], n=1), complement 3 (C3 [MIM 120700], n=1), complement 5 (C5 [MIM 120900], n=1), colony-stimulating factor 2 (CSF2 [MIM 138960], n=1), cytotoxic T lymphocyte–associated protein (CTLA4 [MIM 123890], n=2), leukotriene C4 synthase (LTC4S [MIM 246530], n=1), nitrous oxide synthetase 3 (NOS3 [MIM 163729], n=2), nitrous oxide synthetase 2A (NOS2A [MIM 163730], n=1), stromal cell–derived factor 1 (SDF1 [MIM 600835], n=1), lymphotoxin α (LTA [MIM 153440], n=1), tumor necrosis factor (TNF [MIM 191160], n=2), vitamin D receptor (VDR [MIM 601769], n=1), and group-specific component (GC [MIM 139200], n=1). Full descriptions of the polymorphisms included in this study are available at the authors' Web site (Association Studies in Hutterites). The samples 1, 2, and 3 (described in the “Null Distribution of the WQLS and Wχ2corr Statistics” subsection above) corresponding to the three phenotypes—atopy, BHR, and asthma, respectively—were used to conduct the analysis. The association between these 48 SNPs and the three phenotypes was also tested in the Hutterites, using the transmission/disequilibrium test (TDT), as implemented in ASPEX (Hinds and Risch 1999), considering all the cases for which parental genotypes were available.

Results

Null Distribution of the WQLS and Wχ2corr Statistics

Table 2 presents the results of simulation studies in samples 1 (+SPT), 2 (BHR), and 3 (asthma), assessing the empirical type I error of both the QLS and the corrected χ2 statistics when the χ2 distribution is used as the null distribution of the statistics. Four different choices of allele-frequency distribution are considered. The results of the noncorrected χ2 test are also displayed, to highlight the increase in type I error when interindividual correlations are neglected. For a given sample, this increase shows nonnegligible variation from one allele case to another, even though the correction factor ρχ2corr does not depend on allele frequency. This presumably reflects the fact that Wχ2corr is only χ2 distributed and that, in finite samples, the accuracy of the χ2 approximation varies slightly, depending on the allele-frequency distribution. In our largest sample, sample 1, the real type I error for the noncorrected χ2 with a nominal P value of 5% may be as large as 18% in the three-allele case. For both the QLS and the corrected χ2 tests, the nominal type I error lies within the 95% CI of the real type I errors in all three samples for two of the three biallelic situations considered and for the triallelic situation. Similar results are obtained for the two smaller Hutterite subsamples, as can be seen in table 3. When the allele frequency becomes low (0.05), the χ2 approximation seems to be slightly conservative or anticonservative for both tests in all five samples, a deviation possibly due to small numbers of observations of the minor allele when its frequency is low. Even though not exhaustive, these results tend to confirm that the χ2 distribution is a reasonably good approximation of the null distribution of these two tests, as long as neither the allele frequency nor the sample size is too small. The sample size and allele frequency required for the χ2 distribution to hold depend on the relationship among the individuals of the sample and are likely to differ from one population sample to another. We would recommend that exact simulations be performed for confirmation in case of a significant result associated with a small number of alleles in the case and/or the control sample.

Table 2.

Empirical Type I Error of the QLS Test, the Corrected Case-Control χ2 Test, and the Noncorrected χ2 Test Estimated with 5,000 Simulations in Three Hutterite Samples[Note]

Empirical Type I Error with Nominal Type I Error of
.05
.01
Number of Alleles (Frequency) and Test +SPT BHR Asthma +SPT BHR Asthma
2 (.5/.5):
 QLS .052 .046 .055 .013 .011 .012
 Corrected χ2 .055 .050 .054 .009 .013 .011
 Noncorrected χ2 .145
.071
.071
.054
.021
.019
2 (.2/.8):
 QLS .045 .050 .047 .010 .010 .010
 Corrected χ2 .055 .056 .049 .010 .010 .010
 Noncorrected χ2 .140
.072
.069
.054
.016
.017
2 (.05/.95):
 QLS .041
.038
.045 .013 .013 .014
 Corrected χ2 .045 .050 .042
.008 .008 .012
 Noncorrected χ2 .123
.066
.060
.043
.043
.016
3 (.3/.3/.4):
 QLS .055 .055 .056 .013 .011 .012
 Corrected χ2 .054 .055 .051 .010 .014
.012
 Noncorrected χ2 .180
.082
.079
.073
.020
.020

Note.— Values outside the 95% CI of the nominal type I error are underlined.

Table 3.

Empirical Type I Error of the QLS Test and the Corrected χ2 Test Estimated with 5,000 Simulations, for a Nominal Type I Error of .01 in Smaller Hutterite Samples and for Biallelic Markers[Note]

Empirical Type I Error in
76 Cases/76 Controlswith Allele Frequency of
30 Cases/30 Controlswith Allele Frequency of
Test .5 .2 .05 .5 .2 .05
QLS .012 .012 .019
.012 .01 .01
Corrected χ2 .012 .012 .004
.012 .009 .0024

Note.— Values outside the 95% CI of the nominal type I error are underlined.

Power of the WQLS and Wχ2corr Statistics

Figure 1 displays the power of both the WQLS and Wχ2corr statistics based on the analytical power calculations, for various alternatives defined by (p,r) pairs (allele frequency in the controls and difference in allele frequency between cases and controls), using the three different Hutterite samples described above. Note that these power calculations are expected to be more accurate for smaller values of r, which represent alternative models close to the null.

Figure 1.

Figure  1

Power with a 5% nominal type I error of the QLS test (solid lines) and the corrected χ2 test (dotted lines) for different alternative models defined by (p, r) pairs in three different Hutterite case-control samples: +SPT, BHR, and asthma. Power is presented as a function of r for two different values of p.

The corresponding noncentrality parameter ratios (λQLS/λχ2corr) are 4.01 for sample 1, 2.36 for sample 2, and 2.14 for sample 3. These ratios depend only on the sample composition (i.e., the actual choice of cases and controls) and not on the alternative model, as can be seen from eqs. (10) and (11). In every situation considered, the approximate power calculated for WQLS is higher than that calculated for Wχ2corr, and, in fact, we show in appendix C that λQLS⩾λχ2corr. The difference in power between the two tests tends to become smaller for small values of r, but the gain in power when using WQLS instead of Wχ2corr remains nonnegligible. This point is of particular interest, because small values of r are more likely to occur in real data sets. Indeed, the difference in allele frequencies between cases and controls would reach its maximum when the marker is the functional variant. However, the most common situation is that the marker has an allele that is in linkage disequilibrium with the functional variant, corresponding to smaller values of r.

Testing Candidate Genes for Asthma in the Hutterites

The results for all the 48 SNPs with a noncorrected P value <.05 for at least one of the three tests (QLS test, corrected χ2 test, or TDT) are presented in table 4 for asthma, BHR, and atopy. As expected, the number of observations available for the two case-control tests is larger than that for the TDT. Although, for example, 269 atopy cases and 323 controls are genotyped for the Val640Leu amino acid polymorphism in SELP (SELP_640), only 136 heterozygous parents are available for the TDT. Two association signals reached the 5% significance threshold after adjustment for 105 tests (35 different genes tested for three phenotypes), using the Bonferroni correction (P=.000476): SELP_640 and atopy, using the QLS test, and SCYA11_−1328 and BHR, using the corrected χ2 test. If we consider the uncorrected threshold of .05, association signals were detected when the QLS test was used, with 15 other polymorphisms when the corrected χ2 test was used but with only 3 polymorphisms when the TDT was used and none of the polymorphisms reached the significance threshold. Interestingly, the smallest P values obtained using the corrected χ2 test and the QLS test were not systematically observed at the same loci. Except for the associations between CSF2_117 and atopy, which had a P value <.01 with both tests (P=.0087 with the corrected χ2 test and P=.0038 with the QLS test) and between NOS3_298 and asthma, which had a P value close to .01 (P=.0084 with the corrected χ2 test and P=.011 with the QLS test), all other association signals with a P value <.01 were observed with only one of the two tests (SCYA11_−1328 and asthma, SCYA11_−1328 and BHR, and SDF1_3UTR and asthma, using the corrected χ2 test; LTC4S_444 and BHR, C3_102 and atopy, and SELP_640 and atopy, using the QLS test). Even though the QLS test is locally more powerful than the corrected χ2 test under certain regularity conditions, the probability that the corrected χ2 test provides a smaller P value than the QLS test in any particular case is not negligible. Furthermore, we have not studied the power of both tests when the alternative is not local (e.g., in the event of strong difference in allele frequency between cases and controls; note that analytic power calculations are not feasible for that case). In particular, the results shown on figure 1 were valid for values of r close to 0. We cannot rule out the possibility that the corrected χ2 test might perform better for some alternatives, as suggested by the results of our data analysis.

Table 4.

Results of Association Studies with Asthma, BHR, and Atopy, Using the QLS Test, the Corrected χ2 Test, and the TDT[Note]

P
Phenotypeand RS Number Marker Nc Nt p χ2corr QLS TDT TR:NT
Asthma:
 1801157 SDF1_3UTRa 65 357 .64 .00087
.037
.032
22:9
 4795895 SCYA11_−1328b 65 362 .79 .0096
.14 .38 14:20
 1799983 NOS3_298c 72 404 .52 .0084
.011
.89 26:24
 25882 CSF2_117d 66 372 .84 .037
.04
.83 11:9
 17611 C5_802e 66 362 .53 .034
.04
1.0 21:20
 1800872 IL10_−571f 65 366 .55 .033
.077 .65 18:22
 1042713 ADRB2_16g 71 400 .64 .025
.97 1.0 27:27
BHR:
 361525 TNF_−238h 147 409 .87 .043
.46 .74 23:20
 4795895 SCYA11_−1328b 135 262 .79 .00038
.099 .33 29:38
 1799983 NOS3_298c 144 404 .52 .013
.018
.26 55:43
 1137933 NOS2A_346i 136 370 .76 .046
.12 .61 28:33
 730012 LTC4S_−444j 133 354 .8 .06 .002
.47 22:28
 25882 CSF2_117d 136 372 .83 .21 .04
.4 29:22
 1800872 IL10_−571f 134 366 .56 .035
.19 .19 42:45
 6133 SELP_640k 137 372 .82 .11 .012
.027
43:24
Atopy:
 2569190 CD14_−260l 259 319 .61 .27 .03
.2 87:70
 1800779 NOS3_−922m 285 361 .56 .36 .034
1.0 89:89
 5742909 CTLA4_−318n 261 320 .77 .013
.013
.51 59:67
 25882 CSF2_117d 265 330 .81 .0087
.0038
.05 63:42
 2230199 C3_102o 238 297 .72 .64 .0049
.23 46:34
 2290608 IL5R_−80p 258 326 .93 .028
.074 .45 34:27
 6133 SELP_640k 269 323 .82 .068 .000002
.0014
87:49
 1041163 VCAM1_−1594q 271 330 .91 .49 .017
.48 40:33

Note.— SNPs with an associated P value <.05 for at least one of the three tests are reported, and P values <.05 are underlined. Descriptions of the SNPs can be found on the dbSNP Home Page, using their reference SNP (RS) number. The associated alleles are underlined in the footnotes below. The number of genotypes available in cases (Nc) and controls (Nt), the major allele frequency in the case-control sample as a whole (p) and the number of transmitted:nontransmitted major alleles in the TDT sample (TR:NT) are displayed.

a

G→A in 3′ UTR in SDF1.

b

G→A in promoter region position −1328 in SCYA11 (eotaxin).

c

Glu 298Asp in NOS3.

d

Ile 117Thr in CSF2.

e

Val 802Ile in C5.

f

C→A in promoter region position −571 in IL10.

g

Gly16 Arg in ADRB2.

h

G→A in promoter region position −238 in TNF.

i

C→T synonymous change (Asp346) in NOS2A.

j

A→C in promoter region position −444 in LTC4S.

k

Val640Leu in SELP.

l

C→T in promoter region position −260 in CD14.

m

A→G in promoter region position −922 in NOS3.

n

T→C in promoter region position −318 in CTLA4.

o

Arg102Gly in C3.

p

G→A in promoter region position −80 in IL5R.

q

T→C in promoter region position −1594 in VCAM1.

Discussion

Recent progress in unraveling the genetic complexity of common diseases suggests that susceptibility is due to numerous genetic factors with modest effects. In this context, the study of isolated populations with negligible migration will continue to be important, because the relative genetic (and often environmental) homogeneity may result in less complex underlying models of susceptibility. In such studies, however, limited sample sizes can be a serious problem. Making use of all affected individuals, not only those for whom parents are available (as in the TDT), will increase the power to detect susceptibility loci. However, to use case-control association tests, rather than family-based association tests, one needs to correct for the relatedness among individuals. In populations with known genealogy, it is preferable to use this information. This is the rationale for developing the QLS test described in the present article. Indeed, making use of the quasi-likelihood framework, we were able to derive a valid test for allelic association in the presence of strong but known correlations among alleles. We showed that this approach may be more powerful than simply correcting the variance of the χ2 test under certain conditions and is asymptotically the locally most powerful test in a general class of linear tests. Furthermore, we detected a highly significant association by use of this test.

Recent attempts to correct association tests either for unknown population stratification and cryptic relatedness (Devlin and Roeder 1999) or for the sampling of related subjects in outbred populations (Slager and Schaid 2001) used the Armitage trend test (Armitage 1955), which is genotype based rather than allele based. Indeed, as shown by Sasieni (1997) and further explored by Devlin and Roeder (1999), both the Armitage trend test and the allele-based test contrast allelic frequencies between cases and controls, while considering an additive effect for alleles. In addition, the Armitage trend test corrects for possible departure from Hardy-Weinberg equilibrium in the sample, whereas the allelic test does not. Apart from genotyping errors, departures from Hardy-Weinberg equilibrium in isolated populations such as the Hutterites are mainly due to nonnegligible inbreeding (Bourgain et al. 2002). The QLS test presented here, which is performed conditional on the pedigree structure and explicitly models inbreeding, is thus likely to be a correct test for allelic association even though it is performed at the level of alleles rather than genotypes. We showed how the QLS test may also be used in outbred populations when relatives of any kind are sampled. We should stress that, for this approach to be correct in outbred pedigrees, no departure from Hardy-Weinberg equilibrium should be observed at the loci under study. We did not compare the power of the Slager and Schaid (2001) approach with ours, because we focused on pedigrees that are too complex to be handled by their method. We believe that in simpler pedigrees the Slager and Schaid (2001) approach might perform better than the corrected χ2 test presented here, because the former method uses a corrected variance, computed conditional on the IBD information obtained from the pedigree data, whereas our method uses an unconditional corrected variance (exact computation of the conditional variance is infeasible in complex pedigrees). With regard to the comparison between the corrected χ2 test and the QLS test in small pedigrees, it is not obvious which test would be more powerful. The outcome might depend on the kind of correlations among the individuals in the sample, as well as on the informativeness of the markers used in the analysis. We note that, if desired, a QLS-type version of the Slager and Schaid (2001) approach could be performed, in small pedigrees, that should be more powerful than both our QLS approach and their approach.

The controversy as to whether case-control tests are preferable to family-based association tests has been ongoing for several years (Morton and Collins 1998; Risch and Teng 1998; McGinnis 2000). In the present article, we did not formally compare the power of the QLS test and the TDT. We note that the TDT can be expressed as a conditional-likelihood–score test (Clayton 1999), whereas our approach is an unconditional QLS test, which can be viewed as an approximation to the unconditional-likelihood–score test. The unconditional approach would be expected to perform better in the absence of confounding population substructure. However, the formal comparison of these two tests is not straightforward, because the QLS test is only an approximation to the unconditional-likelihood–score test. Furthermore, the correction for relatedness among individuals may be interpreted as a reduction of the effective sample size. Indeed, the weight associated with an allele in the WQLS statistic decreases as the amount of relatedness of this allele with other alleles of the sample increases. The difference in effective sample size between the QLS test and the TDT is thus not as large as it might first appear. No general statement on power comparison can be made from our analysis of real data, because none of the genes investigated have been definitively established as risk factors for asthma, BHR, or atopy. Nonetheless, no associations were detected by the TDT that were not detected by the QLS test, and, in each case, the association signal was stronger when the QLS test was used. A number of quite significant associations would have been missed if we used only the TDT.

The most significant association among the 48 markers examined in the present study was between a polymorphism at amino acid 640 (ValLeu) in SELP and atopy, detected by the QLS test (P=.000002) and the TDT (P=.0014). Although associations between polymorphisms in SELP and asthma-related phenotypes have not been reported previously, P-selectin is an outstanding functional candidate. Indeed, P-selectin is an adhesion molecule expressed on the surface of activated platelets and endothelial cells. It contributes to both bronchoconstriction and inflammation in murine models of allergic airway reactivity (Lukacs et al. 2002). The results of this study indicate that the common Val630 allele is a risk allele for atopy. The other significant association observed in this study was between a promoter polymorphism in SCYA11 (−1328GA) and BHR, detected by the corrected χ2 test (P=.000383). The −1328A allele was significantly associated with BHR and with asthma (P=.00956), although the latter association likely reflects the fact that our definition of asthma included BHR. The SCYA11 gene encodes eotaxin, the predominant eosinophil chemoattractant involved in allergic inflammation. Another variant in the promoter region of this gene was associated with IgE level in patients with atopic dermatitis (AD) but not with AD itself or asthma (Tsunemi et al. 2002). Thus, variation in this gene may influence a variety of atopic phenotypes. Other novel associations identified in the present study are between asthma and the 880G allele in the 3′ UTR of SDF1 and between atopy and the 102Gly allele in C3. These genes are both good functional candidates for asthma-related phenotypes. SDF1 encodes a small chemokine (C-K-C motif) that is a highly potent lymphocyte chemoattractant and is the principal ligand for CXCR4, which is also a coreceptor for CD4. Furthermore, the 3′ UTR polymorphism investigated in this study has been associated with delayed progression to AIDS (Winkler et al. 1998), suggesting that its role as a viral receptor might influence asthma susceptibility. C3 deficiency in an allergen-induced model of airway allergy was associated with diminished airway responsiveness and lung eosinophilia (Drouin et al. 2001). Thus, variation in this gene may influence allergic responses, as indicated by our study. Most of the remaining associations detected in the present study, including many of the more modest associations, have been reported elsewhere for the same or related phenotypes (Genetic Association Database.

Finally, we believe that the two tests presented here are not only of general interest for studies involving related individuals but may also be particularly interesting tools to take full advantage of founder populations for gene mapping of complex traits.

Acknowledgments

The authors acknowledge Dr. Mark Abney for discussion and critical comments. This study was supported by National Institutes of Health grants HL56399, HL63533, HG001645, and DK55889 and an Institut National de la Santé et de la Recherche Medicale postdoctoral fellowship (to C.B.).

Appendix A : Expression of WQLS in the Multiallelic Case

Notation: Given the n×m matrix A with (i,j)th element aij and the p×q matrix B with (i,j)th element bij, their Kronecker product, denoted AB, is the np×mq matrix with block structure

graphic file with name AJHGv73p612df15.jpg

Consider a locus with a distinct alleles. Define p=(p1,…pa-1)T as the (a-1)-vector of control allele frequencies, and define r=(r1,…ra-1)T as the (a-1)-vector of differences in allele frequencies between cases and controls. Our model stipulates that under the null hypothesis, r=r0=0, in which case we obtain E0(Y)=μ0=pDp and Inline graphic with K defined in eq. (1) and F defined in eq. (6). Our model for the alternative hypothesis specifies E(Y)=μ=pDp+rDr, and, as in the biallelic case, we allow Var(Y)=Ω to depend on both r and p, provided that, when r=0, Ω=Σ and that Ω be differentiable and invertible. Let Dπ=∂μ/∂p and Dρ=∂μ/∂r. Dπ and Dρ are N(a-1)×(a-1) matrices with the kth column equal to ∂μ/∂pk and ∂μ/∂rk, respectively. Then Dπ=Ia-1Dp and Dρ=Ia-1Dr, where Ia-1 is the identity matrix of dimension (a-1). Ur(r0,p)=DTρΣ-1(Y0) becomes Ur(r0,p)=(Ia-1Dr)T(FK)-1(Y-pDp). From the properties of the Kronecker product (e.g., see Schott 1996), it follows that Inline graphic and Inline graphic.

Similarly, Inline graphic becomes Inline graphic. Then, evaluating Inline graphic at Inline graphic (the quasi-likelihood estimator of p when r=r0), we get

graphic file with name AJHGv73p612df16.jpg

where Inline graphic corresponds to matrix F-1 evaluated at Inline graphic. Eq. (7) is easily derived from the latter expression of WQLS.

Appendix B : General Framework for the QLS and the χ2 Test in the Multiallelic Case

In the multiallelic case, using the framework presented in appendix A, we note that Inline graphic. Define VQLS=Ia-1V1 with V1 given in the text and let SQLS=VTQLSY. We note that Inline graphic. We have Inline graphic. Because p is unknown, we evaluate Inline graphic at Inline graphic to obtain Inline graphic. Then Inline graphic. Note that Inline graphic, which matches the formula for Inline graphic given in appendix A once Inline graphic is substituted in for p. Thus, Inline graphic.

A possible expression for the classical χ2 test for case-control association in the biallelic case is

graphic file with name AJHGv73p612df501.jpg

with Inline graphic. We note that Tχ2 may also be written

graphic file with name AJHGv73p612df502.jpg

with Sχ2=[DTr-(DTrDp)(DTpDp)-1DTp]Y=VT2Y. When the correlations among all the individuals as well as between the two alleles of an individual are zero then the variance of Y when r=r0 is Inline graphic where IN is the N×N identity matrix. Thus, Inline graphic. If Inline graphic is used as an estimator of p to compute Inline graphic, then

graphic file with name AJHGv73p612df503.jpg

Similarly, in the multiallelic case, Inline graphic. Thus, Sχ2=VTχ2Y with Vχ2=Ia-1V2 and Inline graphic. Finally,

graphic file with name AJHGv73p612df504.jpg

Appendix C : Proof That the QLS Test Is Asymptotically the Locally Most Powerful Test of the W Class

We focus here on the biallelic situation. Consider the statistics of the W class described in the text, that is, Inline graphic with S=VTY, V≠0 and known, E0(S)=0. Under the alternative hypothesis, EH1(Y)=pDp+rDr. Under the null hypothesis, r=0. E0(S)=0 implies that pVTDp=0, which must hold for any p, so that VTDp=0. As described in the text, Inline graphic, where K is the correlation matrix described in eq. (1). Thus, Inline graphic and

graphic file with name AJHGv73p612df505.jpg

If we assume that W has a χ2 distribution under the null hypothesis and a noncentral χ2 distribution with noncentrality parameter λ under local alternatives, the locally most powerful statistic of this class is the one maximizing the value of λ over all V≠0, such that VTDp=0. By definition Inline graphic is the expectation of Inline graphic under the alternative hypothesis; thus,

graphic file with name AJHGv73p612df506.jpg

In practice, p is a nuisance parameter, and we use its estimator Inline graphic. Under certain regularity conditions, this is asymptotically equivalent to the case in which the true p is used. In what follows, we use the true value of p and will thus derive the asymptotic value of λ:

graphic file with name AJHGv73p612df507.jpg

Maximizing λ is thus equivalent to maximizing

graphic file with name AJHGv73p612df508.jpg

over all V≠0, such that VTDp=0.

We first consider a modified version of the problem by maximizing

graphic file with name AJHGv73p612df509.jpg

over all ω≠0 such that ωTR=0 with Z and R being N-vectors and R≠0. By definition, TZ|=|ω||Z||cos(ω,Z))|, where θ(ω,Z) is the angle between ω and Z. Thus, m=|Z||cos(ω,Z))|, and we need only to maximize cos(ω,Z)) over all ω≠0 subject to ωTR=0. By geometry, the maximizing ω, ωmax, is any scalar multiple of the projection of Z onto the subspace orthogonal to R. Thus, ωmax∝(IN-R(RTR)-1RT)Z, where IN is the N×N identity matrix (e.g., see Schott 1996).

We consider now the initial problem. K being symmetric positive definite, we can derive its Cholesky decomposition K=CTC, where C is an invertible upper triangular matrix. Define ω by ω=CV. Because C is invertible, V≠0⇔ω≠0. Furthermore, VTDp=0⇔ωTC-TDp=0, and we have

graphic file with name AJHGv73p612df500.jpg

Define R=C-TDp and Z=C-TDr. We get from the previous modified version of the problem that

Inline graphic and the corresponding V is

Inline graphic. Note that VmaxVQLS so that

graphic file with name AJHGv73p612df510.jpg

It follows that, when p is known, the noncentrality parameter for WQLS is larger than that for any other statistic in the W class. When p is unknown, we argue that replacement of p by

graphic file with name AJHGv73p612df666.jpg

makes WQLS asymptotically the locally most powerful test in the W class. Note that all our data are correlated with an n×n correlation matrix K. Thus, whether or not we have (i) W asymptotically χ2 under the null and (ii) noncentral χ2 for local alternatives and (iii)

graphic file with name AJHGv73p612df667.jpg

for local alternatives, will depend on assumptions about how K behaves as n→∞. Subject to regularity conditions on K ensuring that statements (i), (ii), and (iii) hold, then WQLS will be asymptotically the locally most powerful test in the W class.

Electronic-Database Information

URLs for data presented herein are as follows:

  1. Association Studies in Hutterites, http://www.genes.uchicago.edu/hutterite/inflasnps/asthma (for full descriptions of the polymorphisms included in this study)
  2. dbSNP Home Page, http://www.ncbi.nlm.nih.gov/SNP/index.html
  3. Genetic Association Database, http://geneticassociationdb.nih.gov/
  4. Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim/ (for IL4, IL4RA, IL13, ADRB2, ICAM1, VCAM1, SELE, SELP, FCERB1, CD14, UGB, TGFB1, SCYA11, CCR2, CCR3, CCR5, TCF7, IL9, IL1A, IL1B, IL5RA, IL6, IL10, C3, C5, CSF2, CTLA4, LTC4S, NOS3, NOS2A, SDF1, LTA, TNF, VDR, and GC)

References

  1. Abney M, Ober C, McPeek MS (2002) Quantitative-trait homozygosity and association mapping and empirical genome-wide significance in large, complex pedigrees: fasting serum-insulin level in the Hutterites. Am J Hum Genet 70:920–934 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Armitage P (1955) Tests for linear trends in proportions and frequencies. Biometrics 11:375–386 [Google Scholar]
  3. Bourgain C, Newmann D, Ober C, McPeek MS (2002) Performing classical statistical genetic tests in founder populations. Am J Hum Genet Suppl 71:177 [Google Scholar]
  4. Boyce AJ (1983) Computation of inbreeding and kinship coefficients on extended pedigrees. J Hered 74:400–404 [Google Scholar]
  5. Clayton D (1999) A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission. Am J Hum Genet 65:1170–1177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cox DR, Hinkley DV (1974) Theoretical statistics. Chapman and Hall, London [Google Scholar]
  7. Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004 [DOI] [PubMed] [Google Scholar]
  8. Drouin SM, Kildsgaard J, Haviland J, Zabner J, Jia HP, McCray PB, Tack BF, Wetsel RA (2001) Expression of the complement anaphylatoxin C3a and C5a receptors on bronchial epithelial and smooth muscle cells in models of sepsis and asthma. J Immunol 166:2025–2032 [DOI] [PubMed] [Google Scholar]
  9. Heyde C (1997) Quasi-likelihood and its application: a general approach to optimal parameter estimation. Springer, New York [Google Scholar]
  10. Hinds D, Risch N (1996) The ASPEX package: affected sib pair exclusion mapping. ftp://lahmed.stanford.edu/pub/aspex/
  11. Lukacs NW, John A, Berlin A, Bullard DC, Knibbs R, Stoolman LM (2002) E- and P-selectins are essential for the development of cockroach allergen–induced airway responses. J Immunol 169:2120–2125 [DOI] [PubMed] [Google Scholar]
  12. McCullagh P, Nelder JA (1989) Generalized linear models, 2nd ed. Chapman and Hall, London [Google Scholar]
  13. McGinnis R (2000) General equations for Pt, Ps, and the power of the TDT and the affected–sib-pair test. Am J Hum Genet 67:1340–1347 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Mirel DB, Valdes AM, Lazzeroni LC, Reynolds RL, Erlich HA, Noble JA (2002) Association of IL4R haplotypes with type 1 diabetes. Diabetes 51:3336–3341 [DOI] [PubMed] [Google Scholar]
  15. Morton NE, Collins A (1998) Tests and estimates of allelic association in complex inheritance. Proc Natl Acad Sci USA 95:11389–11393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Newman DL, Abney M, McPeek MS, Ober C, Cox NJ (2001) The importance of genealogy in determining genetic associations with complex traits. Am J Hum Genet 69:1146–1148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Ober C, Leavitt SA, Tsalenko A, Howard TD, Hoki DM, Daniel R, Newman DL, Wu X, Parry R, Lester LA, Solway J, Blumenthal M, King RA, Xu J, Meyers DA, Bleecker ER, Cox NJ (2000) Variation in the interleukin 4–receptor α gene confers susceptibility to asthma and atopy in ethnically diverse populations. Am J Hum Genet 66:517–526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Risch N, Teng J (1998) The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases. I. DNA pooling. Genome Res 8:1273–1288 [DOI] [PubMed] [Google Scholar]
  19. Sasieni PD (1997) From genotypes to genes: doubling the sample size. Biometrics 53:1253–1261 [PubMed] [Google Scholar]
  20. Schott JR (1996) Matrix analysis for statistics. John Wiley, New York [Google Scholar]
  21. Slager SL, Schaid DJ (2001) Evaluation of candidate genes in case-control studies: a statistical method to account for related subjects. Am J Hum Genet 68:1457–1462 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Tsunemi Y, Saeki H, Nakamura K, Sekiya T, Hirai K, Fujita H, Asano N, Tanida Y, Kakinuma T, Wakugawa M, Torii H, Tamaki K (2002) Eotaxin gene single nucleotide polymorphisms in the promoter and exon regions are not associated with susceptibility to atopic dermatitis, but two of them in the promoter region are associated with serum IgE levels in patients with atopic dermatitis. J Dermatol Sci 29:222–228 [DOI] [PubMed] [Google Scholar]
  23. Winkler C, Modi W, Smith MW, Nelson GW, Wu X, Carrington M, Dean M, Honjo T, Tashiro K, Yabe D, Buchbinder S, Vittinghoff E, Goedert JJ, O’Brien TR, Jacobson LP, Detels R, Donfield S, Willoughby A, Gomperts E, Vlahov D, Phair J, O’Brien SJ (1998) Genetic restriction of AIDS pathogenesis by an SDF-1 chemokine gene variant: ALIVE Study, Hemophilia Growth and Development Study (HGDS), Multicenter AIDS Cohort Study (MACS), Multicenter Hemophilia Cohort Study (MHCS), San Francisco City Cohort (SFCC). Science 279:389–393 [DOI] [PubMed] [Google Scholar]
  24. Wu X, Ober C, McPeek MS (2000) A quasi-likelihood method for allele frequency estimation. Am J Hum Genet Suppl 67:A1810 [Google Scholar]

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES