SUMMARY
A large number of linkage and association studies of complex diseases focus on analysis of a more common or more easily measured disease endophenotype. The motivation for this approach is that there is a pleiotropic locus common to both the disease and the endophenotype and that this locus is a major genetic determinant of the endophenotype. In this paper we determine the conditions in which the risk of the endophenotype in siblings of affected probands with disease equals the risk of the endophenotype in the offspring(parents) of affected parents(offspring) with disease. In doing so we prove that this equality holds if and only if the penetrance of either the endophenotype or the disease (but not necessarily both) is additive.
Keywords: linkage, sibpair, endophenotype, recurrence risk, additivity
INTRODUCTION
Linkage analysis based on affected sib pairs with disease has long been considered an effective and reliable approach for finding major disease loci. The power of such a linkage analysis, however, is determined by the extent to which a locus determines the disease of interest. Investigators have frequently estimated the power of these linkage analyses based on simulations. In these simulations they have assumed a major disease allele and set the penetrance values and gene frequencies to values that coincided with the disease prevalence and observed risks in siblings of affected individuals [SLINK Ott (1989) and Weeks et al. (1990)]. As an alternative to simulation, Risch (1990) derived some very straightforward and extremely useful expressions for numerically computing the power of a linkage analysis for a given number of affected sibling pairs that was based only on disease recurrence risk in siblings and the population risk. However these equations were derived based on the assumption that the disease recurrence risk in siblings equaled disease recurrence risk in parents and offspring of an affected proband. This latter assumption holds whenever the disease allele has an additive effect on the penetrance of the trait; that is, whenever the penetrance (f) of the disease for the individual with one disease allele (f1), was midway between the penetrance of the disease for those with no disease alleles (f0) and 2 disease alleles (f2) that is, f1 = (f0 + f2) / 2.
More recently, the study of disease related endophenotypes has been of interest with a popular design being to ascertain the family members through an affected proband with disease and to study the distribution of marker and endophenotype in sib pairs. With this in mind, Sung et al. (2009) derived numerical expressions for the power of a linkage analysis of the endophenotype based on sib pairs in which one sibling is affected with disease and the other is chosen at random. In this case, assuming that there was one pleiotropic allele that affected both disease and endophenotype, there were three additional penetrance values to be incorporated into the computation. Assuming that the penetrance values for the disease were lower than the penetrance values for the endophenotype, calculations from Sung et al. (2009) indicated (as expected) that for a given sample size a linkage analysis based on the disease endophenotype will have substantially greater power than one based on the disease itself.
Sung et al. (2009) also noted that similarly to Risch’s (1990) study, expressions for the power could also be derived and simplified greatly if the risk of the endophenotypes in offspring(parents) of affected parents(offspring) with disease equals risk of the endophenotypes in siblings of affected individuals with disease. Moreover, they noted that this finding holds if and only if either the penetrance of the endophenotype is additive or the penetrance of the disease is additive. It is not necessary that both sets of penetrance values be additive. The observations by Sung et al. (2009) were based on computer simulations. We now present an algebraic proof of these equalities which is considered to be a very useful finding for the field of human quantitative genetics that we have not to date noted in textbooks or other papers.
NOTATION AND ASSUMPTIONS
We consider a biallelic locus which is pleiotropic for a presence of both a disease (D+) and an abnormal endophenotype (E+). We denote by the letter A, the allele that increases the susceptibility to D+ and assume that A also increases the susceptibility to E+. We denote the allele frequency of A, by p. Thus, associated with this locus are six penetrance values, with fkt denoting the penetrance of the phenotype t (t=D+, E+) given one has the genotype with k copies (k = 0,1,2) of the disease/endophenotype susceptibility allele. We let Gk denote the genotype with k copies of the disease/endophenotype susceptibility allele and then let Md⊗m denote a parental mating type in which the father has genotype Gd and the mother has genotype Gm ( d , m = 0,1,2 ). We let equal 1 if the individual has the disease (abnormal endophenotype) and 0 otherwise for an individual with position R in a family. We set R = pro (proband) or sib (sibling of proband). We let KD+ and K E+ denote the population risks of D+ and E+, respectively. We also denote the risk of E+ conditional on an affected sibling with the disease as . Similarly we denote the risk of E+ in an offspring(parents) conditional on affected parent(offspring) with the disease as , and let denote the risk of E+ given a monozygotic twin has D+. Thus in terms of our notation:
(1) |
Well known results from quantitative genetics are the relationships between sibling recurrence risk, parent-offspring recurrence risk, monozygotic twin recurrence risk and the number of alleles shared IBD at the locus [Pak Sham (1998)]. We refer to this latter variable, which we denote by IL (where IL = 0,1,2 ), as ‘alleles IBD’. These relationships can be stated:
(2) |
(3) |
(4) |
Equations (2) and (4) are quite logical. That is a sibling who shares no loci IBD with the proband has the same risk of abnormal endophenotype as a random member of the population and a sibling who shares 2 alleles IBD has the same risk as a monozygotic cotwin of a proband (who also shares 2 alleles IBD). Equation (3) follows from the fact that parent and offspring always share exactly one at a locus. Thus we extend the logic of equations (2) and (4) to this situation.
We assume random mating with respect to D+ and E+. Thus, P(Md⊗m) = P(Gd) · P(Gm). We also assume that sibling genotypes conditional on parents’ genotypes are independent: . We assume Hardy Weinberg Equilibrium and obtain the values of P(Gk) for k = 0,1,2 as a function of p, the frequency of the disease/endophenotype predisposing allele, as P(G0) = (1− p)2 , P(G1) = 2p(1− p) P(G2) = p2 .
The values of and for h, k, d, m = 0,1,2 and simply the Mendelian probabilities of offspring genotypes conditional on parents’ genotypes. For completeness, these are given in Table 1.
TABLE 1.
Values for P () for Md⊗m = Gd ⊗ Gm, d, m, h = 0, 1 ,2
Md ⊗ m | h | |
---|---|---|
G0 ⊗G0 | 0 | 1 |
G0 ⊗G2 | 1 | 1 |
G2 ⊗G2 | 2 | 1 |
| ||
G0 ⊗G1 | 0,1 | 0.5 |
G1 ⊗G2 | 1,2 | 0.5 |
G1 ⊗G1 | 1 | 0.5 |
| ||
G1 ⊗G1 | 0,2 | 0.25 |
| ||
OTHERWISE | OTHERWISE | 0 |
RESULTING EQUATIONS
Expressions for and are obtained as a function of disease/endophenotype predisposing allele frequency (p) and the D+ and E+ penetrance values (fkt , k = 0,1,2 and t = D+, E+ ) as follows.
(5) |
From equation (3) we obtain
(6) |
Above we have noted that the disease population risk, and that P(IL = 1) = 0.5 to obtain equation (5) and (6). The conditional probabilities, , required to reduce the expressions in (6) are given in Table 2.
TABLE 2.
Values for P() for Md⊗m = Gd ⊗ Gm, d, m = 0,1,2
Parents’ Mating Type |
Proband and Sibling Genotypes | IL | ||
---|---|---|---|---|
Md ⊗ m | 0 | 1 | 2 | |
| ||||
G1 ⊗G0 | (G1, G0) | 0.5 | 0.5 | 0 |
(G1, G1), (G0, G0) | 0 | 0.5 | 0.5 | |
| ||||
G1 ⊗G2 | (G1, G2) | 0.5 | 0.5 | 0 |
(G1, G1), (G2, G2) | 0 | 0.5 | 0.5 | |
| ||||
G1 ⊗G1 | (G0, G0), (G2, G2) | 0 | 0 | 1 |
(G1, G0), (G1, G2) | 0 | 1 | 0 | |
(G0, G2) | 1 | 0 | 0 | |
(G1, G1) | 0.5 | 0 | 0.5 |
THE PROOF
As indicated in Table 2, we have except when both parents are heterozygous. Thus if we note that
(7) |
where
and
(8) |
where
then
(9) |
Equations (7) and (8) imply that what remains is to show an equivalence for the case where both parents are heterozygous. That is, we substitute the results of (9) into (7) and (8) and obtain the following expressions
(10) |
(11) |
Thus, if or equivalently , substituting into (10) and (11), we have
Therefore, if , then f0E+ − 2f1E+ + f2E+, or f0D+ − 2f1D+ + f2D+ = 0 , or equivalently f1t = (f0t + f2t) / 2 for either t = E+ or D+, but not necessarily both. Alternatively that is, if either the endophenotype is additive or disease is additive (but not necessarily both), we have .
DISCUSSION
It has been well known for quite a while that additivity in allele penetrance results in equality of parent/offspring recurrence risk and sibling recurrence risk [Risch (1990), Pak Sham (1998)]. However, this extension to two traits determined by a single pleiotropic locus did not necessarily follow and needed to be documented. What is particularly noteworthy is the finding that the parent/offspring recurrence risk equals the sibling recurrence risk if and only if either the penetrance of the disease is additive or the penetrance of the abnormal endophenotype is additive, but not necessarily both [Sung et al. (2009)]. It is also a potentially very useful result. The motivation for investigating a complex disease via an endophenotype is the idea that while many genes may contribute to the complex disease, the genetics of the abnormal endophenotype involves a major gene. The criteria for a disease endophenotype to be useful for genetic analysis have been defined in the literature [Gottesman and Gould (2003), Cannon and Keller (2006)] as follows: 1) The endophenotype should be heritable. 2) The endophenotype is associated with causes of the disease. 3) The endophenotype appears independently whether or not disease is in effect. 4) The endophenotype and disease cosegregate within families. 5) The endophenotype found in affected family members is found in nonaffected family members at a higher rate than in the general population. Given these criteria, one would not want to be confined to the assumption of additive penetrance in the case of the endophenotype. However, it might be reasonable to assume that this same gene may have a minor effect with an additive low disease penetrance value considering that the population risk of endophenotype is higher than the population risk of disease. Additionally, the multifactorial mode of inheritance which has been proposed for several complex diseases [Murphy and Chase (1975)] involves the assumption of many genes with small equal and additive effects. It is reassuring to note that additive penetrance in the case of the disease is sufficient in order to be able to assume equality of parent-offspring and siblings’ risk of the abnormal endophenotype in relatives of affected probands with disease. For example, Sung et al. (2009), noted that upon assuming this equality, the power of a linkage study based on the distribution of the endophenotype in siblings of D+ probands could be expressed as an explicit function of the difference between (the risk of abnormal endophenotype conditional on a sibling having the disease) and KE+ (the population risk of abnormal endophenotype).
The purpose of Sung et al. (2009) was to give a general expression for the calculation of the identity by decent distribution in siblings of probands, with and without the abnormal endophenotypes. Thus if the disease and the endophenotype are not additive one needs to use their general equations and/or have estimates of the risk of the endophenotype in offspring(parents) of disease affected parents(offspring) as well as the sibling risk of abnormal endophenotype. However, Sung et al (2009) did explore several situations where neither the disease nor the endophenotype has additive penetrance. They noted that for those cases where the risk of the abnormal endophenotype in offspring(parents) of the disease affected parents(offspring) did not equal the sibling risk, their findings on the relationship between power and risk difference, i.e. observed for the case of additive either disease or endophenotype penetrance were still observed for those situations where the pleiotropic allele frequency is less than 0.06.
ACKNOWLEDGEMENTS
Funding for this project was supported in part by the Division of Intramural Research, National Human Genome Research Institute, National Institutes of Health and also part by Division of Extramural Research by National Institute of Mental Health R01 44292, National Institute of Mental Health R01 071523, the National Alliance for Research on Schizophrenia and Depression (NARSAD) and by the Sidney R. Baer, Jr. Foundation. The authors truly appreciate Dr. Steven Matthysse’s suggestions and valuable comments through this work.
REFERENCES
- Cannon TD, Keller MC. Endophenotypes in the genetic analyses of mental disorders. Annu. Rev. Clin. Psychol. 2006;2:267–90. doi: 10.1146/annurev.clinpsy.2.022305.095232. [DOI] [PubMed] [Google Scholar]
- Gottesman II, Gould TD. The endophenotype concept in psychiatry: Etymology and strategic intentions. Am J Psychiatry. 2003;160(4):636–645. doi: 10.1176/appi.ajp.160.4.636. [DOI] [PubMed] [Google Scholar]
- Murphy EA, Chase GA. Principles of Genetic Counseling. Year Book Medical Publisher; Chicago: 1975. [Google Scholar]
- Ott J. Computer-simulation methods in human linkage analysis. Proc. Natl. Acad. Sci. USA. 1989;86:4175–4178. doi: 10.1073/pnas.86.11.4175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sham Pak. Statistics in human genetics. Edward Arnold; London: 1998. [Google Scholar]
- Risch N. Linkage strategies for genetically complex traits. II. The power of affected relative pairs. Am. J. Hum. Genet. 1990;46(2):229–41. [PMC free article] [PubMed] [Google Scholar]
- Sung H, et al. The power of linkage analysis of a disease-related endophenotype using asymmetrically ascertained sib pairs. Comput. Stat. Data. Anal. 2009;53(5):1829–1842. doi: 10.1016/j.csda.2008.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weeks DE, et al. SLINK: a general simulation program for linkage analysis. Am. J. Hum. Genet. 1990;47:A204. abstract. [Google Scholar]