Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
letter
. 2002 Feb;70(2):556–558. doi: 10.1086/338923

HLODs Remain Powerful Tools for Detection of Linkage in the Presence of Genetic Heterogeneity

Susan E Hodge 1, Veronica J Vieland 3, David A Greenberg 1,2
PMCID: PMC384930  PMID: 11791217

To the Editor:

Recently, Whittemore and Halpern (2001) investigated whether one can meaningfully estimate the admixture parameter α (their p) in the admixture LOD score (i.e., “HLOD”) when certain assumptions of HLODs are violated. They argued that such estimates of α are fundamentally problematic, a conclusion with which we agree. However, they then went on to suggest that, in such circumstances, investigators should not use HLODs to detect linkage in the presence of suspected heterogeneity (i.e., should not treat α as a “nuisance parameter”) but should, instead, use “nonparametric” methods. Unfortunately, they do not cite any evidence to support this final recommendation.

We are writing this letter because, in fact, there is a fair body of evidence, from numerous simulation studies, supporting the use of HLODs to detect linkage, even when the assumed heterogeneity model is incorrect. Some of these studies have been published; others are currently in press and were not available to Whittemore and Halpern (2001) when they did their work. We summarize some of these studies.

Several already-published studies have investigated various violations of HLOD model assumptions: Goldin (1992) compared the magnitude of HLODs to the magnitude of two-locus (2L) LODs in data sets in which many families were segregating both the linked and unlinked forms of the disease (these are called “mixed” families). The assumptions made by the 2L analyses exactly matched the reality, whereas the assumptions of the HLODs did not, because the latter assume, incorrectly, that every family has either the linked form or the unlinked form but not both. Goldin found that the HLODs were almost as high as the “correct” 2L LODs. This finding, in conjunction with the work of Abreu et al. (in press; see below), implies, in turn, that the power of the HLOD is also almost as good as the power of analyses performed under the correct model. Durner et al. (1992) made similar comparisons, in data sets with varying proportions of “mixed” families, but under generating models different than those which Goldin had used, and found similar results. Vieland et al. (2001) and Huang and Vieland (2001a) considered yet another violation of HLOD assumptions—namely, that proportions of linked families may differ across different data sets. They showed that, for affected-sib-pair (ASP) data, a simple adaptation of the HLOD maintained higher power than did both the homogeneity LOD and certain nonparametric tests (the ASP mean test, as well as Risch’s [1990] maximum LOD score under Holmans’s [1993] “possible triangle” constraint) in these situations.

We would also like to bring to readers’ attention several relevant just-published or not-yet-published studies: Greenberg and Abreu (in press) show that the multipoint HLOD has excellent power to detect linkage—and better power than that of the nonparametric NPL statistic of GENEHUNTER (Kruglyak et al. 1996). Their simulations include generating models, such as epistatic and additive models, that violate the assumptions of the HLOD. Vieland and Logue (in press) focus on another way in which the assumptions of the HLOD are commonly violated when the genetic models at the linked and unlinked loci differ. Their work shows, in agreement with that of Whittemore and Halpern (2001), that estimates of α are problematic, but simultaneously it indicates that the maximum HLOD provides a directly interpretable and powerful measure of the strength of evidence for linkage in a data set, despite the problems with α. Abreu et al. (in press) demonstrate that one does not pay much of a price in type I error by using HLODs to detect linkage—in most cases, considerably less than even “half” a degree of freedom to the corresponding asymptotic χ2 statistic; also see the work of Faraway (1993) and Huang and Vieland (2001b).

Of necessity, our list of studies is not exhaustive, and our summaries of the papers’ findings are oversimplified. For more details, interested readers can consult the papers. Also, we have mentioned only those papers that use HLODs. Another body of published work demonstrates that simple single-locus LOD scores have better power than nonparametric methods do—even without inclusion of the admixture parameter (e.g., see Durner and Greenberg 1992; Goldin and Weeks 1993; Abreu et al. 1999; Durner et al. 1999).

One more point: Whittemore and Halpern (2001) also advocate looking for heterogeneity by using “subgroup analyses”—that is, subdividing the families on the basis of known attributes that might be genetically relevant (e.g., age at onset, ethnic background, etc.) We agree wholeheartedly that this is a valuable approach when one has some idea of how to subdivide families. This approach has been dubbed the “predivided-sample,” as opposed to the “admixture” approach (Hodge et al. 1983; Ott 1983). But in a situation in which one suspects genetic heterogeneity but in which this heterogeneity does not appear to fall along ethnic and other lines, the admixture approach provides an essential tool. Whittemore and Halpern’s suggestion of waiting until we have identified the gene before concerning ourselves with heterogeneity does not take into account the effect of heterogeneity on identifying the gene’s location in the first place.

In conclusion, this letter should not be misconstrued as attacking Whittemore and Halpern’s (2001) findings concerning estimation of the admixture parameter. We agree with Whittemore and Halpern that interpretation of Inline graphic is problematic in many circumstances. However, as we have summarized here, despite the well-known and -recognized problems with parameter estimation, a good deal of evidence indicates that the HLOD can provide a robust and powerful tool for detection of linkage in the presence of heterogeneity, even when the assumptions of the HLOD are violated. Whittemore and Halpern (2001, p. 457) stated that “nevertheless, we do not recommend the use of parametric heterogeneity models in linkage analysis, even merely as a tool for increasing the statistical power to detect linkage. … because the assumptions required by these models cannot be verified, and their violation could actually decrease power.” Violation of assumptions “could” decrease power, but so far all the evidence is that the power is still greater than that of nonparametric methods. There may well be situations in which such a power loss could be serious, but so far we have not seen these situations, in contrast to numerous demonstrations that simple LOD scores, especially when they incorporate admixture, have good power to detect linkage in the presence of heterogeneity, including situations in which many different assumptions of the model are violated. Whittemore and Halpern’s recommendation not to use HLODs to detect linkage in these circumstances may have seemed reasonable in the light of the problems in interpretation of α, but that recommendation is not, in fact, supported by the evidence available so far. Certainly, this topic of robustness of HLODs could use more research. However, given the difficulties in dealing with complex diseases, as well as our need for as many good analytic tools as we can find, it would be a shame if readers of Whittemore and Halpern avoided this particular useful tool, the HLOD, unnecessarily.

Acknowledgments

The authors have received support from National Institutes of Health grants DK-31813, DK-31775, MH-48858, NS-27941, MH-52841, and K02-01432.

References

  1. Abreu PC, Greenberg DA, Hodge SE (1999) Direct power comparisons between simple LOD scores and NPL scores for linkage analysis in complex diseases. Am J Hum Genet 65:847–857 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abreu PC, Hodge SE, Greenberg DA. Quantification of type I error probabilities for heterogeneity lod scores. Genet Epidemiol (in press) [DOI] [PubMed] [Google Scholar]
  3. Durner M, Greenberg DA (1992) Effect of heterogeneity and assumed mode of inheritance on lod scores. Am J Med Genet 42:271–275 [DOI] [PubMed] [Google Scholar]
  4. Durner M, Greenberg DA, Hodge SE (1992) Inter- and intrafamilial heterogeneity: effective sampling strategies and comparison of analysis methods. Am J Hum Genet 51:859–870 [PMC free article] [PubMed] [Google Scholar]
  5. Durner M, Vieland VJ, Greenberg DA (1999) Further evidence for the increased power of LOD scores compared with nonparametric methods. Am J Hum Genet 64:281–289 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Faraway JJ (1993) Distribution of the admixture test for the detection of linkage under heterogeneity. Genet Epidemiol 10:75–83 [DOI] [PubMed] [Google Scholar]
  7. Goldin LR (1992) Detection of linkage under heterogeneity: comparison of the two-locus vs. admixture models. Genet Epidemiol 9:61–66 [DOI] [PubMed] [Google Scholar]
  8. Goldin LR, Weeks DE (1993) Two-locus models of disease: comparison of likelihood and nonparametric linkage methods. Am J Hum Genet 53:908–915 [PMC free article] [PubMed] [Google Scholar]
  9. Greenberg DA, Abreu P (2001) Determining trait locus position from multipoint analysis: accuracy and power of three different statistics. Genet Epidemiol 21:299–314 [DOI] [PubMed] [Google Scholar]
  10. Hodge SE, Anderson CE, Neiswanger K, Sparkes RS, Rimoin DL (1983) The search for heterogeneity in insulin-dependent diabetes mellitus (IDDM): linkage studies, two-locus models, and genetic heterogeneity. Am J Hum Genet 35:1139–1155 [PMC free article] [PubMed] [Google Scholar]
  11. Holmans P (1993) Asymptotic properties of affected-sib-pair linkage analysis. Am J Hum Genet 52:362–374 [PMC free article] [PubMed] [Google Scholar]
  12. Huang J, Vieland VJ (2001a) Comparison of “model-free” and “model-based” linkage statistics in the presence of locus heterogeneity: single data set and multiple data set applications. Hum Hered 51:217–225 [DOI] [PubMed] [Google Scholar]
  13. ——— (2001b) The null distribution of the heterogeneity lod scores does depend on the assumed genetic model for the trait. Hum Hered 52:217–222 [DOI] [PubMed] [Google Scholar]
  14. Kruglyak L, Daly MJ, Reeve-Daly MP, Lander ES (1996) Parametric and nonparametric linkage analysis: a unified multipoint approach. Am J Hum Genet 58:1347–1363 [PMC free article] [PubMed] [Google Scholar]
  15. Ott J (1983) Linkage analysis and family classification under heterogeneity. Ann Hum Genet 47:311–320 [DOI] [PubMed] [Google Scholar]
  16. Risch N (1990) Linkage strategies for genetically complex traits. III. The effect of marker polymorphism on analysis of affected relative pairs. Am J Hum Genet 46:242–253 [PMC free article] [PubMed] [Google Scholar]
  17. Vieland VJ, Logue M. HLODs, trait models, and ascertainment: implications of admixture for parameter estimation and linkage detection. Hum Hered (in press) [DOI] [PubMed] [Google Scholar]
  18. Vieland VJ, Wang K, Huang J (2001) Power to detect linkage based on multiple sets of data in the presence of locus heterogeneity: comparative evaluation of model-based linkage methods for ASP data. Hum Hered 51:199–208 [DOI] [PubMed] [Google Scholar]
  19. Whittemore AS, Halpern J (2001) Problems in the definition, interpretation, and evaluation of genetic heterogeneity. Am J Hum Genet 68:457–465 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES