Skip to main content
Briefings in Functional Genomics logoLink to Briefings in Functional Genomics
. 2014 Jul 9;13(5):371–377. doi: 10.1093/bfgp/elu022

Using population isolates in genetic association studies

Konstantinos Hatzikotoulas, Arthur Gilly, Eleftheria Zeggini
PMCID: PMC4168662  PMID: 25009120

Abstract

The use of genetically isolated populations can empower next-generation association studies. In this review, we discuss the advantages of this approach and review study design and analytical considerations of genetic association studies focusing on isolates. We cite successful examples of using population isolates in association studies and outline potential ways forward.

Keywords: isolated populations, rare variants, complex disease, genetic association studies


Genome-wide association studies (GWAS) have met with widespread success in identifying common-frequency variants associated with complex diseases and medically relevant quantitative traits. Technological advances in genotyping and sequencing have recently enabled access to low-frequency and rare variant genotypes at the population level. The identification of modest effects at individual low frequency and rare variants requires very large sample sizes. Power can be boosted by using statistical approaches to aggregate rare variants across chromosomal regions or functional units. Power can also be increased by leveraging the unique characteristics of isolated, or founder, populations in genetic association studies. In the past, population isolates have typically been used in family-based genetic studies of Mendelian traits [1, 2]. Advances in genotyping and sequencing technologies have catalysed the use of founder populations in genetic association studies of complex traits. Here, we review the population genetics characteristics of isolated populations, outline study design and analytical considerations and discuss examples of next-generation association studies in population isolates.

PROPERTIES OF POPULATION ISOLATES

Population isolates can be defined as subpopulations derived from a small number of individuals who became isolated because of a founding event (e.g. settlement of a new territory, famine, war, environmental disruption, infectious disease epidemics, social and/or cultural barriers) and have stayed so for many generations. The resulting geographical and/or cultural isolation of these populations has genetic consequences, as, for example, endogamy (within community marriage) along with very restricted gene flow (immigration) from neighbouring populations can often be observed. Thus, genomes tend to show higher homogeneity in isolates compared with cosmopolitan populations, which is reflected by a reduced effective population size (Ne or the effective number of individuals required to explain the observed genetic variability) [3] (Figure 1).

Figure 1:

Figure 1:

Factors that shape population isolate characteristics.

Another potentially advantageous property of population isolates is environmental and cultural homogeneity. In addition to reduced genetic complexity, individuals from an isolated population tend to share a common lifestyle, including diet, physical activity levels and other cultural habits, and, importantly, are exposed to similar environmental and sanitary conditions and disease vectors. Phenotype definition and diagnosis harmonization can also be achieved through standardized clinician training, a model adopted, for example, by Finland.

Isolates can branch and develop independently; geographical proximity and common origins do not imply identical evolution. Many isolates experience more than one founding event, which can result in population bottlenecks [4] or the creation of regional subisolates [5]. A typical example is Finland, where both older (∼2000 years) and younger (∼500 years) population isolates have appeared within one geographical region due to internal migration [6]. Various factors such as the total number of founders, number and intensity of bottlenecks, levels of endogamy, age and duration of isolation shape the demographic history of isolates (Figure 1). Detailed genealogical records are sometimes available for isolated populations. For example, the Icelandic population has an extensive genealogical and disease history database [7, 8]. Given genome-wide data availability, the individual characteristics of isolates can be examined using population genetics analysis tools [9–12].

GENETIC CONSEQUENCES OF ISOLATION

Reduced haplotype complexity

Thanks to the HapMap [13] and 1000 Genomes Projects [14], we now have a good understanding of linkage disequilibrium (LD) patterns across the human genome for several human ethnic groups. In isolated populations, LD tends to extend over longer distances compared with non-isolated populations, as exemplified by studies in the populations of the Central Valley of Costa Rica [15], Palau [16] and Val Borbera in northwest Italy [9]. As expected through ancestral recombination, the LD intervals of older isolates tend to be shorter than those of younger isolates [17]. Moreover, relatively higher levels of LD are observed in isolates with a small number of founders that experience slow growth during the early generations following the initial bottleneck [18]. Longer stretches of LD in isolates mean longer haplotypes, thus facilitating disease association studies and empowering imputation approaches that infer genotypes at untyped variants based on regional LD information [19]. The disadvantage of high levels of correlation among sequence variants is a reduction in resolution of the localization of causal variants within a wide association peak signal. Trans-ethnic meta-analysis has been proposed as an approach to synthesize data across populations with diverse LD patterns to enable fine-mapping of the causal variant(s) [20].

Reduced allelic variability and genetic drift

Isolation can influence the patterns and prevalence of disease. Notably, owing to the enrichment of some rare alleles resulting from the combined effect of endogamy, bottlenecks, genetic drift, recurrent mutation and selection, isolates have been shown to potentially exhibit an increased incidence of recessive disorders. Each isolate shows a unique profile of rare disease alleles [21], which can be expressed through a higher prevalence of some diseases and lower incidence of others [22, 23]. For example, the Pima Indians of Arizona have a very high prevalence of type 2 diabetes (∼20%) [24, 25] and near absence of type 1 diabetes [26].

In population isolates, certain alleles reach fixation or extinction at a particular locus, thus reducing the extent of genetic variability [27, 28]. Some variants that contribute to complex traits/diseases are rare in the parent population and drifted to higher frequency in the isolate. The enrichment of low-frequency alleles in the study population can empower the identification of these variants with smaller discovery sets. For example, a null mutation in APOC3 that had risen in frequency in the Amish founder population was found to be associated with a favourable plasma lipid profile. Association with this variant (previously thought to be private to the Amish) was recently replicated in an exome chip-wide association scan for lipid traits in ∼1200 individuals from a Greek population isolate [29]. To achieve 80% power to detect the observed effect size in the general European population (in which the variant has 40-fold lower frequency), a sample size of 67 000 individuals would have been required. The phenomenon of reduced allelic variability, combined with extended LD, is expected to improve power for trait association at rare variants compared with populations with wider allelic diversity (noting that other rare variants will be lost).

STUDY DESIGN

Population choice is an important consideration in designing a genetic association study focusing on isolates. Factors such as the number of founding haplotypes, age of divergence from the parent population, effective sample size and degree of admixture with neighbouring populations, all play a role in the population’s allelic architecture. For initial GWAS as well as rare Mendelian gene discovery, the study of young founder populations with recent expansion (e.g. late-settlement Finland) is a powerful strategy [30] because of their higher degree of LD and reduced genetic diversity. It has been suggested that small populations that have remained stable throughout most of their history could lead to more economical locus discovery efforts [31]. Drift of rare alleles occurs at random and for a small set of variant sites; therefore, the power of a genetic association study in a population isolate will depend on the enrichment of alleles that are relevant to a phenotype of interest [32–34], which requires the alleles in question to have passed through the initial population bottleneck. An association study in an isolate can often be motivated by a suspected higher prevalence of a trait or disease in that particular population.

TO TYPE OR TO SEQUENCE?

GWAS arrays by definition assay primarily variants selected to represent common frequency variation across the genome. Low-frequency and rare variants genome-wide cannot be easily captured on a genotyping array because of their large numbers and low levels of correlation. Recently, a genotyping chip focused on likely functional coding low-frequency and rare variants—the exome chip—has been used successfully in founder populations to associate rare variants with proinsulin [35] and HDL cholesterol [29] levels. The decreasing cost of sequencing makes it increasingly easier to study the complete variation landscape irrespective of allele frequency [36].

Whole exome sequencing has the advantage of reduced cost compared with whole-genome sequencing, but does not capture variation in non-genic regions. Previous experience from GWAS strongly indicates that the majority of complex trait signals reside outside of genes. High-depth sequencing is considered necessary to call high-quality variants across the allele frequency spectrum [37]. However, it has been shown that in the context of a population study, whole-genome sequencing many individuals at low depth can have variant detection power advantages over fewer individuals sequenced at higher depth [38, 39]. This approach was tested by the 1000 Genomes Project pilot and has been used to generate the widely used phase 1 and phase 2 variant sets.

When not all samples can be sequenced, whole-genome sequencing of a subset of cases and controls following an initial GWAS has proven to be a successful strategy for empowering rare variant association in dichotomous trait studies [40, 41]. Variants from the sequenced samples are phased using long-range haplotype phasing (LRP; see Imputation), then imputed back into the whole sample set, which is equivalent to using the sequenced subset as a reference panel for imputation.

ANALYTICAL CONSIDERATIONS

Relatedness

One intrinsic consequence of genetic isolation is relatedness among individuals, which can conflict with the assumptions of independence of many commonly used analysis tools and inflate test statistics affecting association signals. An efficient approach is to account for relatedness in the association analysis through the use of a linear mixed model (LMM). Until recently, computation of an exact association test statistic such as the Wald statistic or likelihood ratio (implemented in EMMA [42]) was computationally impractical. Tools that compute approximate solutions, either by using the residuals from the LMM under the null model as phenotypes, such as GenABEL [43], or by avoiding the repeated estimation of variance components, such as TASSEL [44] or EMMAX [45], have recently been developed. Mathematically optimized versions of the exact test, such as GEMMA [46] or FaST-LMM [47], have also been developed and are widely used.

Several methods have been proposed to improve on the power of single-point tests for rare variants by combining information across multiple sites in a chromosomal region and testing for association with the trait of interest [48–51]. Relatedness information can be incorporated in the model, such as in famSKAT [52] or other tools [53–57].

Imputation

When performing association based on genotyping arrays, it is common practice to impute untyped variants based on a reference panel (e.g. the 1000 Genomes Project (www.1000genomes.org) and/or the UK10K study data (www.uk10k.org) to enhance the resolution of the study. This approach, where positions that were not genotyped in the sample are added using phase information in the reference set, is also relevant to refining genotype calls for low-depth sequencing data.

Imputation is closely related to phasing, a procedure that infers haplotypes based on identity by state (IBS), with other phased individuals. Relatedness is helpful for phasing because it increases the likelihood of finding a long IBS string of variants; the more related the samples, the more certain it is that these IBS sequences are actually inherited identical-by-descent (IBD), and the probability that unobserved positions are IBS becomes quantifiable [58, 59]. Kong et al. [60] proposed LRP, a method that uses regions of IBD between related individuals within the sample, to phase and impute variants. This approach has been generalized and improved to achieve higher accuracy around recombination sites in e.g. SLRP [61], ANCHAP [62] and other methods [63].

Meta-analysis

The synthesis of data through meta-analysis can increase the power of association studies. Two different classes of methods have been typically applied in traditional meta-analysis of GWAS: P-value-based tests and effect size-based methods, which can be further subdivided into fixed or random effects models [64]. Fixed effects models assume that the same underlying effect is present in all studies, whereas random effects models allow for effect sizes to be different. These approaches can be applied to meta-analysing data across isolates. However, in the era of rare variant association testing, allelic heterogeneity can decrease power either because of the presence of similarly associated multiple rare variants or different ethnic backgrounds in the populations being meta-analyzed [65]. In addition, meta-analysis generally assumes independence of the study samples, which does not hold in the case of within-isolate meta-analysis. Research in this field is still ongoing [20], and a continued effort in method development is needed.

COMPLEX TRAIT LOCUS DETECTION IN FOUNDER POPULATIONS

Several studies from Iceland have been successful in identifying low- /rare-frequency variants associated with sick sinus syndrome, gout, prostate cancer and Alzheimer’s disease [41, 66–68]. In a recent study in Finland, four novel loci were found to be associated with saccular intracranial aneurysms, a complex trait with a sporadic and a familial form [69]. One of these variants has drifted up 15 times in frequency compared with the Dutch general population and is virtually non-existent in other populations from the 1000 Genomes Project. A genome-wide significant risk locus for schizophrenia and bipolar disorder has been identified in an ethnically homogeneous cohort of Ashkenazi Jewish individuals [70]. The top signal (rs 11098403) is an inter-genic variant located in NDST3 and was replicated in 11 independent cohorts of varying ethnicities. Recently, a Greek isolated population replicated a genome-wide significant association between R19X, a cardioprotective variant in APOC3, and low blood triglyceride levels [29]. This study also demonstrated that associations discovered in population isolates can be generalizable, as the same variant (R19X) had previously been discovered in the Amish founder population [71].

FUTURE DIRECTIONS

Population isolates are uniquely positioned to usher in the new era of sequence-based association studies. The relative power advantages afforded by studying isolates have been well-documented and recently exemplified in the literature through successful identification of complex trait loci that replicate in other populations. Not all novel discoveries in isolates can be recapitulated in other populations. However, private variants detected in isolates can importantly point to novel biology, identifying potential pathways and loci involved in the aetiopathogenesis of clinically relevant traits. Large-scale efforts to synthesize genome-wide data across isolated populations are an intuitive next step in the field, and hold the promise of catalysing the discovery of further complex trait-associated variants.

Key points.

  • The population genetics characteristics of an isolate depend on demographic history and the number of bottlenecks/founding events, total number of founders, levels of endogamy, age and duration of isolation.

  • Genetic consequences of isolation such as reduced haplotype diversity and genetic drift can enhance the power for locus identification in genetic association studies of complex traits.

FUNDING

This work is supported by the Wellcome Trust (098051) and the European Research Council (ERC-2011-StG 280559-SEPI).

Biographies

Konstantinos Hatzikotoulas is a statistical geneticist; his research interests are on underlying genetic mechanisms of common diseases.

Arthur Gilly is a statistical geneticist; his work focuses on statistical analyses and methods development for high-throughput sequencing data.

Eleftheria Zeggini is a group leader; her work aims to help elucidate the genetic determinants of complex human traits.

References

  • 1.Sheffield VC, Stone EM, Carmi R. Use of isolated inbred human populations for identification of disease genes. Trends Genet. 1998;14:391–6. doi: 10.1016/s0168-9525(98)01556-x. [DOI] [PubMed] [Google Scholar]
  • 2.Puffenberger EG, Hosoda K, Washington SS, et al. A missense mutation of the endothelin-B receptor gene in multigenic Hirschsprung's disease. Cell. 1994;79:1257–66. doi: 10.1016/0092-8674(94)90016-7. [DOI] [PubMed] [Google Scholar]
  • 3.Charlesworth B. Fundamental concepts in genetics: effective population size and patterns of molecular evolution and variation. Nat Rev Genet. 2009;10:195–205. doi: 10.1038/nrg2526. [DOI] [PubMed] [Google Scholar]
  • 4.Peltonen L, Pekkarinen P, Aaltonen J. Messages from an isolate: lessons from the Finnish gene pool. Biol Chem Hoppe Seyler. 1995;376:697–704. doi: 10.1515/bchm3.1995.376.12.697. [DOI] [PubMed] [Google Scholar]
  • 5.Jorde LB, Watkins WS, Kere J, et al. Gene mapping in isolated populations: new roles for old friends? Hum Hered. 2000;50:57–65. doi: 10.1159/000022891. [DOI] [PubMed] [Google Scholar]
  • 6.Peltonen L, Palotie A, Lange K. Use of population isolates for mapping complex traits. Nat Rev Genet. 2000;1:182–90. doi: 10.1038/35042049. [DOI] [PubMed] [Google Scholar]
  • 7. deCODE genetics. http://www.decode.com (May 2004, date last accessed)
  • 8. Statistics Iceland. http://www.statice.is/ (May 2004, date last accessed)
  • 9.Colonna V, Pistis G, Bomba L, et al. Small effective population size and genetic homogeneity in the Val Borbera isolate. Eur J Hum Genet. 2013;21:89–94. doi: 10.1038/ejhg.2012.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Esko T, Mezzavilla M, Nelis M, et al. Genetic characterization of northeastern Italian population isolates in the context of broader European genetic diversity. Eur J Hum Genet. 2013;21:659–65. doi: 10.1038/ejhg.2012.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Price AL, Helgason A, Palsson S, et al. The impact of divergence time on the nature of population structure: an example from Iceland. PLoS Genet. 2009;5:e1000505. doi: 10.1371/journal.pgen.1000505. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Vitart V, Biloglav Z, Hayward C, et al. 3000 years of solitude: extreme differentiation in the island isolates of Dalmatia, Croatia. Eur J Hum Genet. 2006;14:478–7. doi: 10.1038/sj.ejhg.5201589. [DOI] [PubMed] [Google Scholar]
  • 13.International HapMap C, Altshuler DM, Gibbs RA, et al. Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–8. doi: 10.1038/nature09298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.1000 Genomes Project Consortium. Abecasis GR, Auton A, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi: 10.1038/nature11632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Service SK, Ophoff RA, Freimer NB. The genome-wide distribution of background linkage disequilibrium in a population isolate. Hum Mol Genet. 2001;10:545–51. doi: 10.1093/hmg/10.5.545. [DOI] [PubMed] [Google Scholar]
  • 16.Devlin B, Roeder K, Otto C, et al. Genome-wide distribution of linkage disequilibrium in the population of Palau and its implications for gene flow in Remote Oceania. Hum Genet. 2001;108:521–8. doi: 10.1007/s004390100511. [DOI] [PubMed] [Google Scholar]
  • 17.Varilo T, Peltonen L. Isolates and their potential use in complex gene mapping efforts. Curr Opin Genet Dev. 2004;14:316–23. doi: 10.1016/j.gde.2004.04.008. [DOI] [PubMed] [Google Scholar]
  • 18.Kruglyak L. Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat Genet. 1999;22:139–44. doi: 10.1038/9642. [DOI] [PubMed] [Google Scholar]
  • 19.Marchini J, Howie B, Myers S, et al. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39:906–13. doi: 10.1038/ng2088. [DOI] [PubMed] [Google Scholar]
  • 20.Morris AP. Transethnic meta-analysis of genomewide association studies. Genet Epidemiol. 2011;35:809–22. doi: 10.1002/gepi.20630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Arcos-Burgos M, Muenke M. Genetics of population isolates. Clin Genet. 2002;61:233–47. doi: 10.1034/j.1399-0004.2002.610401.x. [DOI] [PubMed] [Google Scholar]
  • 22.Charrow J. Ashkenazi Jewish genetic disorders. Fam Cancer. 2004;3:201–6. doi: 10.1007/s10689-004-9545-z. [DOI] [PubMed] [Google Scholar]
  • 23.Norio R. Finnish Disease Heritage I: characteristics, causes, background. Hum Genet. 2003;112:441–56. doi: 10.1007/s00439-002-0875-3. [DOI] [PubMed] [Google Scholar]
  • 24.Knowler WC, Bennett PH, Hamman RF, et al. Diabetes incidence and prevalence in Pima Indians: a 19-fold greater incidence than in Rochester, Minnesota. Am J Epidemiol. 1978;108:497–505. doi: 10.1093/oxfordjournals.aje.a112648. [DOI] [PubMed] [Google Scholar]
  • 25.Baier LJ, Hanson RL. Genetic studies of the etiology of type 2 diabetes in Pima Indians: hunting for pieces to a complicated puzzle. Diabetes. 2004;53:1181–6. doi: 10.2337/diabetes.53.5.1181. [DOI] [PubMed] [Google Scholar]
  • 26.Dabelea D, Hanson RL, Bennett PH, et al. Increasing prevalence of Type II diabetes in American Indian children. Diabetologia. 1998;41:904–10. doi: 10.1007/s001250051006. [DOI] [PubMed] [Google Scholar]
  • 27.Otto SP, Whitlock MC. The probability of fixation in populations of changing size. Genetics. 1997;146:723–33. doi: 10.1093/genetics/146.2.723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Andrews CA. Natural Selection, Genetic Drift and Gene Flow do Not Act in Isolation in Natural Populations. Nature Education Knowledge. 2010;3:5. [Google Scholar]
  • 29.Tachmazidou I, Dedoussis G, Southam L, et al. A rare functional cardioprotective APOC3 variant has risen in frequency in distinct population isolates. Nat Commun. 2013;4:2872. doi: 10.1038/ncomms3872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kristiansson K, Naukkarinen J, Peltonen L. Isolated populations and complex disease gene identification. Genome Biol. 2008;9:109. doi: 10.1186/gb-2008-9-8-109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Terwilliger JD, Zollner S, Laan M, et al. Mapping genes through the use of linkage disequilibrium generated by genetic drift: ‘drift mapping’ in small populations with no demographic expansion. Hum Hered. 1998;48:138–54. doi: 10.1159/000022794. [DOI] [PubMed] [Google Scholar]
  • 32.Wright AF, Carothers AD, Pirastu M. Population choice in mapping genes for complex diseases. Nat Genet. 1999;23:397–404. doi: 10.1038/70501. [DOI] [PubMed] [Google Scholar]
  • 33.Stephens JC, Bamshad M. Population choice as a consideration for genetic analysis study design. Cold Spring Harb Protoc. 2011;2011:917–22. doi: 10.1101/pdb.top122. [DOI] [PubMed] [Google Scholar]
  • 34.Zuk O, Schaffner SF, Samocha K, et al. Searching for missing heritability: designing rare variant association studies. Proc Natl Acad Sci USA. 2014;111:E455–64. doi: 10.1073/pnas.1322563111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Huyghe JR, Jackson AU, Fogarty MP, et al. Exome array analysis identifies new loci and low-frequency variants influencing insulin processing and secretion. Nat Genet. 2013;45:197–201. doi: 10.1038/ng.2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kiezun A, Garimella K, Do R, et al. Exome sequencing and the genetic basis of complex traits. Nat Genet. 2012;44:623–30. doi: 10.1038/ng.2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Do R, Kathiresan S, Abecasis GR. Exome sequencing and complex disease: practical aspects of rare variant association studies. Hum Mol Genet. 2012;21:R1–9. doi: 10.1093/hmg/dds387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Le SQ, Durbin R. SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples. Genome Res. 2011;21:952–60. doi: 10.1101/gr.113084.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Li Y, Sidore C, Kang HM, et al. Low-coverage sequencing: implications for design of complex trait association studies. Genome Res. 2011;21:940–51. doi: 10.1101/gr.117259.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zeggini E. Next-generation association studies for complex traits. Nat Genet. 2011;43:287–88. doi: 10.1038/ng0411-287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Holm H, Gudbjartsson DF, Sulem P, et al. A rare variant in MYH6 is associated with high risk of sick sinus syndrome. Nat Genet. 2011;43:316–20. doi: 10.1038/ng.781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kang HM, Zaitlen NA, Wade CM, et al. Efficient control of population structure in model organism association mapping. Genetics. 2008;178:1709–23. doi: 10.1534/genetics.107.080101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Aulchenko YS, Ripke S, Isaacs A, et al. GenABEL: an R library for genome-wide association analysis. Bioinformatics. 2007;23:1294–6. doi: 10.1093/bioinformatics/btm108. [DOI] [PubMed] [Google Scholar]
  • 44.Zhang Z, Ersoz E, Lai CQ, et al. Mixed linear model approach adapted for genome-wide association studies. Nat Genet. 2010;42:355–60. doi: 10.1038/ng.546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kang HM, Sul JH, Service SK, et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010;42:348–54. doi: 10.1038/ng.548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44:821–4. doi: 10.1038/ng.2310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lippert C, Listgarten J, Liu Y, et al. FaST linear mixed models for genome-wide association studies. Nat Methods. 2011;8:833–5. doi: 10.1038/nmeth.1681. [DOI] [PubMed] [Google Scholar]
  • 48.Chen H, Hendricks AE, Cheng Y, et al. Comparison of statistical approaches to rare variant analysis for quantitative traits. BMC Proc. 2011;5(Suppl 9):S113. doi: 10.1186/1753-6561-5-S9-S113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Asimit J, Zeggini E. Rare variant association analysis methods for complex traits. Annu Rev Genet. 2010;44:293–308. doi: 10.1146/annurev-genet-102209-163421. [DOI] [PubMed] [Google Scholar]
  • 50.Bansal V, Libiger O, Torkamani A, et al. Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet. 2010;11:773–85. doi: 10.1038/nrg2867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Basu S, Pan W. Comparison of statistical tests for disease association with rare variants. Genet Epidemiol. 2011;35:606–19. doi: 10.1002/gepi.20609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Chen H, Meigs JB, Dupuis J. Sequence kernel association test for quantitative traits in family samples. Genet Epidemiol. 2013;37:196–204. doi: 10.1002/gepi.21703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Schaid DJ, McDonnell SK, Sinnwell JP, et al. Multiple genetic variant association testing by collapsing and kernel methods with pedigree or population structured data. Genet Epidemiol. 2013;37:409–18. doi: 10.1002/gepi.21727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wang X, Lee S, Zhu X, et al. GEE-based SNP set association test for continuous and discrete traits in family-based association studies. Genet Epidemiol. 2013;37:778–86. doi: 10.1002/gepi.21763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Listgarten J, Lippert C, Kang EY, et al. A powerful and efficient set test for genetic markers that handles confounders. Bioinformatics. 2013;29:1526–33. doi: 10.1093/bioinformatics/btt177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Oualkacha K, Dastani Z, Li R, et al. Adjusted sequence kernel association test for rare variants controlling for cryptic and family relatedness. Genet Epidemiol. 2013;37:366–76. doi: 10.1002/gepi.21725. [DOI] [PubMed] [Google Scholar]
  • 57.Jiang D, McPeek MS. Robust rare variant association testing for quantitative traits in samples with related individuals. Genet Epidemiol. 2014;38:10–20. doi: 10.1002/gepi.21775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Powell JE, Visscher PM, Goddard ME. Reconciling the analysis of IBD and IBS in complex trait studies. Nat Rev Genet. 2010;11:800–805. doi: 10.1038/nrg2865. [DOI] [PubMed] [Google Scholar]
  • 59.Carmi S, Palamara PF, Vacic V, et al. The variance of identity-by-descent sharing in the Wright-Fisher model. Genetics. 2013;193:911–28. doi: 10.1534/genetics.112.147215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kong A, Masson G, Frigge ML, et al. Detection of sharing by descent, long-range phasing and haplotype imputation. Nat Genet. 2008;40:1068–75. doi: 10.1038/ng.216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Palin K, Campbell H, Wright AF, et al. Identity-by-descent-based phasing and imputation in founder populations using graphical models. Genet Epidemiol. 2011;35:853–60. doi: 10.1002/gepi.20635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Glodzik D, Navarro P, Vitart V, et al. Inference of identity by descent in population isolates and optimal sequencing studies. Eur J Hum Genet. 2013;21:1140–5. doi: 10.1038/ejhg.2012.307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Uricchio LH, Chong JX, Ross KD, et al. Accurate imputation of rare and common variants in a founder population from a small number of sequenced individuals. Genet Epidemiol. 2012;36:312–9. doi: 10.1002/gepi.21623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Zeggini E, Ioannidis JP. Meta-analysis in genome-wide association studies. Pharmacogenomics. 2009;10:191–201. doi: 10.2217/14622416.10.2.191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Asimit J, Day-Williams A, Zgaga L, et al. An evaluation of different meta-analysis approaches in the presence of allelic heterogeneity. Eur J Hum Genet. 2012;20:709–12. doi: 10.1038/ejhg.2011.274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Sulem P, Gudbjartsson DF, Walters GB, et al. Identification of low-frequency variants associated with gout and serum uric acid levels. Nat Genet. 2011;43:1127–30. doi: 10.1038/ng.972. [DOI] [PubMed] [Google Scholar]
  • 67.Jonsson T, Atwal JK, Steinberg S, et al. A mutation in APP protects against Alzheimer's disease and age-related cognitive decline. Nature. 2012;488:96–9. doi: 10.1038/nature11283. [DOI] [PubMed] [Google Scholar]
  • 68.Gudmundsson J, Sulem P, Gudbjartsson DF, et al. A study based on whole-genome sequencing yields a rare variant at 8q24 associated with prostate cancer. Nat Genet. 2012;44:1326–9. doi: 10.1038/ng.2437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Kurki MI, Gaál EI, Kettunen J, et al. High risk population isolate reveals low frequency variants predisposing to intracranial aneurysms. PLoS Genet. 2014;10:e1004134. doi: 10.1371/journal.pgen.1004134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Lencz T, Guha S, Liu C, et al. Genome-wide association study implicates NDST3 in schizophrenia and bipolar disorder. Nat Commun. 2013;4:2739. doi: 10.1038/ncomms3739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Pollin TI, Damcott CM, Shen H, et al. A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science. 2008;322:1702–5. doi: 10.1126/science.1161524. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Briefings in Functional Genomics are provided here courtesy of Oxford University Press

RESOURCES