Genetical Genomics: Spotlight on QTL Hotspots

Rainer Breitling; Yang Li; Bruno M Tesson; Jingyuan Fu; Chunlei Wu; Tim Wiltshire; Alice Gerrits; Leonid V Bystrykh; Gerald de Haan; Andrew I Su; Ritsert C Jansen

doi:10.1371/journal.pgen.1000232

. 2008 Oct 24;4(10):e1000232. doi: 10.1371/journal.pgen.1000232

Genetical Genomics: Spotlight on QTL Hotspots

Rainer Breitling ¹, Yang Li ¹, Bruno M Tesson ¹, Jingyuan Fu ^1,², Chunlei Wu ³, Tim Wiltshire ⁴, Alice Gerrits ⁵, Leonid V Bystrykh ⁵, Gerald de Haan ⁵, Andrew I Su ^3,^*, Ritsert C Jansen ^1,^2,^*

Editor: Gary A Churchill⁶

PMCID: PMC2563687 PMID: 18949031

Genetical genomics aims at identifying quantitative trait loci (QTLs) for molecular traits such as gene expression or protein levels (eQTL and pQTL, respectively). One of the central concepts in genetical genomics is the existence of hotspots [1], where a single polymorphism leads to widespread downstream changes in the expression of distant genes, which are all mapping to the same genomic locus. Several groups have hypothesized that many genetic polymorphisms—e.g., in major regulators or transcription factors—would lead to large and consistent biological effects that would be visible as eQTL hotspots.

Rather surprisingly, however, there have been only very few verified hotspots in published genetical genomics studies to date. In contrast to local eQTLs, which coincide with the position of the gene and are presumably acting in cis—e.g., by polymorphisms in the promoter region—distant eQTLs have been found to be more elusive. They seem to show smaller effect sizes and are less consistent, perhaps due to the indirect regulation mechanism, resulting in lower statistical power to detect them and, consequently, an inability to reliably delimit hotspots [2]. While there are typically hundreds to thousands of strong local eQTLs per study, the number of associated hotspots is much lower. For example, a recent very large association study in about 1,000 humans did not find a single significant hotspot [3]. Other studies have reported up to about 30 hotspots, far less than the number of significant local eQTLs (Table 1). The molecular basis is known for less than a handful of cases. An example is the Arabidopsis ERECTA locus, which leads to a drastic phenotypic change in the plant and has broad pleiotropic effects on many molecular (and morphological) traits [4].

Table 1. eQTL Hotspots Reported in Selected Genetical Genomics Studies.

Paper	Organism	Population Size	Number of Local eQTLs	Number of Distant eQTLs	Threshold for eQTLs	Number of Hotspots
Brem et al., Science, 2002 [23]	yeast	40	185	385	p<5×10⁻⁵	8
Yvert et al., Nat Genet, 2003 [13]	yeast	86	578	1,716	p<3.4×10⁻⁵	13
Schadt et al., Nature, 2003 [1]	mouse	111	1,022	1,985	LOD>4.3	7
Kirst et al., Plant Physiol, 2004 [24]	eucalyptus	91	1	8	experiment-wise α = 0.10	2
Monks et al., AJHG, 2004 [25]	human	15 CEPH families (167)	13	20	p<5×10⁻⁵	0
Morley et al., Nature, 2004 [26]	human	14 CEPH families	29	118	p<4.3×10⁻⁷	2
Cheung et al., Nature, 2005 [27]	human	57	65	0	p<0.001	0
Stranger et al., PLoS Genet, 2005 [28]	human	60	10–40	3	corrected p-value = 0.05	0
Chesler et al., Nat Genet, 2005 [29]	mouse	35	83	5	FDR = 0.05	7
Bystrykh et al., Nat Genet, 2005 [30]	mouse	30	478	136	genome-wide p<0.005	“multiple”
Hubner et al., Nat Genet, 2005 [31]	rat	259	622	1,211	p<0.05	2
Mehrabian et al., Nat Genet, 2005 [32]	mouse	111	20,107 total	20,107 total	LOD>2	1
DeCook et al., Genetics, 2006 [33]	Arabidopsis	30	3,525 total	3,525 total	FDR = 2.3%	5
Lan et al., PLoS Genet, 2006 [34]	mouse	60	723	5,293	LOD>3.4	15
Wang et al., PLoS Genet, 2006 [35]	mouse	312	2,118	4,556	p<5×10⁻⁵	7
Li et al., PLoS Genet, 2006 [36]	C. elegans	80	414	308	p<0.001; FDR = 0.04	1
Keurentjes et al., PNAS, 2007 [4]	Arabidopsis	160	1,875	1,958	FDR = 0.05	∼29
McClurg et al., Genetics, 2007 [37]	mouse	32	N.A.	N.A.	N.A.	25
Emilsson et al., Nature, 2008 [3]	human	470	1,970	52	FDR = 0.05	0
Schadt et al., PLoS Biol, 2008 [38]	human	427	3,210	242	p<1.6×10⁻¹²	23
Ghazalpour et al., PLoS Genet, 2008 [39]	mouse	110	471	701	FDR = 0.1	4
Wu et al., PLoS Genet, 2008 [5]	mouse	28	600	885,840 (C. Wu and A. I. Su, unpublished data)	p<0.003	1,659

Open in a new tab

The numbers are based on the statistical procedure and threshold used in the original publication, which can vary widely between papers. Where results based on multiple thresholds were reported, we included the most conservative one in the table.

N.A., not reported in the original paper. FDR, false discovery rate.

Recently, Wu et al. [5] reported the large-scale identification of hotspots. They studied gene expression in adipose tissue of 28 inbred mouse strains and performed eQTL analysis by genome-wide association analysis. The paper reports the identification of over 1,600 candidate hotspots, each with a minimum hotspot size of 50 target genes. Furthermore, they demonstrated that these hotspots are biologically coherent by showing that in about 25% of cases, the hotspot targets are enriched for functional gene sets derived from Gene Ontology, the KEGG pathways database, and the Ingenuity Pathways Knowledge Base. These findings suggested that genetic polymorphisms can indeed lead to large and consistent biological effects that are visible as eQTL hotspots.

However, the authors chose a relatively permissive threshold of p = 0.003 for QTL detection, uncorrected for multiple testing. In total, 886,440 eQTLs were identified at this threshold, i.e., 134 per gene. A permutation test (C. Wu and A. I. Su, unpublished data) shows that this results in a false discovery rate of 64%, largely resulting from multiple testing across 157,000 SNPs and 6,601 probe sets. This relatively permissive threshold was chosen because the focus of the analysis was on patterns of eQTL hotspots and not on individual eQTL associations. Analysis of eQTL patterns is relatively robust to individual false positives, and a permissive threshold allows for relatively greater sensitivity in detecting signal [6]. The authors observed an enrichment of specific biological functions among the genes in the reported hotspots. The study also reported that enriched categories tended to match the annotation of candidate regulators. Moreover, one predicted regulator was experimentally validated. In sum, these data seem to support the hypothesis that hotspots are downstream of a common master regulator linked to the eQTL.

However, we suggest here that these observations may also be explained by clusters of genes with highly correlated expression. If one gene shows a spurious eQTL, many correlated genes will show the same spurious eQTL, in particular if the false discovery rate for individual eQTLs is very high [2], [7]–[9]. There are many nongenetic mechanisms that can create strongly correlated clusters of functionally related genes. On the one hand, such clusters may be a result of a concerted response to some uncontrolled environmental factor. On the other hand, dissected tissue samples can contain slightly varying fractions of individual cell types, leading to cell-type–specific gene clusters, which vary in a correlated manner. The resulting correlation patterns represent potentially confounding effects, both for the correct determination of a significance threshold and for the biological interpretation of the resulting hotspots.

Consequently, a key consideration in eQTL analysis is in the effective design of a permutation strategy to assess statistical significance. The approach used in [5] permuted the observed eQTLs among genes (Figure 1B). However, this approach has the disadvantage of ignoring the expression correlation between genes so that their spurious eQTLs no longer cluster along the genome. This permutation strategy leads to a potentially severe underestimate of the null distribution of the size of hotspots, when there are correlated clusters as described above.

(A) The top panel shows the original data. The genotype matrix contains information about the genotype of each strain (S₁…S_n) at each marker position along the genome (M₁…M_n). For each strain, the expression of genes G₁…G_n is measured. Linkage or association mapping combines these two sources of information to yield the eQTL matrix, where each purple entry indicates a significant linkage or association for a gene at a particular locus. The bottom panel illustrates the permutation strategy advocated here, where the strain labels are permuted, so that each strain is assigned the genotype vector of another random strain, while the expression matrix is unchanged. When the mapping is repeated on these permuted data, the correlation structure of gene expression is maintained, leading to an accurate estimate of the clustered distribution of false eQTLs along the genome. (B) shows the permutation strategy used in [5], where the original eQTL matrix is permuted by assigning the same number of eQTLs to genes randomly. The correlation of gene expression is lost, leading to an underestimate of the clustered pattern of spurious eQTLs.

An alternative strategy would have been to permute the strain labels as shown in Figure 1A, maintaining the correlation of the expression traits while destroying any genetic association [2],[10]. As discussed above, it is expected that this would result in a more realistic significance threshold and a much smaller number of significant hotspots. Reanalysis of the data from [5] confirmed this idea: when permuting the strain labels (i.e., randomly swapping the genotypes between animals), the average maximum size of hotspots in the permuted data increases from less than 50 to 986. Consequently, even the largest hotspot in the real data only has a multiple testing corrected p-value of 0.23. This reanalysis demonstrates that expression correlation can indeed explain a large part of the co-mapping between genes. Such effects may also underlie some of the higher numbers of hotspots reported by some earlier studies (Table 1), especially where no appropriate permutation tests were applied to determine the statistical significance of hotspots [2].

Of course, this does not imply that all hotspots are necessarily false positives. As described above, about 5% of the co-mapping clusters in [5] are not only functionally coherent but also map to a locus that contains a gene of the same functional class. This number is not statistically significant, but it is still suggestive of an enrichment of functional associations (p<0.16, false discovery rate = 67%; C. Wu and A. I. Su, unpublished data). Some of these prioritized hotspots could correspond to true hotspots, and indeed one of them has been verified experimentally: cyclin H was validated as a new upstream regulator of cellular oxidative phosphorylation, as well as a transcriptional regulator of genes composing a hotspot [5].

Other studies, which used much stricter thresholds for defining their hotspots, also demonstrated the potential of interpreting putative hotspots by a closer study of the associated genetic locus [11],[12]. An example is the recent work of Zhu et al. [12]: by combining eQTL information, transcription factor binding sites, and protein–protein interaction data in a Bayesian network approach, they were able to predict causal regulators for nine out of the 13 hotspots (69%) originally reported in [13]. With integrated methods like these, it should be possible to identify those hotspots that are more than just clusters of co-expressed genes. As a result, the number of identified, functionally relevant hotspots could ultimately increase beyond the small numbers reported in Table 1. This would create new opportunities for gene regulatory network reconstruction.

In any case, for the time being it seems that distant eQTLs and their hotspots are still scarce and hard to find, and that those that are reported should be interpreted with caution. This rarity of convincing hotspots in genetical genomics studies is intriguing. It could be due to the limited power of the initial studies, but it could also have a more profound reason. For example, it might well be that biological systems are so robust against subtle genetic perturbations that the majority of heritable gene expression variation is effectively “buffered” and does not lead to downstream effects on other genes, protein, metabolites, or phenotypes [14]–[17]. Experimental evidence for phenotypic buffering of protein coding polymorphisms is well established [18],[19].

In fact, it has been shown that phenotypic buffering is a general property of complex gene-regulatory networks [20]. Also, if small heritable changes in transcript levels were transmitted unbuffered throughout the system, there would be a grave danger that genetic recombination would lead to unhealthy combinations of alleles and, consequently, to systems failure. Hotspots with large pleiotropic effects are thus more likely to be removed by purifying selection. If, as thus expected, common alleles are predominantly buffered by the robust properties of the system and hence largely inconsequential for the rest of the molecules in the system, this will have profound consequences for the design and interpretation of genetical genomics studies of complex diseases. Most importantly, it could turn out that even so-called common diseases—like diabetes, asthma, or rheumatoid arthritis—are not necessarily the result of common, small-effect variants in a large number of genes, but are rather caused by changes at a few crucial fragile points of the system (hotspots), which cause large, system-wide disturbances [21],[22]. Future studies in genetical genomics should aim at further elucidating the striking rarity of eQTL hotspots.

References

1.Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003;422:297–302. doi: 10.1038/nature01434. [DOI] [PubMed] [Google Scholar]
2.de Koning DJ, Haley CS. Genetical genomics in humans and model organisms. Trends Genet. 2005;21:377–381. doi: 10.1016/j.tig.2005.05.004. [DOI] [PubMed] [Google Scholar]
3.Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–428. doi: 10.1038/nature06758. [DOI] [PubMed] [Google Scholar]
4.Keurentjes JJ, Fu J, Terpstra IR, Garcia JM, van den Ackerveken G, et al. Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci. Proc Natl Acad Sci U S A. 2007;104:1708–1713. doi: 10.1073/pnas.0610429104. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Wu C, Delano DL, Mitro N, Su SV, Janes J, et al. Gene set enrichment in eQTL data identifies novel annotations and pathway regulators. PLoS Genet. 2008;4(5):e1000070. doi: 10.1371/journal.pgen.1000070. doi:10.1371/journal.pgen.1000070. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Wessel J, Zapala MA, Schork NJ. Accommodating pathway information in expression quantitative trait locus analysis. Genomics. 2007;90:132–142. doi: 10.1016/j.ygeno.2007.03.003. [DOI] [PubMed] [Google Scholar]
7.Peng J, Wang P, Tang H. Controlling for false positive findings of trans-hubs in expression quantitative trait loci mapping. BMC Proc. 2007;1(Suppl 1):S157. doi: 10.1186/1753-6561-1-s1-s157. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Perez-Enciso M. In silico study of transcriptome genetic variation in outbred populations. Genetics. 2004;166:547–554. doi: 10.1534/genetics.166.1.547. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Wang S, Zheng T, Wang Y. Transcription activity hot spot, is it real or an artifact? BMC Proc. 2007;1(Suppl 1):S94. doi: 10.1186/1753-6561-1-s1-s94. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Churchill GA, Doerge RW. Naive application of permutation testing leads to inflated type I error rates. Genetics. 2008;178:609–610. doi: 10.1534/genetics.107.074609. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Stylianou IM, Affourtit JP, Shockley KR, Wilpan RY, Abdi FA, et al. Applying gene expression, proteomics and single-nucleotide polymorphism analysis for complex trait gene identification. Genetics. 2008;178:1795–1805. doi: 10.1534/genetics.107.081216. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Zhu J, Zhang B, Smith EN, Drees B, Brem RB, et al. Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. Nat Genet. 2008;40:854–861. doi: 10.1038/ng.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet. 2003;35:57–64. doi: 10.1038/ng1222. [DOI] [PubMed] [Google Scholar]
14.Le Rouzic A, Carlborg O. Evolutionary potential of hidden genetic variation. Trends Ecol Evol. 2008;23:33–37. doi: 10.1016/j.tree.2007.09.014. [DOI] [PubMed] [Google Scholar]
15.Gibson G, Wagner G. Canalization in evolutionary genetics: a stabilizing theory? Bioessays. 2000;22:372–380. doi: 10.1002/(SICI)1521-1878(200004)22:4<372::AID-BIES7>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]
16.Gibson G, Dworkin I. Uncovering cryptic genetic variation. Nat Rev Genet. 2004;5:681–690. doi: 10.1038/nrg1426. [DOI] [PubMed] [Google Scholar]
17.Carlborg O, Haley CS. Epistasis: too often neglected in complex trait studies? Nat Rev Genet. 2004;5:618–625. doi: 10.1038/nrg1407. [DOI] [PubMed] [Google Scholar]
18.Queitsch C, Sangster TA, Lindquist S. Hsp90 as a capacitor of phenotypic variation. Nature. 2002;417:618–624. doi: 10.1038/nature749. [DOI] [PubMed] [Google Scholar]
19.Rutherford SL, Lindquist S. Hsp90 as a capacitor for morphological evolution. Nature. 1998;396:336–342. doi: 10.1038/24550. [DOI] [PubMed] [Google Scholar]
20.Bergman A, Siegal ML. Evolutionary capacitance as a general feature of complex gene networks. Nature. 2003;424:549–552. doi: 10.1038/nature01765. [DOI] [PubMed] [Google Scholar]
21.Iyengar SK, Elston RC. The genetic basis of complex traits: rare variants or “common gene, common disease”? Methods Mol Biol. 2007;376:71–84. doi: 10.1007/978-1-59745-389-9_6. [DOI] [PubMed] [Google Scholar]
22.Bodmer W, Bonilla C. Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet. 2008;40:695–701. doi: 10.1038/ng.f.136. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Brem RB, Yvert G, Clinton R, Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. doi: 10.1126/science.1069516. [DOI] [PubMed] [Google Scholar]
24.Kirst M, Myburg AA, De Leon JP, Kirst ME, Scott J, et al. Coordinated genetic regulation of growth and lignin revealed by quantitative trait locus analysis of cDNA microarray data in an interspecific backcross of eucalyptus. Plant Physiol. 2004;135:2368–2378. doi: 10.1104/pp.103.037960. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. Genetic inheritance of gene expression in human cell lines. Am J Hum Genet. 2004;75:1094–1105. doi: 10.1086/426461. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, et al. Genetic analysis of genome-wide variation in human gene expression. Nature. 2004;430:743–747. doi: 10.1038/nature02797. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Cheung VG, Spielman RS, Ewens KG, Weber TM, Morley M, et al. Mapping determinants of human gene expression by regional and genome-wide association. Nature. 2005;437:1365–1369. doi: 10.1038/nature04244. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. Genome-wide associations of gene expression variation in humans. PLoS Genet. 2005;1:e78. doi: 10.1371/journal.pgen.0010078. doi:10.1371/journal.pgen.0010078. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Chesler EJ, Lu L, Shou S, Qu Y, Gu J, et al. Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet. 2005;37:233–242. doi: 10.1038/ng1518. [DOI] [PubMed] [Google Scholar]
30.Bystrykh L, Weersing E, Dontje B, Sutton S, Pletcher MT, et al. Uncovering regulatory pathways that affect hematopoietic stem cell function using ‘genetical genomics’. Nat Genet. 2005;37:225–232. doi: 10.1038/ng1497. [DOI] [PubMed] [Google Scholar]
31.Hubner N, Wallace CA, Zimdahl H, Petretto E, Schulz H, et al. Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease. Nat Genet. 2005;37:243–253. doi: 10.1038/ng1522. [DOI] [PubMed] [Google Scholar]
32.Mehrabian M, Allayee H, Stockton J, Lum PY, Drake TA, et al. Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits. Nat Genet. 2005;37:1224–1233. doi: 10.1038/ng1619. [DOI] [PubMed] [Google Scholar]
33.DeCook R, Lall S, Nettleton D, Howell SH. Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics. 2006;172:1155–1164. doi: 10.1534/genetics.105.042275. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Lan H, Chen M, Flowers JB, Yandell BS, Stapleton DS, et al. Combined expression trait correlations and expression quantitative trait locus mapping. PLoS Genet. 2006;2:e6. doi: 10.1371/journal.pgen.0020006. doi:10.1371/journal.pgen.0020006. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Wang S, Yehya N, Schadt EE, Wang H, Drake TA, et al. Genetic and genomic analysis of a fat mass trait with complex inheritance reveals marked sex specificity. PLoS Genet. 2006;2:e15. doi: 10.1371/journal.pgen.0020015. doi:10.1371/journal.pgen.0020015. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Li Y, Alvarez OA, Gutteling EW, Tijsterman M, Fu J, et al. Mapping determinants of gene expression plasticity by genetical genomics in C. elegans. PLoS Genet. 2006;2:e222. doi: 10.1371/journal.pgen.0020222. doi:10.1371/journal.pgen.0020222. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.McClurg P, Janes J, Wu C, Delano DL, Walker JR, et al. Genomewide association analysis in diverse inbred mice: power and population structure. Genetics. 2007;176:675–683. doi: 10.1534/genetics.106.066241. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Schadt EE, Molony C, Chudin E, Hao K, Yang X, et al. Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 2008;6:e107. doi: 10.1371/journal.pbio.0060107. doi:10.1371/journal.pbio.0060107. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Ghazalpour A, Doss S, Kang H, Farber C, Wen PZ, et al. High-resolution mapping of gene expression using association in an outbred mouse stock. PLoS Genet. 2008;4:e1000149. doi: 10.1371/journal.pgen.1000149. doi:10.1371/journal.pgen.1000149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Schadt1] 1.Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003;422:297–302. doi: 10.1038/nature01434. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-deKoning1] 2.de Koning DJ, Haley CS. Genetical genomics in humans and model organisms. Trends Genet. 2005;21:377–381. doi: 10.1016/j.tig.2005.05.004. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Emilsson1] 3.Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–428. doi: 10.1038/nature06758. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Keurentjes1] 4.Keurentjes JJ, Fu J, Terpstra IR, Garcia JM, van den Ackerveken G, et al. Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci. Proc Natl Acad Sci U S A. 2007;104:1708–1713. doi: 10.1073/pnas.0610429104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Wu1] 5.Wu C, Delano DL, Mitro N, Su SV, Janes J, et al. Gene set enrichment in eQTL data identifies novel annotations and pathway regulators. PLoS Genet. 2008;4(5):e1000070. doi: 10.1371/journal.pgen.1000070. doi:10.1371/journal.pgen.1000070. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Wessel1] 6.Wessel J, Zapala MA, Schork NJ. Accommodating pathway information in expression quantitative trait locus analysis. Genomics. 2007;90:132–142. doi: 10.1016/j.ygeno.2007.03.003. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Peng1] 7.Peng J, Wang P, Tang H. Controlling for false positive findings of trans-hubs in expression quantitative trait loci mapping. BMC Proc. 2007;1(Suppl 1):S157. doi: 10.1186/1753-6561-1-s1-s157. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-PerezEnciso1] 8.Perez-Enciso M. In silico study of transcriptome genetic variation in outbred populations. Genetics. 2004;166:547–554. doi: 10.1534/genetics.166.1.547. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Wang1] 9.Wang S, Zheng T, Wang Y. Transcription activity hot spot, is it real or an artifact? BMC Proc. 2007;1(Suppl 1):S94. doi: 10.1186/1753-6561-1-s1-s94. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Churchill1] 10.Churchill GA, Doerge RW. Naive application of permutation testing leads to inflated type I error rates. Genetics. 2008;178:609–610. doi: 10.1534/genetics.107.074609. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Stylianou1] 11.Stylianou IM, Affourtit JP, Shockley KR, Wilpan RY, Abdi FA, et al. Applying gene expression, proteomics and single-nucleotide polymorphism analysis for complex trait gene identification. Genetics. 2008;178:1795–1805. doi: 10.1534/genetics.107.081216. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Zhu1] 12.Zhu J, Zhang B, Smith EN, Drees B, Brem RB, et al. Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. Nat Genet. 2008;40:854–861. doi: 10.1038/ng.167. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Yvert1] 13.Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet. 2003;35:57–64. doi: 10.1038/ng1222. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-LeRouzic1] 14.Le Rouzic A, Carlborg O. Evolutionary potential of hidden genetic variation. Trends Ecol Evol. 2008;23:33–37. doi: 10.1016/j.tree.2007.09.014. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Gibson1] 15.Gibson G, Wagner G. Canalization in evolutionary genetics: a stabilizing theory? Bioessays. 2000;22:372–380. doi: 10.1002/(SICI)1521-1878(200004)22:4<372::AID-BIES7>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Gibson2] 16.Gibson G, Dworkin I. Uncovering cryptic genetic variation. Nat Rev Genet. 2004;5:681–690. doi: 10.1038/nrg1426. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Carlborg1] 17.Carlborg O, Haley CS. Epistasis: too often neglected in complex trait studies? Nat Rev Genet. 2004;5:618–625. doi: 10.1038/nrg1407. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Queitsch1] 18.Queitsch C, Sangster TA, Lindquist S. Hsp90 as a capacitor of phenotypic variation. Nature. 2002;417:618–624. doi: 10.1038/nature749. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Rutherford1] 19.Rutherford SL, Lindquist S. Hsp90 as a capacitor for morphological evolution. Nature. 1998;396:336–342. doi: 10.1038/24550. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Bergman1] 20.Bergman A, Siegal ML. Evolutionary capacitance as a general feature of complex gene networks. Nature. 2003;424:549–552. doi: 10.1038/nature01765. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Iyengar1] 21.Iyengar SK, Elston RC. The genetic basis of complex traits: rare variants or “common gene, common disease”? Methods Mol Biol. 2007;376:71–84. doi: 10.1007/978-1-59745-389-9_6. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Bodmer1] 22.Bodmer W, Bonilla C. Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet. 2008;40:695–701. doi: 10.1038/ng.f.136. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Brem1] 23.Brem RB, Yvert G, Clinton R, Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. doi: 10.1126/science.1069516. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Kirst1] 24.Kirst M, Myburg AA, De Leon JP, Kirst ME, Scott J, et al. Coordinated genetic regulation of growth and lignin revealed by quantitative trait locus analysis of cDNA microarray data in an interspecific backcross of eucalyptus. Plant Physiol. 2004;135:2368–2378. doi: 10.1104/pp.103.037960. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Monks1] 25.Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. Genetic inheritance of gene expression in human cell lines. Am J Hum Genet. 2004;75:1094–1105. doi: 10.1086/426461. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Morley1] 26.Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, et al. Genetic analysis of genome-wide variation in human gene expression. Nature. 2004;430:743–747. doi: 10.1038/nature02797. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Cheung1] 27.Cheung VG, Spielman RS, Ewens KG, Weber TM, Morley M, et al. Mapping determinants of human gene expression by regional and genome-wide association. Nature. 2005;437:1365–1369. doi: 10.1038/nature04244. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Stranger1] 28.Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. Genome-wide associations of gene expression variation in humans. PLoS Genet. 2005;1:e78. doi: 10.1371/journal.pgen.0010078. doi:10.1371/journal.pgen.0010078. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Chesler1] 29.Chesler EJ, Lu L, Shou S, Qu Y, Gu J, et al. Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet. 2005;37:233–242. doi: 10.1038/ng1518. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Bystrykh1] 30.Bystrykh L, Weersing E, Dontje B, Sutton S, Pletcher MT, et al. Uncovering regulatory pathways that affect hematopoietic stem cell function using ‘genetical genomics’. Nat Genet. 2005;37:225–232. doi: 10.1038/ng1497. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Hubner1] 31.Hubner N, Wallace CA, Zimdahl H, Petretto E, Schulz H, et al. Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease. Nat Genet. 2005;37:243–253. doi: 10.1038/ng1522. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-Mehrabian1] 32.Mehrabian M, Allayee H, Stockton J, Lum PY, Drake TA, et al. Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits. Nat Genet. 2005;37:1224–1233. doi: 10.1038/ng1619. [DOI] [PubMed] [Google Scholar]

[pgen.1000232-DeCook1] 33.DeCook R, Lall S, Nettleton D, Howell SH. Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics. 2006;172:1155–1164. doi: 10.1534/genetics.105.042275. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Lan1] 34.Lan H, Chen M, Flowers JB, Yandell BS, Stapleton DS, et al. Combined expression trait correlations and expression quantitative trait locus mapping. PLoS Genet. 2006;2:e6. doi: 10.1371/journal.pgen.0020006. doi:10.1371/journal.pgen.0020006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Wang2] 35.Wang S, Yehya N, Schadt EE, Wang H, Drake TA, et al. Genetic and genomic analysis of a fat mass trait with complex inheritance reveals marked sex specificity. PLoS Genet. 2006;2:e15. doi: 10.1371/journal.pgen.0020015. doi:10.1371/journal.pgen.0020015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Li1] 36.Li Y, Alvarez OA, Gutteling EW, Tijsterman M, Fu J, et al. Mapping determinants of gene expression plasticity by genetical genomics in C. elegans. PLoS Genet. 2006;2:e222. doi: 10.1371/journal.pgen.0020222. doi:10.1371/journal.pgen.0020222. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-McClurg1] 37.McClurg P, Janes J, Wu C, Delano DL, Walker JR, et al. Genomewide association analysis in diverse inbred mice: power and population structure. Genetics. 2007;176:675–683. doi: 10.1534/genetics.106.066241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Schadt2] 38.Schadt EE, Molony C, Chudin E, Hao K, Yang X, et al. Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 2008;6:e107. doi: 10.1371/journal.pbio.0060107. doi:10.1371/journal.pbio.0060107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1000232-Ghazalpour1] 39.Ghazalpour A, Doss S, Kang H, Farber C, Wen PZ, et al. High-resolution mapping of gene expression using association in an outbred mouse stock. PLoS Genet. 2008;4:e1000149. doi: 10.1371/journal.pgen.1000149. doi:10.1371/journal.pgen.1000149. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Genetical Genomics: Spotlight on QTL Hotspots

Rainer Breitling

Yang Li

Bruno M Tesson

Jingyuan Fu

Chunlei Wu

Tim Wiltshire

Alice Gerrits

Leonid V Bystrykh

Gerald de Haan

Andrew I Su

Ritsert C Jansen

Roles

Table 1. eQTL Hotspots Reported in Selected Genetical Genomics Studies.

Figure 1. Alternative Permutation Strategies for Determining the Significance of eQTL Hotspots in Linkage and Association Studies.

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Genetical Genomics: Spotlight on QTL Hotspots

Rainer Breitling

Yang Li

Bruno M Tesson

Jingyuan Fu

Chunlei Wu

Tim Wiltshire

Alice Gerrits

Leonid V Bystrykh

Gerald de Haan

Andrew I Su

Ritsert C Jansen

Roles

Table 1. eQTL Hotspots Reported in Selected Genetical Genomics Studies.

Figure 1. Alternative Permutation Strategies for Determining the Significance of eQTL Hotspots in Linkage and Association Studies.

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases