Skip to main content
Genetics logoLink to Genetics
. 2021 Nov 13;220(1):iyab189. doi: 10.1093/genetics/iyab189

Genome-wide association mapping of transcriptome variation in Mimulus guttatus indicates differing patterns of selection on cis- versus trans-acting mutations

Keely E Brown 1,✉,2, John K Kelly 1
Editor: S Wright
PMCID: PMC8733635  PMID: 34791192

Abstract

We measured the floral bud transcriptome of 151 fully sequenced lines of Mimulus guttatus from one natural population. Thousands of single nucleotide polymorphisms (SNPs) are implicated as transcription regulators, but there is a striking difference in the allele frequency spectrum of cis-acting and trans-acting mutations. Cis-SNPs have intermediate frequencies (consistent with balancing selection) while trans-SNPs exhibit a rare-alleles model (consistent with purifying selection). This pattern only becomes clear when transcript variation is normalized on a gene-to-gene basis. If a global normalization is applied, as is typically in RNAseq experiments, asymmetric transcript distributions combined with “rarity disequilibrium” produce a superabundance of false positives for trans-acting SNPs. To explore the cause of purifying selection on trans-acting mutations, we identified gene expression modules as sets of coexpressed genes. The extent to which trans-acting mutations influence modules is a strong predictor of allele frequency. Mutations altering expression of genes with high “connectedness” (those that are highly predictive of the representative module expression value) have the lowest allele frequency. The expression modules can also predict whole-plant traits such as flower size. We find that a substantial portion of the genetic (co)variance among traits can be described as an emergent property of genetic effects on expression modules.

Keywords: eGWAS, coexpression modules, selection, floral traits, transcriptome, regulatory, monkeyflower


Brown and Kelly perform a genome-wide association study on the transcriptome of yellow monkeyflower using previously sequenced inbred lines. They find a striking difference in the minor allele frequency spectrum of SNPs affecting gene expression in cis vs. in trans. The authors show that gene co-expression modules can predict fitness-related life-history phenotypes that experience fitness tradeoffs in field studies.

Introduction

Genetically controlled gene expression variation is prevalent within and between species, across different tissues, environments, and treatment contexts (Harding et al. 1989; Whitehead and Crawford 2006; McManus et al. 2010; Meiklejohn et al. 2014; Signor and Nuzhdin 2018). Changes in gene expression can facilitate divergence between species (Johnson and Porter 2000; Tulchinsky et al. 2014; Mack and Nachman 2017; McGirr and Martin 2020) and provide a mechanism for a population to rapidly adapt to a new environment (Morris et al. 2014; Ghalambor et al. 2015; Campbell-Staton et al. 2017; Margres et al. 2017; Mack et al. 2018; Hamann et al. 2021). Standing genetic variation and plasticity in gene expression can buffer a population against environmental fluctuations (Podrabsky and Somero 2004; Stern et al. 2007; Acar et al. 2008; López-Maury et al. 2008). While much has been learned about the regulation of particular genes, genome-wide patterns in the evolutionary dynamics of gene expression are just beginning to be explored (e.g., Josephs et al. 2020). It also remains unclear how gene expression, as a molecular phenotype, might mediate the genetic underpinnings of quantitative trait variation, and ultimately fitness.

Evolutionary dynamics of transcriptional effectors

Mutations can alter gene expression in many ways, and we expect selection to act differently on different types of variants (Lawrence et al. 2016; Bewick and Schmitz 2017; Duren et al. 2017). Gene expression can be affected by mutations acting either in cis or in trans. Cis-acting variants affect a closely linked gene directly, perhaps by altering sequences normally bound by transcription factors or other regulatory machinery. In contrast, trans-acting regulatory variants change the cellular environment in which transcription happens, say by altering diffusible products like the transcription factors themselves (Wittkopp et al. 2004; Emerson and Li 2010).

Natural selection could differ systematically between cis- and trans-acting variants for several reasons. First, the mutational target for trans-acting effectors of a gene could be a substantial fraction of the genome (Boyle et al. 2017) while a more limited set of sites are available for cis-acting mutations (Gruber et al. 2012; Metzger et al. 2016). Second, trans-acting variants have the potential to affect multiple genes and may thus have negative consequences on finely tuned pathways (Stern and Orgogozo 2008; but see Hoekstra and Coyne 2007). If trans-mutations have opposing pleiotropic effects on many genes (antagonism), they may still increase in frequency in a conditional manner (Hall et al. 2010; Anderson et al. 2011). Third, if a trans-acting variant with weakly deleterious effects on the expression of one or more target genes increases substantially in frequency (due to drift or selection), cis-variants specific to each affected gene might then act as a buffer, leading to positive directional selection on cis-compensatory mutations. This often occurs, for example, with pleiotropic mutations associated with antibiotic resistance (Maisnier-Patin and Andersson 2004; Brandis et al. 2012) and compensatory pairs of cis- and trans-effectors have been documented in several systems (Coolon et al. 2014; Wang et al. 2015; Fear et al. 2016; Mack et al. 2016; Verta et al. 2016; Metzger et al. 2017). These theories generally suggest that trans-acting variants should be under stronger negative selection than cis-acting variants based on the premise that gene expression should usually experience strong stabilizing selection (Denver et al. 2005; Rifkin et al. 2005; Whitehead and Crawford 2006; Hodgins-Davis et al. 2015). If this is correct, then any mutation with broad effects on expression, regardless of cis- or trans-effect, will more likely be deleterious, perhaps through cascading effects on connected pathways or networks (Fisher 1930).

Broad patterns of selection can be inferred from the allele frequency spectrum (AFS) of cis- and trans-acting variants. When compared with the neutral expectation, an excess of intermediate frequency variants suggests balancing selection while an excess of rare variants suggests purifying selection (Tajima 1989). Demographic events, such as population expansions or contractions, can perturb the AFS away from the neutral expectation (Hartl and Clark 1997). However, since demographic effects are genome-wide, we can make inferences about selection by comparing the AFS for a particular class of polymorphism (e.g., cis-effectors of gene expression) to that of the entire genome. Of course, this is just a first step; inferences about selection require corroboration from multiple lines of evidence (Beaumont and Balding 2004; Bigham et al. 2010). In this study, we find an AFS consistent with purifying selection for trans-acting expression variants and corroborate this pattern by showing that the skew toward extreme allele frequencies is greatest at loci with the broadest effects on expression. In contrast, cis-acting single nucleotide polymorphism (SNPs) exhibit an AFS suggesting balancing selection. The processes most likely to generate balancing selection on cis-SNPs depend on the specific ways that these mutations affect whole organism phenotypes, and also on the complicated and variable mapping from phenotype to fitness in nature.

Transcriptional mutations generating genetic (co)variation in traits

How important are transcriptional regulators in modifying fitness-related traits? Case studies of specific genes with known mutant phenotypes provide many examples where gene expression influences fitness relevant traits of plants (Streisfeld and Rausher 2009; Sobel and Streisfeld 2013; Ning et al. 2017; Kremling et al. 2018; Alonge et al. 2020). The quantitative importance of transcriptional mutations relative to those that effect enzymatic or structural protein function remains a point of contention (Hoekstra and Coyne 2007; Stern and Orgogozo 2008), but a steady increase of evidence from human eQTL/eGWAS research suggests a predominant role for gene expression variation in generating quantitative trait variation (Nicolae et al. 2010; Maurano et al. 2012; Torres et al. 2014; Farh et al. 2015; Boyle et al. 2017). As a first step to understanding the relationships between mutations affecting transcription, quantitative trait variation, and fitness, we here use observed gene expression variation to predict variation and covariation among a set of quantitative traits. These traits correlate with field fitness components in yellow monkeyflower (Mimulus guttatus) and were previously analyzed as part of a GWAS that predicted trait and fitness measures directly from SNPs (Troth et al. 2018).

In this study, we associate SNPs segregating within inbred lines derived from the Iron Mountain (IM) population with gene expression variation in flower buds. Allele frequencies in the inbred lines accurately represent those in the natural population (Troth et al. 2018). We first document strikingly different patterns of apparent selection from the AFS of cis- and trans-acting regulatory SNPs. We then show that modules of coexpressed genes predict the trait means of the inbred lines, despite that we measured gene expression and traits on different plants grown in different places. The stability of the relationship between transcriptome and trait is surprising, given the frequently cited “noisiness” of transcriptome data (Arias and Hayward 2006; Raj and van Oudenaarden 2008). Finally, we demonstrate that correlations, including tradeoffs between fitness-related traits can be predicted from gene expression variation.

Methods

Study system

We used randomly derived inbred lines of the yellow monkeyflower, M.guttatus (syn Erythranthe guttata, Phrymaceae) from the IM population in the Cascade Mountains of Oregon (Willis 1999; Kelly 2003; 44.402217N, 122.153317W). This population is predominantly outcrossing with little internal population structure (Willis 1993; Sweigart et al. 1999). Due to its annual/winter annual lifespan and short growing season, the IM population experiences a fitness tradeoff caused by variation in flower size and life-history phenotypes (Mojica et al. 2012). In 2018, Troth et al. sequenced whole genomes of 187 IM inbred lines and phenotyped them for 13 flower size and developmental timing traits known to influence fitness in the field.

RNAseq

We grew plants from 151 of the genome-sequenced inbred lines in the University of Kansas greenhouse under standard conditions (Monnahan and Kelly 2015) in three different cohorts. For each cohort, we grew more plants than needed for tissue collection and randomly selected plants for sampling soon after germination. We chose a recognizable and consistent stage at which to collect tissue, which we call the late floral bud stage. These are unopened flower buds (approximately 2–6 mm in length) on the first flowering node (so the corolla is presumably not fully expanded), but are advanced enough that buds on the second flowering node are visible. We chose bud tissue to enrich for transcripts related to flower size. When beginning the first cohort, it was unclear if this tissue type/amount would yield enough RNA for adequate sequencing. We thus pooled bud tissue from three plants of the same line in each tube prior to RNA extraction. Biological replicates were then multiple tubes of pooled tissue, all from the same line. Pooling was done randomly with regard to flowering time (i.e., if six plants per line were sequenced, three in each of two tubes, one tube was not all three earliest flowering plants). This process was not repeated in cohorts 2 and 3, for which one to three biological replicates (plants) of each line were collected and sequenced separately. We collected tissue into liquid nitrogen at the same time of day (with regard to both actual time and hours after greenhouse lights turn on) within a 2-h window that was consistent between cohorts.

We ground the collected tissue finely with a plastic micropestle and extracted RNA using the Qiagen RNeasy Plant Mini Kit (Hilden, Germany). We generated sequencing libraries using the QuantSeq 3’mRNA-Seq Library Prep Kit for Illumina (Lexogen, Vienna, Austria) per protocol, modified to perform half reactions, and we sequenced the libraries using NextSeq HO-SR75bp (Illumina, San Diego, CA, USA) at the University of Kansas Genome Sequencing Core. Each cohort was sequenced separately with a maximum of 96 samples per flow cell (a total of 4 flow cells and 281 individual samples).

To calculate read counts, we implemented the programs in Lexogen’s BlueBee pipeline. First, we trimmed reads with bbduk (k = 13, ktrim = r, useshortkmers = t, mink = 5, qtrim = r, trimq = 10, minlength = 20) from BBTools 38.86 (Bushnell 2014) and aligned reads to the M. guttatus V2.0 reference genome (Phytozome; Hellsten et al. 2013) with STAR 2.5.0a (Dobin et al. 2013) using Lexogen’s recommended parameters (outFilterMultimapNmax 20, alignSJoverhangMin 8, alignSJDBoverhangMin 1, outFilterMismatchNmax 999, outFilterMismatchNoverLmax 0.1, alignIntronMin 20, alignIntronMax 1000000, alignMatesGapMax 1000000). Finally, we counted transcript copies using htseq-count 0.11.2 and the genome annotation (Anders et al. 2015). The output is a table of read counts for each transcript. We then removed five samples that had fewer than 250k mapped reads (mean for remaining samples of 3,877,524 mapped reads) and normalized the counts for each sample (to account for variable library quality and sequencing depth) using the estimateSizeFactors function in DESeq2 1.28.1 (Love et al. 2014).

Predicting transcript levels from SNPs

Across all samples, 28,615 of 33,573 total annotated transcripts had at least one mapped read. We kept each gene isoform as a separate transcript. We first filtered out any transcripts which had mapped reads in fewer than 5% of samples, and which did not have 10 or more mapped reads in at least one sample. This left 20,463 transcripts for association mapping. The vast majority of genes (19,721 out of 20,463) had only one isoform with mapped reads. To account for the effect of cohort, we fit a linear model using lm in base R (R Core Team 2013) to each transcript with cohort as a categorical predictor and then subtracted the estimated effect from each read count. We then transformed each transcript’s expression in each sample by two methods: (1) log(expression + 1) and (2) Box–Cox transformation (Box and Cox 1964) using the boxcox() function in the R package EnvStats 2.3.1 with a range for λ between −5 and 5 (Millard 2014). Because some counts were negative after factoring out the effect of cohort, we shifted the distributions of all gene counts such that the minimum value was 0 for both types of transformation. Additionally, because Box–Cox transformation cannot accommodate zeros, we added a small value to each count that was equal to 10% of the minimum difference between any two samples (such that the difference between that value and zero was essentially undetectable in the original counts). Finally, we averaged every gene’s expression across plants with each inbred line.

We obtained a filtered set of polymorphisms of the sequenced lines by starting with sites called by Troth et al. (2018). We kept only biallelic SNPs with a minor allele frequency (MAF) above 2.5% that were called in at least half of the sequenced lines. We then pruned these sites for local LD using PLINK 1.90b3.38 (Purcell et al. 2007) with a window size of 50 SNPs, a step size of 10 SNPs, and an R2 threshold of 0.9. This left 2,952,894 SNPs for downstream analysis. We performed the GWAS using GEMMA 0.98.1 (Zhou and Stephens 2014) by first constructing a centered relatedness matrix using all filtered, but unpruned sites. Finally, we used the univariate linear mixed model (−lmm) in GEMMA, which in the case of no covariates takes the form: expression = SNP genotype effect + random effect of relatedness + error. As part of the model, GEMMA outputs the “chip heritability” for each gene; an estimate of the proportion of transcription variation that can be explained by all genetic causes. We used P-values taken from the likelihood ratio test to find associations between the levels of 20,463 transcripts and each of the 2,953,894 SNPs.

We classified the associations as cis-acting if the site was within 25 kb of any part of the transcribed gene and trans-acting otherwise. This distance-based approach for calling cis-effectors can be undermined by long-distance LD. A physically distant SNP (which we would classify as trans) might be associated with expression simply because it is in LD with a cis-acting SNP. For this reason, we excluded data from the meiotic drive locus on chromosome 11 (a known region of extended LD; Fishman and Willis 2005; Fishman and Saunders 2008; Fishman and Kelly 2015) from genome-wide summaries. Overall, the sequenced lines from IM show a rapid decay of LD as inter-SNP distances exceed 10 kb (Puzey et al. 2017) which makes our 25 kb cutoff conservative. Cis- and trans-acting mutations can be distinguished more directly using allele-specific expression data (Wittkopp et al. 2004; Springer and Stupar 2007; Tirosh et al. 2009; Shi et al. 2012; Osada et al. 2017; Signor and Nuzhdin 2018), but only in heterozygous individuals and here we are measuring expression in highly homozygous inbred lines.

To interpret our estimates for SNP effects on transcription, we permuted the vector of gene expression data (all genes) against line genotypes 100 times. For each replicate, we applied the GEMMA −lmm for all cis-tests (all cases where an SNP was within 25 kb of a gene). Across permutation replicates, we obtained ca. 1.2 billion tests to relate SNP allele frequency to significance levels under the null hypothesis of no SNP effect on expression (as in Josephs et al. 2015).

We next performed simulations allowing SNP effects on gene expression. As previously, we first permuted expression values against genotypes. This simulates the “environmental variance” in expression. We then added 2*β to all lines carrying the homozygous alternate (nonreference) genotype, where the genotypic effect (β) was determined separately for each gene. Given the large number of lines in our panel, the variance in expression attributable to SNP genotype, Vsnp, is

Vsnp=4 q1-qβ2, (1)

where q is the frequency of the reference base in the line panel. Based on results from our significant cis-effect tests (described below), we set Vsnp = Vresidual for the first set of simulations. In other words, SNP explains half the variance in expression. We calibrate the simulations with SNP effects in two different ways: (1) We consider the case where the proportion of variance due to SNP is held constant at 0.5.

This implies:

β=Vresidual4q(1-q) .  (2)

Since all variation is environmental after permutation, Vresidual is simply the variance of expression in the gene before adding effects to genotypes. (2) We set Vsnp for each gene assuming that q = 0.5. This implies β= Vresidual. For this case, Vsnp obtained after adding genotypic effects will vary with q (and be lower for SNPs with lower minor allele frequencies). These simulation schemes were considered by Tung et al. (2015) in analyzing expression data, although these authors simulated the residual variance from a normal distribution while we use permutation of the observed expression levels. For both schemes, we performed tests on a random selection of 20 of the cis-SNPs for each gene with a distinct permutation of expression values vs line for each test.

Predicting phenotypes from gene coexpression modules

To identify sets of coexpressed genes, we used WGCNA 1.69 in R (Langfelder and Horvath 2008) using the sample normalized and cohort factored, but untransformed, counts as input with a power of 3, max block size of 21,000, minimum module size of 30, dynamic tree cut method, correlation using dissimilarity, and merge cut height of 0.25. During coexpression analysis, one sample was removed as an outlier. WGCNA identified 37 modules of coexpressed transcripts (Supplementary Table S1 and Figure S1). Next, we extracted the line means for 13 traits from Troth et al. (2018) for the 151 lines used in this study (germination date, days to flower, corolla width, corolla length, floral tube length, throat width, stigma length, anther length, height at flowering, first flowering node, width of widest leaf, and the first two principal components (PCs) calculated from all floral dimensions). To look for associations between gene expression modules and measured traits, we Box–Cox transformed each module’s eigen expression value (which is the first PC of a PCA for expression of all member genes in a module), as well as every trait and fit a linear model (all in R, R Core Team 2013). We used the eigen gene expression for each module as a predictor in a simple regression, as well as fitting the multiple regression for each trait using all 37 modules simultaneously. We also used the program stepAIC from the R package MASS 7.3-52 (Venables and Ripley 2013) to choose a lowest AIC (Akaike Information Criterion) regression model including some but not all modules as predictors.

For each best-fit multiple regression model, we used permutation to test for significance. We treated the set of traits as one block and the set of modules as another block and permuted which line had which of each block. This preserved the correlations between modules and between traits, but changed which sets of trait values and module values went together. To elaborate, imagine a line has a set of trait values X and module values K, and another line has a set of trait values Y and module values L. Then, a permuted data set might combine traits X with modules L and traits Y with modules K. This is referred to as Permutation 1 in Results. We next used the coefficients from the best-fit model to predict trait variances and covariances. For each trait, we estimated the effect of each module included in the best fit model from a multiple linear regression and constructed an equation to predict trait value for each line:

 Yjz'=b+ imijxiz, (3)

where Yjz' is the predicted value of trait j for line z, mij is the estimated effect of module i on trait j, xiz is the eigen expression of line z for module i, and the sum is taken over all modules in the model. The covariance of predicted values for traits j and k is:

CovYj',Yk'= 1n-1zYjz'-µj*Ykz'-µk, (4)

where µj is the mean of trait j, µk is the mean of trait k, and the sum is taken over all n lines. Calculations were done using a custom python script (Supplementary File S1). We permuted traits against module values for testing. For each permuted set, we again found a best-fit model with a subset of gene expression modules and asked how much trait covariation we could predict (using the above method) to generate a distribution. We determined if the amount of trait covariation explained by the real gene expression data, as represented by modules, was significant using alpha levels calculated from the permuted distribution.

For each trait, we randomly sorted genes into modules with the same number of genes, calculated the eigen gene expression value (PC1) for each module for each line, Box–Cox transformed the module eigenvalues, and then included them in multiple regression. In each case, we fit two models, one with all permuted modules and one with the stepAIC chosen set. We permuted the module composition 1000 times to generate distributions of R2 for each trait. Reported P-values for Permutation 2 in Results are calculated from those distributions.

Results

Genetic effects on transcription levels of individual genes

The number of detected associations between genotype and expression, as well as the putative type of regulatory association (cis vstrans), are contingent on how we transform transcript counts. Before testing for genetic effects on gene expression, we transformed expression read count data using two methods, log(count + 1) and Box–Cox with λ estimated for each gene separately. Using a rounded P-value cutoff of 1e−12 (Bonferroni = 8.27e−13), we identified 106,585 significant SNP/transcript associations using log(counts + 1) transformed counts (10,087 cis associations and 96,498 trans, Figure 1A). Using Box–Cox transformed counts, over 90% of the significant trans-effects evaporate and we find only 8088 cis and 7685 trans associations (Figure 1B).

Figure 1.

Figure 1

Genome-wide associations of gene expression. Above: transcript levels were normalized using (A) log(count + 1) or (B) Box–Cox. Associations are designated as cis (pink) or trans (blue). Chromosomes are numbered along both axes. Points are larger in (B) to aid visualization. The putative misassembly is indicated with a red asterisk. The three genes with the most trans-associations are indicated with blue asterisks. Distributions of read counts for a representative gene, Migut.D00004, which had 3665 SNP associations with Log(counts + 1) transformation (C), but none with Box–Cox transformation (D).

A careful inspection of the differences between the two methods indicates that the Box–Cox results are more reliable. With log(counts + 1) transformation, individual genes often have skewed distributions with a small number of lines exhibiting atypically high or low expression (Figure 1C;  Supplementary Figure S2). The few outlier lines with extreme expression will harbor the same minor (and in most cases very rare) allele at many loci. In fact, the majority of the 96,498 trans-regulatory associations (Figure 1A) involve SNPs with an MAF between 2.5% and 5% (Figure 2A). When unlinked but rare alleles occur together in the same lines, LD is high owing to “rarity disequilibrium” (Lappalainen et al. 2013; Houle and Márquez 2015). If those same lines have extreme expression, all of the linked SNPs will show a strong association with expression.

Figure 2.

Figure 2

(A) Minor allele frequency distribution for all associations excepting Chromosome 11. (B) The subset of associations in the top quartile of effect sizes for each regulatory category. Cis-acting variants in pink and trans-acting variants in blue. Lowest MAF bin is 0.025 < x < 0.035.

The Box–Cox transformation provides a scale adjustment specific to each gene. In transcripts with the largest number of genetic associations in the log(counts + 1) analysis, Box–Cox more completely “normalizes” expression reducing the effect of outliers (Figure 1, C and D; Supplementary Figure S2). Estimates from the Box–Cox are less affected by the pull of extreme values and we will subsequently limit attention to these tests. The mean estimated heritability of gene expression was 0.31 (see Supplementary Figure S3 for the full distribution). For genome-wide analyses, we removed SNPs on chromosome 11 because the large block of apparent trans-effects on chromosome 11 are within the meiotic drive locus (Fishman and Willis 2005; Fishman and Saunders 2008; Fishman and Kelly 2015). The Drive allele is essentially a single DNA sequence over >5 Mb of DNA segregating in the inbred lines at ∼30%. As a consequence, it is impossible to distinguish trans-acting SNPs from those that are simply in linkage disequilibrium with cis-acting SNPs. This leaves 7832 cis and 4626 trans associations.

SNPs with significant cis-effects explain between 26% and 74% of the total variance in expression of a gene (Supplementary Table S2). Significant trans-effectors have a larger average effect size than cis-SNPs (cis = 0.54, trans = 1.16, F-value = 9.07, P-value = 2.6e−3), and effect size is negatively correlated with MAF (both log-transformed; estimated effect of cis-SNP effect size on MAF = −0.029, t = −8.347, P < 2e−16; estimated effect of trans-SNP effect size on MAF = −0.025, t = −4.351, P = 1.39e−05; Supplementary Figure S4). Since we find trans-acting SNPs have a distribution skewed toward low frequency, it follows that such mutations would also have larger effect sizes, as has been reported in other systems (Josephs et al. 2020).

Across the genome, the distribution of MAF differs greatly between cis- and trans-acting mutations (Figure 2A). We find many rare alleles responsible for trans-regulatory effects on gene expression, and an increasing number of cis effects at higher MAF. We find a mean MAF for cis and trans associations of 0.342 and 0.215, respectively (F-value = 2197, P-value < 2.2e−16). The MAF distributions for cis- and trans-acting sites are both different from the MAF distribution of the entire genome (Supplementary Figure S5, two-sample Kolmogorov–Smirnov tests: cis-to-all comparison: D = 0.51866, P < 2.2e−16, trans-to-all comparison: D = 0.1661, P < 2.2e−16), and are different from each other (D = 0.43651, P < 2.2e−16). The difference in MAF could be due to differential power to detect cis- vstrans-acting loci with different effect sizes, since we found larger effect sizes for trans effectors. To test whether differences in effect size were driving the MAF pattern, we took only the top quartile of effect sizes (after normalizing by mean expression level) in each regulatory class. The pattern remains the same (Figure 2B)—an excess of common cis-acting alleles and an excess of rare trans-acting alleles. Finally, we established that the pattern is insensitive to our distance cutoff (25 kb) for trans-effectors. If we limit trans to SNPs that affect expression on different chromosomes (Supplementary Figure S6), the cis/trans difference remains.

Permutation tests indicated that our significance threshold for cis-tests is quite stringent. Across 100 whole genome permutations, only three of out of 1.2 billion SNP tests passed our P < 10−12 threshold (Supplementary Table S3A). Following Josephs et al. (2015), we considered significance levels across allele frequency categories of SNPs to establish a null distribution for the AFS of tests. For a given MAF, the fraction of tests that yield P-values less than a specified threshold (say 10−5) are reported in Supplementary Table S3A. Because permutation reiterates the null hypothesis, we expect the fraction to equal the threshold, e.g., about one test in a million would have P < 10−6. In fact, we find that tests on intermediate allele frequency SNPs (minor allele >20%) tend to be conservative (low P-values under-represented) while rare-allele SNPs (minor allele <10%) tend to be anticonservative. These results imply a pull toward more minor frequencies in the null distribution for AFS. However, it is noteworthy that an extremely small number of tests approach our actual threshold.

The simulations allowing SNP effects on expression routinely yield significant results. The fraction of tests passing various thresholds for our two simulation schemes (constant Vsnp and constant β) are reported in Supplementary Table S3, B and C. With Vsnp held constant (effect size varies with allele frequency), about 90% of tests pass our P < 10−12 threshold regardless of allele frequency. Essentially all tests pass for lower thresholds. With fixed β (where the proportion of variance explained by an SNP varies with AF), there is lower power for rare alleles than intermediate frequency SNPs, which is expected given that rare alleles generate less variation. Most relevant to the results, we consider the relative proportion of significant tests that fall into each allele frequency class for each significance level, and how this compares to the observed AFS of significant tests. This is depicted for three thresholds in Figure 3. The AFS of significant tests with Vsnp held constant is unaffected by threshold and matches the AFS of all tested SNPs. With Vsnp constant, allele frequency has no effect on ascertainment. With fixed β, a smaller fraction of tests is significant for rare alleles (contrast orange to gray bars in Figure 3). However, this skew toward intermediacy is not sufficient to explain the data—the AFS of real tests is substantially more intermediate than predicted by ascertainment with fixed β (contrast orange to blue bars) at all significance thresholds.

Figure 3.

Figure 3

The proportion of significant cis-tests from the real data (blue), simulations with constant β (orange), and constant Vsnp (gray) are reported for 10 AFS categories (minor allele q = 0.0–0.05, 0.05–0.1, etc.). The panels indicate the proportions obtained by imposing different significance thresholds to call significance.

We find no evidence for “trans-eQTL hotspots,” single SNPs affecting many genes, similar to Populus tremula winter buds (Mähler et al. 2017). In fact, there are more genes with transcript levels that are affected by many trans-SNPs than SNPs with more than two trans-associations (Supplementary Figure S7). The three genes with the most trans-acting SNPs are Migut.D00926 (160 SNPs) annotated as a jasmonate ZIM domain-containing protein (JAZ); Migut.M00568 (114 SNPs) annotated as a chlorophyll A/B binding protein; and Migut.N01403 (105 SNPs) annotated as an auxin-responsive F-box transport inhibitor response protein. For genes M00568 and N01403, the trans-associations are concentrated on the same chromosome as the gene (Figure 1B). There is an apparent association between the Chr11 Drive Locus and one gene on chromosome 8 (Figure 1A). This gene (Migut.H01175) has three putative homologs in the Mimulus genome, only one of which was expressed in our samples (Migut.K01148). We found that all of our samples had high expression for only one of the two genes (Supplementary Figure S8), and low to no expression for the other, which could indicate misassembly. Indeed, when we mapped reads from two samples with expression of either gene to the newer reference genome build (M.guttatus TOL v5.0, DOE-JGI, http://phytozome.jgi.doe.gov/) they all mapped to the same region on chromosome 11, which supports that Migut.H01175 is a misassembled isoform of Migut.K01148. Finally, the number of cis-associations for a gene is positively correlated with gene size [effect estimate for log(number of associations + 1) ∼ log(gene length) = 0.1797, t-value = 6.04, P = 1.87e−9, Supplementary Figure S9].

Rare-allele load refers to the proportion of segregating loci at which an individual carries the minor allele, if the population frequency of that allele is very low. It is similar to the concept of deleterious mutation load, but assumes nothing about the fitness effect of individual rare alleles. Instead, it is usually used to test whether or not there is a cumulative fitness effect of harboring many rare variants. This load predicts dysregulation of gene expression in maize (Kremling et al. 2018) and the severity of inbreeding depression in M. guttatus (Brown and Kelly 2020). We tested whether lines with an excess of rare alleles exhibit differing patterns of expression, but found no correlation between load and the number of genes showing extreme expression (plus/minus two standard deviations from the mean; Supplementary Figure S10). We also find no clustering of lines by rare-allele load in gene expression PC space (Supplementary Figure S11). The many associations between gene expression and rare variants suggested by Figure 1A (and by the MAF of associations removed by Box–Cox transformation) is thus likely not a real cumulative effect of many rare alleles generating extreme gene expression genome-wide.

Construction of gene coexpression networks

We next sought to establish sets of genes that covary in expression across inbred lines. Using the cohort and individual normalized gene expression counts, WGCNA identified 37 modules of coexpressed transcripts. Each module includes between 37 and 5767 genes (mean 553, median 231) and each transcript (gene) belongs to only one module (Supplementary Table S1 and Figure S1). WGCNA groups genes with correlated expression and further collapses groups such that gene expression between modules should not be highly correlated (R2 > 0.8). However, eigengene expression (PC1 for the PCA of all genes in a module) of a few pairs of modules remain moderately correlated (20 of 666 pairwise comparisons with 0.74 > R2 > 0.5) (Supplementary Figure S12).

To determine if the apparent purifying selection on trans-effecting sites (Figure 2A) is due to their impact on regulatory networks, we calculated the “connectedness” of each gene by correlating the gene’s expression with the eigengene expression value of its module. This measures how predictive a gene’s expression is of the expression of all genes in the module. Note that we are not calculating the number of edges a gene has in a regulatory or interaction network, which is sometimes called connectivity. Hence, the use of a nonstandard term. The distribution of correlation coefficients (R2 with module PC1) is highly right-skewed (Figure 4A). For this reason, we grouped genes by “connectedness” quartile and then calculated the average MAF of sites affecting each gene either in cis or in trans. We find a consistent difference in MAF of cis- and trans-effectors, with trans having lower MAF, especially in the highest quartile for “connectedness,” for which MAF is significantly lower than in all other categories (effect estimate for quartile 4 on trans-MAF = −0.143, P = 1.53e−14, effect estimate for quartile 4 on cis-MAF = −0.022, P = 0.00872) (Figure 4B). We did not find enrichment for any GO terms in the set of genes in connectedness quartile 4, using the closest Arabidopsis thaliana putative homologs.

Figure 4.

Figure 4

Purifying selection on trans-effectors of highly connected genes. (A) The distribution of connectedness (as measured by R2 between a gene and its module expression) for genes with associated cis (pink) and trans (blue) acting variants. (B) The average minor allele frequency of sites affecting each gene in a given connectedness quartile, separated by cis- and trans-acting variants. Each data point in (B) is a gene, which is assigned a quartile and the MAF of sites affecting it is calculated and plotted on the Y-axis.

Predicting phenotypes from expression

We next tested whether floral bud gene expression affects quantitative traits (line means from Troth et al. 2018). We use modules of coexpressed genes as predictors of phenotype because this provides a tractable way to incorporate the whole transcriptome. We used multiple linear regression including all 37 modules as predictors of trait, and then chose the AIC-best model for each trait. This selected model included from 6 (widest leaf) to 20 (flower size PC1) of the 37 total expression modules (Table 1). Parameter estimates for the best-fit models, including effect sizes for each included module, are reported in Supplementary Table S4. The best-fitting model for each trait explained from 23% to 47% of trait variation, with the strongest prediction being for overall flower size (PC1 in Table 1). To establish statistical significance for prediction of trait variation, we permuted the data in two ways.

Table 1.

 Variation in traits explained by the AIC best-fit model of gene expression modules

Trait R 2 Modules H0: Gene expression does not predict trait variation H0: Modules do not predict traits better than random groups of genes
Days to germination 0.289 17 0.094 0.203
Days to flower 0.246 12 0.253 0.516
Corolla width 0.392 19 0.005** 0.040*
Corolla length 0.435 17 0.000*** 0.015*
Tube length 0.389 19 0.002** 0.078
Throat width 0.309 11 0.044* 0.190
Stigma length 0.382 18 0.005** 0.071
Anther length 0.400 19 0.001** 0.032*
Height 0.333 11 0.014* 0.341
Node 0.314 8 0.036* 0.639
Widest leaf 0.231 6 0.294 0.891
Flower size PC1 0.472 20 0.000*** 0.006**
Flower size PC2 0.270 9 0.149 0.501

Significance was established by permuting either the module eigen values across lines or the module gene composition. Bold indicates significance with *P < 0.05, **P < 0.01, ***P < 0.001.

Permutation 1

Does gene expression predict trait variation? To test the hypothesis that a model using gene expression explains no more trait variation than by chance, we permuted modules by line. Correlations among traits and among modules were preserved (see Methods), but randomly associated with each other across lines. This tests whether the transcriptome (as collapsed into coexpression modules) is a significant predictor of traits in a linear model. Using this method, trait variation predicted by the real gene expression modules is highly significant (P < 0.01) for seven of eight flower-size measurements (except flower size PC2), and marginally significant (0.01 < P < 0.05) for height and node (Table 1). These nine traits are significantly correlated with each other, except for throat width with node (Figure 5; Supplementary Table S5).

Figure 5.

Figure 5

Gene expression predicts trait covariances. The bottom triangle shows trait correlations. A line denotes a significant correlation at P < 0.05. The diagonal displays normalized trait histograms. Fraction covariance explained by gene expression, using the best-fit model coefficients for prediction, is shown in the top diagonal. Any displayed number is significant, asterisks denote levels of significance (determined by permutation, P < 0.05, *P < 0.025, **P < 0.01, ***P < 0.001).

Permutation 2

Do gene coexpression modules better predict traits than random groups of genes? The significant prediction of traits by modules does not imply that modules are necessarily the best summary of gene expression for trait prediction. In order to test the hypothesis that the predicted trait variation is just a function of including the whole transcriptome (by creating groups of genes as predictors), we permuted module membership by shuffling genes into random groups of the same size as the real modules. These groups contain the same amount of information in terms of fraction of transcriptome included, but eliminate clustering of genes based on coexpression that defines the real modules. Essentially, we are asking if coexpression networks are a better way of decreasing parameter space than grouping genes randomly when the goal is to predicts trait values. By permuting gene module membership, we find that only four traits (corolla width and length, anther length, and flower size PC1) are significantly better predicted by coexpression modules than by random assortment of genes (Table 1). For all other traits, the amount of trait variation explained is attributable to the inclusion of the whole transcriptome, not variation in coexpressed groups of genes. However, while most traits are not significantly better predicted by real modules than scrambled sets of genes, real modules better predict traits than the average permuted data set for all but four traits (days to flower, node, widest leaf, and flower size PC2). Quantiles for the distribution of permuted R2 for both permutations are presented in Supplementary Table S6.

Overlapping sets of gene expression modules are included in the best-fit model for the four traits where expression modules are significant by both permutation tests (corolla width and length, anther length, and flower size PC1). All four are significantly predicted (in their own best-fit regression models) by 14 common modules. These traits are all positively correlated (Figure 5). The 14 modules are not correlated (Supplementary Figure S12), but they affect all four traits in the same direction. As a consequence, trait correlations can emerge from the joint effects of uncorrelated modules.

Prior studies indicate that tradeoffs between fitness components (and associated traits) are central to the maintenance of variation in this population (Mojica and Kelly 2010; Scoville et al. 2011; Mojica et al. 2012; Monnahan and Kelly 2015, 2017; Brown and Kelly 2018). For this reason, we estimated the extent to which gene expression modules generate trait covariances. Among pairwise comparisons between the nine traits that are significantly predicted by gene expression (Table 1, column 3, and see Permutation 1 above), 35 of 36 pairs are significantly correlated (R2 between 0.07 and 0.91, P < 0.05 for all but node by throat width). We used estimates for the effect of each gene expression module on each trait from the best-fit multiple linear model (Table 1, column 1) to predict trait covariances using Equations (1) and (2). If a module affects two traits, some fraction of the covariance between the traits can be attributed to the shared effect of that module. We find that 26–54% of the covariance between traits is attributable to this module-predicted covariance (35 pairwise comparisons). Thirty-three of 36 covariances are significantly predicted by gene expression modules (same permuted datasets as above, see Equations (1) and (2) in Methods, P < 0.05, Figure 5, upper triangle). Modules are most strongly predictive of trait covariances among the four traits that are better predicted by modules than by the randomly grouped whole transcriptome (corolla width, corolla length, stigma length, anther length, and flower size PC1; see Table 1, column 4).

A large fraction of module variation is genetic. For each individual, we calculated the eigen expression (PC1) for each module and tested for an effect of inbred line using an ANOVA. Line explains 57–91% of variance in gene module expression (Supplementary Table S7). Of the 37 modules, 29 are significantly affected by line (P < 0.05). There is no correlation between the estimated genetic control of a module and the number of traits for which a module is a significant predictor. However, all modules that significantly predict at least half of our measured traits (save one, “brown”) are significantly affected by genotype (P from 0.046 to 8.18e−15, F from 1.34 to 4.04). That is to say, modules that significantly predict many traits exhibit genetic variation among lines.

Discussion

Natural selection on regulatory variants

Using a collection of sequenced inbred lines derived from a single natural population of yellow monkeyflower (M.guttatus), we have dissected the genetic variation in the floral bud transcriptome. We found 12,458 SNPs with genome-wide significant associations with expression, 62% of which act in cis. Striking differences in the AFS suggest differing selection regimes on cis- and trans-acting regulatory SNPs. Sites proximal to the affected gene are enriched for intermediate frequency variants. SNPs distant from target genes are enriched for rare variants. Hodgins-Davis et al. (2015) argue that gene expression should evolve according to a “house of cards” model, characterized by few mutations with large effects and moderate stabilizing selection (as opposed to a Gaussian model of evolution with many mutations of small effect and weak selection). Stabilizing selection on a quantitative trait with a fixed optimum predicts that minor alleles should be less common than under neutral evolution. Trans-acting mutations are more likely to be deleterious than cis-acting mutations if they have more pronounced effects (see Introduction). The results of this study suggest that different selective pressures operate on cis and trans variation, consistent with previous work in a natural population of Capsella grandiflora (Josephs et al. 2020). The distribution of MAF for trans-effecting SNPs in Capsella was similar to the Mimulus estimate (Figure 2). However, Capsella exhibits a nearly uniform distribution of MAF for cis-SNPs, while there is a definite inflation of intermediate frequency SNPs in Mimulus. The skew of cis-acting SNPs toward intermediate frequency, relative not only to trans-acting but also the genome as a whole, suggests balancing selection.

Ascertainment is a central concern for inference in QTL and association mapping studies (Beavis 1994). For loci with no effect on expression, our permutation study indicates that rare-allele SNPs are more likely to yield very low P-values than intermediate frequency SNPs. Thus, false positives are more likely to come from rare alleles than common, although permutation almost never produced P-values that pass the thresholds imposed on the real data (Supplementary Table S3A). Considering SNPs with effects on expression, ascertainment depends on how we measure the “importance” of an SNP. In simulated data where SNPs explain the same amount of expression variation as observed in the real data (constant Vsnp), we find no effect of allele frequency on the probability that an SNP is detected (Figure 3, gray bars). In contrast, if we hold the effect of alleles constant (constant β), then Vsnp is lower with extreme than intermediate allele frequencies. For fixed β, the simulations indicate that an SNP with equally frequent alleles (q = 0.5) is very likely to prove significant (90% of cases) while the detection probability falls to below 25% if q0.15 (Supplementary Table S3C). However, this sieve does not explain the intermediacy of q for significant cis-SNPs in the data. First, the number of significant tests in the highest MAF categories significantly exceed the predicted number under the constant β simulations (Figure 3C). Second, the constant β simulations predict that tests on SNPs with q ≪ 0.5 will “fill in” the lower portion of the distribution when the significance threshold is reduced (orange bars in Figure 3, A and B). In other words, SNPs that do not pass the stringent 10−12 threshold simply because the minor allele is present in fewer lines should still routinely yield P < 10−9 or P < 10−6. The real data provide no indication of these “almost significant” SNPs in the range if 0.05  q0.25. The distribution is skewed intermediate across significance thresholds.

Figures 2 and 4 support the hypothesis that trans-effectors are routinely subject to purifying selection, at least for mutations with large enough effects to be detected in this study. Loci influencing expression in trans can affect multiple components of finely tuned networks simultaneously. Here, we show that the MAF of SNPs affecting a gene’s expression is correlated with how well that gene predicts the expression of many other genes (those in the same coexpression module), what we call “connectedness.” Genes that are well-connected in this sense are likely to be the hub of a regulatory network, a role commonly filled by transcription factors (Babu et al. 2004), although we do not detect an enrichment for any particular type of gene in this set. We find that SNPs affecting well-connected genes tend to be lower in frequency and that the magnitude of decrease in MAF is stronger for trans-acting SNPs than cis-acting SNPs (Figure 4). This difference supports the idea that trans-effectors with broad pleiotropic effects on many genes are more likely to affect regulatory hubs and therefore be routinely subjected to purifying selection. Previous studies suggest that genes with high network connectivity are constrained by selection (Hahn and Kern 2005; Ramsay et al. 2009; Josephs et al. 2017), which could explain why their expression would also be stabilized.

Connectedness of genes affected by trans-SNPs might explain the pattern of purifying selection, but it does not explain why cis-acting variants exhibit an MAF distribution suggestive of balancing selection. Cis-acting variants did have smaller effect sizes, which would explain a difference in severity of purifying selection, but not that allele frequencies at cis-SNPs are more intermediate than the genome-wide average. One potential explanation is that cis-acting variants may evolve on a gene-by-gene basis to counter the pleiotropic effects that trans-acting loci have on many genes. The hypothesis that cis-acting variants might evolve to mitigate trans-pleiotropy is supported by many studies finding opposing cis- and trans effects on the same gene (Coolon et al. 2014; Wang et al. 2015; Mack et al. 2016; Metzger et al. 2017). In this study, we find no such preponderance of compensatory cis/trans pairs. Using a conservative set of 24 genes with both cis-SNPs and inter-chromosome trans-SNPs, we find only one example of cis/trans compensation.

Genetic effect on traits mediated through gene expression

Understanding selection requires that we look at how genetic effects on gene expression translate to effects on whole-organism phenotypes and, ultimately, to fitness in the natural environment. Table 1 shows that floral and plant height measures can be significantly predicted by the flower bud transcriptome when abstracted into coexpression modules. The most precise prediction is for overall flower size (flower size PC1) and for the component measurements that jointly determine flower size (corolla width and length, stigma and anther lengths). The strength of prediction (nearly 50% of variation explained) is notable given that flower traits are likely established early in development (Krizek and Anderson 2013). Accurate prediction of flower size from the RNAseq data does not imply that the bud mRNA from the exact time of sampling were causal to trait variation or covariation. Measured transcript levels might simply be strongly correlated through development time, which might suggest that trait variation is continually reinforced through development.

Prediction precision was likely reduced by the fact that modules were estimated from RNAseq performed on one set of plants, while the mean phenotypes were estimated from different plants of the same inbred lines. Plants from the two experiments almost certainly experienced subtle environmental differences (different greenhouses, growth at different times of the year, different years). The high R2 for flower size despite these limitations suggests that stable relationships between genotypes and traits are mediated through transcriptome variation. Additionally, the separation of experiments avoids a subtle but potentially important bias. When phenotypes and gene expression levels are measured on the same plants, the two can become associated owing to confounding factors, even if there is no effect of expression on phenotype. Imagine that plants differ randomly in receipt of a resource such as soil nitrogen. If nitrogen affects both gene expression and phenotype, expression and phenotype will be correlated even if there is no inherent relationship. Establishing the mean phenotype of each line prior to measuring expression eliminates this bias (Rausher 1992).

Gene expression modules predict not only trait variation but also the covariances between traits (Figure 5). Trait correlations emerge when the same module influences multiple traits [Equations (3) and (4)]. We find that module predictions can explain up to 54% of the observed covariance between traits (throat width and height). We further show that a substantial fraction of the variation in expression modules has a genetic basis (Supplementary Table S7), which suggests variation in gene expression as a potential cause of genetic correlations between whole-plant traits. Understanding trait covariances is essential when natural selection involves tradeoffs between traits. Such tradeoffs can provide the mechanistic basis of balancing selection (Mérot et al. 2020), which is suggested in our data by the intermediacy of the AFS for cis-acting variants.

In many annual plants, suites of correlated life-history traits related to rate of development (progression to flowering) are subject to a tradeoff between flowering time and fecundity. In Mimulus specifically, variation in life-history traits is maintained by opposing selective pressures on survival to flower and seed set (Kelly 2008; Mojica and Kelly 2010; Mojica et al. 2012; Monnahan and Kelly 2015; Troth et al. 2018; Monnahan et al. 2021). As with many other species, small, fast-growing plants survive to flower but make fewer seeds. Large, slow-growing plants have the capacity to make more seeds and perhaps disperse more pollen, but risk not reaching maturity before the end of the growing season. This type of tradeoff can maintain polymorphism through balancing selection on loci affecting the underlying traits such as days to flower or flower size (Austen et al. 2017; Brown and Kelly 2018; Exposito-Alonso et al. 2018). Our bud transcriptome modules predict floral dimensions, but not development rate under greenhouse conditions (days to germination or days to flower; Table 1). However, we suggest that future studies measuring gene modules from a range of tissues at different time points, coupled with the statistical methods that we employ here [Equations (3) and (4)], might determine whether the survival/fecundity tradeoff in Mimulus contributes to the intermediate allele frequency pattern evident for cis-acting transcriptional mutations (Figure 2).

Scale of measurement for gene expression

Figure 1 contrasts two different ways to normalize read counts, Box–Cox and log(count + 1). The latter is most similar to models typically applied in RNAseq studies, such as generalized linear models that use the log-link function (e.g., DESeq2; Love et al. 2014). When expression is normalized in the same way across all genes [such as with the log(count + 1) method], rare alleles occurring in lines with extreme expression produce many false positives as a result of “rarity disequilibrium” (Lappalainen et al. 2013; Houle and Márquez 2015). When counts are instead power transformed using an exponent (λ) estimated for each gene separately (Box–Cox), samples with extreme expression are pulled closer to the mean of the resulting distribution (compare Figure 1, C and D). This decreases the occurrence of spurious associations due to rare alleles. We retained the log(count + 1) analysis in Figure 1 as a caution for future studies. This issue is likely to emerge in any situation where the absolute count of individuals carrying the rare allele is small (say <5).

Conclusion

The two major findings from this study are connected through our summarization of the transcriptome in terms of gene expression modules. The first result is that cis-acting SNPs tend to have intermediate allele frequencies (relative to the genome as a whole), while trans-SNPs exhibit a rare-alleles model consistent with purifying selection. Trans-acting mutations are most rare if they have broad effects, with the latter measured by how strongly a trans-affected gene predicts the overall expression of its module. The second result is that expression modules predict flower size with a surprising degree of precision. As a consequence, we can attribute substantial fractions of the variance in flower size measures to variation in expression modules (R2 values in Table 1). Despite that expression levels of different modules are largely uncorrelated (across lines), they can generate covariances among traits because individual modules influence multiple traits. This “transcriptome-explained” covariance can be a substantial portion of the total covariance across lines (up to 54%, Figure 5). Our results do not provide a clear explanation for why cis-acting SNPs exhibit allele frequencies consistent with balancing selection, but the prediction of trait covariances suggests how future studies that may address this question. Specifically, experiments that determine the nature and extent of transcriptional control of development rate could provide a more mechanistic understanding of balancing selection.

Data availability

Gene expression data have been submitted to NCBI’s SRA (project number PRJNA736440). The Python scripts used to generate trait covariances as well as those used for permutation and simulations are available as Supplementary File S1.

Supplementary material is available at GENETICS online.

Supplementary Material

iyab189_Supplementary_Data

Acknowledgments

We thank S. Wright, S. Macdonald, P. Veltsos, L. Hileman, E. Everman, R. Unckless, C. Wessinger, C. Fiscus, and two anonymous reviewers for comment on the manuscript, and J. Willis, B. Sikes, the Koenig and Seymour labs at UCR for comments on early stages of analysis.

Funding

J.K.K. acknowledges support from National Science Foundation grant numbers DEB-1753630 and MCB-1940785. K.E.B. was supported by National Science Foundation PRFB IOS-1907061 and a KU Genome Sequencing Core Award Voucher. This work was supported by the Center for Research Computing at the University of Kansas and the KU Genome Sequencing Core.

Conflicts of interest

The authors declare that there is no conflict of interest.

Literature cited

  1. Acar M, Mettetal JT, van Oudenaarden A.  2008. Stochastic switching as a survival strategy in fluctuating environments. Nat Genet. 40:471–475. [DOI] [PubMed] [Google Scholar]
  2. Alonge M, Wang X, Benoit M, Soyk S, Pereira L, et al.  2020. Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell. 182:145–161.e23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Anderson JT, Willis JH, Mitchell-Olds T.  2011. Evolutionary genetics of plant adaptation. Trends Genet. 27:258–266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anders S, Pyl PT, Huber W.  2015. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 31:166–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Arias AM, Hayward P.  2006. Filtering transcriptional noise during development: concepts and mechanisms. Nat Rev Genet. 7:34–44. [DOI] [PubMed] [Google Scholar]
  6. Austen EJ, Rowe L, Stinchcombe JR, Forrest JRK.  2017. Explaining the apparent paradox of persistent selection for early flowering. New Phytol. 215:929–934. [DOI] [PubMed] [Google Scholar]
  7. Babu MM, Luscombe NM, Aravind L, Gerstein M, Teichmann SA.  2004. Structure and evolution of transcriptional regulatory networks. Curr Opin Struct Biol. 14:283–291. [DOI] [PubMed] [Google Scholar]
  8. Beaumont MA, Balding DJ.  2004. Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol. 13:969–980. [DOI] [PubMed] [Google Scholar]
  9. Beavis WD.  1994. The power and deceit of QTL experiments: lessons from comparative QTL studies. In: Proceedings of the Forty-Ninth Annual Corn and Sorghum Industry Research Conference, Washington, D.C, Vol. 250. p. 266. [Google Scholar]
  10. Bewick AJ, Schmitz RJ.  2017. Gene body DNA methylation in plants. Curr Opin Plant Biol. 36:103–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bigham A, Bauchet M, Pinto D, Mao X, Akey JM, et al.  2010. Identifying signatures of natural selection in Tibetan and Andean populations using dense genome scan data. PLoS Genet. 6:e1001116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Box GEP, Cox DR.  1964. An analysis of transformations. J Roy Stat Soc. 26:211–243. [Google Scholar]
  13. Boyle EA, Li YI, Pritchard JK.  2017. An expanded view of complex traits: from polygenic to omnigenic. Cell. 169:1177–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brandis G, Wrande M, Liljas L, Hughes D.  2012. Fitness-compensatory mutations in rifampicin-resistant RNA polymerase. Mol Microbiol. 85:142–151. [DOI] [PubMed] [Google Scholar]
  15. Brown KE, Kelly JK.  2018. Antagonistic pleiotropy can maintain fitness variation in annual plants. J Evol Biol. 31:46–56. [DOI] [PubMed] [Google Scholar]
  16. Brown KE, Kelly JK.  2020. Severe inbreeding depression is predicted by the “rare allele load” in Mimulus guttatus. Evolution. 74:587–596. [DOI] [PubMed] [Google Scholar]
  17. Bushnell B.  2014. BBTools software package. http://sourceforge. Net/projects/bbmap.
  18. Campbell-Staton SC, Cheviron ZA, Rochette N, Catchen J, Losos JB, et al.  2017. Winter storms drive rapid phenotypic, regulatory, and genomic shifts in the green anole lizard. Science. 357:495–498. [DOI] [PubMed] [Google Scholar]
  19. Coolon JD, McManus CJ, Stevenson KR, Graveley BR, Wittkopp PJ.  2014. Tempo and mode of regulatory evolution in Drosophila. Genome Res. 24:797–808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Denver DR, Morris K, Streelman JT, Kim SK, Lynch M, et al.  2005. The transcriptional consequences of mutation and natural selection in Caenorhabditis elegans. Nat Genet. 37:544–548. [DOI] [PubMed] [Google Scholar]
  21. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, et al.  2013. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 29:15–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Duren Z, Chen X, Jiang R, Wang Y, Wong WH.  2017. Modeling gene regulation from paired expression and chromatin accessibility data. Proc Natl Acad Sci U S A. 114:E4914–E4923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Emerson JJ, Li W-H.  2010. The genetic basis of evolutionary change in gene expression levels. Philos Trans R Soc Lond B Biol Sci. 365:2581–2590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Exposito-Alonso M,, BrennanAC, , Alonso-Blanco C,, PicóFX. 2018. Spatio-temporal variation in fitness responses to contrasting environments in Arabidopsis thaliana. Evolution; international journal of organic evolution. 72:1570–1586. 10.1111/evo.13508. 29947421. [DOI] [PubMed] [Google Scholar]
  25. Farh KK-H, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, et al.  2015. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 518:337–343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Fear JM, León-Novelo LG, Morse AM, Gerken AR, Van Lehmann K, et al.  2016. Buffering of genetic regulatory networks in Drosophila melanogaster. Genetics. 203:1177–1190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Fisher RA.  1930. The Genetical Theory of Natural Selection. Oxford, UK: Clarendon Press. [Google Scholar]
  28. Fishman L, Kelly JK.  2015. Centromere-associated meiotic drive and female fitness variation in Mimulus. Evolution. 69:1208–1218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Fishman L, Saunders A.  2008. Centromere-associated female meiotic drive entails male fitness costs in monkeyflowers. Science. 322:1559–1562. [DOI] [PubMed] [Google Scholar]
  30. Fishman L, Willis JH.  2005. A novel meiotic drive locus almost completely distorts segregation in Mimulus (monkeyflower) hybrids. Genetics. 169:347–353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ghalambor CK, Hoke KL, Ruell EW, Fischer EK, Reznick DN, et al.  2015. Non-adaptive plasticity potentiates rapid adaptive evolution of gene expression in nature. Nature. 525:372–375. [DOI] [PubMed] [Google Scholar]
  32. Gruber JD, Vogel K, Kalay G, Wittkopp PJ.  2012. Contrasting properties of gene-specific regulatory, coding, and copy number mutations in Saccharomyces cerevisiae: frequency, effects, and dominance. PLoS Genet. 8:e1002497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hahn MW, Kern AD.  2005. Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol Biol Evol. 22:803–806. [DOI] [PubMed] [Google Scholar]
  34. Hall MC, Lowry DB, Willis JH.  2010. Is local adaptation in Mimulus guttatus caused by trade-offs at individual loci? Trade-offs can contribute to adaptation in Mimulus. Mol Ecol. 19:2739–2753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hamann E, Pauli CS, Joly-Lopez Z, Groen SC, Rest JS, et al.  2021. Rapid evolutionary changes in gene expression in response to climate fluctuations. Mol Ecol. 30:193–206. doi:10.1111/mec.15583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Harding K, Hoey T, Warrior R, Levine M.  1989. Autoregulatory and gap gene response elements of the even-skipped promoter of Drosophila. EMBO J. 8:1205–1212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hartl DL, Clark AG.  1997. Principles of Population Genetics, Vol. 116. Sunderland, MA: Sinauer Associates. [Google Scholar]
  38. Hellsten U, WrightKM, , Jenkins J, Shu S, Yuan Y, et al. 2013. Fine-scale variation in meiotic recombination in Mimulus inferred from population shotgun sequencing. Proc Natl Acad Sci USA. 110:19478–19482. 10.1073/pnas.1319032110. 24225854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hodgins-Davis A, Rice DP, Townsend JP.  2015. Gene expression evolves under a house-of-cards model of stabilizing selection. Mol Biol Evol. 32:2130–2140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hoekstra HE, Coyne JA.  2007. The locus of evolution: evo devo and the genetics of adaptation. Evolution. 61:995–1016. [DOI] [PubMed] [Google Scholar]
  41. Houle D, Márquez EJ.  2015. Linkage disequilibrium and inversion-typing of the Drosophila melanogaster genome reference panel. G3 (Bethesda).  5:1695–1701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Johnson NA, Porter AH.  2000. Rapid speciation via parallel, directional selection on regulatory genetic pathways. J Theor Biol. 205:527–542. [DOI] [PubMed] [Google Scholar]
  43. Josephs EB, Lee YW, Stinchcombe JR, Wright SI.  2015. Association mapping reveals the role of purifying selection in the maintenance of genomic variation in gene expression. Proc Natl Acad Sci USA. 112:15390–15395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Josephs EB, Lee YW, Wood CW, Schoen DJ, Wright SI, et al.  2020. The evolutionary forces shaping Cis- and Trans-regulation of gene expression within a population of outcrossing plants. Mol Biol Evol. 37:2386–2393. [DOI] [PubMed] [Google Scholar]
  45. Josephs EB, Wright SI, Stinchcombe JR, Schoen DJ.  2017. The relationship between selection, network connectivity, and regulatory variation within a population of Capsella grandiflora. Genome Biol Evol. 9:1099–1109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kelly JK.  2003. Deleterious mutations and the genetic variance of male fitness components in Mimulus guttatus. Genetics. 164:1071–1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kelly JK.  2008. Testing the rare-alleles model of quantitative variation by artificial selection. Genetica. 132:187–198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Kremling KAG, Chen S-Y, Su M-H, Lepak NK, Romay MC, et al.  2018. Dysregulation of expression correlates with rare-allele burden and fitness loss in maize. Nature. 555:520–523. [DOI] [PubMed] [Google Scholar]
  49. Krizek BA, Anderson JT.  2013. Control of flower size. J Exp Bot. 64:1427–1437. [DOI] [PubMed] [Google Scholar]
  50. Langfelder P, Horvath S.  2008. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 9:559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lappalainen T, Sammeth M, Friedländer MR, Ac’t Hoen P, Monlong J, et al.  2013. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 501:506–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lawrence M, Daujat S, Schneider R.  2016. Lateral thinking: how histone modifications regulate gene expression. Trends Genet. 32:42–56. [DOI] [PubMed] [Google Scholar]
  53. López-Maury L, Marguerat S, Bähler J.  2008. Tuning gene expression to changing environments: from rapid responses to evolutionary adaptation. Nat Rev Genet. 9:583–593. [DOI] [PubMed] [Google Scholar]
  54. Love MI, Huber W, Anders S.  2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15:550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Mack KL, Ballinger MA, Phifer-Rixey M, Nachman MW.  2018. Gene regulation underlies environmental adaptation in house mice. Genome Res. 28:1636–1645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Mack KL, Campbell P, Nachman MW.  2016. Gene regulation and speciation in house mice. Genome Res. 26:451–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Mack KL, Nachman MW.  2017. Gene regulation and speciation. Trends Genet. 33:68–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Mähler N, Wang J, Terebieniec BK, Ingvarsson PK, Street NR, et al.  2017. Gene co-expression network connectivity is an important determinant of selective constraint. PLoS Genet. 13:e1006402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Maisnier-Patin S, Andersson DI.  2004. Adaptation to the deleterious effects of antimicrobial drug resistance mutations by compensatory evolution. Res Microbiol. 155:360–369. [DOI] [PubMed] [Google Scholar]
  60. Margres MJ, Wray KP, Hassinger ATB, Ward MJ, McGivern JJ, et al.  2017. Quantity, not quality: rapid adaptation in a polygenic trait proceeded exclusively through expression differentiation. Mol Biol Evol. 34:3099–3110. [DOI] [PubMed] [Google Scholar]
  61. Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, et al.  2012. Systematic localization of common disease-associated variation in regulatory DNA. Science. 337:1190–1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. McGirr JA, Martin CH.  2020. Ecological divergence in sympatry causes gene misexpression in hybrids. Mol Ecol. 29:2707–2721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. McManus CJ, Coolon JD, Duff MO, Eipper-Mains J, Graveley BR, et al.  2010. Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Res. 20:816–825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Meiklejohn CD, Coolon JD, Hartl DL, Wittkopp PJ.  2014. The roles of cis- and trans-regulation in the evolution of regulatory incompatibilities and sexually dimorphic gene expression. Genome Res. 24:84–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Mérot C, Llaurens V, Normandeau E, Bernatchez L, Wellenreuther M.  2020. Balancing selection via life-history trade-offs maintains an inversion polymorphism in a seaweed fly. Nat Commun. 11:670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Metzger BPH, Duveau F, Yuan DC, Tryban S, Yang B, et al.  2016. Contrasting frequencies and effects of cis- and trans-regulatory mutations affecting gene expression. Mol Biol Evol. 33:1131–1146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Metzger BPH, Wittkopp PJ, Coolon JD.  2017. Evolutionary dynamics of regulatory changes underlying gene expression divergence among Saccharomyces species. Genome Biol Evol. 9:843–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Millard SP.  2014. Env Stats, an R Package for Environmental Statistics. Wiley StatsRef: Statistics Reference. https://onlinelibrary.wiley.com/doi/abs/10.1002/9781118445112.stat07181.
  69. Mojica JP, Kelly JK.  2010. Viability selection prior to trait expression is an essential component of natural selection. Proc R Soc B. 277:2945–2950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Mojica JP, Lee YW, Willis JH, Kelly JK.  2012. Spatially and temporally varying selection on intrapopulation quantitative trait loci for a life history trade-off in Mimulus guttatus. Mol Ecol. 21:3718–3728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Monnahan PJ, Colicchio J, Fishman L, Macdonald SJ, Kelly JK.  2021. Predicting evolutionary change at the DNA level in a natural Mimulus population. PLoS Genet. 17:e1008945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Monnahan PJ, Kelly JK.  2015. Naturally segregating loci exhibit epistasis for fitness. Biol Lett. 11:20150498. doi:10.1098/rsbl.2015.0498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Monnahan PJ, Kelly JK.  2017. The genomic architecture of flowering time varies across space and time in Mimulus guttatus. Genetics. 206:1621–1635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Morris MRJ, Richard R, Leder EH, Barrett RDH, Aubin-Horth N, et al.  2014. Gene expression plasticity evolves in response to colonization of freshwater lakes in threespine stickleback. Mol Ecol. 23:3226–3240. [DOI] [PubMed] [Google Scholar]
  75. Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, et al.  2010. Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet. 6:e1000888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Ning Y, Liu W, Wang G-L.  2017. Balancing immunity and yield in crop plants. Trends Plant Sci. 22:1069–1079. [DOI] [PubMed] [Google Scholar]
  77. Osada N, Miyagi R, Takahashi A.  2017. Cis- and trans-regulatory effects on gene expression in a natural population of Drosophila melanogaster. Genetics. 206:2139–2148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Podrabsky JE, Somero GN.  2004. Changes in gene expression associated with acclimation to constant temperatures and fluctuating daily temperatures in an annual killifish Austrofundulus limnaeus. J Exp Biol. 207(Pt 13):2237–2254. [DOI] [PubMed] [Google Scholar]
  79. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, et al.  2007. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 81:559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Puzey JR, Willis JH, Kelly JK.  2017. Population structure and local selection yield high genomic variation in Mimulus guttatus. Mol Ecol. 26:519–535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. R Core Team. 2013. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/.
  82. Raj A, van Oudenaarden A.  2008. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell. 135:216–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Ramsay H, Rieseberg LH, Ritland K.  2009. The correlation of evolutionary rate with pathway position in plant terpenoid biosynthesis. Mol Biol Evol. 26:1045–1053. [DOI] [PubMed] [Google Scholar]
  84. Rausher MD.  1992. The measurement of selection on quantitative traits: biases due to environmental covariances between traits and fitness. Evolution. 46:616–626. [DOI] [PubMed] [Google Scholar]
  85. Rifkin SA, Houle D, Kim J, White KP.  2005. A mutation accumulation assay reveals a broad capacity for rapid evolution of gene expression. Nature. 438:220–223. [DOI] [PubMed] [Google Scholar]
  86. Scoville AG, Lee YW, Willis JH, Kelly JK.  2011. Explaining the heritability of an ecologically significant trait in terms of individual quantitative trait loci. Biol Lett. 7:896–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Shi X, Ng DW-K, Zhang C, Comai L, Ye W, et al.  2012. Cis- and trans-regulatory divergence between progenitor species determines gene-expression novelty in Arabidopsis allopolyploids. Nat Commun. 3:950. [DOI] [PubMed] [Google Scholar]
  88. Signor SA, Nuzhdin SV.  2018. The evolution of gene expression in cis and trans. Trends Genet. 34:532–544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Sobel JM, Streisfeld MA.  2013. Flower color as a model system for studies of plant evo-devo. Front Plant Sci. 4:321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Springer NM, Stupar RM.  2007. Allele-specific expression patterns reveal biases and embryo-specific parent-of-origin effects in hybrid maize. Plant Cell. 19:2391–2402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Stern DL, Orgogozo V.  2008. The loci of evolution: how predictable is genetic evolution?  Evolution. 62:2155–2177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Stern S, Dror T, Stolovicki E, Brenner N, Braun E.  2007. Genome-wide transcriptional plasticity underlies cellular adaptation to novel challenge. Mol Syst Biol. 3:106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Streisfeld MA, Rausher MD.  2009. Altered trans-regulatory control of gene expression in multiple anthocyanin genes contributes to adaptive flower color evolution in Mimulus aurantiacus. Mol Biol Evol. 26:433–444. [DOI] [PubMed] [Google Scholar]
  94. Sweigart A, Karoly K, Jones A, Willis JH.  1999. The distribution of individual inbreeding coefficients and pairwise relatedness in a population of Mimulus guttatus. Heredity. 83:625–632. [DOI] [PubMed] [Google Scholar]
  95. Tajima F.  1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 123:585–595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Tirosh I, Reikhav S, Levy AA, Barkai N.  2009. A yeast hybrid provides insight into the evolution of gene expression regulation. Science. 324:659–662. [DOI] [PubMed] [Google Scholar]
  97. Torres JM, Gamazon ER, Parra EJ, Below JE, Valladares-Salgado A, et al.  2014. Cross-tissue and tissue-specific eQTLs: partitioning the heritability of a complex trait. Am J Hum Genet. 95:521–534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Troth A, Puzey JR, Kim RS, Willis JH, Kelly JK.  2018. Selective trade-offs maintain alleles underpinning complex trait variation in plants. Science. 361:475–478. [DOI] [PubMed] [Google Scholar]
  99. Tulchinsky AY, Johnson NA, Porter AH.  2014. Hybrid incompatibility despite pleiotropic constraint in a sequence-based bioenergetic model of transcription factor binding. Genetics. 198:1645–1654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Tung J, Zhou X, Alberts SC, Stephens M, Gilad Y.  2015. The genetic architecture of gene expression levels in wild baboons. eLife. 4:e04729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Venables WN, Ripley BD.  2013. Modern Applied Statistics with S-PLUS. Berlin: Springer Science & Business Media. [Google Scholar]
  102. Verta J-P, Landry CR, MacKay J.  2016. Dissection of expression-quantitative trait locus and allele specificity using a haploid/diploid plant system–insights into compensatory evolution of transcriptional regulation within populations. New Phytol. 211:159–171. [DOI] [PubMed] [Google Scholar]
  103. Wang Z, Sun X, Zhao Y, Guo X, Jiang H, et al.  2015. Evolution of gene regulation during transcription and translation. Genome Biol Evol. 7:1155–1167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Whitehead A, Crawford DL.  2006. Variation within and among species in gene expression: raw material for evolution. Mol Ecol. 15:1197–1211. [DOI] [PubMed] [Google Scholar]
  105. Willis JH.  1993. Partial self-fertilization and inbreeding depression in two populations of Mimulus guttatus. Heredity. 71:145–154. [Google Scholar]
  106. Willis JH.  1999. The role of genes of large effect on inbreeding depression in Mimulus guttatus. Evolution. 53:1678–1691. [DOI] [PubMed] [Google Scholar]
  107. Wittkopp PJ, Haerum BK, Clark AG.  2004. Evolutionary changes in cis and trans gene regulation. Nature. 430:85–88. [DOI] [PubMed] [Google Scholar]
  108. Zhou X, Stephens M.  2014. Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat Methods. 11:407–409. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

iyab189_Supplementary_Data

Data Availability Statement

Gene expression data have been submitted to NCBI’s SRA (project number PRJNA736440). The Python scripts used to generate trait covariances as well as those used for permutation and simulations are available as Supplementary File S1.

Supplementary material is available at GENETICS online.


Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES