Skip to main content
eLife logoLink to eLife
. 2025 Mar 12;13:RP95566. doi: 10.7554/eLife.95566

Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction

James Boocock 1,2,3, Noah Alexander 1,2,3, Leslie Alamo Tapia 1,2,3, Laura Walter-McNeill 1,2,3, Shivani Prashant Patel 1,2,3, Chetan Munugala 1,2,3, Joshua S Bloom 1,2,3,, Leonid Kruglyak 1,2,3,
Editors: Rachel Brem4, Alan M Moses5
PMCID: PMC11903034  PMID: 40073070

Abstract

Expression quantitative trait loci (eQTLs) provide a key bridge between noncoding DNA sequence variants and organismal traits. The effects of eQTLs can differ among tissues, cell types, and cellular states, but these differences are obscured by gene expression measurements in bulk populations. We developed a one-pot approach to map eQTLs in Saccharomyces cerevisiae by single-cell RNA sequencing (scRNA-seq) and applied it to over 100,000 single cells from three crosses. We used scRNA-seq data to genotype each cell, measure gene expression, and classify the cells by cell-cycle stage. We mapped thousands of local and distant eQTLs and identified interactions between eQTL effects and cell-cycle stages. We took advantage of single-cell expression information to identify hundreds of genes with allele-specific effects on expression noise. We used cell-cycle stage classification to map 20 loci that influence cell-cycle progression. One of these loci influenced the expression of genes involved in the mating response. We showed that the effects of this locus arise from a common variant (W82R) in the gene GPA1, which encodes a signaling protein that negatively regulates the mating pathway. The 82R allele increases mating efficiency at the cost of slower cell-cycle progression and is associated with a higher rate of outcrossing in nature. Our results provide a more granular picture of the effects of genetic variants on gene expression and downstream traits.

Research organism: S. cerevisiae

Introduction

Genome-wide studies have identified thousands of loci that influence gene expression; these loci are known as expression quantitative trait loci or eQTLs (Albert and Kruglyak, 2015; Kang et al., 2023). eQTLs serve as an important bridge between DNA sequence variation and organismal phenotypes and provide a mechanism by which noncoding variants can underlie complex traits (Finucane et al., 2015; Umans et al., 2021; Gusev et al., 2016). The vast majority of eQTL studies to date have relied on measurements of average gene expression levels in bulk populations of cells (Aguet et al., 2020; Albert et al., 2018). This approach, while experimentally tractable, can lose information about known differences in genetic effects among tissues (Aguet et al., 2020), cell types (Westra et al., 2015; Chen et al., 2016; Kim-Hellmuth et al., 2020; Ota et al., 2021), and cellular states (Strober et al., 2019). Recently, studies in humans have leveraged single-cell RNA sequencing (scRNA-seq) to more flexibly investigate how eQTL effects are altered in different contexts (Elorbany et al., 2022; Cuomo et al., 2020; Neavin et al., 2021; Jerber et al., 2021; Yazar et al., 2022; van der Wijst et al., 2018), including cellular states that are difficult to access with bulk approaches (Nathan et al., 2022). However, obtaining these more granular eQTL maps with either bulk or single-cell approaches comes at the cost of substantial increases in the numbers of samples that must be obtained and analyzed one at a time.

In model organisms, such as the nematode Caenorhabditis elegans and the budding yeast Saccharomyces cerevisiae, mapping populations of millions of recombinant progeny can be generated in a single flask (Ehrenreich et al., 2010; Burga et al., 2019). Such populations can be combined with scRNA-seq in a ‘one-pot’ eQTL mapping design in which the same single-cell data enables measurement of gene expression, cell type classification, and genotyping of transcribed variants in each cell (Ben-David et al., 2021). This design has two major advantages. First, it retains information about tissues, cell types, and cellular states. Second, by replacing expensive and labor-intensive genotyping and expression profiling of many samples with a single scRNA-seq experiment, it enables facile exploration of genetics of gene expression in many different genetic backgrounds and in response to many environmental perturbations. Here, we implement this design in yeast (Figure 1A), which presents additional challenges due to small cell size and the presence of a cell wall (Gasch et al., 2017; Jariani et al., 2020; Nadal-Ribelles et al., 2019; Jackson et al., 2020; Brettner et al., 2022; N’Guessan et al., 2023), and use it to identify eQTLs in different genetic backgrounds, study interactions between eQTL effects and stages of the cell cycle, search for allele-specific effects on gene expression noise, and uncover a connection between a common variant in the gene GPA1, gene expression, progression through the cell cycle, and mating efficiency.

Figure 1. One-pot eQTL mapping is feasible in yeast.

(A) One-pot eQTL mapping workflow. A large population of hybrid diploid cells is sporulated, and MATa haploid yeast progeny cells (segregants) are isolated by fluorescence-activated cell sorting. Cells are captured and processed with the 10× Chromium device. The resulting barcoded library of single-cell transcriptomes is sequenced by Illumina short-read sequencing. Unique molecular identifier (UMI) counts are tallied for each transcript in each segregant. The number of supporting molecules for each parental allele is identified at every transcribed sequence position that differs between the parental strains, and a hidden Markov model is used to infer the genotype of each segregant. In the cartoon example of an eQTL shown on the top right, segregants with the C allele have higher expression of the gene than those with the A allele. (B) Representative Uniform Manifold Approximation and Projection for Dimension Reduction (UMAP) plot of cells colored by their assigned cell-cycle stage. (C) Scatter plot of local eQTL effects from the one-pot experiment in the cross between BY and RM (x-axis) against local eQTL effects based on expression measurements from bulk RNA-seq in the same cross (y-axis) (Albert et al., 2018). Green dots denote one-pot eQTL effects that were significant at a false-discovery rate (FDR) of 0.05; yellow dots denote those that were not. The x- and y-axis were truncated at –1 and 1 for ease of visualization, which left out 67 of 4044 data points.

Figure 1.

Figure 1—figure supplement 1. Single-cell eQTL mapping of 393 previously genotyped segregants.

Figure 1—figure supplement 1.

393 segregants were pooled, grown in minimal medium, and processed using the 10× Chromium device. The resulting barcoded library is sequenced with Illumina short-read sequencing. The number of supporting molecules for each allele is inferred at every variant position between the parental strains and a hidden Markov model is used to infer the genotype of each segregant. A cartoon example of one eQTL is shown on the top right, cells with the C allele had higher expression than cells with A allele.

Figure 1—figure supplement 2. Cell-cycle classification of the single cells from the set of 393 previously genotyped segregants visualized on different combinations of principal components.

Figure 1—figure supplement 2.

These data are from the first 10× run of the 393 segregants (number of cells = 3454). The results look similar for the second run. The principal components (PCs) were calculated using Seurat with the cell-cycle genes. Cells are colored according to their assigned cell-cycle stage. (A) PC plot from the single-cell data comparing PC 1 and PC 2. (B) PC plot from the single-cell data comparing PC 3 and PC 4. (C) PC plot from the single-cell data comparing PC 1 and PC 5. (D) PC plot from the single-cell data comparing PC 1 and PC 6.

Figure 1—figure supplement 3. Marker gene expression of cell-cycle classified single cells from the set of 393 previously genotyped segregants.

Figure 1—figure supplement 3.

These data are from the first 10× run of the 393 segregants (number of cells = 3454). The results look similar for the second run. The heatmap shows the normalized expression of each of the 22 markers used for assigning clusters to each cell-cycle stage. For the M/G1 stage, we used the genes PIR1, EGT2, ASH1, DSE1, DSE2, and CTS1. For the G1 stage, we used the gene MFA1, which in our experiments reproducibly connected the M/G1 and G1/S transition stages. For the G1/S stage, we used the genes CSI1, TOS4, POL30, PRY2, AXL2, and CLN2. For the S stage, we used these genes HTB1 and HHF2. Finally, for the G2/M stage we used the genes HOF1, PHO3, MMR1, CLB2, WSC4, CDC5, and CHS2.

Figure 1—figure supplement 4. Cell-cycle classification of the single cells from the set of 393 previously genotyped segregants.

Figure 1—figure supplement 4.

(A) Proportion of cells assigned to each cell-cycle stage for the first of two 10× runs of single cells from the pool of 393 segregants (number of cells = 3454). (B) Proportion of cells assigned to each cell cycle for the second 10× run of single cells from the pool of 393 segregants (number of cells = 3708). (C) UMAP plot from the single-cell data in (A) with cells colored according to their assigned cell-cycle stage. (D) UMAP plot from the single-cell data in (B) with cells colored according to their assigned cell-cycle stage.

Figure 1—figure supplement 5. Cell-cycle marker gene expression of the single cells from the set of 393 previously genotyped segregants.

Figure 1—figure supplement 5.

(A) Gene expression levels of the 18 cell-cycle markers used for classification with panels ordered by their position in the cell cycle for the first of two 10× runs of single cells from the pool of 393 segregants (number of cells = 3454). (B) Same as (A) for the second 10× run of single cells from the pool of 393 segregants (number of cells = 3708).

Figure 1—figure supplement 6. Histogram of the number of single cells identified for each of the 393 segregants.

Figure 1—figure supplement 6.

The median number of cells per segregant (17 cells) is displayed as a vertical red line.

Figure 1—figure supplement 7. Single-cell hidden Markov model (HMM) genotyping accuracy compared to the number of unique molecular identifiers (UMIs) per cell.

Figure 1—figure supplement 7.

The horizontal red line shows the median genotyping agreement of 92.5%. The Spearman’s correlation coefficient and p-value comparing the genotyping accuracy to the number of UMIs is shown in the top left corner of the plot. A log10 transformation was applied to the x-axis.

Figure 1—figure supplement 8. Local eQTL effects estimated with different genotyping methods.

Figure 1—figure supplement 8.

Local eQTL effect sizes estimated using the hidden Markov model (HMM)-based genotypes from single-cell data (x-axis) compared to local eQTL effect sizes estimated using the lookup of genotypes obtained from whole-genome sequencing (y-axis). The single-cell sequencing data of the 393 segregants is used for quantifying gene expression for each eQTL analysis. The Spearman’s correlation coefficient of the local eQTL effect size estimates between these genotyping methods and p-value are shown in the top left corner of the plot.

Results

scRNA-seq enables simultaneous expression profiling, cell-cycle stage determination, and genotyping in a segregating yeast population

eQTL mapping requires tracking the inheritance of genetic variants and measuring gene expression in the same individuals. scRNA-seq captures the transcriptomes of individual cells, and genotypes of expressed single-nucleotide polymorphisms (SNPs) in transcribed sequences can be used to track inheritance in these same cells. We previously showed that this approach enables single-cell eQTL mapping in C. elegans (Ben-David et al., 2021). To test the feasibility of the approach in yeast, we pooled 393 previously genotyped haploid segregants from a cross between a lab strain (BY) and a wine strain (RM) (Albert et al., 2018; Bloom et al., 2013; Figure 1—figure supplement 1; Supplementary file 1, tables S1–S3) and used scRNA-seq to obtain the transcriptomes of 7124 cells (Methods, Supplementary file 1, table S4). We captured a median of 1514 unique RNA molecules (unique molecular identifiers; henceforth UMIs) and a median of 1091 expressed SNPs per cell (Supplementary file 1, table S4).

The expression of hundreds of yeast genes varies during progression through the stages of the cell cycle (Spellman et al., 1998). We classified individual haploid yeast cells into five different cell-cycle stages (M/G1, G1, G1/S, S, and G2/M) via unsupervised clustering of the expression of 787 cell-cycle-regulated genes (Spellman et al., 1998) in combination with 22 cell-cycle-informative marker genes (Figure 1B, Figure 1—figure supplements 25). Using this classification approach, we found that expression of 2139 genes displayed significant variation by cell-cycle stage (likelihood ratio test, false-discovery rate [FDR] <0.05; Supplementary file 1, table S5). To account for the observed widespread effects of the cell cycle on gene expression, we incorporated the cell-cycle stage into subsequent eQTL analyses, unless otherwise stated.

We used a hidden Markov model (HMM) to reconstruct the patterns of inheritance of parental alleles in each cell based on the observed genotypes at expressed SNPs. This allowed us to match each cell to one of the 393 segregants. We observed a median of 17 cells per segregant, with 277 of the segregants sampled more than ten times (Figure 1—figure supplement 6). The genotypes measured from scRNA-seq data were in high agreement with those obtained from whole-genome sequencing of the same strains (median genotype agreement 92.5%). The agreement was higher in cells with more UMIs (Figure 1—figure supplement 7), and we leveraged higher yields of UMIs per cell in subsequent experiments to ensure better genotyping accuracy.

We used the two sets of genotypes to map local eQTLs—those that influence the expression of nearby genes, most commonly in cis. We modeled the genetic effects of the closest marker to each transcript on single-cell gene expression with a count-based model that did not include a cell-cycle term. We mapped 770 local eQTLs at an FDR of 5% with the HMM-based genotypes, and 697 with the matched genotypes obtained from whole-genome sequencing of the segregants; 611 eQTLs were detected in both analyses (Supplementary file 1, table S6). We further compared the local eQTL effects for all 4901 tested transcripts, regardless of statistical significance, and found that they were highly correlated between the two sets of genotypes (Spearman’s = 0.93, p < 10–15; Figure 1—figure supplement 8). A single-cell eQTL study on a different set of previously genotyped segregants from the same cross reached a similar conclusion (N’Guessan et al., 2023), providing further evidence that genotypes obtained from scRNA-seq data at transcribed SNPs are of sufficient quality for eQTL mapping.

One-pot eQTL mapping in de novo yeast segregants

One-pot eQTL mapping is an attractive experimental design compared to bulk RNA sequencing and genotyping because it lowers cost, eliminates individual sample preparation, and reduces other sources of technical variation. To compare one-pot eQTL mapping with the traditional bulk design, we generated segregants de novo from a cross between BY and RM (Albert et al., 2018). We used scRNA-seq to measure the expression of 5435 transcripts in 27,744 single cells (Supplementary file 1, table S4). We mapped 1031 local eQTLs at an FDR of 5%. We compared these results to those from bulk RNA-seq and genotyping in the same cross (Albert et al., 2018) and found that 717 (69.5%) of the 1031 local eQTLs were also detected as statistically significant in that study, with an additional 108 local eQTLs showing effects in the same direction (Figures 1C and 2A; Supplementary file 2, table S1). Thus, 825 (80%) of the local eQTLs detected with the one-pot approach were supported by the bulk results, despite differences in growth conditions and experimental procedures between the two studies.

Figure 2. Single-cell eQTL map recapitulates bulk trans-eQTL hotspots and identifies new hotspots.

Figure 2.

(A) Map of local and distant eQTLs. Each point denotes an eQTL, with the genomic position of the peak marker on the x-axis and the genomic location of the gene with the expression difference on the y-axis. The high density of points on the diagonal line with a slope of one indicates that many genes have local eQTLs. The dense vertical bands correspond to trans-eQTL hotspots. (B) Histogram showing the number of distant eQTLs in 50 kb windows top: one-pot eQTL map; bottom: bulk eQTL map (Albert et al., 2018). Red lines show statistical eQTL enrichment thresholds for a window to be designated a hotspot. Text labels highlight known and putative causal genes underlying hotspots, as well as loci that meet hotspot criteria only in the current study.

We next broadened our analysis to the trans-acting (distant) eQTLs, here defined as those that influence the expression of genes on a different chromosome. We mapped 1562 distant eQTLs at an FDR of 5% (Figure 2A; Supplementary file 3, table S1). As in previous studies (Albert et al., 2018; Brem et al., 2002), distant eQTLs were not uniformly distributed throughout the genome, but rather clustered at a number of hotspot loci that influence the expression of many genes. We identified 12 distant eQTL hotspots in the one-pot eQTL experiment (Supplementary file 3, table S1). When we applied the same criteria for defining a hotspot, we identified 21 hotspots in the bulk eQTL experiment in the same cross (Figure 2B). Five regions met the hotspot criteria in both studies, including the well-characterized hotspots driven by variants in the genes MKT1 (Zhu et al., 2008), GPA1 (Yvert et al., 2003), IRA2 (Smith and Kruglyak, 2008), and HAP1 (Brem et al., 2002). One hotspot on chromosome XIV in the bulk experiment was not observed here because it is caused by a de novo variant in the gene KRE33 that arose in the RM parent used in the construction of the bulk eQTL mapping panel (Albert et al., 2018; Jerison et al., 2017). The other hotspots from the bulk experiment generally affected the expression of fewer genes, and the fact that they did not meet hotspot criteria here can be explained by a combination of statistical power and different experimental conditions.

To learn more about the seven regions that met hotspot criteria only in the single-cell experiment but not in the bulk experiment, we performed a functional enrichment analysis of the genes they influence (Figure 2B; Supplementary file 3, table S1). The hotspot on chromosome X at position 323,158 changed the expression of 36 genes that were enriched for gene ontology (GO) terms related to zinc ion transmembrane transporter activity and transition metal ion transmembrane transporter activity. The hotspot region contains the gene ZAP1, which encodes a zinc-regulated transcription factor (Zhao and Eide, 1997). This gene contains nine missense variants between BY and RM, and we predict that ZAP1 is the causal gene underlying this hotspot (see also Weith et al., 2023). The hotspot on chromosome XIII at position 24,326 changed the expression of 26 genes that were enriched for GO terms related to acid phosphatase activity. The hotspot region contains the gene PHO84, which encodes an inorganic phosphate transporter (Bun-Ya et al., 1991). BY harbors a rare coding variant P259L in PHO84 (L allele frequency = 0.3%) (Peter et al., 2018) that has been shown to affect resistance to polychlorinated phenols (Perlstein et al., 2007) and is the likely causal variant for this hotspot. The other five hotspots were enriched for GO terms broadly related to growth. We grew the yeast segregants for scRNA-seq in a medium containing sheath fluid, a phosphate-buffered saline (PBS) solution with a pH of 7.4, whereas unbuffered minimal medium was used in the bulk eQTL study. Gene–environment interactions in gene expression are common in yeast (Smith and Kruglyak, 2008), especially for distant eQTLs, and subtle differences in the growth conditions between the two studies may explain why these new loci met the hotspot criteria only in the single-cell study.

We took advantage of the convenience of one-pot eQTL mapping and applied it to two additional yeast crosses, one between a clinical strain (YJM145) and a soil strain (YPS163) both isolated in the United States (44,784 cells, 5556 transcripts; Supplementary file 1, table S4), and another between a soil strain isolated in South Africa (CBS2888) and a clinical strain isolated in Italy (YJM981) (6595 cells, 4696 transcripts; Supplementary file 1,table S4). Hereafter, we refer to the BY × RM cross as cross A and the two new crosses as crosses B and C, respectively. We mapped a total of 1914 local eQTLs in the new crosses (1193 in cross B and 721 in cross C; Supplementary file 2, tables S2 and S3), as well as 1626 distant eQTLs (550 in cross B and 1126 in cross C; Figure 3A, B; Supplementary file 3, tables S2 and S3). These distant eQTLs clustered into 13 hotspots (6 in cross B and 7 in cross C; Figure 3C, D). Of the 25 hotspots detected in the three crosses, 14 (56%) were unique to a single cross (7/12 in cross A, 2/6 in cross B, and 5/7 in cross C). This observation is consistent with prior work suggesting that variants with widespread effects on gene expression are likely to be deleterious, and that purifying selection should reduce their allele frequencies, making them more likely to be strain specific (Ronald and Akey, 2007). We used functional annotations to identify candidate genes for two of the new hotspots: GPA1 for the hotspot on chromosome VIII in cross B (Supplementary file 4—Cross B chrVIII:46887–140660) and CYR1 for the hotspot on chromosome X in cross C (Supplementary file 4—Cross C chrX:397734–497167); the biological effects of these hotspots are discussed below.

Figure 3. Single-cell eQTL maps in two new crosses.

Figure 3.

(A) eQTL map for the YJM145 × YPS163 cross (cross B). (B) eQTL map for the CBS2888 × YJM981 cross (cross C). (C) Histogram of distal eQTLs showing hotspots in cross B. (D) Histogram of distal eQTLs showing hotspots in cross C. The y-axis has been truncated to have a maximum value 100 for ease of visualization purposes. The hotspot on chromosome X near the gene CYR1 influences the expression of 175 genes, and the hotspot on chromosome XI influences the expression of 386 genes.

Distant eQTLs effects are more dependent on cell-cycle stage than local eQTLs effects

We asked whether the effects of eQTLs varied across the different stages of the cell cycle. Of the 2945 total local eQTLs detected in the three crosses, only 116 (4%) showed significant interactions between the eQTL effect and the cell-cycle stage at an FDR of 5%. In contrast, 790 (24.4%) of 3238 distant eQTLs showed significant interactions with the cell-cycle stage (OR = 7.8, Fisher’s exact test, p < 10–15), which suggests that the effects of distant eQTLs depend more on the state of a cell than those of local eQTLs (Supplementary files 2 and 3). This observation is consistent with prior work which showed that the effects of distant eQTLs are often dependent on the environment (Smith and Kruglyak, 2008), tissue (Aguet et al., 2020; Battle et al., 2017), and cell type (Ben-David et al., 2021), while those of local eQTLs tend to be less affected by these factors, perhaps because their effects on expression are more direct. Our results extend this notion beyond external environments, tissues and cell types to internal cellular states in a single-cell type.

Identification of hundreds of genetic effects on expression noise

An outstanding question in genetics is whether, and to what extent, genetic variation influences noise in gene expression—that is, do some genetic variants alter the variability in the expression level of specific genes, separately from their effects on the average expression levels? Measurement of expression in single cells with different genotypes is uniquely suited to exploring this question, but separating the effects on noise from those on average expression is not trivial, and previously identified genetic effects on expression noise in scRNA-seq data could be explained by their effects on average expression (Sarkar et al., 2019). In mapping panels, apparent allelic effects on intrinsic expression variability can instead reflect extrinsic sources of expression variability that differ between cells, such as cell-cycle stage and genetic differences in trans-acting factors. To overcome these issues of interpretation, we investigated the genetic contribution to intrinsic noise in gene expression in scRNA-seq data we generated for F1 diploid hybrids of the parental strains. The F1 diploid yeast cells are isogenic and share all trans-acting factors, allowing us to exclude extrinsic genetic sources of expression variability and focus on allele-specific contributions to gene expression noise.

We obtained a total of 13,973 single-cell transcriptomes from F1 diploids used to generate the segregants for the three crosses (5890 for cross A, 2864 for cross B, and 5219 for cross C; Supplementary file 1, table S4). We classified each cell into one of four cell-cycle stages relevant for diploids (M/G1, G1/S, S, and G2/M). We found 3406 genes with allele-specific effects on average expression levels (668 for cross A, 996 for cross B, and 1742 for cross C; Supplementary file 5). These allele-specific effects were well correlated with local eQTL effects from the eQTL mapping experiments described above (Figure 4—figure supplement 1). We observed 160 genes with significant interactions between allele-specific expression and cell-cycle stage (Supplementary file 5).

We next looked for allele-specific effects on gene expression noise. We used an approach that tests for significant differences in gene expression noise between the two alleles in the F1 diploid hybrids after accounting for average differences in gene expression due to genotype, cell-cycle stage, and their interactions (Figure 4A, B, Figure 4—figure supplement 2; Methods). Using this approach, we found a total of 874 genes with allele-specific effects on expression noise at an FDR of 5%, independent of any effects on average expression (Figure 4C; Supplementary file 6).

Figure 4. Genetic effects on expression noise.

(A) Cumulative distribution of simulated allele-specific counts for two alleles with different average expression but the same expression noise. (B) Cumulative distribution of simulated allele-specific counts for two alleles with different expression noise but the same average expression. These simulated distributions are shown to illustrate allele-specific effects on average expression and on expression noise, respectively. (C) Log–log scatter plot of change in expression noise between alleles (x-axis) against change in average expression between alleles (y-axis); points correspond to all 1487 genes with significant allele-specific effects on expression noise and/or average expression. Black line shows the predicted change in noise given a change in expression, with the 95% confidence interval for the trend shown in gray. The 377 genes with allele-specific effects on expression noise that cannot be accounted for by the overall trend are shown in red. The x and y axes have been truncated at –5 and 5 for ease of visualization purposes, which left out 30 of 1487 data points.

Figure 4.

Figure 4—figure supplement 1. The allele-specific expression effects compared to the local eQTL effects from each cross.

Figure 4—figure supplement 1.

The position of points on the y-axis represents the allele-specific expression effect of a transcript estimated from the three F1 diploid hybrids of crosses of each of the shown haploid parents and the position of points on the x-axis represents the local eQTL effect estimated for that gene estimated in our one-pot eQTL experiment. The points are colored depending on whether they were significant in the one-pot eQTL experiment at a false-discovery rate (FDR) of <5%. The x and y axes have been truncated at –2 and 2 for ease of visualization purposes, which left out 117 of 7942 data points.
Figure 4—figure supplement 2. Estimates of noise are biased downwards at low expression levels.

Figure 4—figure supplement 2.

We simulated allele-specific counts from 5000 cells across a range of differences in expression level and noise and fit a count-based negative binomial model with a genotype effect for both the mean and the noise (Methods). The estimated and simulated parameters are displayed across the panels. For (A, B), we assumed a baseline expression of 0.05 counts per cell, and for (C, D), we assumed a baseline expression of 0.5 counts per cell. 0.05 counts per cell represents the number of counts observed for an average yeast gene in our experiments, and 0.5 counts per cell represents the average number of counts observed for the top 10% of yeast genes in our single-cell data. In (A, C) the relationship between the simulated change in ln(noise) between alleles and estimated change in ln(noise) between alleles is shown. Individual lines were grouped and colored based on the amount of simulated change in ln(expression) between alleles. In (B, D), the relationship between the simulated change in ln(expression) between alleles and estimated change in ln(expression) between alleles is shown. Individual lines were grouped and colored based on the amount of simulated change in ln(noise). These simulations show that the estimates of noise are biased downwards when expression levels are low, but the p-values are well calibrated and not significant in such instances (data not shown). This behavior of the models runs opposite to the global trend we observe whereby increasing expression decreases noise.
Figure 4—figure supplement 3. Empirical global relationship between expression noise and average expression levels.

Figure 4—figure supplement 3.

(A) Log–log scatter plot of allelic average expression (x-axis) against allelic expression noise (y-axis); points correspond to all 22,294 alleles for which we were able to estimate allele-specific effects. Average allelic expression is negatively correlated with allelic noise (Spearman’s = −0.42, p < 10–15). (B) Log–log scatter plot of change in expression between alleles (x-axis) against change in expression noise between alleles (y-axis); points correspond to all 11,147 genes for which we were able to estimate allele-specific effects. Points in green highlight the 1487 genes with significant allele-specific effects at a false-discovery rate (FDR) of <5% on expression noise and/or average expression. Changes in noise are negatively correlated with expression level, across all genes (Spearman’s = −0.32, p < 10–15), genes without an allele-specific effect (Spearman’s = −0.23, p < 10–15), and genes with significant allele-specific effects (Spearman’s = −0.64, p < 10–15). The x and y axes have been truncated at –5 and 5 for ease of visualization purposes, which left out 142 of 11,147 data points.

An additional consideration for these analyses is that prior work has revealed an empirical negative correlation between gene expression noise and average gene expression, even when noise is estimated while accounting for average expression (Antolović et al., 2017; Love et al., 2014). We observed this global trend in our data—across all genes, noise was negatively correlated with expression level (Spearman’s = −0.42, p < 10–15; Figure 4—figure supplement 3). To test whether the observed allele-specific effects on expression noise could arise from this trend, we asked whether the confidence interval (CI) of each significant allele-specific effect on noise overlapped the CI of the global trend line (Methods). Of the 874 genes with allele-specific effects on noise, these CIs did not overlap for 377, suggesting that these allele-specific effects on noise cannot be explained by the empirical global relationship between expression noise and average expression (Figure 4C; Supplementary file 6).

An illustrative example of an allele-specific effect on expression noise was found in crosses A and C for the gene HSP12, which encodes an intrinsically unstructured protein that improves membrane stability (Supplementary file 6; Welker et al., 2010). HSP12 is a member of the general stress response pathway regulated by Msn2/4 (Kuang et al., 2017) and helps yeast cells survive high-temperature shocks (Welker et al., 2010). In cross A, the RM allele did not significantly change the expression of HSP12 compared to the BY allele, but it did significantly increase the noise, whereas in cross C, the YJM981 allele decreased the expression of HSP12 compared to the CBS2888 allele and increased the noise. Previous work found that HSP12 has high extrinsic expression noise relative to other genes (Stewart-Ornstein et al., 2012), and it was proposed that the high noise arises from variability in the activity of the Msn2/4 stress response pathway and subsequent activation of Msn2/4 targets such as HSP12 (Gasch et al., 2017). Our experiments in F1 hybrids control for sources of extrinsic noise, such as Msn2/4 activity, and our results suggest that the RM and YJM981 alleles of HSP12 are intrinsically more variable than the BY and CBS2888 alleles, and that genetic differences acting in cis are responsible. We hypothesize that the RM and YJM981 allele of HSP12 may provide a fitness advantage during periods of extreme stress via a bet-hedging strategy in which noisy expression of HSP12 creates a subpopulation of cells with very high HSP12 expression that can better survive environmental shocks.

Natural genetic variants affect cell-cycle occupancy

Because the single-cell expression data allowed us to assign each genotyped cell to a stage of the cell cycle, we next moved beyond gene expression and searched for genetic effects on cell-cycle progression. Specifically, we looked for loci at which one allele is overrepresented in cells assigned to a particular cell-cycle stage. Because cell-cycle occupancy represents the proportion of cells assigned to a given stage, changes in the proportion of cells in each stage are correlated, potentially leading to QTLs with effects on occupancy of multiple cell-cycle stages. We found a total of 20 unique cell-cycle occupancy QTLs in the three crosses (4 for cross A, 10 for cross B, and 6 for cross C; Figure 5A; Supplementary file 7). One of the QTLs identified in cross A contained the gene MKT1, variation in which is known to affect dozens of growth traits and thousands of molecular traits, including gene expression (Figure 5B; Zhu et al., 2008). Segregants inheriting the RM allele of MKT1 are overrepresented in the G1 stage of the cell cycle and underrepresented in the G2/M stage. This observation suggests that some of the previously described cellular impacts of MKT1 variation may arise as a consequence of its effect on progression of yeast cells through the cell cycle.

Figure 5. Natural genetic variants affect cell-cycle occupancy.

(A) Cell-cycle occupancy QTL map for three different crosses. LOD score for linkage with cell-cycle occupancy (y-axis) is plotted against the genomic location of genetic markers (x-axis). Colored lines show results for different cell-cycle stages as denoted in the legend. Horizontal line corresponds to a family-wise error rate (FWER) threshold of 0.05. Text labels highlight genes with QTL effects shown in panels B–D. Cell-cycle occupancy mapping was not performed on chromosome III. (B) Variation in MKT1 increases G1 occupancy and decreases G2/M occupancy in the BY × RM cross. (C) Variation in GPA1 decreases G1 occupancy and increases S and G2/M occupancy in the YJM145 × YPS163 cross. (D) Variation in CYR1 decreases G1 occupancy and increases S and G2/M occupancy in the CBS2888 × YJM981 cross. Error bars in B–D represent 95% confidence intervals.

Figure 5.

Figure 5—figure supplement 1. Single-cell expression profiling of allele-replacement strains comparing the cell-cycle distribution of strains with the 82R allele of GPA1 to strains with the WT allele of GPA1.

Figure 5—figure supplement 1.

Single-cell RNA sequencing (scRNA-seq) of 26,859 cells from 11,695 cells with the 82R allele and 11,695 cells with the WT (82W) allele. (A) Combined UMAP plot of the integrated single-cell dataset with cells colored according to their cell-cycle stage. (B) Combined UMAP plot of the integrated single-cell dataset with cells colored according to their genotype at position 82 of the GPA1 protein. (C) Allele frequency of the 82R allele across the cell cycle in our allele-replacement single-cell data. Error bars represent the 95% confidence intervals for the proportion of cells in each cell-cycle stage.

We identified a cell-cycle occupancy QTL on chromosome X in cross C whose location coincided with a distant eQTL hotspot (Figures 3D and 5D). This hotspot affected the expression of 224 genes that were enriched for GO terms related to oxidative phosphorylation and the citric acid cycle. We combined the eQTL mapping results with growth QTL (Bloom et al., 2019a) from the same cross and predicted that the likely causal gene underlying this hotspot is CYR1 (Supplementary file 4—Cross C chrX:397734–497167). CYR1 encodes adenylate cyclase, an enzyme which catalyzes the reaction that produces cyclic AMP (Matsumoto et al., 1982). Segregants which carry the CBS2888 allele of CYR1 more frequently occupy the G1 phase of the cell cycle and show improved growth in eight stressful conditions (Figure 5D). The CBS2888 allele of CYR1 contains multiple variants with predicted large deleterious effects on the gene, and these variants may act individually or together to compromise the function of CYR1, with the result that cells with this natural allele may mimic the G1 arrest phenotype observed in temperature-sensitive mutants of CYR1 (Matsumoto et al., 1983). Mutations in CYR1 are known to alter stress tolerance in yeast (Versele et al., 2004; Vanhalewyn et al., 1999; Vianna et al., 2010), providing additional support for our hypothesis that CYR1 is the causal gene underlying this hotspot.

The W82R variant of GPA1 alters gene expression and cell-cycle occupancy

We mapped a cell-cycle occupancy QTL in cross B to a region on chromosome VIII that contains the gene GPA1 (Figure 5C). GPA1 encodes the GTP-binding alpha subunit of a heterotrimeric G protein that mediates the response to mating pheromone (Nakafuku et al., 1987). Segregants carrying the YJM145 allele of GPA1 more frequently occupied G1, the cell-cycle stage during which the mating pathway is active (Figure 5A; Lang et al., 2009). This locus is also a distant eQTL hotspot that influenced the expression of 51 genes (Figure 2C). These genes are enriched for GO terms related to sexual reproduction and cellular response to pheromone (Supplementary file 4—Cross B chrVIII:46887–140660). Variants in the coding sequence of GPA1 are known to alter the expression of genes involved in mating (Yvert et al., 2003). The YJM145 allele of GPA1 contains a variant that changes a tryptophan to an arginine at position 82 of the Gpa1 protein. An evolutionary analysis revealed that this residue has been conserved as tryptophan for ~400 million years in the budding yeasts (Saccharomycotina) (Shen et al., 2018), and is commonly found as a aromatic amino acid (phenylalanine, tyrosine, or tryptophan) across the tree of life (Supplementary file 8). The conservation of this tryptophan is reflected in the prediction that the 82R allele is highly deleterious to the function of Gpa1 (Provean score of –13.915) (Choi and Chan, 2015). We thus hypothesized that this variant in GPA1 is responsible for the observed effects of the chromosome VIII locus on both gene expression and cell-cycle occupancy.

To test this hypothesis, we used CRISPR–Cas9 to engineer each allele of the W82R variant into a common genetic background (Supplementary file 1, table S1; Sadhu et al., 2018). We performed scRNA-seq on 26,859 cells from these isogenic strains that differed only in whether they carried the 82R (N = 11,695) or the 82W allele (N = 15,164) of GPA1. We observed that for 36 of the 50 genes affected by the hotspot and detected in our single-cell validation dataset, the sign of the expression difference was consistent between the eQTL effect and the W82R validation experiment (binomial test, p = 0.0026; Supplementary file 9). Importantly, the gene expression difference in the W82R experiment was statistically significant and concordant with the eQTL effect for all six mating-related genes affected by this hotspot (AGA1, AGA2, MFA1, STE2, FUS3, and PRM5), showing that the 82R allele isolated from other segregating genetic variation increases the expression of genes involved in the mating response. Consistent with the QTL effect, cells with the 82R allele were overrepresented in G1 (46.2% in G1 vs. 42.8% in other stages, logistic regression, p < 10–15; Figure 5—figure supplement 1). We conclude that the W82R variant is responsible for the observed effects of this chromosome VIII QTL on gene expression and cell-cycle occupancy.

The 82R allele of GPA1 increases mating efficiency at the cost of growth rate

One possible consequence of increasing the proportion of cells in G1 is slowing progression through the cell cycle, with a corresponding decrease in the cell doubling rate. Previous work has shown that strains which carry a different variant in GPA1 (a G to T substitution at position 1406 in the coding sequence of the gene, which results in a serine to isoleucine substitution at position 469 of the protein) have decreased growth rates (Lang et al., 2009). We measured growth rates of the engineered strains and found that cells with the 82R allele grew slower than those with the 82W allele (relative growth rate = 0.993, T = −2.592, p = 0.0268), but that this effect was smaller than that observed for the S469I variant (relative growth rate of the 469I allele = 0.984, T = −5.291, p < 0.001; Figure 6A). The 469I allele is known to improve the efficiency of mating, a difference we successfully replicated (relative 469I mating efficiency = 110%, T = 9.73, p < 0.001). We observed that the 82R allele also increased mating efficiency (relative mating efficiency = 107%, T = 6.75, p < 0.001), but to a lesser extent than the 469I allele (82R mating efficiency compared to 469I = 97.2%, T = −2.98, p < 0.001; Figure 6B, Figure 6—figure supplement 1). These results show that natural variants in GPA1 increase mating efficiency at the cost of slower growth. Both effects may be explained by the impact of these variants on the mating pathway—enhanced activity of the pathway facilitates mating in the presence of partners, while inappropriate pathway activation in the absence of partners slows down the G1 phase of the cell cycle, as we have shown for the 82R allele, thereby decreasing growth rate.

Figure 6. The 82R allele of GPA1 increases mating efficiency at the cost of growth rate and is associated with increased outbreeding in natural populations.

(A) Boxplots show growth of allele replacement strains grown in glucose. Points represent replicate measurements of the doublings per hour for each strain. Tukey’s HSD adjusted p-values of pairwise comparisons of allele replacement strains are shown. (B) Boxplots show mating efficiency of allele replacement strains; details as in A. (C) Genome-wide neighbor-joining tree of 1011 sequenced yeast isolates. Strains in which only the 82R allele is present are denoted in blue; strains with support for both 82R and 82W alleles are denoted in red; and strains in which only 82W allele is present are denoted in gray. We observed that the 82R allele is enriched in mosaic strains (allele frequency = 45.3%, permutation test p = 0.007). Other clades mentioned in the text are labeled on the tree.

Figure 6.

Figure 6—figure supplement 1. Mating efficiency of the 82R allele of GPA1 compared to the 469I and wild-type (82W 469S) alleles.

Figure 6—figure supplement 1.

The boxplots show the mating efficiency of different allele-replacement strains. Each point represents replicate measurements of the mating efficiency as estimated by flow cytometry. The plot is further subdivided into two panels depending on the color of the fluorescent protein used to estimate mating efficiency. Mating efficiency values were normalized to the average mating efficiency value of the WT strain. The unadjusted p-values of pairwise t-tests between the alleles are shown.
Figure 6—figure supplement 2. Allele frequency of the 82R allele of GPA1 in sequenced yeast strains.

Figure 6—figure supplement 2.

The allele frequency is further broken down by the clade assignments given in Peter et al., 2018. The number of strains in each clade is given in the brackets next to the clade name.
Figure 6—figure supplement 3. Genome-wide heterozygosity per strain, after classifying strains as having the 82W allele or 82R allele of GPA1.

Figure 6—figure supplement 3.

The violin plots display the distribution of the proportion of sites per strain called heterozygous from Peter et al., 2018. Strains were classified depending on whether it is homozygous for the 82W (N = 760) or 82R (N = 164) alleles. Strains with support for both 82R and 82W alleles (N = 87) were not analyzed here.

The W82R allele of GPA1 is common in the yeast population and is associated with increased outbreeding in natural populations

We searched for the 82R and 469I alleles in a worldwide collection of 1011 S. cerevisiae isolates (Peter et al., 2018) and found that the 469I allele is rare in the population (1.9%), whereas the 82R allele is common (20.5%) (Figure 6C). The 82R allele is fixed in a clade of strains isolated from Brazilian bioethanol (82R allele frequency = 100%) and is found at high frequency (58%) in a clade of strains isolated from Asian fermentation products such as rice wine (Figure 6—figure supplement 2). Peter et al. identified four groups of mosaic strains, which are characterized by admixture of two or more different lineages through outbreeding, and we observed that the 82R allele is enriched in these mosaic strains (allele frequency = 45.3%, permutation test p = 0.007). Strains derived from outbreeding events between genetically distinct parents are expected to show higher rates of heterozygosity than strains resulting from inbreeding or clonal propagation. We compared homozygous (<5% heterozygous sites) and heterozygous (>5% heterozygous sites) strains, as defined by Peter et al., and found that the 82R allele is enriched in heterozygous strains (OR = 3.3, Fisher’s exact test, p < 10–7) and associated with higher rates of heterozygosity (Wilcoxon rank sum test, p < 10–15; Figure 6—figure supplement 3). We have shown that the 82R allele increases mating efficiency in the lab, and these observations suggest that this increased mating efficiency may translate into higher outcrossing rates in nature.

Discussion

We used a one-pot single-cell eQTL mapping design, in which tens of thousands of cells from a segregating population are subjected to scRNA-seq, to map thousands of eQTLs in three different yeast crosses. We identified both local and distant eQTLs and showed that distant eQTLs in all three crosses cluster at hotspot loci that affect the expression of many genes, recapitulating and extending previous observations made with bulk eQTL mapping in the widely studied BY–RM cross. Notably, most of these hotspots are not shared between crosses, which suggests that they are caused by alleles unique to one of the six parent strains. This observation is consistent with the idea that alleles that alter the expression of many genes are likely to be selectively disfavored and therefore present at lower frequencies in the yeast population (Ronald and Akey, 2007; Bloom et al., 2019a).

Prior research has leveraged scRNA-seq data to detect genetic loci that alter gene expression noise, but their effects could not be separated from those on average expression levels and may reflect other sources of extrinsic cell-to-cell variability (Sarkar et al., 2019). To account for extrinsic factors, we obtained thousands of transcriptomes from single cells of hybrid diploid F1 yeast and tested for allele-specific differences in intrinsic gene expression noise. We employed an approach that accounts for average changes in gene expression and identified 874 genes with an allele-specific effect on gene expression noise. For 377 of these genes, the effects on noise could not be accounted for by the empirically observed negative correlation across genes between estimated gene expression noise and average gene expression (Love et al., 2014). We observed allele-specific effects on HSP12 expression noise in two separate crosses. HSP12 plays a role in protection against high-temperature shocks, and the high-noise alleles may provide a fitness advantage during high-temperature stress by creating a subpopulation of cells with very high HSP12 expression that can survive under these conditions. This observation adds to previous reports showing that noise mediated by promoter variants can provide a fitness advantage in times of environmental stress (Liu et al., 2015) and may constrain variation in promoter evolution (Metzger et al., 2015).

Single-cell RNA-seq data allowed us to assign each cell to a cell-cycle stage and explore genetic effects on expression during different stages of the cell cycle. We detected hundreds of eQTLs whose effects differed across cell-cycle stages. Distant eQTLs were more likely than local eQTLs to be cell cycle dependent, perhaps because the effects of distant eQTLs are more indirect and mediated by cellular regulatory networks that are affected by the cell cycle (Albert et al., 2018). Previous work has shown that effects of distant eQTLs are more sensitive than those of local eQTLs to tissue type (Battle et al., 2017) and external environment (Smith and Kruglyak, 2008), and our results extend these findings to show that they are more sensitive to internal cellular states within a single-cell type.

We used the ability to classify genotyped cells by their cell-cycle stage to identify 20 loci that altered the occupancy of different cell-cycle stages, one of which overlapped an eQTL hotspot. We used fine mapping and allele replacement with CRISPR–Cas9 to show that a common variant (W82R) in the gene GPA1 is responsible for the effects of this locus on cell-cycle occupancy and gene expression. We further showed that the 82R allele increases yeast mating efficiency at the cost of slower growth. Natural yeast isolates vary in their propensity to mate or enter the cell cycle upon germination (McClure et al., 2018), leading us to ask whether the 82R allele alters mating efficiency outside the lab. We searched for this allele in a collection of 1011 sequenced yeast isolates (Peter et al., 2018) and found that it is common (20.5%) and occurs more frequently in isolates that show evidence of recent outcrossing, suggesting that the observed increase in mating efficiency in the lab translates into more frequent mating in nature. Outcrossing rate has a major impact on the genetic structure of a population and its response to natural selection (Hartfield et al., 2017), and our results suggest that common variants can alter this key evolutionary parameter.

Studies of genetic effects on gene expression provide a molecular lens into the genetic basis of complex traits. One-pot single-cell eQTL mapping makes such studies cheaper, more efficient, and more flexible. This approach will power broader explorations of how genetic variants influence gene expression in different genetic backgrounds and under different experimental conditions. It also enables integration of information across multiple levels, as shown here for the case of gene expression, cell-cycle occupancy, and mating efficiency. The results of this study have the potential to inform the design, execution, and analysis of other one-pot studies of the effects of genetic variation on gene expression, such as human ‘cell villages’ (Wells et al., 2023; Neavin et al., 2023).

Materials and methods

Unless otherwise specified, computational analyses were performed in the R (v4.4.0) programming language (R Development Core Team, 2022) and visualizations were created using the ggplot2 package (v3.5.1) (Wickham, 2009).

Strains, plasmids, and primers used in this study are listed in Supplementary file 1, tables S1-S3.

Sporulation and single-cell sorting

To assess our ability to reconstruct genotypes and perform eQTL mapping with single-cell data, we performed a single-cell eQTL study using 480 MATa segregants from a cross between a lab (BY) and wine (RM) strain (YLK1993, BY × RM) that were genotyped by whole-genome sequencing and had their transcriptomes measured by bulk RNA-seq (Albert et al., 2018). These 480 segregants were grown to saturation in 96-well plates and pooled at equal volumes to create our study population. This pool of strains was then transferred to minimal YNB medium (6.7 g/l Difco Yeast Nitrogen Base w/o Amino Acids, 2% glucose) and grown overnight at 30°C in YPD (2% bacto peptone, 1% yeast extract, 2% glucose). This pool of segregants was diluted to an OD600 of ~0.05 and allowed to grow until they were in mid-log, defined as the culture having an OD600 of between 0.4 and 0.6. Cells were then harvested using a 125-ml vacumm filtration system (Sigma-Aldrich #Z290467) fitted with a 0.2-μM nylon membrane filters (Sterlitech, #NY0225100). The filter was transferred to a 50-ml conical tube and submerged in liquid nitrogen to flash freeze the yeast cells. The conical tubes were transferred to a –80°C freezer for later use. 393 of the 480 (81.9%) MATa segregants were present in the scRNA-seq data.

For de novo eQTL mapping, we used three parental diploids that were transformed with a fluorescent Magic marker that we have previously described (Treusch et al., 2015). These crosses were YLK3051 (BY × RM; cross A; PLK124), YLK3301 (YJM145 × YPS163; cross B; PLK124), YLK3004 (CBS2888 × YJM981; cross C; PLK73; Tables S3 and S4). Spores containing haploid recombinant progeny were obtained from these diploid strains by growing the parental diploid strains for 5–7 days in SPO++ sporulation medium at room temperature (https://dunham.gs.washington.edu/sporulationdissection.htm). We used a modified random spore prep protocol to obtain individual spores prior to fluorescence-activated cell sorting (FACS). We resuspended 1 ml of the sporulation cultures in 100 μl of water and added 20 μl of 2000 U/ml Zymolyase 20T (Amsbio, #120491-1). The spores were digested for 25 min at 37°C, and a light microscope was used to confirm that the ascii were open. We quenched the digestion with 900 μl of sterile water, and washed them two times with 1 ml of sterile water. 1 ml of 2× YPD was added to the spores and they were grown at 30°C for 7 hr. 250,000 Mata cells were sorted into 5 ml polystyrene round-bottom test tubes (Falcon #352058). 1 ml 2× YNB with 2% glucose was added to the sorted cells in sheath fluid (PBS, Bio-Rad #12012932) and they were transferred to a shaking incubator and grown at 30°C. We sought to minimize the outgrowth of our cultures, and attempted to capture the culture at mid log in 2× YNB with sheath fluid the next morning. However, if the culture grew too much, we diluted the yeast in 1× YNB with 2% glucose to an OD600 of ~0.1 in a 50-ml flask. The culture was grown until the density reached an OD600 of ~0.5 and the yeast were harvested with vacuum filtration. For our experiments with the BY and RM cross (YLK3051), the cells were harvested in 2× YNB medium containing sheath fluid. For our other de novo eQTL experiments, the cells were captured as early as practicable. FACS was performed using a Bio-Rad S3e cell sorter with a cooled sample block.

For our allele-specific expression experiments, parental diploids were grown overnight in YPD medium and diluted in the morning to an OD600 of ~0.05 in 1× YNB. The diploid yeast were pooled in some experiments and for every experiment they were grown to a OD600 of between 0.4 and 0.6. The yeast were then harvested using vacuum filtration, flash frozen, and transferred to the –80°C freezer.

Single-cell library preparation

Yeast cells stored on filters were removed from the –80°C freezer and the cells were fixed in 5 ml of 80% methanol. The yeast were placed in the –20°C freezer for 10 min. The fixed yeast were washed three times with 1 M Sorbitol. We partially digested the cell walls of the fixed yeast cells with Zymolyase 20T (Amsbio, #120491-1) to enable lysis in the Chromium device from 10× Genomics. In more detail, 200 μl of cells were combined with 0.5 µl of 500 U/ml of Zymolyase 20T and 1 μl of 10% Beta-Mercaptoethanol and digested for 20 min at 30°C with gentle shaking (250 rpm). For our diploid experiments, the zymolyase concentration was reduced to 250 U/ml. Phase microscopy was used to confirm that the digestion made partial spheroplasts. We washed digested yeast cells three times in 1 M Sorbitol with the centrifuge cooled to 4°C and after each wash step the yeast were spun at 500 × g for 5 min. The yeast were diluted to 1000 cells/μl and loaded onto the Chromium device using either the Single Cell 3′ Solution V2 or V3 kit. Library prep was carried out according to the manufacturer’s protocol. Prepared libraries were sequenced on the Illumina Novaseq 6000 or Nextseq 2500. Each sequencing library was treated as a different ‘batch’ for all downstream analyses.

scRNA-seq data processing

Sequencing reads were analyzed using Cellranger (version 5.01) using the S2888C reference genome (SGD, R64-2-1) (Engel et al., 2014). The transcriptome was amended to include the 3′ untranslated region (UTR) in the gene model using a custom python script (https://gist.github.com/theboocock/aacf72277a572ee3fe589c430bfd496e; Boocock, 2023). We obtained 3′ UTR lengths from an experimental dataset (Xu et al., 2009) and used the median 3′ UTR length for genes when this information was not available.

Cell-cycle stage classification

We used unsupervised clustering based on cell-cycle gene expression, along with well-described marker gene expression, to classify yeast single cells into five different stages of the cell cycle (M/G1, G1, G1/S, S, and G2/M). Filtered gene expression matrices were loaded into the Seurat package (v4.04) (Hao et al., 2021) of the R programming language (v4.4.0) (R Development Core Team, 2022). 799 cell-cycle genes were obtained from previous cell-cycle synchronization experiments where microarrays were used to measure RNA levels (Spellman et al., 1998). Of these 799 genes, 787 were reliably quantified in our single-cell datasets and were used in the subsequent analysis. For each 10× library, we extracted the 787 cell-cycle genes from our filtered gene expression matrices and performed normalization using SCtransform (Hafemeister and Satija, 2019). We constructed a shared nearest neighbor graph using 12 principal components and identified clusters using the louvain algorithm set to a resolution of 0.3. We performed UMAP with 12 principal components to visualize these clusters in two dimensions. The Wilcoxon rank sum test, as implemented in the FindAllMarkers function of Seurat, was used to identify markers between the clusters. The identified markers were filtered to remove those with a log2 fold change of less than 0.2 and an adjusted p-value of less than 0.05. These markers were annotated with their cell-cycle classification from Spellman et al., 1998.

To classify haploid yeast into their cell-cycle stage, we used a list of 22 cell-cycle genes that are highly expressed in a certain stage and well-represented across our datasets (Spellman et al., 1998). For the M/G1 stage, we used the genes PIR1, EGT2, ASH1, DSE1, DSE2, and CTS1. For the G1 stage, we used the gene MFA1, which in our experiments reproducibly connected the M/G1 and G1/S transition stages. For the G1/S stage, we used the genes CSI1, TOS4, POL30, PRY2, AXL2, and CLN2. For the S stage, we used these genes HTB1 and HHF2. Finally for the G2/M stage we used the genes HOF1, PHO3, MMR1, CLB2, WSC4, CDC5, and CHS2. We intersected these known markers with the differentially expressed transcripts identified with Seurat, as described above. Using this information, we manually classified every unsupervised cluster into one of the five stages of the cell cycle (M/G1, G1, G1/S, S, and G2/M). We could not identify a discrete G1 cell-cycle stage between M/G1 and G1/S in our diploid single cell. We therefore classified our diploid single-cell data into four cell-cycle stages (M/G1, G1/S, S, and G2/M) using the same markers as above excluding MFA1 which is not expressed in diploids.

Single-cell variant counting and genotype inference

We obtained deep (>100×) paired-end sequenced data for our parental strains from previous work (Bloom et al., 2019b) and generated a variant call file (VCF) using the standard Genome Analysis Toolkit (GATK, v3.8.0) variant calling pipeline (DePristo et al., 2011). For each cross, we extracted biallelic SNPs segregating in each cross and used Vartrix version 1.0 (https://github.com/10XGenomics/vartrix; 10XGenomics, 2018) to generate UMI counts for each parental allele from the cell ranger binary sequence alignment/map (BAM) file (Li et al., 2009). Doublet cells were identified by observing an excessive fraction of variant sites where we observed both parental alleles for a given cell barcode, and removed from downstream analyses. Counts at variants with extremely distorted allele frequencies (minor allele frequency <5%) were treated as missing and genotypes at those sites were imputed using the HMM described next.

We used an HMM to infer the genotypes of the recombinant progeny (Broman, 2005; Arends et al., 2010). The HMM is used to calculate the probability of underlying genotypes for each individual and requires three components: (1) prior probabilities for each of the possible genotypes, (2) emission probabilities for observing variant informative reads given each of the possible genotypes, and (3) transition probabilities – the probabilities of recombination occurring between adjacent genotype informative sites.

We defined prior genotype probabilities as 0.5 for each parental variant. Emission probabilities were calculated as previously described for low coverage sequencing data (Dodds et al., 2015; Bilton et al., 2018) under the assumption that the observed counts of reads for both possible variants (Y) at a genotype informative site (g) arise from a random binomial sampling of the alleles present at that site and that sequencing errors (e) occur independently between reads at a rate of 0.005:

p(Yg=A)=(Dr)(1e)r(1(1e))Drp(Yg=B)=(Dr)(e)r(1e)Dr

where D is the total read depth at a genotype informative site for a given individual, r is the total read depth for the A variant at that site, and A represents the variant from the haploid A parent and B represents the variant the haploid B parent. Transition probabilities were derived from existing genetic maps for the crosses (Bloom et al., 2019a). We linearly interpolated genetic map distances from the existing map to all genotype informative sites in our cross progeny.

For our combined single-cell datasets from each of our three crosses, we calculated the fraction of cells with unique genotypes by binarizing the genotype based on whether the genotype probability was greater than 50%. We then calculated the pairwise hamming distance between every pair of cells. Crosses with greater than 10% non-unique genotypes were further processed to retain the unique segregants with the highest UMI count. This filtering step was only needed for the cross of YJM981 and CBS2888, and the procedure reduced the number of cells from 14,823 to 6595.

Local eQTL mapping in previously genotyped segregants

After inferring genotype probabilities, as described in the above section, the genotype probabilities for each single cell were correlated with the genotypes of existing segregants that were previously determined by whole-genome sequencing (Bloom et al., 2013). Segregant identity was determined by picking the previously genotyped segregant with genotypes that were most correlated with the genotype probability vector for a given single cell. Next we fit a negative binomial regression model that included a fixed effect of the natural log of total UMIs per cell (to control for compositional effects), a fixed effect for batch, a fixed effect for the genotypic marker closest to the transcript, and a random effect of segregant identity. Model parameters were estimated using iteratively reweighted least squares as implemented in the ‘nebula’ function in the Nebula R package (He et al., 2021) with default arguments. p-values for the effect of the genotypic marker closest to the transcript were adjusted for multiple testing using the procedure of Benjamini and Hochberg, 1995.

One-pot eQTL mapping

Genotype probabilities were standardized, and markers in very high LD (r > 0.999) were pruned. This LD pruning is approximately equivalent to using markers spaced 4 centimorgans (cm) apart. For each transcript, we counted the number of cells for which at least one UMI count was detected. Transcripts with non-zero counts in at least 128 cells were used for downstream analyses.

For each expressed transcript we first fit the negative binomial generalized linear model:

E[Y]=μ (1)
Var(Y)=μ+1θμ2 (2)
μ=exp(βi+Xtβt+Xbβb+Xcβc) (3)

which has the following log-likelihood:

l(β,θ)=n=1N[(yn+θ)log(μn+θ)ynlog(μn)+log(|Γ(yn+1)|)θlog(θ)+log(|Γ(θ)|)log(|Γ(θ+yn)|)] (4)

And where Y is a vector of UMI counts per cell, Xt is a vector of the natural logarithm of the total UMIs per cell and controls for compositional effects, Xb is an indicator matrix assigning cells to batches, and Xc is the vector of standardized genotype probabilities across cells for the closest genotypic marker to each transcript from the pruned marker set. In addition, β is a vector of estimated coefficients from the model, μn is the expected value of Y for a given cell n, N is the total number of cells, and θ is a negative binomial overdispersion parameter. Model parameters were estimated using iteratively reweighted least squares as implemented in the ‘nebula’ function in the Nebula R package (He et al., 2021). Due to the computational burden of fitting so many GLMs in the context of sc-eQTL mapping, we chose to estimate θ once for each transcript and use that estimate of θ in the additional models for that transcript within the cell-cycle stages, as described below. This approach is conservative, as the effects of unmodeled factors (for example trans eQTLs) will be absorbed into the estimate of overdispersion, resulting in larger estimated overdispersion (1θ) and lower model likelihoods. Computational approaches that re-estimate θ for each model, that jointly model all additive genetic effects, or that regularize θ across models and transcripts (McCarthy et al., 2012), may further increase statistical power to identify linkages.

To evaluate the statistical significance of local eQTLs, a likelihood ratio statistic 2(lnclfc), was calculated, comparing the log-likelihood of this model described above (lfc) to the log-likelihood of the model where β is re-estimated while leaving out the covariate Xc for the closest marker to a transcript eQTL marker (lnc). A permutation procedure was used to calculate FDR-adjusted p-values, and is described further below.

For each expressed transcript in each cell-cycle stage we also scanned the entire genome for eQTLs, enabling detection of trans eQTLs. A similar procedure was used as for the local eQTL-only scan except that Equation 3 was replaced with:

μ=exp(βi+Xtβt+Xbβb+Xgβg) (5)

where Xg is a vector of the scaled genotype probabilities at the gth genotypic marker, and the model is fit separately, one at a time, for each marker across the genome for each transcript. A likelihood ratio statistic for each transcript, within each cell-cycle stage, for each genotypic marker is calculated by comparing this model to the model where β is re-estimated while leaving out the covariate. The likelihood ratio statistic was transformed into an LOD score, by dividing it by 2loge(10). We also used functions in the fastglm R package (Huling, 2022) for this scan, again re-using estimates of θ obtained as described above for each transcript across each cell-cycle stage. For each transcript and each Xg chromosome, QTL peak markers were identified as the marker with the highest LOD score. The 1.5 LOD-drop procedure was used to define approximate 95% CIs for QTL peaks (Dupuis and Siegmund, 1999).

FDR-adjusted p-values were calculated for QTL peaks. They were calculated as the ratio of the number of transcripts expected by chance to show a maximum LOD score greater than a particular LOD threshold versus the number of transcripts observed in the real data with a maximum LOD score greater than that threshold, for a series of LOD thresholds ranging from 0.1 to 0.1+ the maximum observed LOD for all transcripts within a cell-cycle stage assignment, with equal-sized steps of 0.01. The number of transcripts expected by chance at a given threshold was calculated by permuting the assignments of segregant identity within each batch relative to segregant genotypes, calculating LOD scores for all transcripts across the chromosome as described above, and recording the maximum LOD score for each transcript. In each permutation instance, the permutation ordering was the same across all transcripts. We repeated this permutation procedure five times. Then, for each of the LOD thresholds, we calculated the average number of transcripts with maximum LOD greater than the given threshold across the five permutations. We used the ‘approxfun’ function in R to interpolate the mapping between LOD thresholds and FDR and estimate an FDR-adjusted p-value for each QTL peak (Albert et al., 2018). To detect QTL affecting transcript levels across cell-cycle stages and increase power to detect such effects, the same procedure was performed after summing the LODs across cell-cycle stage assignments and summing the LODs from permutations within cell-cycle stage assignments.

Cell-cycle eQTL interactions

We tested for cell cycle by genotype interactions by calculating

Z=βiβj(SEi2+SEj2) (6)

where i and j indicate two cell-cycle stages being contrasted. β and SE correspond to the QTL effect size and standard error from the modeling described above. The Z-statistic was assumed to come from a standard normal distribution, converted to a p-value, and FDR adjusted using the procedure of Benjamini and Hochberg, 1995. The adjustment was performed jointly either across all local eQTL for all transcripts for the local eQTL by cell-cycle interaction test, or jointly across all eQTL hotspots for all transcripts linking to that hotspot for the distant eQTL by cell-cycle interaction test.

Allele-specific expression and overdispersion

Where multiple F1 hybrid diploids were assayed in the same experiment, we used a custom likelihood-based procedure to classify cells to one of the expected input diploids. For each F1 hybrid diploid, for transcripts where an allelic count was observed in at least 64 cells and for cells with less than 20,000 UMIs we tested for allele-specific effects on gene expression noise for each transcript separately using a negative binomial parameterization with the ‘glmmTMB’ function from the glmmTMB R package (Brooks et al., 2017).

Following the model description defined in the above section, ‘One-pot eQTL mapping’, with all terms not described here being the same as they are above, for each transcript Equation 2 was replaced with:

Var(Y)=μ+1θkXkμ2 (7)

And Equation 3 was replaced with:

μ=exp(βi+ZXtβt+ZXbβb+Xkβk+ZXlβl+ZXlXkβm) (8)

where here Y is a vector of all allelic counts from both alleles for a given transcript, stacked, Z is an indicator matrix mapping allelic counts to cells, Xk is an indicator matrix assigning counts to parental alleles, is a vector of the allele-specific dispersion effects, and Xl is an indicator matrix assigning cells to cell-cycle stages. The t-statistic for the allele-specific dispersion effect in the joint model was used to identify significant noise effects. Multiple testing corrected p-values were calculated for this statistic as described in the ‘One-pot eQTL mapping section’.

Noise terminology and overdispersion

Throughout the main text we refer to the overdispersion parameter estimate 1θ as ‘noise’.

Investigating bias in allele-specific expression and noise estimation

We investigated bias in the estimation of the overdispersion parameter by simulating counts from the above model, excluding all covariates except an intercept and allelic fold change from Equation 8. We kept the total number of cells fixed at 5000, and total expression per transcript was fixed at values equivalent to the median or the top 5% of expressed transcripts, reflecting observed values from the BY × RM F1 hybrid diploid. We simulated 250 instantiations each from a grid of allelic fold changes and dispersion fold changes that spanned our observed data, and refit the model. Results are shown in Figure S10. We note that p-values for the test for allelic effects on dispersion appear properly calibrated and non-significant despite the downward bias in overdispersion fold change estimates when alleles have very low counts.

Accounting for the globally observed negative correlation between allele-specific noise and allele-specific average expression

We investigated the relationship between all allele-specific average expression effects and allele-specific noise effects across our F1 hybrids. We observed a negative correlation between allele-specific noise and allele-specific average expression. To identify allele-specific noise effects not explained by this relationship, we fit a robust linear regression model to the observed trend with the ‘lmRob’ function from the robustbase package (v0.99-2) (Maechler, 2023) in data that was filtered to only include genes that had a significant allele-specific average expression effect and/or a significant allele-specific noise effect. We considered an allele-specific noise effect to violate the trend if the 95% CI of this noise effect did not overlap the 95% CI of the trend line.

Cell-cycle occupancy mapping

We treated the assignment of cells to a given cell-cycle stage as separate binary traits. We mapped each binary trait using a logistic regression using the ‘fastglmPure’ function from the fastglm package (Huling, 2022). The effect of ln(UMIs) per cell was added as an additional covariate to control for unwanted technical effects of variable UMIs per cell. A family-wise error rate significance threshold was computed using the procedure of Li and Ji, 2005. Cell-cycle occupancy mapping was not performed on chromosome III due to linkage between the markers on that chromosome and the mating locus, which is highly distorted in allele frequency toward the MATa parent in our experiments.

Hotspot identification, functional annotation, and localization

We sought to identify trans-acting eQTL hotspots in our one-pot eQTL experiments. To achieve this goal, we broke up the yeast genome into bins of 50 kilobases using the GenomicRanges package (v1.50.2) (Lawrence et al., 2013). We counted the number of distant eQTLs, defined here as transcripts physically not located on the same chromosome as the bin that had eQTL peaks overlapping the bin. We then asked whether under a Poisson model the number of linking transcripts could be expected by chance. We adjusted the p-value using a Bonferonni correction for the total number of bins. The mean of the Poisson distribution was taken to be the average number of transcripts linking to each bin. We merged significant bins if they were adjacent to each other giving us a final set of hotspots bins. We repeated this analysis on the previously obtained bulk RNA sequencing based eQTL mapping results (Albert et al., 2018) to enable us to compare our one-pot eQTL results from this cross to the results from bulk eQTL mapping, using consistent methodology.

We set out to annotate hotspots with functional information to identify candidate causal genes and variants. To approximate the CI of a hotspot, we extracted the 20 most significant eQTLs within each hotspot and took the 10th and 90th percentile left and right CI of these eQTL to be the hotspot CI. We conservatively extended the CI of each hotspot by two markers on either side. We intersected the CI of each hotspot with four different annotations: (1) causal genes from a previous QTL mapping study using segregants from the same crosses (Bloom et al., 2019a), (2) QTL CIs from the same study, (3) cell-cycle occupancy QTL from our study, and (4) genetic variants segregating in each cross. The functional consequences of these variants were generated with snpEFF (v5.1) (Cingolani et al., 2012). We added the allele frequency and PROVEAN (Choi and Chan, 2015) score of variants found in the 1,011 yeast genomes project (Peter et al., 2018). We performed GO (Ashburner et al., 2000) and Kyoto Encyclopedia of Genes and Genomics (Kanehisa and Goto, 2000) enrichment analyses of significant eQTLs within each hotspot using TopGO (v2.56.0) (Alexa and Rahnenfuhrer, 2022). These enrichments were calculated for the molecular function and biological process categories of the GO. The background for these enrichments were genes that were tested for eQTLs in each dataset and only genes that were not found on the same chromosome as the hotspot QTL. Fisher’s exact test was used to calculate a p-value, any enrichment with a log2 fold change of greater than 2 and a nominal p < 10–5 were output into a table for further inspection. These analyses were combined into a single worksheet, one for each hotspot, and we explored these worksheets to identify candidate causal genes and variants.

GPA1 allele-replacement strains

To generate allele replacement strains for the W82R and S469I variants of GPA1, we used a single-guide RNA CRISPR system to introduce double-strand breaks near our region of interest and provided coupled repair templates to replace the desired allele (Sadhu et al., 2018). The specifics of this procedure are described below.

We engineered a yeast strain derived from BY4741 (YLK3221; BY4741; Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX) with all three natural combinations (82W 469S, 82W 469I, 82R 469S) of the W82R and S469I alleles. This strain contained a galactose inducible Cas9 (PLK77, p415-GalL-Cas9-CYC1t) (DiCarlo et al., 2013). Since BY4741 had the 469I allele, we first transformed a plasmid (PLK126) derived from PLK88 (SNR52p-gRNA(BstEII/SphI).CAN1.Y-SUP4t) (Schubert et al., 2022) designed to change the codon at that position to serine (S) – the variant commonly found in the population (chrVIII:113512_T/C). This plasmid was also designed to introduce a second edit in the 3′ UTR of GPA1 (chrVIII:113496_T/C), which was needed to break the PAM site of the guide RNA used for editing. We grew the cells in galactose medium to induce the expression of Cas9, and confirmed that the mutation was incorporated into the genome of individual yeast using colony PCR and sanger sequencing (YLK3302 and YLK3303). To create the 82R mutation in these strains, we cured the yeast of the guide RNA plasmid and transformed another plasmid designed to change the codon at position 82 from tryptophan (W) to arginine (R) (PLK125). This plasmid introduces two edits, one at position (chrVIII:114674_A/G), this is the common variant that is found in the population, and another edit (chrVIII:114672_C/T) that makes a synonymous codon change and was needed to break the PAM site used for editing. We confirmed that these edits were incorporated into the genome of each yeast with Sanger sequencing (YLK3304 and YLK3305). We cured our allele-replacement strains of their guide RNA and Cas9 plasmids and obtained two colonies of each strain for use in subsequent phenotyping (YLK3306–YLK3311). We cured our parent strain (YLK3221) of the Cas9 plasmid and obtained two colonies for use in subsequent phenotyping (YLK3312–YLK3313).

Single-cell sequencing of GPA1 allele-replacement strains

Four biological replicates of allele replacement strains with the two natural allelic combinations of GPA1 were individually pooled (YLK3306 and YLK3307 82W 469S, 420/421 82R 469S) and grown overnight in YPD at 30°C. These strains were then diluted to ~0.1 OD600 in a 250-ml flask containing 50 ml of YNB medium with a complete supplement mixture and allowed to grow for two doublings to an OD600 of ~0.5. The yeast were harvested using vacuum filtration. Samples were flash frozen in dry ice and ethanol and transferred to the –80°C freezer. Prior to loading the Chromium device, frozen cells were fixed in 80% methanol for 10 min, washed three times with sorbitol, and diluted to 1000 cells/μl. The 10× reagents were modified by removing 1 μl of beads and replacing it with 1 μl of zymolyase according to Vermeersch et al., 2022. Using these modified reagents, the Chromium device was loaded conventionally and all other aspects of library preparation and loading was done according to 10× and Illumina standard protocols.

Growth measurements

All allele replacement strain growth experiments were performed at 30° in YP medium (2% bacto-peptone, 1% yeast extract) supplemented with 2% glucose using the approach described in Boocock et al., 2021. Strains were incubated with fast shaking in a Biotek synergy 2 plate reader. Before each experiment, strains were grown to saturation in our plate reader in 96-well plates (Corning, Flat Bottom with Lid, #3370) in 2% glucose. Strains were then diluted 1:100 or transferred with a plastic 96-well pinner into new 96-well plates and transferred to a Bio-Tek Synergy plate reader, which automatically took optical density measurements (OD600) measurements every 15 min.

Growth rate calculations

Growth rate was quantified as the geometric mean rate of growth (GMR). Our procedure for calculating the GMR follows that described in Brem et al. (Roop et al., 2016). Briefly, we fit a spline in R using the ‘splinefun’ function, and the time spent (t) between OD 0.2 and 0.8 was calculated. The GMR was then estimated as the log(0.8/0.2)/t. Plates were manually inspected for outliers and for those plates we retained any GMR greater than 0.07 and less than 0.085. This filter removed 20 data points out of 360 from further analysis. We converted the GMR of each well into doublings per hour. To determine whether our allele replacement strains changed the growth rate, we fit a linear model with an additive effect for genotype and plate. We used the emmeans package (v1.10.1) (Lenth, 2023) to extract pairwise contrasts and performed multiple-hypothesis correction with the Tukey method. For visualization purposes, we extracted the residuals from a model with the additive plate effect and added the intercept from this model to these residuals. We created a boxplot of these residuals split by genotype with the ggplot2 (v3.5.1) package (Wickham, 2009).

Mating efficiency experiments

We performed competitive mating assays to test whether our GPA1 allele replacement strains (YLK3306 82W 469S, YLK3308 82R 469S, YLK3312 82W 469I) altered the efficiency of mating. Because our allele replacement strains were isogenic except for the engineered variants, we adapted a classic yeast mating assay to use fluorescence instead of selectable markers (Sprague, 1991). We transformed each of our MATa allele replacement strains with a constitutively expressed mTurquoise (blue) and mRuby2 (red) fluorescent marker on a HIS3 expressing 2 µm plasmid created with the Moclo yeast cloning toolkit (YLK3314–YLK3319) (Lee et al., 2015). Each strain was transformed independently with both plasmids; this was done to ensure that we could control for any possible effect of the fluorescent protein on mating efficiency.

We first grew up pairs of strains with combinations of fluorescent markers for 2 days in selective medium (YNB complete -his + 2% glucose). We grew up the mating tester strain (YLK3218, BY4742; MATα leu2Δ his3Δ ura3Δ lys2Δ) overnight in complete minimal medium (YPD +2% glucose). We mixed 2 ml containing 106 cells from both of the allele replacement strains. We plated 200 μl of the mix on two agar plates containing selective medium at a high density. These plates were used to estimate the ratio of strains in the mix before performing the mating assay. We combined 2 ml of the mix with 1 mLcontaining 107 cells from the mating tester strain. The 3 ml mix of all three strains was filtered through a 125-ml vacuum filtration system (Sigma-Aldrich #Z290467) fitted with a 0.2 μM nylon membrane filters (Sterlitech, #NY0225100). We placed the filter on a YPD agar plate facing upwards to facilitate mating between the MATa and MATɑ yeast strains. After 4 hr, we placed the cells in a 50-ml conical tube containing 1 ml of water and washed the yeast off the filter. We transferred the yeast onto 3 agar plates that select for the plasmid and diploids (YNB minimal + leu + ura + 2% glucose). These plates were used to estimate the ratio of strains after mating had occurred. The pre- and post-mating plates were then allowed to incubate for 3 days at 30°C.

We used flow cytometry to measure the ratio of fluorescence before and after mating with a Bio-Rad S3e cell sorter. The FSC, SSC, and fluorescence gates were calibrated using one of the pre-mating samples, and they remained fixed for other samples. 106 events were captured from each sample. The pre-mating values were averaged and subtracted from the post-mating values. To calculate the mating efficiency, we assumed that the wild-type strain had an efficiency of 100% and we normalized every sample to the average of the wild-type strain for each color. We used linear regression to determine whether the allele replacement strains altered mating efficiency. We used the emmeans package (v1.10.1) (Lenth, 2023) to extract pairwise contrasts and performed multiple-hypothesis correction with the Tukey method. This whole experimental procedure was repeated once more on a different day; and, in total, each strain was grown up four times, twice for each genotype and fluorescent plasmid combination.

Population genetics analysis

We downloaded the reads for the 1011 yeast strains from Peter et al. from the short-read archive and generated a VCF using the standard Genome Analysis Toolkit (GATK, v4.2.0) variant calling pipeline (DePristo et al., 2011). Variants were filtered using bcftools (v1.15) (Danecek et al., 2021) to include any sites with a mapping quality (MQ) greater than 40, mapping quality rank sum (MQRankSum) greater than –12.5, read position rank sum (ReadPosRankSum) greater than –8, quality by depth (QD) greater than 2, total depth less than 986,842.5, variant quality greater than 100, and fraction missing (F_MISS) less than 10%. We further filtered our VCF to only include biallelic SNP sites with a population frequency of greater than 5%. We made a dissimilarity matrix using SNPRelate (v1.38.0) (Zheng et al., 2012) and built a neighbor-joining tree with ape (v5.8) (Paradis and Schliep, 2019) and visualized the tree with the ggtree package (v3.12.0) (Yu et al., 2017).

To assess whether the W82R variant was enriched in mosaic clades of yeast as defined by Peter et al., 2018, we extracted all variants in the population with an allele frequency of between 18.5% and 22.5%, 2% plus or minus the W82R variant allele frequency of 20.5%. There are 8136 variants with this frequency in the population. We tested whether the allele frequency of the W82R in the mosaic clades was significantly enriched in mosaic clades of yeast by randomly sampling with replacement these variants 10,000 times and estimating their allele frequency in the mosaic clades. The number of times the allele frequency of these variants was greater than the W82R variant was used to generate a permutation p-value.

Code availability

Code for the HMM, eQTL mapping, and gene expression noise analysis can be found at https://github.com/joshsbloom/single_cell_eQTL/tree/master/yeast/code (copy archived at Bloom, 2025). Data and code to recreate the figures in the manuscript can be found at https://github.com/theboocock/yeast_single_cell_post_mapping_analysis (copy archived at Boocock, 2024b). This repository also contains the code that performs trans-eQTL hotspot analysis, cell-cycle stage assignment, raw data processing, and additional links to generated data.

Acknowledgements

We thank Frank Albert, Stefan Zdraljevic, and Giancarlo Bruni for helpful manuscript feedback and edits. We thank Eyal Ben-David and Longhua Guo for helpful discussions during the early stages of the project. This work was supported by funding from the Howard Hughes Medical Institute (to LK) and NIH grant 2RO1GM102308-06 (to LK).

Appendix 1

Appendix 1—key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Strain, strain background (Saccharomyces cerevisiae) YLK3301 Bloom et al., 2019a YPS163 MatA ho∆::HphMX flo8∆::NatMX (YLK2438) x YJM145 MatAlpha ho∆::HphMX flo8∆::NatMX (YLK2436)
Strain, strain background (Saccharomyces cerevisiae) YLK3004 Bloom et al., 2019a YJM981 MatAlpha ho∆::HphMX (yST191) x CBS2888 MatA ho∆::KanMX (Box A1 C3)
Strain, strain background (Saccharomyces cerevisiae) YLK3051 Bloom et al., 2019a BY MatA (YLK1879) x RM MatAlpha AMN1-BY ho∆::HphMX flo8∆::NatMX (YLK1950)
Strain, strain background (Saccharomyces cerevisiae) YLK1993 Albert et al., 2018 BY MatA (YLK1879) x RM MatAlpha AMN1-BY ho∆::HphMX flo8∆::NatMX (YLK1950)
Strain, strain background (Saccharomyces cerevisiae) YLK3221 Sadhu et al., 2018 Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W,469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae) YLK3302 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae) YLK3303 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae) YLK3304 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae) YLK3305 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [p415 GalL-Cas9-Cyc1t]
Strain, strain background (Saccharomyces cerevisiae) YLK3306 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S
Strain, strain background (Saccharomyces cerevisiae) YLK3307 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S
Strain, strain background (Saccharomyces cerevisiae) YLK3308 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae) YLK3309 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae) YLK3310 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae) YLK3311 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W
Strain, strain background (Saccharomyces cerevisiae) YLK3312 This paper Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469I
Strain, strain background (Saccharomyces cerevisiae) YLK3313 This paper Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W Gpa1-469I
Strain, strain background (Saccharomyces cerevisiae) YLK3314 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [PLK127]
Strain, strain background (Saccharomyces cerevisiae) YLK3315 This paper Mata chrVII:113512_C chrVIII:113496_C Gpa1-469S [PLK128]
Strain, strain background (Saccharomyces cerevisiae) YLK3316 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [PLK127]
Strain, strain background (Saccharomyces cerevisiae) YLK3317 This paper Mata chrVII:113512_C chrVIII:113496_C chrVIII:114674_G chrVIII:114672_T Gpa1-469S Gpa1-82W [PLK128]
Strain, strain background (Saccharomyces cerevisiae) YLK3318 This paper Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469S [PLK127]
Strain, strain background (Saccharomyces cerevisiae) YLK3319 This paper Mata met15Δ his3Δ1 leu2Δ0 ura3Δ0 nej1Δ::KanMX Gpa1-82W, Gpa1-469S [PLK128]
Recombinant DNA reagent MF2 p41 neo (plasmid) Treusch et al., 2015 RRID:Addgene_58564 Flourescent magic marker plasmid with KanMX resistant cassette
Recombinant DNA reagent MF2 p41 nat (plasmid) Treusch et al., 2015 RRID:Addgene_58546 Flourescent magic marker plasmid with NatMX resistant cassette
Recombinant DNA reagent p415 GalL-Cas9-Cyc1t (plasmid) DiCarlo et al., 2013 RRID:Addgene_43804 Gal inducible CAS9 with LEU cassette
Recombinant DNA reagent SNR52p-gRNA(BstEII/SphI).
CAN1.Y-SUP4t (plasmid)
DiCarlo et al., 2013 RRID:Addgene_98814 Guide RNA expression plasmid with URA resistance
Recombinant DNA reagent plk88+GPA1 novel variant (plasmid) This paper Guide RNA and coupled repair template to change 82 W to 82 R in Gpa1
Recombinant DNA reagent plk88+GPA1 reversion (plasmid) This paper Guide RNA and coupled repair template to change 82I to 82 S in Gpa1
Recombinant DNA reagent HIS3 2 um
with ruby2 (plasmid)
This paper ConLS-pTef1-mRuby2-tEno1-ConR1-His3-2micron-AmpR
Recombinant DNA reagent HIS3 2 um with
mTurquoise (plasmid)
This paper ConLS-pTef1-mTurquoise-tEno1-ConR1-His3-2micron-AmpR
Commercial assay or kit Chromium Single Cell 3' v3 10 x Genomics 10 X:CG000201
Software, algorithm HMM, eQTL
mapping, and noise analysis code
This paper Avaliable at https://github.com/joshsbloom/single_cell_eQTL, archived at: https://doi.org/10.5281/zenodo.14834926
Software, algorithm 3' UTR extension
script for cell ranger
This paper Available at https://gist.github.com/theboocock/aacf72277a572ee3fe589c430bfd496e
Software, algorithm Figure creation
code
This paper Avaliable at https://github.com/theboocock/yeast_single_cell_post_mapping_analysis, archived at: https://doi.org/10.5281/zenodo.14834916

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Joshua S Bloom, Email: JBloom@mednet.ucla.edu.

Leonid Kruglyak, Email: LKruglyak@mednet.ucla.edu.

Rachel Brem, University of California, Berkeley, United States.

Alan M Moses, University of Toronto, Canada.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health 2RO1GM102308-06 to Leonid Kruglyak.

  • Howard Hughes Medical Institute to Leonid Kruglyak.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Resources, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Conceptualization, Software, Investigation, Methodology.

Validation, Investigation, Methodology.

Validation.

Validation, Investigation.

Validation.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Yeast strains (Table S1), plasmids (Table S2), and primers used in this study (Table S3); summary information for the single-cell expression data generated in this study (Table S4); variance explained by cell-cycle stage and the effect of segregant per transcript for the 393 previously generated segregants (Table S5); and Local eQTL summary statistics for the 393 previously generated segregants (Table S6).
elife-95566-supp1.xlsx (956.7KB, xlsx)
Supplementary file 2. Local eQTL summary statistics for our one-pot eQTL experiments.

Each sheet has the local eQTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981). The column ‘has cell-cycle interaction’ is set to 1 if a cell-cycle interaction was observed at a FDR of <5%. For cross A, the summary statistics from bulk eQTL are provided in additional columns. For cross A, the summary statistics from bulk eQTL7 are provided in additional columns. Missing values in the bulk eQTL columns indicate that a gene did not pass our filtering criteria, and a local eQTL test was not performed.

elife-95566-supp2.xlsx (1.2MB, xlsx)
Supplementary file 3. Distant eQTL summary statistics for our one-pot eQTL experiments.

Each sheet has the local eQTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981). The column ‘has cell-cycle interaction’ is set to 1 if a cell-cycle interaction was observed at a FDR of <5%. The column ‘eQTL in hotspot’ is set to 1 if a distal eQTL falls within a significant hotspot bin.

elife-95566-supp3.xlsx (344KB, xlsx)
Supplementary file 4. Hotspot annotation files for each cross (cross A = BY and RM, cross B = YJM145 and YPS163, cross C = CBS2888xYJM981, A_bulk = BY and RM with expression measured by bulk RNA-seq Albert et al., 2018).
elife-95566-supp4.zip (36.1MB, zip)
Supplementary file 5. Single-cell allele-specific expression (ASE) summary statistics.

Each sheet has the ASE summary statistics for the parental hybrids of the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and Table S3 YPS163, cross C=CBS2888xYJM981). The local eQTL summary statistics were included from the corresponding one-pot eQTL experiment for each transcript that was tested there. Missing values in the one-pot eQTL columns indicate that a gene did not pass our filtering criteria, and a local eQTL test was not performed.

elife-95566-supp5.xlsx (1.2MB, xlsx)
Supplementary file 6. Summary statistics for allele-specific effects on noise and average expression.

Each sheet has the summary statistics for the parental hybrids of the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S4 cross C=CBS2888xYJM981). The estimates and p-values are derived from the joint model that contains cell-cycle, cell-cycle interactions, and allelic effects on both the mean and noise. The column ‘Overlaps global trend line’ is set to 1 if the 95% confidence interval of the noise effect did not overlap the 95% confidence interval of the global trend line.

elife-95566-supp6.xlsx (187.7KB, xlsx)
Supplementary file 7. Cell-cycle occupancy QTL summary statistics for our one-pot eQTL experiments.

Each sheet has the cell-cycle occupancy QTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981).

elife-95566-supp7.xlsx (13.8KB, xlsx)
Supplementary file 8. Alignment and tree files from a clustered blast search of the yeast Gpa1 protein sequence (YHR005C).
elife-95566-supp8.zip (416KB, zip)
Supplementary file 9. Single-cell RNA sequencing of allele-replacement strains with the 82R and 82W alleles of GPA1.

The distant eQTL effects near GPA1 from our one-pot eQTL experiment are contrasted to a differential expression analysis of our allele replacement strains. The p-values from our one-pot eQTL study were adjusted using a permutation procedure, and a Bonferroni correction was used to adjust the single-cell validation p-values. Missing values in the single-cell validation columns indicate that a gene did not pass our filtering criteria, and a differential expression test was not performed.

elife-95566-supp9.xlsx (15.5KB, xlsx)
MDAR checklist

Data availability

Sequencing data is available under the NCBI BioProject PRJNA1049497. Raw and processed data can be found at the locations https://doi.org/10.5061/dryad.xgxd254qb and https://zenodo.org/doi/10.5281/zenodo.12695127, respectively. Finemapping spreadsheets can be found on GitHub at https://github.com/theboocock/finemapping_spreadsheets_single_cell (copy archived at Boocock, 2024a).

The following datasets were generated:

Bloom J. 2024. Data for: Raw count data, transcribed variant count data, and reference genomic annotation files for Boocock et al. 2024. Dryad Digital Repository.

Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. Zenodo.

Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. NCBI BioProject. PRJNA1049497

References

  1. 10XGenomics VarTrix. 1.0GitHub. 2018 https://github.com/10XGenomics/vartrix
  2. Aguet F, Anand S, Ardlie KG, Gabriel S, Getz GA, Graubert A, Hadley K, Handsaker RE, Huang KH, Kashin S, Li X, MacArthur DG, Meier SR, Nedzel JL, Nguyen DT, Segrè AV, Todres E, Balliu B, Barbeira AN, Battle A, Bonazzola R, Brown A, Brown CD, Castel SE, Conrad DF, Cotter DJ, Cox N, Das S, de Goede OM, Dermitzakis ET, Einson J, Engelhardt BE, Eskin E, Eulalio TY, Ferraro NM, Flynn ED, Fresard L, Gamazon ER, Garrido-Martín D, Gay NR, Gloudemans MJ, Guigó R, Hame AR, He Y, Hoffman PJ, Hormozdiari F, Hou L, Im HK, Jo B, Kasela S, Kellis M, Kim-Hellmuth S, Kwong A, Lappalainen T, Li X, Liang Y, Mangul S, Mohammadi P, Montgomery SB, Muñoz-Aguirre M, Nachun DC, Nobel AB, Oliva M, Park Y, Park Y, Parsana P, Rao AS, Reverter F, Rouhana JM, Sabatti C, Saha A, Stephens M, Stranger BE, Strober BJ, Teran NA, Viñuela A, Wang G, Wen X, Wright F, Wucher V, Zou Y, Ferreira PG, Li G, Melé M, Yeger-Lotem E, Barcus ME, Bradbury D, Krubit T, McLean JA, Qi L, Robinson K, Roche NV, Smith AM, Sobin L, Tabor DE, Undale A, Bridge J, Brigham LE, Foster BA, Gillard BM, Hasz R, Hunter M, Johns C, Johnson M, Karasik E, Kopen G, Leinweber WF, McDonald A, Moser MT, Myer K, Ramsey KD, Roe B, Shad S, Thomas JA, Walters G, Washington M, Wheeler J, Jewell SD, Rohrer DC, Valley DR, Davis DA, Mash DC, Branton PA, Barker LK, Gardiner HM, Mosavel M, Siminoff LA, Flicek P, Haeussler M, Juettemann T, Kent WJ, Lee CM, Powell CC, Rosenbloom KR, Ruffier M, Sheppard D, Taylor K, Trevanion SJ, Zerbino DR, Abell NS, Akey J, Chen L, Demanelis K, Doherty JA, Feinberg AP, Hansen KD, Hickey PF, Jasmine F, Jiang L, Kaul R, Kibriya MG, Li JB, Li Q, Lin S, Linder SE, Pierce BL, Rizzardi LF, Skol AD, Smith KS, Snyder M, Stamatoyannopoulos J, Tang H, Wang M, Carithers LJ, Guan P, Koester SE, Little AR, Moore HM, Nierras CR, Rao AK, Vaught JB, Volpi S, The GTEx Consortium The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nature Reviews. Genetics. 2015;16:197–212. doi: 10.1038/nrg3891. [DOI] [PubMed] [Google Scholar]
  4. Albert FW, Bloom JS, Siegel J, Day L, Kruglyak L. Genetics of trans-regulatory variation in gene expression. eLife. 2018;7:e35471. doi: 10.7554/eLife.35471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Alexa A, Rahnenfuhrer J. Enrichment analysis for gene ontology. Bioconductor; 2022. [DOI] [Google Scholar]
  6. Antolović V, Miermont A, Corrigan AM, Chubb JR. Generation of single-cell transcript variability by repression. Current Biology. 2017;27:1811–1817. doi: 10.1016/j.cub.2017.05.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Arends D, Prins P, Jansen RC, Broman KW. R/QTL: high-throughput multiple QTL mapping. Bioinformatics. 2010;26:2990–2992. doi: 10.1093/bioinformatics/btq565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene Ontology: tool for the unification of biology. Nature Genetics. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Battle A, Brown CD, Engelhardt BE, Montgomery SB, GTEx Consortium. Laboratory, Data Analysis and Coordinating Center—Analysis Working Group. Statistical Methods groups—Analysis Working Group. Enhancing GTEx groups. NIH Common Fund. NIH/NCI. NIH/NHGRI. NIH/NIMH. NIH/NIDA. Biospecimen Collection Source Site—NDRI. Biospecimen Collection Source Site—RPCI. Biospecimen Core Resource—VARI. Brain Bank Repository—University of Miami Brain Endowment Bank. Leidos Biomedical—Project Management. ELSI Study. Genome Browser Data Integration &Visualization—EBI. Genome Browser Data Integration &Visualization—UCSC Genomics Institute, University of California Santa Cruz. Lead analysts. Laboratory, Data Analysis and Coordinating Center. NIH program management. Biospecimen collection. Pathology. eQTL manuscript working group Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. doi: 10.1038/nature24277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Ben-David E, Boocock J, Guo L, Zdraljevic S, Bloom JS, Kruglyak L. Whole-organism eQTL mapping at cellular resolution with single-cell sequencing. eLife. 2021;10:e65857. doi: 10.7554/eLife.65857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x. [DOI] [Google Scholar]
  12. Bilton TP, Schofield MR, Black MA, Chagné D, Wilcox PL, Dodds KG. Accounting for errors in low coverage high-throughput sequencing data when constructing genetic maps using biparental outcrossed populations. Genetics. 2018;209:65–76. doi: 10.1534/genetics.117.300627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bloom JS, Ehrenreich IM, Loo WT, Lite TLV, Kruglyak L. Finding the sources of missing heritability in a yeast cross. Nature. 2013;494:234–237. doi: 10.1038/nature11867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bloom JS, Boocock J, Treusch S, Sadhu MJ, Day L, Oates Barker H, Kruglyak L. Rare variants contribute disproportionately to quantitative trait variation in yeast. eLife. 2019a;8:e49212. doi: 10.7554/eLife.49212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bloom JS, Boocock J, Treusch S, Sadhu MJ, Day L, Oates Barker H, Kruglyak L. Rare Variants Contribute Disproportionately to Quantitative Trait Variation in Yeast. bioRxiv. 2019b doi: 10.1101/607291. [DOI] [PMC free article] [PubMed]
  16. Bloom JS. Some code for single cell eqtl analysis. swh:1:rev:9c5f66a3185486fe38437668a1b6e37b34108d3eSoftware Heritage. 2025 https://archive.softwareheritage.org/swh:1:dir:6866b0beab201fee41c1ff2d5e4b4c7721ceef14;origin=https://github.com/joshsbloom/single_cell_eQTL;visit=swh:1:snp:e9057b124351ba5c989f84c341ebaee330dfc525;anchor=swh:1:rev:9c5f66a3185486fe38437668a1b6e37b34108d3e
  17. Boocock J, Sadhu MJ, Durvasula A, Bloom JS, Kruglyak L. Ancient balancing selection maintains incompatible versions of the galactose pathway in yeast. Science. 2021;371:415–419. doi: 10.1126/science.aba0542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Boocock J. Alignment_to_three_prime. GitHub Gist. 2023 https://gist.github.com/theboocock/aacf72277a572ee3fe589c430bfd496e
  19. Boocock J. Finemapping_spreadsheets_single_cell. swh:1:rev:91913be1e10e06b93f356b16cf2bfcd427729d5dSoftware Heritage. 2024a https://archive.softwareheritage.org/swh:1:dir:224513d0e5261c396f1c531a66f96dcca0d3384c;origin=https://github.com/theboocock/finemapping_spreadsheets_single_cell;visit=swh:1:snp:04623015ef4bfb728b162fa3881088788137e2c5;anchor=swh:1:rev:91913be1e10e06b93f356b16cf2bfcd427729d5d
  20. Boocock J. Yeast_single_cell_post_mapping_analysis. swh:1:rev:80dc233886035dffd74ebb91be3f61a8656fe53cSoftware Heritage. 2024b https://archive.softwareheritage.org/swh:1:dir:f1a038ec7dd523fc76415cd910e47475a0cb614b;origin=https://github.com/theboocock/yeast_single_cell_post_mapping_analysis;visit=swh:1:snp:f94d183a88f41c114cff7cb48f349959a7e2f801;anchor=swh:1:rev:80dc233886035dffd74ebb91be3f61a8656fe53c
  21. Brem RB, Yvert G, Clinton R, Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. doi: 10.1126/science.1069516. [DOI] [PubMed] [Google Scholar]
  22. Brettner L, Eder R, Schmidlin K, Geiler Samerotte K. An ultra high-throughput, massively multiplexable, single-cell rna-seq platform in yeasts. bioRxiv. 2022 doi: 10.1101/2022.09.12.507686. [DOI] [PMC free article] [PubMed]
  23. Broman KW. The genomes of recombinant inbred lines. Genetics. 2005;169:1133–1146. doi: 10.1534/genetics.104.035212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Brooks M, Kristensen K, Benthem K, Magnusson A, Berg C, Nielsen A, Skaug H, Mächler M, Bolker B. glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. The R Journal. 2017;9:378. doi: 10.32614/RJ-2017-066. [DOI] [Google Scholar]
  25. Bun-Ya M, Nishimura M, Harashima S, Oshima Y. The PHO84 gene of Saccharomyces cerevisiae encodes an inorganic phosphate transporter. Molecular and Cellular Biology. 1991;11:3229–3238. doi: 10.1128/mcb.11.6.3229-3238.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Burga A, Ben-David E, Lemus Vergara T, Boocock J, Kruglyak L. Fast genetic mapping of complex traits in C. elegans using millions of individuals in bulk. Nature Communications. 2019;10:2680. doi: 10.1038/s41467-019-10636-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Chen L, Ge B, Casale FP, Vasquez L, Kwan T, Garrido-Martín D, Watt S, Yan Y, Kundu K, Ecker S, Datta A, Richardson D, Burden F, Mead D, Mann AL, Fernandez JM, Rowlston S, Wilder SP, Farrow S, Shao X, Lambourne JJ, Redensek A, Albers CA, Amstislavskiy V, Ashford S, Berentsen K, Bomba L, Bourque G, Bujold D, Busche S, Caron M, Chen S-H, Cheung W, Delaneau O, Dermitzakis ET, Elding H, Colgiu I, Bagger FO, Flicek P, Habibi E, Iotchkova V, Janssen-Megens E, Kim B, Lehrach H, Lowy E, Mandoli A, Matarese F, Maurano MT, Morris JA, Pancaldi V, Pourfarzad F, Rehnstrom K, Rendon A, Risch T, Sharifi N, Simon M-M, Sultan M, Valencia A, Walter K, Wang S-Y, Frontini M, Antonarakis SE, Clarke L, Yaspo M-L, Beck S, Guigo R, Rico D, Martens JHA, Ouwehand WH, Kuijpers TW, Paul DS, Stunnenberg HG, Stegle O, Downes K, Pastinen T, Soranzo N. Genetic drivers of epigenetic and transcriptional variation in human immune cells. Cell. 2016;167:1398–1414. doi: 10.1016/j.cell.2016.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31:2745–2747. doi: 10.1093/bioinformatics/btv195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Cuomo ASE, Seaton DD, McCarthy DJ, Martinez I, Bonder MJ, Garcia-Bernardo J, Amatya S, Madrigal P, Isaacson A, Buettner F, Knights A, Natarajan KN, HipSci Consortium. Vallier L, Marioni JC, Chhatriwala M, Stegle O. Single-cell RNA-sequencing of differentiating iPS cells reveals dynamic genetic effects on gene expression. Nature Communications. 2020;11:810. doi: 10.1038/s41467-020-14457-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. DiCarlo JE, Norville JE, Mali P, Rios X, Aach J, Church GM. Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems. Nucleic Acids Research. 2013;41:4336–4343. doi: 10.1093/nar/gkt135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Dodds KG, McEwan JC, Brauning R, Anderson RM, van Stijn TC, Kristjánsson T, Clarke SM. Construction of relatedness matrices using genotyping-by-sequencing data. BMC Genomics. 2015;16:1047. doi: 10.1186/s12864-015-2252-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Dupuis J, Siegmund D. Statistical methods for mapping quantitative trait loci from a dense set of markers. Genetics. 1999;151:373–386. doi: 10.1093/genetics/151.1.373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Ehrenreich IM, Torabi N, Jia Y, Kent J, Martis S, Shapiro JA, Gresham D, Caudy AA, Kruglyak L. Dissection of genetically complex traits with extremely large pools of yeast segregants. Nature. 2010;464:1039–1042. doi: 10.1038/nature08923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Elorbany R, Popp JM, Rhodes K, Strober BJ, Barr K, Qi G, Gilad Y, Battle A. Single-cell sequencing reveals lineage-specific dynamic genetic regulation of gene expression during human cardiomyocyte differentiation. PLOS Genetics. 2022;18:e1009666. doi: 10.1371/journal.pgen.1009666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Engel SR, Dietrich FS, Fisk DG, Binkley G, Balakrishnan R, Costanzo MC, Dwight SS, Hitz BC, Karra K, Nash RS, Weng S, Wong ED, Lloyd P, Skrzypek MS, Miyasato SR, Simison M, Cherry JM. The reference genome sequence of Saccharomyces cerevisiae: then and now. G3: Genes, Genomes, Genetics. 2014;4:389–398. doi: 10.1534/g3.113.008995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P-R, Anttila V, Xu H, Zang C, Farh K, Ripke S, Day FR, ReproGen Consortium. Schizophrenia Working Group of the Psychiatric Genomics Consortium. RACI Consortium. Purcell S, Stahl E, Lindstrom S, Perry JRB, Okada Y, Raychaudhuri S, Daly MJ, Patterson N, Neale BM, Price AL. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature Genetics. 2015;47:1228–1235. doi: 10.1038/ng.3404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Gasch AP, Yu FB, Hose J, Escalante LE, Place M, Bacher R, Kanbar J, Ciobanu D, Sandor L, Grigoriev IV, Kendziorski C, Quake SR, McClean MN. Single-cell RNA sequencing reveals intrinsic and extrinsic regulatory heterogeneity in yeast responding to stress. PLOS Biology. 2017;15:e2004050. doi: 10.1371/journal.pbio.2004050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, Jansen R, de Geus EJC, Boomsma DI, Wright FA, Sullivan PF, Nikkola E, Alvarez M, Civelek M, Lusis AJ, Lehtimäki T, Raitoharju E, Kähönen M, Seppälä I, Raitakari OT, Kuusisto J, Laakso M, Price AL, Pajukanta P, Pasaniuc B. Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics. 2016;48:245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biology. 2019;20:296. doi: 10.1186/s13059-019-1874-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hao Y, Hao S, AndersenNissen E, Mauck WM, Zheng S, Butler A, Lee MJ, Wilk AJ, Darby C, Zager M, Hoffman P, Stoeckius M, Papalexi E, Mimitou EP, Jain J, Srivastava A, Stuart T, Fleming LM, Yeung B, Rogers AJ, McElrath JM, Blish CA, Gottardo R, Smibert P, Satija R. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hartfield M, Bataillon T, Glémin S. The evolutionary interplay between adaptation and self-fertilization. Trends in Genetics. 2017;33:420–431. doi: 10.1016/j.tig.2017.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. He L, Davila-Velderrain J, Sumida TS, Hafler DA, Kellis M, Kulminski AM. NEBULA is a fast negative binomial mixed model for differential or co-expression analysis of large-scale multi-subject single-cell data. Communications Biology. 2021;4:629. doi: 10.1038/s42003-021-02146-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Huling J. Fast and stable fitting of generalized linear models using’Rcppeigen. 0.0.3Package 2022
  47. Jackson CA, Castro DM, Saldi GA, Bonneau R, Gresham D. Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments. eLife. 2020;9:e51254. doi: 10.7554/eLife.51254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Jariani A, Vermeersch L, Cerulus B, Perez-Samper G, Voordeckers K, Van Brussel T, Thienpont B, Lambrechts D, Verstrepen KJ. A new protocol for single-cell RNA-seq reveals stochastic gene expression during lag phase in budding yeast. eLife. 2020;9:e55320. doi: 10.7554/eLife.55320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Jerber J, Seaton DD, Cuomo ASE, Kumasaka N, Haldane J, Steer J, Patel M, Pearce D, Andersson M, Bonder MJ, Mountjoy E, Ghoussaini M, Lancaster MA, HipSci Consortium. Marioni JC, Merkle FT, Gaffney DJ, Stegle O. Population-scale single-cell RNA-seq profiling across dopaminergic neuron differentiation. Nature Genetics. 2021;53:304–312. doi: 10.1038/s41588-021-00801-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Jerison ER, Kryazhimskiy S, Mitchell JK, Bloom JS, Kruglyak L, Desai MM. Genetic variation in adaptability and pleiotropy in budding yeast. eLife. 2017;6:e27167. doi: 10.7554/eLife.27167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Research. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kang JB, Raveane A, Nathan A, Soranzo N, Raychaudhuri S. Methods and insights from single-cell expression quantitative trait loci. Annual Review of Genomics and Human Genetics. 2023;24:277–303. doi: 10.1146/annurev-genom-101422-100437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kim-Hellmuth S, Aguet F, Oliva M, Muñoz-Aguirre M, Kasela S, Wucher V, Castel SE, Hamel AR, Viñuela A, Roberts AL, Mangul S, Wen X, Wang G, Barbeira AN, Garrido-Martín D, Nadel BB, Zou Y, Bonazzola R, Quan J, Brown A, Martinez-Perez A, Soria JM, GTEx Consortium. Getz G, Dermitzakis ET, Small KS, Stephens M, Xi HS, Im HK, Guigó R, Segrè AV, Stranger BE, Ardlie KG, Lappalainen T. Cell type-specific genetic regulation of gene expression across human tissues. Science. 2020;369:eaaz8528. doi: 10.1126/science.aaz8528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kuang Z, Pinglay S, Ji H, Boeke JD. Msn2/4 regulate expression of glycolytic enzymes and control transition from quiescence to growth. eLife. 2017;6:e29938. doi: 10.7554/eLife.29938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lang GI, Murray AW, Botstein D. The cost of gene expression underlies a fitness trade-off in yeast. PNAS. 2009;106:5755–5760. doi: 10.1073/pnas.0901620106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ. Software for computing and annotating genomic ranges. PLOS Computational Biology. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lee ME, DeLoache WC, Cervantes B, Dueber JE. A highly characterized yeast toolkit for modular, multipart assembly. ACS Synthetic Biology. 2015;4:975–986. doi: 10.1021/sb500366v. [DOI] [PubMed] [Google Scholar]
  58. Lenth R. Emmeans: estimated marginal means, aka least-squares means. 1.10.7Emmeans_1.10.7.Tar.Gz. 2023 https://CRAN.R-project.org/package=emmeans
  59. Li J, Ji L. Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix. Heredity. 2005;95:221–227. doi: 10.1038/sj.hdy.6800717. [DOI] [PubMed] [Google Scholar]
  60. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Liu J, Martin-Yken H, Bigey F, Dequin S, François J-M, Capp J-P. Natural yeast promoter variants reveal epistasis in the generation of transcriptional-mediated noise and its potential benefit in stressful conditions. Genome Biology and Evolution. 2015;7:969–984. doi: 10.1093/gbe/evv047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Maechler M. Robustbase - basic robust statistics. 0.99-4-1Package ‘Robustbase.’. 2023 http://robustbase.r-forge.r-project.org/
  64. Matsumoto K, Uno I, Oshima Y, Ishikawa T. Isolation and characterization of yeast mutants deficient in adenylate cyclase and cAMP-dependent protein kinase. PNAS. 1982;79:2355–2359. doi: 10.1073/pnas.79.7.2355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Matsumoto K, Uno I, Ishikawa T. Control of cell division in Saccharomyces cerevisiae mutants defective in adenylate cyclase and cAMP-dependent protein kinase. Experimental Cell Research. 1983;146:151–161. doi: 10.1016/0014-4827(83)90333-6. [DOI] [PubMed] [Google Scholar]
  66. McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40:4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. McClure AW, Jacobs KC, Zyla TR, Lew DJ. Mating in wild yeast: delayed interest in sex after spore germination. Molecular Biology of the Cell. 2018;29:3119–3127. doi: 10.1091/mbc.E18-08-0528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Metzger BPH, Yuan DC, Gruber JD, Duveau F, Wittkopp PJ. Selection on noise constrains variation in a eukaryotic promoter. Nature. 2015;521:344–347. doi: 10.1038/nature14244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Nadal-Ribelles M, Islam S, Wei W, Latorre P, Nguyen M, de Nadal E, Posas F, Steinmetz LM. Sensitive high-throughput single-cell RNA-seq reveals within-clonal transcript correlations in yeast populations. Nature Microbiology. 2019;4:683–692. doi: 10.1038/s41564-018-0346-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Nakafuku M, Itoh H, Nakamura S, Kaziro Y. Occurrence in Saccharomyces cerevisiae of a gene homologous to the cDNA coding for the alpha subunit of mammalian G proteins. PNAS. 1987;84:2140–2144. doi: 10.1073/pnas.84.8.2140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Nathan A, Asgari S, Ishigaki K, Valencia C, Amariuta T, Luo Y, Beynor JI, Baglaenko Y, Suliman S, Price AL, Lecca L, Murray MB, Moody DB, Raychaudhuri S. Single-cell eQTL models reveal dynamic T cell state dependence of disease loci. Nature. 2022;606:120–128. doi: 10.1038/s41586-022-04713-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Neavin D, Nguyen Q, Daniszewski MS, Liang HH, Chiu HS, Wee YK, Senabouth A, Lukowski SW, Crombie DE, Lidgerwood GE, Hernández D, Vickers JC, Cook AL, Palpant NJ, Pébay A, Hewitt AW, Powell JE. Single cell eQTL analysis identifies cell type-specific genetic control of gene expression in fibroblasts and reprogrammed induced pluripotent stem cells. Genome Biology. 2021;22:76. doi: 10.1186/s13059-021-02293-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Neavin DR, Steinmann AM, Farbehi N, Chiu HS, Daniszewski MS, Arora H, Bermudez Y, Moutinho C, Chan C-L, Bax M, Tyebally M, Gnanasambandapillai V, Lam CE, Nguyen U, Hernández D, Lidgerwood GE, Graham RM, Hewitt AW, Pébay A, Palpant NJ, Powell JE. A village in A dish model system for population-scale hiPSC studies. Nature Communications. 2023;14:3240. doi: 10.1038/s41467-023-38704-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. N’Guessan A, Tong WY, Heydari H, Nguyen Ba AN. Refining the resolution of the yeast genotype-phenotype map using single-cell RNA-sequencing. bioRxiv. 2023 doi: 10.1101/2023.10.18.562953. [DOI]
  75. Ota M, Nagafuchi Y, Hatano H, Ishigaki K, Terao C, Takeshima Y, Yanaoka H, Kobayashi S, Okubo M, Shirai H, Sugimori Y, Maeda J, Nakano M, Yamada S, Yoshida R, Tsuchiya H, Tsuchida Y, Akizuki S, Yoshifuji H, Ohmura K, Mimori T, Yoshida K, Kurosaka D, Okada M, Setoguchi K, Kaneko H, Ban N, Yabuki N, Matsuki K, Mutoh H, Oyama S, Okazaki M, Tsunoda H, Iwasaki Y, Sumitomo S, Shoda H, Kochi Y, Okada Y, Yamamoto K, Okamura T, Fujio K. Dynamic landscape of immune cell-specific gene regulation in immune-mediated diseases. Cell. 2021;184:3006–3021. doi: 10.1016/j.cell.2021.03.056. [DOI] [PubMed] [Google Scholar]
  76. Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35:526–528. doi: 10.1093/bioinformatics/bty633. [DOI] [PubMed] [Google Scholar]
  77. Perlstein EO, Ruderfer DM, Roberts DC, Schreiber SL, Kruglyak L. Genetic basis of individual differences in the response to small-molecule drugs in yeast. Nature Genetics. 2007;39:496–502. doi: 10.1038/ng1991. [DOI] [PubMed] [Google Scholar]
  78. Peter J, De Chiara M, Friedrich A, Yue J-X, Pflieger D, Bergström A, Sigwalt A, Barre B, Freel K, Llored A, Cruaud C, Labadie K, Aury J-M, Istace B, Lebrigand K, Barbry P, Engelen S, Lemainque A, Wincker P, Liti G, Schacherer J. Genome evolution across 1,011 Saccharomyces cerevisiae isolates. Nature. 2018;556:339–344. doi: 10.1038/s41586-018-0030-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. R Development Core Team . Vienna, Austria; 2022. https://www.R-project.org/ [Google Scholar]
  80. Ronald J, Akey JM. The evolution of gene expression QTL in Saccharomyces cerevisiae. PLOS ONE. 2007;2:e678. doi: 10.1371/journal.pone.0000678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Roop JI, Chang KC, Brem RB. Polygenic evolution of a sugar specialization trade-off in yeast. Nature. 2016;530:336–339. doi: 10.1038/nature16938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Sadhu MJ, Bloom JS, Day L, Siegel JJ, Kosuri S, Kruglyak L. Highly parallel genome variant engineering with CRISPR-Cas9. Nature Genetics. 2018;50:510–514. doi: 10.1038/s41588-018-0087-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Sarkar AK, Tung P-Y, Blischak JD, Burnett JE, Li YI, Stephens M, Gilad Y. Discovery and characterization of variance QTLs in human induced pluripotent stem cells. PLOS Genetics. 2019;15:e1008045. doi: 10.1371/journal.pgen.1008045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Schubert OT, Bloom JS, Sadhu MJ, Kruglyak L. Genome-wide base editor screen identifies regulators of protein abundance in yeast. eLife. 2022;11:e79525. doi: 10.7554/eLife.79525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Shen X-X, Opulente DA, Kominek J, Zhou X, Steenwyk JL, Buh KV, Haase MAB, Wisecaver JH, Wang M, Doering DT, Boudouris JT, Schneider RM, Langdon QK, Ohkuma M, Endoh R, Takashima M, Manabe R-I, Čadež N, Libkind D, Rosa CA, DeVirgilio J, Hulfachor AB, Groenewald M, Kurtzman CP, Hittinger CT, Rokas A. Tempo and mode of genome evolution in the budding yeast subphylum. Cell. 2018;175:1533–1545. doi: 10.1016/j.cell.2018.10.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Smith EN, Kruglyak L. Gene-environment interaction in yeast gene expression. PLOS Biology. 2008;6:e83. doi: 10.1371/journal.pbio.0060083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Molecular Biology of the Cell. 1998;9:3273–3297. doi: 10.1091/mbc.9.12.3273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Sprague GF. Assay of yeast mating reaction. Methods in Enzymology. 1991;194:77–93. doi: 10.1016/0076-6879(91)94008-z. [DOI] [PubMed] [Google Scholar]
  89. Stewart-Ornstein J, Weissman JS, El-Samad H. Cellular noise regulons underlie fluctuations in Saccharomyces cerevisiae. Molecular Cell. 2012;45:483–493. doi: 10.1016/j.molcel.2011.11.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Strober BJ, Elorbany R, Rhodes K, Krishnan N, Tayeb K, Battle A, Gilad Y. Dynamic genetic regulation of gene expression during cellular differentiation. Science. 2019;364:1287–1290. doi: 10.1126/science.aaw0040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Treusch S, Albert FW, Bloom JS, Kotenko IE, Kruglyak L. Genetic mapping of MAPK-mediated complex traits Across S. cerevisiae. PLOS Genetics. 2015;11:e1004913. doi: 10.1371/journal.pgen.1004913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Umans BD, Battle A, Gilad Y. Where are the disease-associated eQTLs? Trends in Genetics. 2021;37:109–124. doi: 10.1016/j.tig.2020.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. van der Wijst MGP, Brugge H, de Vries DH, Deelen P, Swertz MA, LifeLines Cohort Study. BIOS Consortium. Franke L. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nature Genetics. 2018;50:493–497. doi: 10.1038/s41588-018-0089-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Vanhalewyn M, Dumortier F, Debast G, Colombo S, Ma P, Winderickx J, Van Dijck P, Thevelein JM. A mutation in Saccharomyces cerevisiae adenylate cyclase, Cyr1K1876M, specifically affects glucose- and acidification-induced cAMP signalling and not the basal cAMP level. Molecular Microbiology. 1999;33:363–376. doi: 10.1046/j.1365-2958.1999.01479.x. [DOI] [PubMed] [Google Scholar]
  95. Vermeersch L, Jariani A, Helsen J, Heineike BM, Verstrepen KJ. Single-cell rna sequencing in yeast using the 10× genomics chromium device. Methods in Molecular Biology. 2022;2477:3–20. doi: 10.1007/978-1-0716-2257-5_1. [DOI] [PubMed] [Google Scholar]
  96. Versele M, Thevelein JM, Van Dijck P. The high general stress resistance of the Saccharomyces cerevisiae fil1 adenylate cyclase mutant (Cyr1Lys1682) is only partially dependent on trehalose, Hsp104 and overexpression of Msn2/4-regulated genes. Yeast. 2004;21:75–86. doi: 10.1002/yea.1065. [DOI] [PubMed] [Google Scholar]
  97. Vianna CR, Ferreira MC, Silva CLC, Tanghe A, Neves MJ, Thevelein JM, Rosa CA, Van Dijck P. Stress tolerance of the Saccharomyces cerevisiae adenylate cyclase fil1 (CYR1) mutant depends on Hsp26. Journal of Molecular Microbiology and Biotechnology. 2010;19:140–146. doi: 10.1159/000321500. [DOI] [PubMed] [Google Scholar]
  98. Weith M, Großbach J, Clement-Ziza M, Gillet L, Rodríguez-López M, Marguerat S, Workman CT, Picotti P, Bähler J, Aebersold R, Beyer A. Genetic effects on molecular network states explain complex traits. Molecular Systems Biology. 2023;19:e11493. doi: 10.15252/msb.202211493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Welker S, Rudolph B, Frenzel E, Hagn F, Liebisch G, Schmitz G, Scheuring J, Kerth A, Blume A, Weinkauf S, Haslbeck M, Kessler H, Buchner J. Hsp12 is an intrinsically unstructured stress protein that folds upon membrane association and modulates membrane function. Molecular Cell. 2010;39:507–520. doi: 10.1016/j.molcel.2010.08.001. [DOI] [PubMed] [Google Scholar]
  100. Wells MF, Nemesh J, Ghosh S, Mitchell JM, Salick MR, Mello CJ, Meyer D, Pietilainen O, Piccioni F, Guss EJ, Raghunathan K, Tegtmeyer M, Hawes D, Neumann A, Worringer KA, Ho D, Kommineni S, Chan K, Peterson BK, Raymond JJ, Gold JT, Siekmann MT, Zuccaro E, Nehme R, Kaykas A, Eggan K, McCarroll SA. Natural variation in gene expression and viral susceptibility revealed by neural progenitor cell villages. Cell Stem Cell. 2023;30:312–332. doi: 10.1016/j.stem.2023.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Westra H-J, Arends D, Esko T, Peters MJ, Schurmann C, Schramm K, Kettunen J, Yaghootkar H, Fairfax BP, Andiappan AK, Li Y, Fu J, Karjalainen J, Platteel M, Visschedijk M, Weersma RK, Kasela S, Milani L, Tserel L, Peterson P, Reinmaa E, Hofman A, Uitterlinden AG, Rivadeneira F, Homuth G, Petersmann A, Lorbeer R, Prokisch H, Meitinger T, Herder C, Roden M, Grallert H, Ripatti S, Perola M, Wood AR, Melzer D, Ferrucci L, Singleton AB, Hernandez DG, Knight JC, Melchiotti R, Lee B, Poidinger M, Zolezzi F, Larbi A, Wang DY, van den Berg LH, Veldink JH, Rotzschke O, Makino S, Salomaa V, Strauch K, Völker U, van Meurs JBJ, Metspalu A, Wijmenga C, Jansen RC, Franke L. Cell specific eqtl analysis without sorting cells. PLOS Genetics. 2015;11:e1005223. doi: 10.1371/journal.pgen.1005223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Wickham H. Ggplot2: Elegant Graphics for Data Analysis. Springer; 2009. [Google Scholar]
  103. Xu Z, Wei W, Gagneur J, Perocchi F, Clauder-Münster S, Camblong J, Guffanti E, Stutz F, Huber W, Steinmetz LM. Bidirectional promoters generate pervasive transcription in yeast. Nature. 2009;457:1033–1037. doi: 10.1038/nature07728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Yazar S, Alquicira-Hernandez J, Wing K, Senabouth A, Gordon MG, Andersen S, Lu Q, Rowson A, Taylor TRP, Clarke L, Maccora K, Chen C, Cook AL, Ye CJ, Fairfax KA, Hewitt AW, Powell JE. Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease. Science. 2022;376:eabf3041. doi: 10.1126/science.abf3041. [DOI] [PubMed] [Google Scholar]
  105. Yu G, Smith DK, Zhu H, Guan Y, Lam TTY. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2017;8:28–36. doi: 10.1111/2041-210X.12628. [DOI] [Google Scholar]
  106. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, Smith EN, Mackelprang R, Kruglyak L. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nature Genetics. 2003;35:57–64. doi: 10.1038/ng1222. [DOI] [PubMed] [Google Scholar]
  107. Zhao H, Eide DJ. Zap1p, a metalloregulatory protein involved in zinc-responsive transcriptional regulation in Saccharomyces cerevisiae. Molecular and Cellular Biology. 1997;17:5044–5052. doi: 10.1128/MCB.17.9.5044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Zheng X, Levine D, Shen J, Gogarten SM, Laurie C, Weir BS. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics. 2012;28:3326–3328. doi: 10.1093/bioinformatics/bts606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Zhu J, Zhang B, Smith EN, Drees B, Brem RB, Kruglyak L, Bumgarner RE, Schadt EE. Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. Nature Genetics. 2008;40:854–861. doi: 10.1038/ng.167. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife assessment

Rachel Brem 1

This manuscript describes the mapping of natural DNA sequence variants that affect gene expression and its noise, as well as cell cycle timing, using as input single-cell RNA-sequencing of progeny from crosses between wild yeast strains. The method represents an important advance in the study of natural genetic variation. The findings, especially given the follow-up validation of the phenotypic impact of a mapped locus of major effect, provide convincing support for the rigor and utility of the method.

Reviewer #1 (Public review):

Anonymous

The authors demonstrate that it is possible to carry out eQTL experiments for the model eukaryote S. cerevisiae, in "one pot" preparations, by using single-cell sequencing technologies to simultaneously genotype and measure expression. This is a very appealing approach for investigators studying genetic variation in single-celled and other microbial systems, and will likely inspire similar approaches in non-microbial systems where comparable cell mixtures of genetically heterogeneous individuals could be achieved.

While eQTL experiments have been done for nearly two decades (the corresponding author's lab are pioneers in this field), this single-cell approach creates the possibility for new insights about cell biology that would be extremely challenging to infer using bulk sequencing approaches. The major motivating application shown here is to discover cell occupancy QTL, i.e. loci where genetic variation contributes to differences in the relative occupancy of different cell cycle stages. The authors dissect and validate one such cell cycle occupancy QTL, involving the gene GPA1, a G-protein subunit that plays a role in regulating the mating response MAPK pathway. They show that variation at GPA1 is associated with proportional differences in the fraction of cells in the G1 stage of the cell cycle. Furthermore, they show that this bias is associated with differences in mating efficiency.

Reviewer #2 (Public review):

Anonymous

Boocock and colleagues present an approach whereby eQTL analysis can be carried out by scRNA-Seq alone, in a one-pot-shot experiment, due to genotypes being able to be inferred from SNPs identified in RNA-Seq reads. This approach obviates the need to isolate individual spores, genotype them separately by low-coverage sequencing, and then perform RNA-Seq on each spore separately. This is a substantial advance and opens up the possibility to straightforwardly identify eQTLs over many conditions in a cost-efficient manner. Overall, I found the paper to be well-written and well-motivated, and have no issues with either the methodological/analytical approach (though eQTL analysis is not my expertise), or with the manuscript's conclusions.

eLife. 2025 Mar 12;13:RP95566. doi: 10.7554/eLife.95566.3.sa3

Author response

James Boocock 1, Noah Alexander 2, Leslie Alamo Tapia 3, Laura Walter-McNeill 4, Shivani Prashant Patel 5, Chetan Munugala 6, Joshua S Bloom 7, Leonid Kruglyak 8

The following is the authors’ response to the original reviews.

Public Reviews:

Reviewer #1 (Public Review):

Summary:

The authors demonstrate that it is possible to carry out eQTL experiments for the model eukaryote S. cerevisiae, in "one pot" preparations, by using single-cell sequencing technologies to simultaneously genotype and measure expression. This is a very appealing approach for investigators studying genetic variation in single-celled and other microbial systems, and will likely inspire similar approaches in non-microbial systems where comparable cell mixtures of genetically heterogeneous individuals could be achieved.

Strengths:

While eQTL experiments have been done for nearly two decades (the corresponding author's lab are pioneers in this field), this single-cell approach creates the possibility for new insights about cell biology that would be extremely challenging to infer using bulk sequencing approaches. The major motivating application shown here is to discover cell occupancy QTL, i.e. loci where genetic variation contributes to differences in the relative occupancy of different cell cycle stages. The authors dissect and validate one such cell cycle occupancy QTL, involving the gene GPA1, a G-protein subunit that plays a role in regulating the mating response MAPK pathway. They show that variation at GPA1 is associated with proportional differences in the fraction of cells in the G1 stage of the cell cycle. Furthermore, they show that this bias is associated with differences in mating efficiency.

Weaknesses:

While the experimental validation of the role of GPA1 variation is well done, the novel cell cycle occupancy QTL aspect of the study is somewhat underexploited. The cell occupancy QTLs that are mentioned all involve loci that the authors have identified in prior studies that involved the same yeast crosses used here. It would be interesting to know what new insights, besides the "usual suspects", the analysis reveals. For example, in Cross B there is another large effect cell occupancy QTL on Chr XI that affects the G1/S stage. What candidate genes and alleles are at this locus? And since cell cycle stages are not biologically independent (a delay in G1, could have a knock-on effect on the frequency of cells with that genotype in G1/S), it would seem important to consider the set of QTLs in concert.

We thank the reviewer for this suggested clarification. We have modified the text to make it clear that cell cycle occupancy is a compositional phenotype. Like the reviewer, we also noticed the distal trans eQTL hotspot on Chr XI in Cross B, but we were not able to identify compelling candidate gene(s) or variant(s) despite extensive effort.

Reviewer #2 (Public Review):

Boocock and colleagues present an approach whereby eQTL analysis can be carried out by scRNA-Seq alone, in a one-pot-shot experiment, due to genotypes being able to be inferred from SNPs identified in RNA-Seq reads. This approach obviates the need to isolate individual spores, genotype them separately by low-coverage sequencing, and then perform RNA-Seq on each spore separately. This is a substantial advance and opens up the possibility to straightforwardly identify eQTLs over many conditions in a cost-efficient manner. Overall, I found the paper to be well-written and well-motivated, and have no issues with either the methodological/analytical approach (though eQTL analysis is not my expertise), or with the manuscript's conclusions.

I do have several questions/comments.

393 segregant experiment:

For the experiment with the 393 previously genotyped segregants, did the authors examine whether averaging the expression by genotype for single cells gave expression profiles similar to the bulk RNA-Seq data generated from those genotypes? Also, is it possible (and maybe not, due to the asynchronous nature of the cell culture) to use the expression data to aid in genotyping for those cells whose genotypes are ambiguous? I presume it might be if one has a sufficient number of cells for each genotype, though, for the subsequent one-pot experiments, this is a moot point.

As mentioned in our preliminary response, while it is possible to expand the analysis along these lines, this is not relevant for the subsequent one-pot experiments. We have made all the data available so that anyone interested can try these analyses.

Figure 1B:

Is UMAP necessary to observe an ellipse/circle - I wouldn't be surprised if a simple PCA would have sufficed, and given the current discussion about whether UMAP is ever appropriate for interpreting scRNA-Seq (or ancestry) data, it seems the PCA would be a preferable approach. I would expect that the periodic elements are contained in 2 of the first 3 principal components. Also, it would be nice if there were a supplementary figure similar to Figure 4 of Macosko et al (PMID 26000488) to indeed show the cell cycle dependent expression.

We have added two new figures (S2 and S3) that represent alternative visualizations of the cell-cycle that are not dependent on UMAP. Figure S2 shows plots of different pairs of principal components, with each cell colored by its assigned cell-cycle stage. We do not observe a periodic pattern in the first 3 principal components as the reviewer expected, but when we explore the first 6 principal components, we see combinations of components that clearly separate the cell cycle clusters. We emphasize that the clusters were generated using the Louvain algorithm and assigned to cell-cycle stages using marker genes, and that UMAP was used only for visualization.

We could not create a figure similar to Macosko et al. because of differences between the cell cycle categories we used and those of Spellman et al (PMID 9843569). We instead created Figure S3 to address the reviewer's comment. This figure uses a heatmap in a style similar to that of Macosko et al. to display cell-cycle-dependent expression of the 22 genes we used as cell cycle markers across each of the five cell cycle stages (M/G1, G1, G1/S, S, G2/M).

We have renumbered the supplementary figures after incorporating these two additional supplementary figures into the manuscript.

Aging, growth rate, and bet-hedging:

The mention of bet-hedging reminded me of Levy et al (PMID 22589700), where they saw that Tsl1 expression changed as cells aged and that this impacted a cell's ability to survive heat stress. This bet-hedging strategy meant that the older, slower-growing cells were more likely to survive, so I wondered a couple of things. It is possible from single-cell data to identify either an aging, or a growth rate signature? A number of papers from David Botstein's group culminated in a paper that showed that they could use a gene expression signature to predict instantaneous growth rate (PMID 19119411) and I wondered if (a) this is possible from single-cell data, and (b) whether in the slower growing cells, they see markers of aging, whether these two signatures might impact the ability to detect eQTLs, and if they are detected, whether they could in some way be accounted for to improve detection.

As mentioned in our preliminary response, we are not sure how to look for gene expression signatures of aging in yeast scRNA-seq data. We believe that the proposed analyses are beyond the scope of the current paper. As noted above, we have made all the data available so that anyone interested can explore these hypotheses.

AIL vs. F2 segregants:

I'm curious if the authors have given thought to the trade-offs of developing advanced intercross lines for scRNA-Seq eQTL analysis. My impression is that AIL provides better mapping resolution, but at the expense of having to generate the lines. It might be useful to see some discussion on that.

We thank the reviewer for the comments. We believe that a discussion of trade-offs between different approaches for constructing mapping populations, such as AIL and F2 segregants, is beyond the scope of this paper.

10x vs SPLit-Seq

10x is a well established, but fairly expensive approach for scRNA-Seq - I wondered how the cost of the 10x approach compares to the previously used approach of genotyping segregants and performing bulk RNA-Seq, and how those costs would change if one used SPLiT-Seq (see PMID 38282330).

We thank the reviewer for the comments. We believe that a discussion of cost trade-offs between 10x and other approaches is beyond the scope of this paper, especially given the rapidly evolving costs of different technologies.

Recommendations for the authors:

Reviewer #1 (Recommendations For The Authors):

Throughout the results section the authors point to File S1 for additional information. This file is a tarball with about 20 Excel documents in it, each with several sheets embedded. The authors should provide a detailed README describing how to understand the organizations of the files in File S1 and the many embedded sheets in each file. Statements made in the manuscript about File S1 should explicitly direct the reader to a specific spreadsheet and table to refer to.

We have added an additional README file to the tarball that explains the organization of File S1 and describes the data contained in each sheet. Throughout the text, we now reference specific spreadsheets to assist the reader. In addition, these spreadsheets have been added to a github repository https://github.com/theboocock/finemapping_spreadsheets_single_cell

Neither of the two GitHub repositories referenced under "Code availability" has adequate documentation that would allow a reader to try and reproduce the analyses presented here. The one entitled https://github.com/joshsbloom/single_cell_eQTL has no functional README, while https://github.com/theboocock/yeast_single_cell_post_analysis is somewhat better but still hard to navigate. Basic information on expected inputs, file formats, file organization, output types, and formats, etc. is required to get any of these pipelines to run and should be provided at a minimum.

We thank the reviewer for the comment. In response, we have refactored both GitHub repositories and added extensive documentation to improve usability. We updated the versions of software and packages, this has been reflected in the methods section.

S. cerevisiae strains are preferentially diploid in nature and many genes involved in the mating pathway are differentially regulated in diploids vs haploids. Have the authors explored the fitness effects of the GPA1 82R allele in diploids? What is the dominance relationship between 82W and 82R?

We thank the reviewer for the comment. In diploid yeast, the mating pathway is repressed, and thus we would not expect there to be any fitness consequences due to the presence of different alleles of GPA1.

The diploid expression profiling (page 5 and Table S9) doesn't implicate GPA1; can you the authors comment on this in light of their finding in haploids?

The mating pathway, including GPA1, is repressed in diploids, and hence the expression of GPA1 cannot be studied in these strains (PMID: 3113739). In addition, allele-specific expression differences only identify cis-regulatory effects. We know that the GPA1 variant results in a protein-coding change, which may or may not influence the levels of mRNA in cis, so that even if GPA1 were expressed in diploids, there would be no expectation of an allele-specific difference in expression.

With respect to the candidate CYR1 QTL -- note that strains with compromised Cyr1 function also generally show increased sporulation rates and/or sporulation in rich media conditions (cAMP-PKA signaling represses sporulation). Is this the case in diploids with the CBS2888 allele at CYR1? If the CBS2888 allele is a CYR1 defect one might expect reduced cAMP levels. It is possible to estimate adenylate cyclase levels using a fairly straightforward ELISA assay. This would provide more convincing evidence of the causal mechanism of the alleles identified.

We thank the reviewer for the comment, and we agree that a functional study of the CYR1 alleles would provide more convincing evidence for the causal mechanism of the connection between cell cycle occupancy, cAMP levels, and growth. However, we believe that the proposed experiments are beyond the scope of our current study. The evidence we provide is sufficient to establish that CYR1 is a strong candidate gene for the eQTL hotspot.

Re: CYR1 candidate QTL -- The authors should reference the work of Patrick Van Dijck and Johan M Thevelein on CYR1 allelic variation, and other papers besides the Matsumoto/ Ishikawa papers, as the effects of cAMP-PKA signaling on stress can be quite variable. cAMP pathway variants, including in CYR1, have popped up in quite a few other yeast QTL mapping and experimental evolution papers. These should be referenced as well.

We thank the reviewer for these references; we have added a comment about the relationship between stress tolerance and CYR1 variation, and cited the relevant references accordingly.

Figure S10 - the subfigure showing the frequency of the GPA 82R compared to 82W suggests a fairly large and deleterious fitness effect of this allele; on the order of 7-8% fewer cells per cell cycle stage than the 82W allele. Can the authors reconcile this with the more modest growth rate effect they report on page 8?

Figure S12C displays the allele frequency of the 82R allele across the cell cycle in the single-cell data from allele-replacement strains. These strains were grown separately and processed using two individual 10x chromium runs. The resulting sequenced library had 11,695 cells with the 82R allele and 14,894 cells with the 82W allele. The 7-8% difference in the number of cells is due to slight differences in the number of captured cells per run, not due to growth differences, because we attempted to pool cells in equal numbers from separate mid-log cultures.

The proportion of cells in G1 increases by ~3% in strains with the 82R allele relative to the baseline proportion of cells in the experiment, which, to the reviewers point, is still larger than the ~1% growth difference we observed. Cell cycle occupancy is a compositional phenotype. As shown in figure S12C, the 82R variant increases the fraction of cells in G1 and slightly decreases the fraction of cells in M/G1. There is no obvious expectation for quantitatively translating a change in cell cycle occupancy to a change in growth rate.

The authors refer to the Lang et al. 2009 paper w/respect to GPA1 variant S469I but that paper seems to have explored a different GPA1 allele, GPA1-G1406T, with respect to growth rates.

We thank the reviewer for their comment. The S469I variant is the same as the G1406T variant, one denoting the amino acid change at position 469 in the protein and the other denoting the corresponding nucleotide change at position 1406 in the DNA coding sequence. We have altered the text to make this clear to the reader.

Reviewer #2 (Recommendations For The Authors):

I make no recommendations as to additional work for the authors. The manuscript is complete. I suggested some things I would like to see in my review, but it's up to them to decide whether they think any of those would further enhance the manuscript.

However, I do have I have some pedantic formatting notes:

- Microliters are variously presented as uL, ul, and µl - it should be µL

- Similarly, milliliters are presented as ml and ML - it should be mL

- Also, there should be a space between the number and the unit, e.g. 10 µL

- Some gene names in the manuscript are not italicized in all instances, e.g., GPA1

We thank the reviewer for these formatting suggestions, we have made these changes throughout the text.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Bloom J. 2024. Data for: Raw count data, transcribed variant count data, and reference genomic annotation files for Boocock et al. 2024. Dryad Digital Repository. [DOI]
    2. Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. Zenodo. [DOI] [PMC free article] [PubMed]
    3. Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. NCBI BioProject. PRJNA1049497 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Supplementary file 1. Yeast strains (Table S1), plasmids (Table S2), and primers used in this study (Table S3); summary information for the single-cell expression data generated in this study (Table S4); variance explained by cell-cycle stage and the effect of segregant per transcript for the 393 previously generated segregants (Table S5); and Local eQTL summary statistics for the 393 previously generated segregants (Table S6).
    elife-95566-supp1.xlsx (956.7KB, xlsx)
    Supplementary file 2. Local eQTL summary statistics for our one-pot eQTL experiments.

    Each sheet has the local eQTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981). The column ‘has cell-cycle interaction’ is set to 1 if a cell-cycle interaction was observed at a FDR of <5%. For cross A, the summary statistics from bulk eQTL are provided in additional columns. For cross A, the summary statistics from bulk eQTL7 are provided in additional columns. Missing values in the bulk eQTL columns indicate that a gene did not pass our filtering criteria, and a local eQTL test was not performed.

    elife-95566-supp2.xlsx (1.2MB, xlsx)
    Supplementary file 3. Distant eQTL summary statistics for our one-pot eQTL experiments.

    Each sheet has the local eQTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981). The column ‘has cell-cycle interaction’ is set to 1 if a cell-cycle interaction was observed at a FDR of <5%. The column ‘eQTL in hotspot’ is set to 1 if a distal eQTL falls within a significant hotspot bin.

    elife-95566-supp3.xlsx (344KB, xlsx)
    Supplementary file 4. Hotspot annotation files for each cross (cross A = BY and RM, cross B = YJM145 and YPS163, cross C = CBS2888xYJM981, A_bulk = BY and RM with expression measured by bulk RNA-seq Albert et al., 2018).
    elife-95566-supp4.zip (36.1MB, zip)
    Supplementary file 5. Single-cell allele-specific expression (ASE) summary statistics.

    Each sheet has the ASE summary statistics for the parental hybrids of the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and Table S3 YPS163, cross C=CBS2888xYJM981). The local eQTL summary statistics were included from the corresponding one-pot eQTL experiment for each transcript that was tested there. Missing values in the one-pot eQTL columns indicate that a gene did not pass our filtering criteria, and a local eQTL test was not performed.

    elife-95566-supp5.xlsx (1.2MB, xlsx)
    Supplementary file 6. Summary statistics for allele-specific effects on noise and average expression.

    Each sheet has the summary statistics for the parental hybrids of the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S4 cross C=CBS2888xYJM981). The estimates and p-values are derived from the joint model that contains cell-cycle, cell-cycle interactions, and allelic effects on both the mean and noise. The column ‘Overlaps global trend line’ is set to 1 if the 95% confidence interval of the noise effect did not overlap the 95% confidence interval of the global trend line.

    elife-95566-supp6.xlsx (187.7KB, xlsx)
    Supplementary file 7. Cell-cycle occupancy QTL summary statistics for our one-pot eQTL experiments.

    Each sheet has the cell-cycle occupancy QTL summary statistics for the three crosses we examined (Table S1 cross A=BY and RM, Table S2 cross B=YJM145 and YPS163, Table S3 cross C=CBS2888xYJM981).

    elife-95566-supp7.xlsx (13.8KB, xlsx)
    Supplementary file 8. Alignment and tree files from a clustered blast search of the yeast Gpa1 protein sequence (YHR005C).
    elife-95566-supp8.zip (416KB, zip)
    Supplementary file 9. Single-cell RNA sequencing of allele-replacement strains with the 82R and 82W alleles of GPA1.

    The distant eQTL effects near GPA1 from our one-pot eQTL experiment are contrasted to a differential expression analysis of our allele replacement strains. The p-values from our one-pot eQTL study were adjusted using a permutation procedure, and a Bonferroni correction was used to adjust the single-cell validation p-values. Missing values in the single-cell validation columns indicate that a gene did not pass our filtering criteria, and a differential expression test was not performed.

    elife-95566-supp9.xlsx (15.5KB, xlsx)
    MDAR checklist

    Data Availability Statement

    Sequencing data is available under the NCBI BioProject PRJNA1049497. Raw and processed data can be found at the locations https://doi.org/10.5061/dryad.xgxd254qb and https://zenodo.org/doi/10.5281/zenodo.12695127, respectively. Finemapping spreadsheets can be found on GitHub at https://github.com/theboocock/finemapping_spreadsheets_single_cell (copy archived at Boocock, 2024a).

    The following datasets were generated:

    Bloom J. 2024. Data for: Raw count data, transcribed variant count data, and reference genomic annotation files for Boocock et al. 2024. Dryad Digital Repository.

    Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. Zenodo.

    Boocock J. 2024. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. NCBI BioProject. PRJNA1049497


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES