Skip to main content
eLife logoLink to eLife
. 2018 Apr 11;7:e32472. doi: 10.7554/eLife.32472

The genetic landscape of a physical interaction

Guillaume Diss 1,2, Ben Lehner 1,2,3,
Editor: Naama Barkai4
PMCID: PMC5896888  PMID: 29638215

Abstract

A key question in human genetics and evolutionary biology is how mutations in different genes combine to alter phenotypes. Efforts to systematically map genetic interactions have mostly made use of gene deletions. However, most genetic variation consists of point mutations of diverse and difficult to predict effects. Here, by developing a new sequencing-based protein interaction assay – deepPCA – we quantified the effects of >120,000 pairs of point mutations on the formation of the AP-1 transcription factor complex between the products of the FOS and JUN proto-oncogenes. Genetic interactions are abundant both in cis (within one protein) and trans (between the two molecules) and consist of two classes – interactions driven by thermodynamics that can be predicted using a three-parameter global model, and structural interactions between proximally located residues. These results reveal how physical interactions generate quantitatively predictable genetic interactions.

Research organism: Human, S. cerevisiae

eLife digest

Proteins, the molecular workhorses of the cell, are made of small units called amino acids attached together like the links of a chain. Each protein is composed of a unique combination of amino acids, which is determined by a specific sequence of DNA called a gene. A change in a gene – a mutation – can create a variation in the protein it codes for, for instance by swapping a type of amino acid for another. Different mutations in the same gene can alter a protein in different ways. Some of these changes are harmless, but other can hinder how the protein performs its role. For example, a small change in the structure of a protein could affect how it will bind to other molecules.

It is possible for people to have identical mutations in the same genes, but experience different consequences. For instance, two persons could carry the same disease-inducing mutation, but one has a severe version of the condition and the other only mild symptoms. One reason is that changes in other genes cancel out or enhance the effect of a mutation. This phenomenon is known as a genetic interaction and it remains poorly understood, especially at the molecular level.

Here, Diss and Lehner developed a method, called deepPCA, to study the consequences of mutations in proteins in the laboratory. The experiments focused on two human genes which code for two proteins that normally attach to each other. Two mutations were artificially created, either one in each gene, or two in one of them. Diss and Lehner then examined how strongly the two mutated proteins could still attach to each other. By repeating this process with over 120,000 different pairs of mutations, it became possible to study how one mutation can have different effects depending on the presence of other mutations in the same protein or in the binding partner.

Overall, Diss and Lehner found that genetic interactions are the result of two mechanisms. In the first one, the two mutations together cause specific structural changes that modify how proteins bind to each other. In the second one, the changes solely depend on the magnitude of the initial, thermodynamic effects of individual mutations, but not on their specific physical and chemical properties. To predict the consequences of this second type of genetic interactions, knowing the identity or the exact effects of the two mutations is not necessary.

Understanding and predicting genetic interactions is important to develop personalized medicine, where treatments are tailored based on the genetic make up of an individual. This knowledge will also help to study how genes have evolved together.

Introduction

Mutations often have outcomes that change depending upon additional genetic variation carried by an individual, making their effects difficult to predict (Lehner, 2011). The unexpected outcomes obtained when two or more mutations are combined are referred to as genetic interactions or epistasis (Phillips, 2008).

One approach that has been taken to better understand how mutations interact to alter phenotypes has been to systematically combine together gene deletions or representative hypomorphic alleles (Baryshnikova et al., 2013). In budding yeast, this has been undertaken on a genomic scale, with the resulting network of interactions referred to as the ‘genetic landscape’ of a cell (Costanzo et al., 2010, 2016; Tong et al., 2004).

However, gene deletions are rare in nature – most genetic variation consists of point mutations not deletions or null alleles. Point mutations can have very diverse and difficult to predict effects (Shendure and Akey, 2015). These range from no consequence, through partial loss-of-function, to very strong effects or the creation of new functions. To date, however, there has been no systematic effort to map how point mutations in two genes combine together to alter biological functions.

Protein-protein interactions (PPIs) represent the backbone of a cell’s functional organization. Mutations affecting PPIs lead to disease, to functional innovations, and hence are subject to selection (Diss et al., 2013). It has long been appreciated that pairs of mutations in two physically interacting proteins can have non-additive outcomes (Horovitz, 1996; Lehner, 2011). However, to date, the effects of mutations on PPIs have only been quantified for deep mutant libraries of one protein in combination with a small number of targeted mutants in a physical interaction partner (Aakre et al., 2015; Araya et al., 2012; Raman et al., 2016). A thorough understanding of the patterns of mutation outcome between interacting proteins requires a non-biased, systematic mutagenesis of both interacting proteins.

Here, we present a high-throughput technique based on the protein fragment complementation assay (Tarassov et al., 2008; Diss et al., 2017) (PCA) called deepPCA that quantifies how mutations of diverse individual effect combine to alter protein interactions. We used the assay to systematically and comprehensively determine the effects of combinations of mutations in the proto-oncogenes FOS and JUN on the formation of the AP-1 transcription factor complex (Shaulian and Karin, 2002). Fos and Jun interact through their leucine zipper domains that consist of five heptad repeats (Figure 1A); this interaction has been previously extensively investigated (Mason et al., 2006; Ransone et al., 1989). We first quantified the consequences of combining thousands of pairs of mutations in trans between the two proteins. We then compared these results to the effects of thousands of pairs of mutations in cis within one of the proteins (Fos).

Figure 1. deepPCA.

(A) Leucine zipper domains (colored) and heptad positions of human Fos and Jun. (B) Overview of the assay. Single amino acid variants of Fos and Jun were constructed by overlap-extension PCR using NNS primers and cloned in a head-tail orientation. In PCA, interacting proteins expressed in yeast lead to complementation between their fused DHFR fragments, which is resistant to methotrexate and produces tetrahydrofolate (THF) from dihydrofolate (DHF) to promote growth. Paired-end deep sequencing then allows the frequencies of each variant in the input and output populations to be measured and a PPI score that represents the number of generation of each variant relative to the wild-type interaction to be computed. (C) Scatter plot of PPI scores between biological replicates 1 and 2. (D) Confirmation of single mutants by individual PCA growth assays. Single mutants were reconstructed, sequence confirmed and their PPI scores were derived from their growth curves measured in a plate reader (see Materials and methods). Error bars represent 95% confidence intervals.

Figure 1.

Figure 1—figure supplement 1. Quality control of the PPI scores measured by deepPCA.

Figure 1—figure supplement 1.

(A) Density plots showing the high correlation between PPI scores calculated from sequencing data filtered at different quality cut-offs. (B) Density plot showing the relationship between PPI scores and input read counts (right) and histogram of PPI scores (left). The data corresponds to replicate 1, including variants with stop codons and prior to filtering based on read count. This plot highlights the PPI score boundary (oblique boundary on the left side of the plot) due to low output read count (see Materials and methods). The vertical line represents the input read count filter. (C) Correlations between deepPCA PPI scores from three biological replicate experiments. Contains both single (n = 1,215) and double mutations (n = 107,625) from the trans Fos and Jun library. Variants with less than 10 UMIs in any of the three replicate input samples or without any UMI in any of the replicate output samples were filtered out.

The resulting dataset presents a global view of how hundreds of mutations of diverse individual effects in different genes combine to alter a biological function through two major mechanisms related to the thermodynamics of a PPI and the structural interactions between proximal residues.

Results

Quantifying thousands of protein interactions in parallel using deepPCA

To quantify how mutations of diverse individual effect combine to alter protein interactions, we developed deepPCA, a protein-protein interaction assay that uses PCA and deep sequencing to quantify thousands of protein-protein interactions in parallel in a single assay (Figure 1B, see Materials and methods). deepPCA uses deep sequencing to quantify the effects on a PPI of thousands of combinations of point mutations within one or both physically interacting proteins. The method is inspired by deep mutation scanning experiments on individual proteins (Fowler et al., 2010; Fowler and Fields, 2014) and uses physical linkage on a plasmid to read out the frequency of each pair of mutations after a competitive selection for growth dependent on the physical interaction between two proteins (Figure 1B, see Materials and methods). Briefly, the two proteins of interest are fused to complementary halves of a methotrexate-resistant variant of murine dihydrofolate reductase and expressed in yeast. If the two proteins interact, the two fragments complement each other and reform an active enzyme, allowing growth in the presence of methotrexate. PCA is highly quantitative because the growth rate is correlated to the abundance of the complementation complex (Freschi et al., 2013; Levy et al., 2014; Schlecht et al., 2012) so cells expressing strongly interacting variants of the two proteins will hence grow faster and be enriched in the population while cells expressing weakly interacting variants and variants that don’t interact will be depleted. These changes in frequency between the pre- and post-selection populations (input and output, respectively) are then quantified by paired-end deep sequencing. The final PPI score quantifies the strength of interaction relative to the wild-type protein (Figure 1B).

We used deepPCA to quantify the effects of systematically mutating the leucine zipper domains of FOS and JUN. We obtained reliable (input reads > 10 and output reads > 0; Figure 1—figure supplement 1A,B; see Materials and methods) measurements for 607 and 608 of the 608 (32 positions x 19 substitutions) possible single amino acid (aa) changes within the targeted regions of Fos and Jun, respectively. PPI scores measured by deepPCA are highly reproducible between biological replicates (mean Pearson correlation R = 0.95 between the three pairs of replicates, n = 108,840 mutation combinations, Figure 1C, Figure 1—figure supplement 1C and Supplementary file 1) and also with mutation effects tested individually (R = 0.95 for 14 variants chosen randomly, Figure 1D).

The PPI scores for single amino acid changes in both proteins show a bimodal distribution (Figure 2—figure supplement 1A), with ~20% and 15% of substitutions severely detrimental for the interaction and significantly different from the wild-type (PPI score ≤ 0.64, FDR < 0.05, one sample t-test against a mean of 1; Figure 2—figure supplement 1B). However, the individual substitutions altered the interaction across the entire dynamic range, with 25 and 10 aa changes in each protein strengthening the interaction (PPI score > 1.04, FDR < 0.05, average SEM of these 35 variants = 0.0054; Figure 2—figure supplement 1C).

Determinants of single mutant outcome

Mutations in the hydrophobic core of the interaction interface (heptad positions a and d) are most detrimental, followed by mutations at salt-bridge positions (positions e and g, Figure 2A–B). Mutations in the hydrophilic far side of the zipper (positions b, c and f) were generally of small effect (Figure 2A–C). Changes in the physico-chemical properties of the amino acids (hydrophobicity, charge, α-helical stability etc, see Supplementary file 2) provide good prediction of the mutation effects (percentage of variance explained from 35% to 98% across Fos and Jun positions), with properties related to α-helical stability most informative for predicting single mutation outcomes (Figure 2—figure supplement 1D and Supplementary file 2).

Figure 2. Effects of single mutants.

(A) Heatmap of single mutant PPI scores averaged between the three replicates. Letters inside the heatmap represent the wild-type amino acid. White represents missing data. (B) Distribution of PPI scores per position types. p-Values from Welch t-test. (C) Average PPI score per position overlaid on the crystal structure (pdb: 1fos). Black and gray represent positions not mutated in Fos and Jun, respectively. (D) Scatter plot between PPI scores of corresponding single mutations at the same positions in Fos and Jun. Error bars represent 95% confidence intervals. (E) Scatter plot between average PPI score per corresponding positions in Fos and Jun. The number represents the heptad and the letter the position inside the heptad.

Figure 2.

Figure 2—figure supplement 1. Single mutant effects.

Figure 2—figure supplement 1.

(A) Distribution of single mutants effects. (B) Proportion of single mutants that strongly decrease PPI strength (FDR < 0.05, one sample t-test with mu = 1) at different PPI score thresholds. (C) Proportion of single mutants that significantly increase PPI strength (FDR < 0.05, one sample t-test with mu = 1) at different PPI score thresholds. (D) Maximal proportion of variance explained for each residue by a single amino acid feature (see Supplementary file 2 for predictive features).

Identical substitutions in the same positions in Fos and Jun often had similar effects. For instance, mutations in one protein that disrupted the interaction (PPI scores < 0.64) were also very likely to disrupt the interaction when made in the other protein (odds ratio = 31.6, p<2.2×10−16, Fisher’s exact test; Figure 2D). Near neutral or strengthening mutations (PPI scores > 0.96) in one protein were also more likely to have a similar effect in the other one (odds ratio = 7.3, p < 2.2×10−16, Fisher’s exact test). However, a substantial number of substitutions had effects that differed between the two proteins (n = 381 out of 581, FDR < 0.05, paired t-test between the three replicate measurements in Fos and Jun), underlining the importance the structural context in which they occur. These mutations are enriched in intermediate effects in one or both proteins (odds ratio = 8.1, p < 2.2×10−16, Fisher’s exact test). The average PPI score per position was also generally conserved between the two proteins, but revealed positions asymmetrically involved in the interaction such as the salt bridge positions (Figure 2E).

trans genetic interactions between mutations in Fos and Jun

Considering pairs of substitutions in the two proteins, we obtained data for 107,625 of the 369,664 possible double mutants (input read count above 10 and output read count above 0, Supplementary file 1, see Materials and methods). The double mutant PPI scores also show a bimodal distribution, but with proportionally more severely detrimental (~26%) and fewer near-neutral outcomes (~21%) than for the single mutants (Figure 3A).

Figure 3. A thermodynamic model predicts double mutant outcomes.

(A) Distribution of double mutants PPI scores compared to Fos and Jun single mutants. (B) Observed double mutants PPI scores against the scores predicted by a multiplicative model. (C) Pie chart array of genetic interaction score bins by Fos (x-axis) and Jun (y-axis) single mutant PPI score bins for genetic interactions calculated from the multiplicative model. (D) Fitted thermodynamic model (see Materials and methods). Red arrows illustrate how the sigmoidal function can lead to a different prediction than the multiplicative model. The three fitted free parameters are shown (AT/ABwt and BT/ABwt represent the total concentration of the two proteins relative to the concentration of the wild-type complex and b/ABwt represents the background growth relative to the concentration of the wild-type complex, see Materials and methods). ΔΔGs1, ΔΔGs2 and ΔΔGd represent the change in free energy relative to the wild-type complex of the two single mutants and the double mutant, repsectively. (E) 3D scatter plot of double mutants PPI scores (z-axis) as a function of the corresponding Fos (x-axis) and Jun (y-axis) single mutants PPI scores with the fitted surface from the thermodynamic model. Dot color represents genetic interaction scores according to the color scale in (C). (F) Pie chart array of genetic interaction score bins by Fos (x-axis) and Jun (y-axis) single mutants PPI score bins for genetic interactions calculated from the thermodynamic model. (G) Average genetic interaction score across all double mutants involving a given Fos (left) or Jun (right) single mutant in function of its PPI score for the multiplicative (top) and thermodynamic (bottom) models. Data in all panels is for replicate 1.

Figure 3.

Figure 3—figure supplement 1. Correlations between genetic interaction scores from the three biological replicates of the trans library selection.

Figure 3—figure supplement 1.

Genetic interactions calculated using the multiplicative model.
Figure 3—figure supplement 2. Fitting the thermodynamic model on the trans library data.

Figure 3—figure supplement 2.

(A) Parameter search. Proportion of variance explained as a function of the three parameters of the thermodynamic model. The third parameter (b/ABWT) is represented as surfaces of different colors. The black dot represents the highest percentage of variance explained. (B) Result of Monte-Carlo cross-validation to ensure that the model is not over-fitted. The procedure is described in the Materials and methods section. The median of the free parameters over 1000 sub-samplings is identical, confirming that the model is not over-fitted. (C) Correlations between genetic interaction scores from the three biological replicates calculated from the thermodynamic model. (D) Distribution of the p-values from the one-sample t-tests performed on the genetic interaction scores (i.e. residuals from the thermodynamic model) from three replicates of each double mutant. (E) FDR calculated by permutation of genetic interaction scores (see method) as a function of the p-value threshold. (F) Volcano plot of the p-values of each double mutant as a function of the corresponding average genetic interaction scores. Vertical red lines represent the magnitude threshold used to define strong genetic interactions. The horizontal red line represents the p-value threshold that corresponds to a FDR of 0.2. (G) Ratio of significantly negative to positive genetic interactions at different magnitude thresholds showing that negative genetic interactions are more prevalent than positive genetic interactions.

The outcome of the double mutations was well predicted by multiplying the PPI scores of the constitutive single mutants (percentage of variance explained of 85–86% in all three replicates, Figure 3B), that is, by assuming no genetic interaction between mutations. We calculated a genetic interaction score for each double mutant as the difference between the observed and predicted PPI scores (Supplementary file 1). Negative and positive genetic interactions (16,394 and 11,653 cases, respectively, at a 20% FDR, one-sample t-test) thus represent double mutants with lower or higher interaction strength than expected, respectively. The genetic interaction scores are well correlated between replicates with a distribution centered on zero and long tails of positive and negative scores (Figure 3—figure supplement 1). Thus, as observed in other systems (Araya et al., 2012; Olson et al., 2014), genetic interactions make an important contribution to the outcome of double mutations.

Global dependencies in the genetic interaction landscape

The genetic interaction scores are, however, strongly dependent on the single mutant PPI scores (Figure 3C). Combining two mutants that both moderately reduce PPI strength is likely to result in a negative genetic interaction (Figure 3C). Positive genetic interactions are, however, generally detected between two mutations that greatly weaken the interaction and also often when combining strength-increasing and strength-decreasing mutations (Figure 3C).

A thermodynamic model accurately predicts double mutant outcome

To account for these trends, we considered the thermodynamics of a PPI, relating the concentration of the bound and total subunits to the free energy of a dimeric complex (equation 9 in the Materials and methods). This model has only three free parameters that need to be fitted from the data, representing the total concentration of each protein and the background growth in the PCA selection (see Materials and methods). In the model the changes in free energy (ΔΔG, expressed in arbitrary units) for the mutations are additive but there is a sigmoidal relationship between PPI scores and ΔΔGs (Figure 3D).

Fitting the three parameters from the data (Figure 3—figure supplement 2A–B) reveals that the model provides very good prediction of how mutations in the two proteins combine together (percentage of variance explained of 89–90% in all three replicates, n = 107,618 mutation combinations, Figure 3E). The model also removes the systematic trend in the genetic interaction scores across mutation pairs with different individual effects (Figure 3F–G). Indeed, because of the sigmoidal nature of the model, a single mutant that decreases ΔΔG will increase PPI scores to a lower extent in the wild-type context than when combined with a mutation that destabilized the complex because of the saturation effect caused by the plateau of the sigmoid.

Specific interactions between structurally-related mutations

To investigate the remaining genetic interactions not accounted for by the thermodynamic model, we calculated residual genetic interaction scores as the difference between the observed double mutant PPI score and the thermodynamic model prediction. These new genetic interaction scores also correlate well amongst the three replicates, with a narrow peak centered on zero interaction and long tails of rare strong positive and negative genetic interactions (Figure 3—figure supplement 2C).

We observed more cases of strong negative (1711, 1.6%) than positive (883, 0.82%) genetic interactions (absolute score > 0.1, FDR = 0.2, Figure 3—figure supplement 2D–G). These strong interactions are enriched between particular Fos and Jun residues (Figure 4—figure supplement 1), with positive genetic interactions concentrated between positions close in the sequence of heptad positions (along the diagonal of the matrix in Figure 4A) and negative genetic interactions more spread-out in the structure and less enriched between specific pairs of positions (Figure 4A–B and Figure 4—figure supplement 2). Both directions of interaction are enriched between residues at the interface of the PPI and between residues close in space (Figure 4C–D and Figure 4—figure supplement 3), with this stronger for positive than for negative interactions. Positive interactions are therefore particularly enriched between contacting residues, identifying ‘lock and key’ specificity residues (Horovitz, 1996). We refer to these interactions beyond the interactions predictable from the global thermodynamic model as structural genetic interaction. For instance, in the wild-type PPI, the Glu residue in position 3g of Fos establishes a salt-bridge with the Arg residue in position 4e of Jun (Figure 4E). The individual mutations Glu3gLys and Arg4eGlu both destabilize the PPI by replacing the salt-bridge by repulsive electro-static interactions (Glu-Arg replaced by Lys-Arg and Glu-Glu with average PPI scores of 0.84 and 0.71, respectively). However, the two mutations compensate each other by recreating the salt-bridge (Glu-Arg replaced by Lys-Glu) and restoring a neutral PPI score of 0.98. Additional examples are shown in Figure 4E.

Figure 4. Structural genetic interactions.

(A) Heatmap (top) and distribution (bottom) of percentage of significantly (absolute genetic interaction score > 0.1, FDR < 0.2) positive (left) or negative (right) genetic interactions per pairs of position. Pairs of positions without any significant interactions were excluded from the distribution (bottom). (B) Pairs of position significantly enriched in positive (green) or negative (purple) genetic interactions (Fisher’s exact test, FDR = 10%). Each pair of position was classified according to its heptad position type and the distance between the two position (bottom matrix, yellow cells). (C–D) Enrichment for significantly positive (green) or negative (purple) genetic interactions between different position types (C) and at different distance threshold between the two positions (D). *, FDR < 0.1. **, FDR < 0.01. ***, FDR < 0.001. n.s., non-significant. (E) Example of structural interactions in the Fos-Jun complex (pdb: 1fos). Dashed yellow lines represent polar interactions predicted by pymol and the mutant structures were drawn using the pymol mutagenesis function.

Figure 4.

Figure 4—figure supplement 1. Distribution of number of significant genetic interactions for Fos or Jun single mutants at different magnitude thresholds.

Figure 4—figure supplement 1.

Each row of the heatmap is the distribution of the number of interactions at that magnitude threshold. FDR < 0.2.
Figure 4—figure supplement 2. Robustness of enrichments in position pairs for significant (FDR < 0.2) genetic interactions at different magnitude thresholds.

Figure 4—figure supplement 2.

Pairs of positions presented as significantly enriched in Figure 4C (magnitude threshold of 0.1). Enrichments and their corresponding FDRs were calculated at all 1024 pairs of positions, for each magnitude threshold independently.
Figure 4—figure supplement 3. Robustness of enrichments in structural features for significant (FDR < 0.2) genetic interactions at different magnitude thresholds.

Figure 4—figure supplement 3.

(A) Enrichments for different position types. Enrichments and their corresponding FDRs were calculated for each magnitude threshold independently. (B) Enrichments for pairs of mutations at difference distance threshold in the structure (pdb: 1fos). (C) Enrichments for position type (left) and for pairs of mutations at different distance thresholds with a more stringent input read count filter (> 100 input reads in any of three replicates), showing that the trends are the same even though the significance is lower with this stringent filter due to the smaller size of the dataset (n = 17,758 when filtering at 100 input reads compared to 107,618 when filtering at 10 input reads). Enrichments and their corresponding FDRs were calculated for each magnitude threshold independently.

Comparing genetic interactions in cis and trans

In addition to combining pairs of mutations in the two different proteins, we also quantified the effects of 17,688 double amino acid changes within Fos alone (99% of cis double mutant combinations reachable through combinations of single nucleotide changes; for all comparisons between the trans and cis libraries below, we only consider mutants reachable by single nucleotide changes in both libraries; Figure 5A, Figure 5—figure supplement 1A and Supplementary file 3 and 4). The PPI scores for single mutants correlate very well between the two libraries (R = 0.96, Figure 5—figure supplement 1B), further validating the reproducibility of the deepPCA method.

Figure 5. Comparison of double mutant mutation outcome and genetic interactions in cis and trans.

(A) Cartoon illustrating how the cis library differs from the trans one. NNS, whole codon substitution. Asterisk, point mutation. (B) Scatter plot between Average PPI scores of identical pairs of mutations at the same positions in cis and trans. (C) Distribution of double mutants PPI scores in the original cis and trans libraries or after matching their single mutant effects distributions. Error bars represents 95% confidence intervals around the mean of 1000 sub-samplings when matching the two libraries. (D) Stacked bar-plot showing the proportion of the non-random variance in double mutant PPI scores that is not accounted for by the multiplicative model, explained by the thermodynamic model and the residual structural genetic interactions. Error bars represent the standard error of the mean. (E) Proportion of significant positive and negative genetic interactions in the two original libraries. See Figure 5—figure supplements 2, 3, 7 and 9 for sub-sampled libraries with matched single mutant effect distributions.

Figure 5.

Figure 5—figure supplement 1. PPI scores for the cis library.

Figure 5—figure supplement 1.

(A) Correlation between PPI scores from the three biological replicates of the cis mutation library selection. Contains both single (n = 190) and double (n = 17,290) mutations. Variants with less than 10 UMIs in any of the three replicate input samples or without any UMI in any of the replicate output samples were filtered out. (B) Correlation between single amino acid change PPI scores in the trans and cis libraries for all mutations reachable through a single nucleotide substitution.
Figure 5—figure supplement 2. Proportion of cis and trans double mutants classified as strengthening, intermediate effect or severely detrimental using different score thresholds.

Figure 5—figure supplement 2.

The libraries where sub-sampled to match the distribution of the single mutant PPI scores as described in the method section. (A) Distribution of single mutants PPI scores before (solid lines) after (bars) sub-sampling. Mutants 1 and 2 correspond respectively to Fos and Jun in the trans library or to Fos single mutants randomly attributed to mutant 1 or 2 in each sub-sample of the cis library. For the original cis library, the distribution corresponds to pooling mutants 1 and 2 together dividing the counts by 2. (B) Double mutants stronger than WT. (C) Double mutants severely detrimental. (D–E) Double mutants of intermediate effect, varying the lower and upper thresholds, respectively.
Figure 5—figure supplement 3. Fitting the thermodynamic model on both the trans and cis data.

Figure 5—figure supplement 3.

(A) Proportion of total variance explained in both datasets when fitting the model on either the trans or the cis data only. The models were fitted on the pooled replicates. Percentage of variance explained were then calculated for each replicate independently. (B) Distributions of the proportion of variance explained in 1000 sub-samplings to match the distribution of single mutant effects between the two libraries, showing that the model fitting does not depend on the differences in effect size distributions between the original libraries. (C) Average proportion of the non-random variance in double mutants PPI scores not accounted for by the multiplicative model and explained by the thermodynamic model in the 1000 sub-sampled libraries. The model was fitted on the two original libraries pooled together. The percentage of variance explained was then calculated for each replicate independently. 95% confidence intervals around the mean of the 1000 sub-samplings were not plotted because they are negligible (0.0028 for trans and 0.00058 for cis).
Figure 5—figure supplement 4. Significant structural genetic interactions in cis and trans.

Figure 5—figure supplement 4.

(A) Correlation of structural genetic interaction scores between the three replicates of each library. Genetic interaction scores are from the thermodynamic model fitted on the two original libraries pooled together. (B) Volcano plot of the p-values of each double mutant as a function of the corresponding average genetic interaction scores. Data from the original libraries (not sub-sampled). Red lines represent the thresholds that were used to define the strong and significant genetic interactions used to compute the enrichment in structural features. Vertical lines, magnitude thresholds. Horizontal line, p-value threshold that corresponds to a FDR of 0.2.
Figure 5—figure supplement 5. Comparisons of the patterns of structural genetic interactions in cis and trans.

Figure 5—figure supplement 5.

(A–B) Enrichment (Fisher’s exact test) for strong positive (green) or negative (purple) genetic interactions at different distance thresholds between the two mutated positions (A) and between different position types (B). These enrichments were also calculated after filtering out variants with 100 or less reads in any of the replicates input (bottom barplots on both panels), showing that the trends are the same even though the significance is lower with the stringent filter due to the smaller size of the dataset, especially for the trans library. The trans data analyzed here is restricted to amino acid substitutions reachable through single nucleotide changes. n = 17,290 and 14,118 for cis and trans, respectively, when filtering at 10 input reads, and 15,198 and 2,807, respectively, when filtering at 100 input reads. *, FDR < 0.1. **, FDR < 0.01. ***, FDR < 0.001. n.s., non significant.
Figure 5—figure supplement 6. Comparisons of the pairs of positions enriched in cis and trans.

Figure 5—figure supplement 6.

(A) Pairs of positions significantly enriched in positive (green) or negative (purple) genetic interactions (Fisher’s exact test, FDR = 10%). Each pair of positions is also classified according to its heptad position type and the distance between the two positions (bottom matrix, yellow cells). (B) Heatmap (top) and distribution (bottom) of percentage of significantly positive (left) or negative (right) genetic interactions per pairs of position. The trans data analyzed here is restricted to amino acid substitutions reachable through single nucleotide changes.
Figure 5—figure supplement 7. Robustness of the enrichments in structural features when sub-sampling to match single mutant effects.

Figure 5—figure supplement 7.

Enrichments calculated from one of the sub-samples picked randomly. (A–B) Enrichment (Fisher’s exact test) for strong positive (green) or negative (purple) genetic interactions at different distance thresholds between the two mutated positions (A) and between different position types (B). *, FDR < 0.1. **, FDR < 0.01. ***, FDR < 0.001. n.s., non significant.
Figure 5—figure supplement 8. Robustness of enrichments in significant (FDR < 0.2) genetic interactions at different magnitude thresholds in the cis and trans libraries.

Figure 5—figure supplement 8.

(A) Pairs of positions presented as significantly enriched in Figure 5—figure supplement 6A (magnitude threshold of 0.1). Enrichments and their corresponding FDRs were calculated at all 1024 pairs of positions, for each magnitude threshold independently. (B) Enrichments for different position types. Enrichments and their corresponding FDRs were calculated for each magnitude threshold independently. (C) Enrichments for pairs of mutations at difference distance threshold in the structure (pdb: 1fos). Enrichments and their corresponding FDRs were calculated for each magnitude threshold independently.
Figure 5—figure supplement 9. Comparisons of the extent of structural genetic interactions in cis and trans.

Figure 5—figure supplement 9.

(A–B) Proportion of true positive (A) or negative (B) genetic interactions in the original (solid lines) or sub-sampled (open circles) libraries at different p-value cut-offs, showing that the higher prevalence of both positive and negative interactions in the cis library is not the consequence of the different distribution of single mutants effects. Error bars represent 95% confidence intervals around the mean of the 1000 sub-samplings. True interactions are calculated as the total number of interactions called significant at a given p-value threshold multiplied by the true discovery rate (1-FDR) corresponding to that p-value threshold in that library.

There is a good correlation between the PPI scores of cis and trans double mutants consisting of exactly the same pairs of substitutions at the same positions (R = 0.77, n = 5451 identical double mutants quantified in cis and trans, Figure 5B). This correlation indicates that the structural determinants of mutation effects in FOS and JUN remain well conserved despite sequence divergence over long evolutionary timescales. However, the distributions of double mutant effects are quite different for the cis and trans combinations (Figure 5C). This could be either due to different levels of genetic interactions or merely to the combination of different distribution of single mutant effects (FOS x FOS in cis and FOS x JUN in trans). To control for differences in the distributions of single mutant effects in the two libraries, we sub-sampled the libraries to match their single mutant effect distributions (Figure 5—figure supplement 2A, see Materials and methods). This revealed that, even when controlling for single mutant effect sizes, two mutations within Fos are more likely to increase the strength of the PPI than one mutation in Fos combined with a second mutation in Jun (p < 10−3 over 1000 sub-samplings, 5.2% vs 3.4%, respectively, for PPI scores > 1.04, Figure 5—figure supplement 2B). Two mutations in Fos are slightly less likely to destroy the PPI than a trans mutation combination (25.5% vs 27.8%, respectively, p < 10−3 for PPI scores < 0.64, Figure 5—figure supplement 2C) but have slightly more intermediate negative effects (p < 10−3, 39.4% vs 35.3%, respectively, for PPI scores between 0.64 and 0.92, Figure 5—figure supplement 2D–E). Whether mutations of the same individual effect sizes combine together in cis or in trans therefore influences the double mutant outcome.

The thermodynamic model also accurately predicts interactions in cis

Because leucine zippers, including Fos and Jun, fold upon binding (Patel et al., 1990; Thompson et al., 1993), the same thermodynamic model based on a two-state equilibrium between the two unfolded proteins and the complex can describe how mutations combine in cis as well as in trans. We tested how well the thermodynamic model with parameters fitted on the trans double mutants predicted the cis library data and found that it gave very good prediction (percentage of variance explained of 82–83% for cis vs. 90–91% for trans combinations, Figure 5—figure supplement 3A–B). Similarly, fitting the thermodynamic model on the cis double mutants gave very good prediction of the trans library data (percentage of variance explained of 84% for cis and 90% for trans, Figure 5—figure supplement 3A–B). A common thermodynamic model therefore accounts very well for how mutations combine in both cis and trans to change the PPI (Figure 5D and Figure 5—figure supplement 3C). Therefore, just as in trans-, cis-genetic interactions have a component that results from the non-linear relationship between free energy and protein complex concentration.

Structural interactions in the cis interaction landscape

We then tested whether the residual component of cis-genetic interactions are also enriched for structural interactions. The strongest cases of structural cis interactions (absolute genetic interaction score > 0.1 and FDR < 0.2 in both libraries, Figure 5—figure supplement 4) are indeed also enriched between proximally located residues but are less restricted to pairs of positions that are both at the PPI interface and involve more far side positions compared to trans-genetic interactions (Figure 5—figure supplement 5). cis-genetic interactions are also less enriched at specific positions and more dispersed throughout the structure (Figure 5—figure supplement 6). These results are robust to the magnitude threshold used to call strong genetic interactions and to the differences in single mutant effects between the two libraries (Figure 5—figure supplement 7 and Figure 5—figure supplement 8). Thus, cis-genetic interactions can be subdivided into the same two components as trans-genetic interactions, genetic interactions that results from the non-linearity of the general relationship between protein complex concentration and free energy and specific structural interactions.

Structural interactions are more abundant in cis than in trans

Interestingly, structural genetic interactions explain more of the variance in double mutant PPI scores when mutations are combined in cis than in trans (Figure 5D). Indeed, both positive and negative structural genetic interactions (genetic interactions not accounted for by the thermodynamic model; Supplementary files 3 and 4) are more abundant in cis than in trans (1493 vs. 835 true cases of positive and 1319 vs. 1128 true cases of negative genetic interactions in cis and trans, respectively, at p < 0.031, one sample t-test, FDR < 0.15 and 0.2; Figure 5E and Figure 5—figure supplement 9). This higher prevalence of genetic interactions in cis can potentially be explained by a higher number of contacts between positions within FOS than across the interface (1040 cis vs. 518 trans mutants pairs at positions within 5 Å of each other), supporting our previous result that proximity is a major determinant of genetic interactions both in cis and trans in a PPI interface.

Discussion

Here, we have presented a protein-protein interaction assay – deepPCA – that allowed us to quantify the effects of >120,000 combinations of mutations in both cis and trans on the physical interaction between the products of the FOS and JUN proto-oncogenes. This provided a comprehensive and systematic data for how a very large number of different mutations in two genes combine to alter a biological function, allowing us to investigate the causes of genetic interactions between mutated genes and the extent to which genetic interactions can be quantitatively predicted. In its current form, deepPCA is limited to small domains because of the limit in the amplicon size for paired-end sequencing. However, the use of barcodes (Hiatt et al., 2010) would allow the assay to be applied to longer proteins.

Our data reveal that physical interactions in the cell generate two distinct types of genetic interaction: interactions due to the sigmoidal relationship between the concentration of a protein complex and the free energy of an interaction (Figure 3) and specific, structural interactions (Figure 4).

The general genetic interactions that arise in the physical interaction between molecules is one of several non-linear mappings that can occur between changes in genotype and changes in phenotype. Additional non-linearities occur in the folding of individual proteins or RNAs (‘threshold robustness’) (Bershtein et al., 2006; Olson et al., 2014; Tokuriki and Tawfik, 2009), in saturating enzyme flux (Kacser and Burns, 1973; Stiffler et al., 2015), and in regulatory dynamics (Gjuvsland et al., 2007; Omholt et al., 2000).

This type of genetic interaction is cumulative and easily predictable, for example a three parameter thermodynamic model accounts for ~90% of the variance in our dataset of >120,000 genotypes. The magnitude of this type of genetic interaction can also be predicted when combining three or more mutations together. A better knowledge of all the sources of non-linearities between the genotype and the phenotype is therefore critical to model how genotypic variation translates into phenotypic changes.

The second type of genetic interactions generated by molecular interactions is thermodynamically non-additive. These interactions are enriched between physically contacting and proximal residues, but can also involve some long-range indirect interactions (Halabi et al., 2009). Structural genetic interactions have a more complex basis and are therefore more difficult to predict. Gathering comprehensive data similar to that described here for additional PPIs will help to further elucidate the structural determinants of genetic interactions and the rules for predicting them.

This second type of genetic interactions generated by a protein-protein interaction was more important when combining mutations in cis within the same protein than when combining mutations in trans between the two molecules. This results in a different distribution of double mutant outcomes when combining mutations in cis and trans. Whether a second mutation happens in cis or in trans can therefore impact an evolutionary outcome.

A substantial fraction of genetic interactions could however not be explained by structural contacts. Some other mechanisms not accounted for by the model could be at play. For instance, non-linearities between the growth rate and complementation complex in the protein-complementation assay could artificially produce genetic interactions. However, such saturation effects are unlikely in the range of expression and binding affinities in this study because they would lead to diminishing returns when combining two strength-increasing mutations, which is not observed (Figure 3C). Moreover, Levy et al. have shown that growth is correlated to complementation complex concentration over a wide-range of concentrations (Levy et al., 2014). A more likely source of actual genetic interactions could come from JUN’s ability to form homodimers, which are however less stable than the Fos-JunN heterodimer (Chinenov and Kerppola, 2001). Mutations affecting the equilibrium between the Jun-Jun homodimer and the Fos-Jun heterodimer could indeed have effects that would not be predicted by our thermodynamic model. Elucidating the remaining mechanisms of genetic interactions will thus require further studies that take these effects into account.

Our approach complements the large-scale efforts to comprehensively map genetic interactions between gene deletions or representative alleles of yeast genes (Tong et al., 2004; Costanzo et al., 2010; Costanzo et al., 2016). Gene deletions are, however, rare in nature, with most genetic variation consisting of point mutations of diverse and difficult to predict effects. Our data provides a comprehensive view of how point mutations within two genes interact to affect a biological function. It will be interesting to extend this strategy to quantifying the effects of point mutation combinations on additional phenotypes beyond PPIs, including applying it to gene pairs that do not encode directly physically interacting proteins but instead participate in regulatory interactions or the same biological process.

Materials and methods

All experiments were performed in triplicates starting from the transformation of the variant libraries into yeast.

Deep sequencing data is available at GEO with accession number GSE102901 (reviewer token: yvgbkwqajvypvah).

All perl and R scripts used to analyze the data are available at https://github.com/gdiss/Diss_et_al_eLife_2018 (Diss, 2018; copy archived at https://github.com/elifesciences-publications/Diss_et_al_eLife_2018)

Yeast strain

All experiments were performed in BY4742 (MATα his3Δ1 leu2Δ0 lys2Δ0 ura3Δ0).

Media and buffer recipes

  • LB: 10 g/L Bacto-tryptone, 5 g/L Yeast extract, 10 g/L NaCl. Autoclaved 20 min at 120˚C.

  • YPD: 20 g/L glucose, 20 g/L Peptone, 10 g/L Yeast extract. Autoclaved 20 min at 120˚C.

  • YPDA: YPD +40 mg/L adenine sulfate.

  • SORB: 1 M sorbitol, 100 mM LiOAc, 10 mM Tris pH 8.0, 1 mM EDTA pH 8.0. Filter sterilized (0.2 μm Nylon membrane, ThermoScientific, Waltham, Massachusetts).

  • Plate mixture: 40% PEG3350, 100 mM LiOAc, 10 mM Tris-HCl pH 8.0, 1 mM EDTA pH 8.0. Filter sterilized (0.2 μm Nylon membrane, ThermoScientific, Waltham, Massachusetts).

  • Recovery medium: YPD +0.5 M sorbitol. Filter sterilized (0.2 μm Nylon membrane, ThermoScientific, Waltham, Massachusetts).

  • SC -ura: 6.7 g/L Yeast Nitrogen base without amino acid, 20 g/L glucose, 0.77 g/L complete supplement mixture drop-out without uracile. Filter sterilized (0.2 μm Nylon membrane, ThermoScientific, Waltham, Massachusetts).

  • SC –ura/ade/met: 6.7 g/L Yeast Nitrogen base without amino acid, 20 g/L glucose, 0.74 g/L complete supplement mixture drop-out without uracile, adenine and methionine. Filter sterilized (0.2 μm Nylon membrane, ThermoScientific, Waltham, Massachusetts).

  • Competition medium: SC –ura/ade/met + 200 μg/mL methotrexate (BioShop Canada Inc., Canada), 2% DMSO.

  • DNA extraction buffer: 2% Triton-X, 1% SDS, 100 mM NaCl, 10 mM Tris-HCl pH 8.0, 1 mM EDTA pH 8.0.

Plasmid backbone construction

Two intermediate plasmids were constructed each carrying the CYC promoter, a DHFR F[1,2] (GGGGS)4 linker fusion or DHFR F[3] (GGGGS)3-(GGGAS) linker fusion, respectively, a CYC terminator and a LEU2 or URA3 selection cassette, respectively. The backbone fragments containing the CYC terminator, selection cassette and propagation elements were amplified from pAG415GPD-ccdB-EGFP and pAG416GPD-ccdB-EGFP (Addgene plasmid # 14196), respectively, by Polymerase Chain Reaction (PCR) using primer pairs OGD075-OGD077 and OGD076-OGD077, respectively. The CYC promoter fragment (identical for both constructs) was amplified by PCR from pYM-N10 (Janke et al., 2004) using primer pair OGD041-OGD042. The DHFR F[1,2] and DHFR F[3] fragments were amplified by PCR from pAG25-D1,2 and pAG32-D3 (Tarassov et al., 2008), respectively, using primer pairs OGD051-OGD080 and OGD053-OGD081, respectively. The linkers were encoded as overhangs on primers used to amplify DHFR fragments (3’ side) and plasmid backbones (CYC terminator 5’ side). All PCR fragments were purified using a QIAquick PCR purification kit (Qiagen, Netherlands). Plasmids were constructed by Gibson assembly using 15 fmol of backbone fragment, 45 fmol of DHFR fragment and 45 fmol of promoter fragment in 20 μL reactions. The reactions were incubated at 50˚C for 60 min and 1 μL was transformed into homemade DH5α chemo-competent cells. Correct assembly was checked by colony PCR and the absence of mutations in the coding region by Sanger sequencing. The resulting sequence-confirmed plasmids were named pGD002 and pGD006, respectively.

The complete empty vector was then assembled by inserting the CYC promoter – DHFR F[1,2] – Linker fragment (amplified by PCR from pGD002 using primer pair OGD089-OGD101) into pGD006 (linearized by PCR using primer pair OGD087-OGD099) at the 3’ end of the CYC terminator. All fragments were purified using a QIAquick PCR purification kit (Qiagen, Netherlands). Gibson assembly was performed as described above with 30 fmol of backbone and 90 fmol of fragment and transformed in homemade DH5α chemo-competent cells. Correct assembly was checked by colony PCR and the absence of mutations in the coding region by Sanger sequencing. The resulting sequence-confirmed plasmid was named pGD009.

Cloning of FOS and JUN into the plasmid backbone

FOS and JUN basic leucine zipper domains, as defined by SMART (Letunic et al., 2015), were amplified from human cDNA provided by Juan Valcarcel using primer pairs OGD098-OGD110 and OGD113-OGD125, respectively. These fragments correspond to residues 135 to 199 and 250 to 314, respectively. The CYC terminator was amplified by PCR using primer pair OGD099-OGD135. The backbone was prepared by digestion with BamHI and NheI. All fragments were purified using a QIAquick PCR purification kit (Qiagen, Netherlands). Gibson assembly was performed as described above with 10 fmol of backbone and 50 fmol of each of the CYC terminator FOS and JUN fragments. Correct assembly was checked by colony PCR and the absence of mutations in the coding region by Sanger sequencing. The resulting sequence-confirmed plasmid was named pGD012.

trans library

Experimental design

Thirty-two codons in FOS and JUN were mutated using NNS primers (S = G or C), that is replacing the wild-type codon by one of 31 or 32 mutant codons (if the wild-type codon ends in A or T, all 32 codons on the primer are mutagenic) able to encode all 20 amino acids plus one stop. Assuming equal representation of all variants, replacing 32 positions by one of 32 codons leads to 1024 combinations. However, because 17/32 codons in wild-type FOS end in G or C, there are 1007 unique variant sequences each present at a frequency of 1/1024 and one wild-type sequence present at a frequency of 17/1024. Similarly, there are 24 codons in wild-type JUN ending in G or C. There are thus 1000 unique variants each present at a frequency of 1/1024 and one wild-type sequence present at a frequency of 24/1024. Consequently, when combined, the resulting library design encodes 1,007,000 unique double codon mutants each present at a frequency of 1/10242, 1007 unique single FOS codon mutants each present at a frequency of 24/10242, 1000 unique single JUN codon mutants each present at a frequency of 17/10242 and one wild-type sequence present at a frequency of 17 × 24/10242. The frequency of double amino acids mutants where both amino acids can be encoded by a single possible codon substitution each is thus equal to the frequency of double codon mutants, that is flow = 1/10242 = 9.5×10−7. All the volumes for all reaction and culturing conditions, as well as the number of reads obtained through deep sequencing, were determined to ensure that no amino acid variant would be lost throughout the library construction process, the competition assay, the sequencing libraries preparation and the sequencing itself. Since some amino acids are encoded by a single codon, these frequencies where calculated using the codon as the unit. Additionally, the competition had to be run for a number of generations that had to be not too low, to ensure correct enrichment for strongly interacting variants, but also not too high, to ensure that the weakly interacting variants could still be quantified.

The mutagenic PCR steps caused no bottlenecks because the picomoles of mutagenic primers used in the reactions represented ~1×1012 molecules. The Gibson assembly was also performed with picomoles of FOS and JUN variants to ensure that inefficient assembly would not create a bottleneck. Following the recommendations of Fowler et al. (2014), we targeted at least 10/flow ≈ 107 bacterial and yeast transformants. Post-transformation selections (i.e. selection of transformants, before the actual competition) and competitions were performed in 1 L so that there were ~500 cells per variant when inoculating at an OD600nm of 0.05 (cell density of 107 cells / OD/ mL x 0.05 OD x 1000 mL x flow ≈ 500). Two cycles of competition from an OD of 0.05 to 1.6 would thus lead to 10 population generations. An estimation of three to four generations of background growth (i.e. where weakly interacting variant grow as fast as strongly interacting variant at the beginning of the competition because of non-depleted cellular resources), leads to six to seven generations of effective competition. Under these settings, a non-functional variant (i.e. a variant that interact so weakly that the cell cannot grow) would decrease in frequency by 1/26 to 1/27 ≈ 64 to 128 fold. A sequencing depth of 100 × 1/flow ≈ 108 reads per replicate input or output would thus lead to an average of 100 reads per variant in the input and ~1 read per dead variant in the output. A high number of reads is important to limit noise from stochastic sampling during cluster formation on the sequencing flow cell. However, as described below, the six samples libraries were sequenced on a single lane of an Illumina HiSeq 2500 for an average of ~4.1×107 reads per sample, which was found to be enough for good quality data. The culture volume of 1 L also ensures that no bottleneck is created during DNA extraction (cell density of 107 cells / OD/ mL * minimum of 1 OD x 1000 mL x flow ≈ 104 cells per variants). Finally, we needed to run a large number of PCR reactions per sample for the preparation of sequencing libraries to ensure that the libraries were amplified from enough template molecules. A lower number of template molecules would increase the probability of several reads coming from the same initial PCR template molecule hence biasing the measurement of variant frequency in the population. Targeting 108 reads per sample, we decided to use 20 times more template molecules in the PCR reactions, that is a total of 2 × 109 per sample.

Construction of the trans library by mutagenic overlap-extension PCR and Gibson assembly

Mutagenic NNS primers were ordered from Sigma-Aldrich. One forward and one reverse primer were designed per codon, with 15 nt upstream and 15 nt downstream of the target codon.

PCR1 template DNA fragments encompassing the whole coding sequences but lacking one or the other universal/flanking primer binding sites were prepared by restriction digestion in order to limit the amplification of wild-type sequences from the template DNA carried over from PCR1 to PCR2 by the two universal primers. All templates were prepared by digesting pGD012 with BsrGI and Cfr10I (FOS 5’ template), BspHI (JUN 5’ template) or BamHI and NheI (FOS and JUN 3’ templates) and gel purified.

Since carried-over templates can still anneal and overlap-extend during PCR2, PCR1 was performed with a low template concentration. Specifically, two PCR reactions were set up per target codon, one using the 5’ template, the reverse mutagenic primer corresponding to the target codon and the 5’ universal primer (OGD145 and OGD138 for FOS and JUN mutagenesis, respectively) and another one using the 3’ template, the forward mutagenic primer corresponding to the target codon and the 3’ universal primer (OGD137 and OGD148, respectively). PCR reactions were performed with Kapa HiFi HotStart DNA polymerase (Kapa Biosystems, Wilmington, Massachusetts) according to manufacturer’s protocol in 25 μL with 0.15 fmol of template DNA and 7.5 pmol of each primer and using a melting temperature of 58˚C and an extension time of 1 min for 15 cycles. This resulted in two PCR fragments per target codon that overlap over 33 bp. The 5’ fragment started 169 or 142 bp upstream of the CYC promoter (for FOS and JUN, respectively) and encompassed the CYC promoter, DHFR fragment, linker and first half of the gene until 15 bp downstream of the target codon. The 3’ fragments started 15 bp upstream of the target codon and encompassed the rest of gene and a fragment of the CYC terminator (the 3’ and 5’ fragments for FOS and JUN, respectively, overlapping over 40 bp for Gibson assembly).

The two overlapping PCR products of each target codon were then pooled together (20 μL each) and purified using QIAEX II beads as follows. 120 μL of QX1 buffer, 1.5 μL of beads and 10 μL of 3 M NaOAc were added to the 40 μL of PCR mix. The mixtures were vortexed, incubated 10 min at room temperature and spun for 2 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R. The beads were washed twice with 100 μL of buffer PE and the DNA fragments were eluted in 30 μL buffer EB after a 45 min room temperature incubation.

The second PCRs were performed with Kapa HiFi HotStart DNA polymerase (Kapa Biosystems, Wilmington, Massachusetts) according to manufacturer’s protocol in 25 μL with 10 μL of the eluted DNA fragments, 7.5 pmol of each universal primer (OGD137-OGD145 or OGD148-OGD138 for FOS and JUN, respectively) and using a melting temperature of 53˚C and an extension time of 1 min 30 s for 15 cycles. 3 μL of the PCR product of each target codon were analyzed on an agarose gel to ensure the correct overlap extension. The concentrations of the PCR products of correct size were estimated by four replicate measurements for each target codon using Quant-iT PicoGreen dsDNA Assay Kit (Life Technologies, Carlsbad, California) according to manufacturer’s protocol and corrected by the intensity ratio of the correct band over the whole well measured on the agarose gel. The 32 PCR products corresponding to the same gene were then pooled at equimolar ratio, resulting in two libraries of mutated PCR products at target codon positions, one for FOS and one for JUN.

The backbone was amplified by PCR using Q5 Hot Start High-Fidelity DNA Polymerase (New England Biolabs, IpswichMassachusetts) according to the manufacturer’s protocol in 8 × 50 μL with 2 ng of pGD012 template DNA and 22.5 pmol of each primer (OGD144 and OGD153) per 50 μL reaction and using a melting temperature of 58˚C and an extension time of 2 min 30 s for 24 cycles.

The two mutated PCR product libraries overlapped each other over 40 bp in the CYC terminator. They also each overlapped upstream of the promoter with the backbone over 40 bp. These Gibson assembly junctions were chosen to minimize the effect of indels on protein activity that can result from incorrect assembly. All three DNA fragments were gel purified using the QIAquick gel extraction kit (Qiagen, Netherlands) and dialyzed to remove extra salt using nitrocellulose membranes with 0.025 μm pores (Merck Millipore). The three fragments were assembled by Gibson assembly in 800 μL with 1.2 pmol of backbone fragment and 3 pmol of each of the mutated gene fragments. The master mix was split into 16 × 50 μL reactions and incubated 17 hr at 50˚C. The 16 reactions were then pooled back, purified on column using QIAquick PCR purification kit (Qiagen, Netherlands), eluted in 2 × 30 μL of EB buffer, concentrated down to 29 μL in a speedvac and stored at −20˚C until use.

Test transformations revealed that a 2X dilution gave the highest number of transformants. 5 μL of concentrated DNA assembly were thus diluted with 5 μL of distilled water and transformed in two batches into NEB 10-beta Electrocompetent E. coli as follows. For one batch, four transformations were set up by mixing 1 μL of DNA to 25 μL of competent cells in a 0.1 cm Gene Pulser Cuvette (Bio-Rad, Hercules, California) chilled on ice. The four mixtures were electroporated using a Gene Pulser Xcell (Bio-Rad, Hercules, California) at 2.0 kV, 200 Ω and 25 μF. After each electroporation, the cells were immediately re-suspended in 1 mL SOC medium (provided by the manufacturer) warmed to 37˚C and the cuvettes were rinsed with another 1 mL of the same medium. The four electroporations were then pooled together and incubated for 30 min at 37˚C. After this recovery period, 10 μL were diluted 100 X in the same medium and 10 μL were plated on pre-warmed LB +Amp Petri dishes and incubated for ~16 hr at 37˚C. The remaining ~7.7 mL of cells were added to 200 mL of pre-warmed LB +400 μg/mL in 1L flasks and incubated for ~16 hr at 37˚C under constant agitation at 200 rpm. This resulted in a total of 11.6e6 transformant per batch. Each batch of 200 mL was then split in four 50 mL Falcon tubes, cells were harvested by centrifugation for 15 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R and pellets were stored at −20˚C for later use. One pellet of each batch was then thawed and plasmid DNA was extracted using Qiagen Plasmid Midi Kit. The two batches were then pooled at equimolar ratio to form the plasmid DNA library.

Large-scale yeast transformation

Yeast transformation was performed in triplicates in parallel following a modified version of the protocol found in Melamed et al. (2013). First, a glycerol stock of the BY4742 strain was streaked to single colonies on YPD Petri dishes and incubated at 30˚C for 2 days. For each replicate, a whole colony was resuspended in 25 mL YPDA and grown overnight to saturation at 30˚C under constant agitation at 200 rpm. The next day, three flasks of 700 mL of pre-warmed YPDA were inoculated with each of the pre-culture at a starting OD600nm of 0.125 and incubated at 30˚C under constant agitation at 200 rpm. After 6 hr 30 min of incubation, the cells were harvested by centrifuging 5 min at 3000 g in a Avanti J-26 XP Centrifuge using a JLA 8.1000 rotor (Beckman-Coulter, Brea, California). Cells were washed once with 250 mL sterile distilled water, re-suspended in 25 mL SORB, transferred to 50 mL Falcon tubes and completed to 50 mL with SORB. The cells were harvested by centrifugation 5 min at 3000 g in an Eppendorf Centrifuge 5810 R, re-suspended in 34.3 mL SORB and incubated at room temperature on a wheel for 30 min. Meanwhile, 2.2 mL of 10 mg/mL salmon sperm DNA (Agilent Technologies, Santa Clara, California) were thawed, boiled for 5 min in a heating block and cooled on ice for 2 min. When the cells were ready, 56 μg of plasmid DNA library and 700 μL salmon sperm DNA were added to each replicate and mixed by inverting the tubes six times. Each trans replicate was then split in 4 × 8.75 mL in 50 mL Falcon tubes for a total of 12 tubes, to which 35 mL of plate mixture were added. The Falcon tubes were incubated at room temperature for 30 min on a SSM4 mini seesaw rocker (Stuart Equipment, United Kingdom) at 30 oscillations/min. Then, 3.5 mL of DMSO were added to each tube and mixed in by inverting the tubes six times. The cells were heat-shocked for 20 min at 42˚C in a water bath. To homogenize the temperature faster, the tubes were inverted after 30 s, 1 min, 2 min 30 s, 5 min, 7 min 30 s, 10 min and 15 min. The cells were then harvested by centrifugation for 5 min at 1791 g (3000 rpm) in an Eppendorf Centrifuge 5810 R (Eppendorf, Hamburg, Germany). The supernatant was first poured out of the tubes, the tubes were then quick-spun for 10 s to bring the remaining liquid down, which was then removed by aspiration with a vacuum pump. The cells from each tube were re-suspended in 10 mL of pre-warmed recovery medium. The four tubes of the same replicate were pooled together and added to 1.4 L of pre-warmed recovery medium in 5 L flasks. Recovery medium were incubated for 1 hr at 30˚C under constant agitation at 200 rpm. The cells were then harvested by centrifugation at 3000 g for 5 min in a Avanti J-26 XP Centrifuge using a JLA 8.1000 rotor (Beckman-Coulter, Brea, California), washed once with 200 mL of SC –ura and added to 1.4 L of SC –ura in 5 L flasks. After homogenization by stirring, 10 μL were platted on SC –ura Petri dishes and incubated for ~48 hr at 30˚C to measure transformation efficiency. The liquid cultures were grown to saturation for ~48 hr at 30˚C under constant agitation at 200 rpm. This transformation resulted in 19.88 × 106, 19.32 × 106 and 16.52 × 106 transformants for replicates 1, 2 and 3.

Competition assay

After the first cycle of post-transformation selection, a second selection cycle was performed for each replicate by inoculating 1 L of SC –ura/ade/met at a starting OD600nm of 0.1 with the saturated culture and growing it to exponential phase (OD600nm of ~1–1.2) for 12 hr 30 min at 30˚C in 5 L flasks under constant agitation at 200 rpm. To start the first cycle of the competition, the volume of culture necessary to inoculate 1 L of competition medium at an OD600nm of 0.05 was centrifuged for 5 min at 3,000 g in an Eppendorf Centrifuge 5810 R. Cells were re-suspended in 10 mL of competition medium containing methotrexate and added to 1 L of the same medium and incubated at 30˚C in 5 L flasks under constant agitation at 200 rpm. Meanwhile, the remainder of the cultures were harvested by centrifugation for 5 min at 3,000 g in a Avanti J-26 XP Centrifuge using a JLA 8.1000 rotor (Beckman-Coulter, Brea, California), washed with distilled water, transferred to 50 mL Falcon tubes and the pellets were stored at −20 for later use. After ~20 hr 50 min of growth, the three replicates reached an OD600nm of 1.57, 1.60 and 1.53 respectively. A second cycle of competition was performed in the same conditions by diluting back the cultures to an OD600nm of 0.05 in 1L of competition medium. The remainder of the cultures were harvested and stored as described above. After 16 hr of growth, the cultures reached an OD600nm of 2.66, 2.49 and 2.55, respectively. The whole cultures were harvested and stored as described above.

DNA extraction

The cell pellets of the second post-transformation selection cycle (i.e. input population) and second competition cycle (i.e. output population) were first thawed at room temperature, re-suspended in 10 mL of DNA extraction buffer, frozen for 10 min in an ethanol/dry ice bath and thawed for 10 min in a 62 water bath. After an additional freeze/thaw cycle, 10 g of acid-washed, 425–600 μm glass beads (Sigma-Aldrich, St. Louis, Missouri) and 10 mL of Phenol/Chloroform/Isoamyl alcohol 25:24:1 equilibrated in 10 mM Tris-HCl, 1 mM EDTA, pH 8.0 were added to each tube. Cells were lysed by vortexing for 10 min and the aqueous phase containing the total DNA was separated by centrifuging for 20 min at 3,184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Hamburg, Germany). Another extraction was performed by adding 11 mL of Phenol/Chloroform/Isoamyl alcohol 25:24:1 equilibrated in 10 mM Tris-HCl, 1 mM EDTA, pH 8.0, vortexing 2 min and centrifuging 20 min at 3,184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany). The aqueous phase was transferred to a new tube and nucleic acids were precipitated by adding 1/10 vol of 2M NaCl and 2.2 volumes of pre-chilled absolute ethanol and incubating for 30 min at −20˚C. NaCl was used instead of NaOAc to limit SDS precipitation. Nucleic acids were pelleted by centrifugation for 30 min at 3,184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany) pre-chilled at 4˚C. The supernatants were removed and the pellets were put at 37˚C until dry and then re-dissolved in 6 mL of 10 mM Tris-HCl, 1 mM EDTA, pH 8.0. The dissolution was extremely slow, and after an overnight incubation at room temperature, the pellets were still not completely dissolved. The solution was thus centrifuged for 15 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany) to pellet the non-dissolvable material and the supernatants were transferred to new 14 mL Falcon tubes. RNAs were digested by adding 50 μL of RNAseA 10 mg/ml and incubating for 30 min at 37˚C followed by a 2 hr incubation at room temperature. DNA was then extracted by adding 6 mL of Phenol/Chloroform/Isoamyl alcohol 25:24:1 equilibrated in 10 mM Tris-HCl, 1 mM EDTA, pH 8.0. After centrifugation 10 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R, the interphases were still thick and two other Phenol/Chloroform/Isoamyl alcohol 25:24:1 extractions were performed in the same way followed by an extraction with chloroform alone. DNA was precipitated by adding 600 μL of 2M NaCl and 13.2 mL of pre-chilled absolute ethanol and incubating for 20 min at −20˚C and then pelleted by centrifugation 30 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany) pre-chilled at 4˚C. To remove excess salt, 8 mL of 70% ethanol was poured on top of the pellets and incubated for 30 min at room temperature. After centrifugation 30 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany), the supernatant was removed, the tubes were sealed with an air-breathable seal and the pellets were let to dry overnight at room temperature. The next day, the dried DNA pellets were dissolved in 1 mL of 10 mM Tris-HCl, pH 8.0. The three replicates from competition cycle two appeared to be cloudy. To remove the non-dissoluble material, the six samples were centrifuged 5 min at 3184 g (max speed) in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany) and the supernatants were transferred to 1.5 mL Eppendorf tubes (Eppendorf, Germany).

Sequencing library preparation

The sequencing libraries were constructed by two consecutive PCR reactions using a method adapted from Levy et al. (2015). The first PCR was designed to amplify the region of interest, that is from directly upstream of the first mutated codon in FOS to directly upstream of the first mutated codon in JUN (the two genes are in head-to-tail orientation). The first PCR also added Unique Molecular Barcodes (UMIs) and the first half of the illumina adapter sequences. A small number of cycles of the first PCR would limit the number of differently barcoded molecules that derive from the same template molecule. The second PCR would then add the remainder of the Illumina adapter sequences.

Plasmid concentrations in the total DNA extractions were first quantified by qPCR using primer pair OGD241-OGD242 that bind in ori region of the plasmid. For each six samples of each of the cis and trans libraries, respectively, 4 and 48 PCRs were performed using Q5 Hot Start High-Fidelity DNA Polymerase (New England Biolabs, Ipswich, Massachusetts) according to manufacturer’s protocol in 50 μL reactions with 4.2 × 107 molecules of plasmid from the DNA extraction, 25 pmol of primers OGD237 and OGD238, and a melting temperature of 66˚C (previously determined by temperature gradient), an extension time of 30 s or 1 min, respectively, for four cycles. A total of 2 × 109 molecules of plasmids were then used to prepare the sequencing libraries from each sample. Excess primers were removed by adding 2 μL of ExoSAP-IT (Affymetrix, Santa Clara, California) and incubating for 20 min at 37˚C followed by an inactivation for 15 min at 80˚C. This step was necessary because these 60 nt primers are not efficiently removed by the following column purification step. The 48 PCRs of each sample of the trans library were then pooled and purified using eight Qiagen PCR purification kit (Qiagen, Netherlands) columns per sample. According to manufacturer’s protocol, one column is able to bind 10 μg of DNA, which corresponds to ~8×108 genomes. Eight columns were then used to ensure that they are not saturated by the genomic DNA carried over from the DNA extraction. The DNA was eluted in 2 × 50 μL of EB buffer (provided by the manufacturer) and pooled for each sample. The eluted DNA was then split into 24 PCR reactions per, which were performed using Kapa HiFi HotStart DNA polymerase (Kapa Biosystems, , Wilmington, Massachusetts) according to manufacturer’s protocol in 50 μL with 15 pmol of illumina adapter primers. The reverse primers carried a different index for each of the six samples of the same library. For this PCR step, Kapa was chosen over Q5 because it was less efficient in the first PCR reactions (higher optimal melting temperature) and thus would lead to a lower re-barcoding of amplicons with new UMIs present on primers from the first PCR reaction that would have been carried over. Each sample was loaded on an agarose gel to check for correct amplification. A strong non-specific band of lower size was observed, which seemed to gradually disappear as the number of cycles in the first PCR was increased. However, increasing the number of cycles would increase the probability of producing amplicons with different UMIs that derived from the same original template molecule. The number of cycles in the first PCR was thus kept to four and the band of correct size was extracted by gel purification from each sample. To this end, the 24 PCRs of each sample were pooled, concentrated on four Qiagen PCR purification kit (Qiagen, Netherlands) columns per sample and each eluted with 2 × 50 μL of EB buffer (provided by the manufacturer). The bands of correct size were then purified on 2% agarose gel starting from 100 μL of each sample using 10 μL QIAEX II beads (Qiagen, Netherlands) according to manufacturer’s protocol and eluted in 20 μL EB buffer. DNA concentration was determined by picogreen in triplicates and the six samples were pooled at equimolar ratio. The pooled sample was sequenced in a single lane of an Illumina HiSeq2500 with 125 bp paired-end reads at the EMBL Genomics Core Facilities in Heidelberg, Germany.

Sequencing data analysis and filtering

A total of 245,676,608 read pairs were obtained from the lane. However, ~30% corresponded to adaptor sequences and had to be discarded. The high adaptor sequence reads level was probably due to the non-specific PCR amplification during the sequencing library preparation despite the gel purification. Sequencing data was filtered using homemade Perl scripts. Paired reads for which one of the two variable regions had an average Phred score below or equal to 20 were also discarded. Additionally, paired reads were also filtered-out if they had i) one or more non-resolved bases (Ns) in the variable or UMI regions, ii) more than one mutated codon in each gene’s variable region or iii) if the mutated codon ended in an A or T (since these were not encoded by the NNS mutagenic primers). These non-designed substitutions could have occurred during the library construction process before the competition and thus lead to accurate PPI score. However, their low frequency suggested they were more likely to be the results of amplification errors during sequencing library preparations and/or sequencing errors and were thus filtered-out in the interest of stringency and higher quality. These filtering steps resulted in between 1.08 and 1.42 × 107 usable reads per replicate input and output.

To estimate sequencing errors, we measured the per base error probability in the constant region upstream of the variable regions of FOS and JUN that were used as primer annealing sites for library amplification. These sequences should be 100% wild type and any mutations called should be the result of a sequencing error. We estimate the probability of a base to be wrongly called to be 0.0019 for the forward read and 0.0058 for the reverse read, averaged across the six samples (three replicates inputs and outputs). Sequencing errors do not seem to affect the downstream estimate of PPI scores as filtering reads at a Phred score of 30, 35 or additionally filtering out reads where at least one of the called mutations had a Phred score below 30 (>90% of the wrongly called mutations in the constant region had a Phred score below 30 compared to <25% of the correctly called bases) led to PPI scores that were highly correlated to the ones calculated using the above filters (Figure 1—figure supplement 1A).

PPI score calculation and correction for background growth

The frequency of each variant in each sample was then calculated by counting the number of unique UMIs associated to each unique amino acid variant (wild-type (WT), single or double mutant) divided by the total number of unique UMI per variant in the corresponding sample. First, nucleotide sequences were converted to amino acid sequences and the UMIs of different nucleotide variants coding for the same amino acid sequences were summed up. PPI scores were then calculated for each variant according to the formula provided in Figure 1B.

Variants with zero UMI in any of the three outputs were filtered out. Even though variants with lethal mutations could be expected to go to 0, they cannot go further down while their actual frequency in the population is still decreasing (e.g. they could still not be detected even if sequencing 10X or 100X deeper). Their PPI scores thus reach a boundary that is a function of the input UMI count as illustrated in Figure 1—figure supplement 1B. These variants could be misidentified as having intermediate effects or even neutral effects while they are actually lethal. On the contrary, variants with at least one UMI in each replicate output have been consistently detected. Hence, even though the estimate of their PPI score might be associated to a high error due to the stochasticity of low read counts, their actual PPI score should lie within a range around the estimates (e.g. sequencing 10X or 100X deeper would lead to read counts of ~10 or ~100, respectively).

Variants with low input UMIs are also typically associated with a higher error due to stochastic effects. Moreover, these variants are biased toward intermediate or neutral effect due to the PPI score boundary (Figure 1—figure supplement 1B). We thus filtered out variants with less than 10 UMIs in any of the three input replicates. The WT and variants containing stop codons were also excluded from downstream analyses.

In PCA, cells that do not express the DHFR fragments or express protein fusions that do not interact can initially divide for a few generations. This background growth is probably the result of intracellular stores of folate and other metabolites that can be used for growth before they are depleted. To correct for batch effects in this background growth, PPI scores were corrected to align the mode of the detrimental variants (left most mode in the bimodal distribution of PPI scores), which is assumed to correspond to the background growth. This mode was identified in each replicate using the R density function. PPI scores were then aligned to the maximal value of this mode across replicates to avoid the possibility of getting negative values for variants with PPI scores in the left-hand tail of the distribution. This was done according to the following formula, which keeps the WT PPI score as one in all replicates:

correctedPPIscore=PPIscore-m-M1-MPPIscorewt-m-M1-M

where m is the mode in the corresponding replicate and M is the maximal value of the modes across all replicates.

Small-scale confirmation of single mutant effects

To ensure that growth rate measured by deep sequencing in our assay agree with actual growth rate, we reconstructed 14 variants (one wild-type interaction and 13 single mutants) chosen randomly across the range of PPI scores and measured their growth rate individually in a TECAN plate reader (TECAN, Switzerland). The 14 mutants were constructed individually by OE-PCR in the same way as the large-scale library but using fixed sequence mutagenic primers instead of NNS primers (Supplementary file 5). Each clone was sequence-confirmed and transformed in yeast using a scaled-down version of the transformation protocol used for the large-scale assay.

Two colonies of each variant were grown overnight in 100 μL of SC –ura/ade/met medium at 30C in microtiter plate with constant shaking. The next day, precultures were diluted to an OD600nm of 1 and 5 uL was added to a final volume of 100 μL of MTX medium in a microtiter plate. Growth curve were measured in a TECAN infinite M200 (TECAN, Switzerland) with OD600nm measurements every 15 min for 50 hr.

PPI scores were calculated in a similar way than the ones from the large-scale assay. For each variant, the number of doubling was calculated between the beginning of the assay and the time point at which the wild-type interaction reached its maximal growth rate (approximating the point at which cells were harvested in the large-scale assay). The PPI score was then calculated as the number of doublings of a variant relative to the number of doubling of the wild-type.

Classification of mutant effects

Thresholds based on the distributions of PPI scores averaged across replicates were used to classify variants as detrimental (PPI score <= 0.64), intermediate (0.64 < PPI scores<=0.96), neutral (0.96 < PPI scores<=1.04) or strengthening (PPI scores > 1.04). Additionally, one sample t-tests were performed to test the difference of the mean PPI score of each variant across replicates from the WT PPI score of 1. False discovery rates were controlled using the Benjamini-Hochberg method as implemented in the R function p.adjust. The p-value threshold to call variants significantly different from WT was set so that the FDR was inferior to 0.05. The robustness of all downstream analyses was analyzed by varying these PPI score thresholds.

Prediction of single mutant effects from amino acid physic-chemical features

Fos and Jun single mutant effects were predicted at each of the 32 mutated positions from amino acid features retrieved from the amino acid index database (Kawashima et al., 2008). Only features without any missing value for the 20 amino acids were considered. PPI score of mutations at each position were predicted with each feature independently using R’s built-in lm function.

Comparisons of identical substitutions in Fos and Jun

The differences in PPI scores between identical mutations at the same position in Fos and Jun were assessed using a paired t-test. False discovery rates were controlled using the Benjamini-Hochberg method as implemented in the R function p.adjust. The enrichment for mutations that had an intermediate effect in either protein was tested using Fisher’s exact test.

Multiplicative genetic interactions

Multiplicative genetic interactions were computed as the difference between the PPI scores measured for the double mutant and the multiplicative expectation, that is the product of the PPI scores of the two corresponding single mutants. This was done independently in each of the three replicates, and an average genetic interaction score was calculated for each variant. A one-sample t-test was performed to test whether this average was significantly different from 0. FDRs were computed as follows. In each of 10,000 permutations, the genetic interaction scores were randomized within replicates independently. The one-sample t-test was then performed again. At any given p-value threshold, hits in the permutated data can be considered as false discoveries that are significant just by chance by randomly drawing values from this distribution of genetic interaction scores. At each p-value observed in the original data, the FDR was hence computed in each permutated data set as the number of false discoveries divided by the total number of discoveries in the real data. The standard error of the average FDR was then computed in the same way at each p-value threshold, but was negligible (2.5e−5 at a FDR of 0.2).

Thermodynamic model

The thermodynamic model was built by fitting the quadratic solution for the dissociation constant, KD, of the interaction:

eΔGRT=KD= AF × BFAB= (ATAB) × (BTAB)AB (1)

with AB being the concentration of the dimeric complex, AF the concentration of the free form of protein A, BF the concentration of the free form of protein B, AT the total concentration of protein A and BT the total concentration of protein B.

Assuming that the growth rate of a cell expressing a variant is directly proportional to the concentration of protein complex so that

n=AB (2)

with n being the number of generations during competition. Cells with variants harboring lethal mutations such as stops are still able to grow for a few generations at the beginning of the competition, even in the presence of methotrexate, as highlighted by the mode of the peak of detrimental variants in Figure 3A. This background growth might be allowed before the depletion of intracellular stores of tetrahydrofolate or downstream metabolites. We hence introduced a parameter b to account for background growth. PPI score (PPI) can then be expressed as

PPI= n+bnwt+b= AB+bABwt+b  AB=PPI × ABwt+b×(PPI1) (3)

Replacing (3) in (1),

eΔGRT= (ATPPI × ABwtb×(PPI1)) × (BTPPI × ABwtb×(PPI1))PPI × ABwt+b×(PPI1)
= ABwt×(ATABwtPPIbABwt×(PPI1)) × (BTABwtPPIbABwt×(PPI1))PPI+bABwt×(PPI1)
= ABwt×(ATABwt+bABwtPPI×(bABwt+1)) × (BTABwt+bABwtPPI×(bABwt+1))PPI×(bABwt+1) bABwt
=ABwt×(bABwt+ATABwtbABwt+1  PPI) × (bABwt+BTABwtbABwt+1  PPI)PPI×(bABwt+1) bABwt
ΔG=RT × log(ABwt×(bABwt+ATABwtbABwt+1  PPI) × (bABwt+BTABwtbABwt+1  PPI)PPI×(bABwt+1) bABwt   )
ΔG=RT × log(ABwt×(X  PPI) × (Y  PPI)PPI×(K+1) K   ) (4)

With K=bABwt, X=bABwt+ATABwtbABwt+1 and Y=bABwt+BTABwtbABwt+1

Equation (4) expresses ΔG (or, more accurately, its approximation in arbitrary units) as a function of PPI scores with the terms ABwt, X, Y and K. The smallest of the two terms X and Y represents the maximal PPI score, including background growth, that a variant can achieve when all the molecules of the limiting factor are in complex and assuming no change in total abundance (multiplying both side of the fraction by ABwt returns back to equation 3 with AT or BT replacing AB).

It is more practical to express PPI score as a function of ΔΔG rather than ΔG:

ΔΔG= ΔG ΔGwt
=RT × log(ABwt×(X  PPI) × (Y  PPI)PPI×(K+1) K   ABwt×(X  PPIwt) × (Y  PPIwt)PPIwt×(K+1) K   )

The PPI score of the wild-type is one by definition, and ABwt cancels out from the numerator and the denominator, leading to

ΔΔG= RT × log(1(PPI×(K+1) K)×(X  PPI) × (Y  PPI)(X  1) × (Y  1) ) (5)

Rearranging Equation (5),

(XPPI )×(YPPI ) (PPI×(K+1)K)×(X1 )×(Y1 )× eΔΔGRT=0
PPI2 PPI×((X1 )×(Y1 )×(K+1)×eΔΔGRT+X+Y)+K×(X1 )×(Y1 )×eΔΔGRT+XY=0 (6)

Equation (6) is a quadratic equation with two solutions but a single relevant solution (the other root would lead to PPI scores higher than the maximum dictated by the free parameters):

PPI=12×((X1)×(Y1)×(K+1)×eΔΔGRT+X+Y((X1 )×(Y1 )×(K+1 )×eΔΔGRT+X+Y)24 × (K×(X1 )×(Y1 )×eΔΔGRT+XY) ) (7)

According to the null expectation, the ΔΔG of a double mutant d is the sum of the ΔΔG of the two corresponding single mutants s1 and s2 (Horovitz, 1996), such that

ΔΔGd=ΔΔGs1+ΔΔGs2= RT × log((XPPIs1 ) × (YPPIs1 )(PPIs1×(K+1) K)  ×(XPPIs2 ) × (YPPIs2 )(PPIs2×(K+1) K)  × 1((X1 ) × (Y1 ))2) (8)

Replacing Equation (8) in Equation (7) leads to:

PPId= 12×((K+1 )×(XPPIs1 )×(YPPIs1 )×(XPPIs2 )×(YPPIs2 )(PPIs1×(K+1) K)×(PPIs2×(K+1) K)×(X1 )×(Y1 ) +X+Y((K+1 )×(XPPIs1 )×(YPPIs1 )×(XPPIs2 )×(YPPIs2 )(PPIs1×(K+1) K)×(PPIs2×(K+1) K)×(X1 )×(Y1 ) +X+Y)24×( K×(XPPIs1 )×(YPPIs1 )×(XPPIs2 )×(YPPIs2 )(PPIs1×(K+1) K)×(PPIs2×(K+1) K)×(X1 )×(Y1 )+XY) ) (9)

Equation (9) represents the thermodynamic model that predicts the PPI score of the double mutant as a function of the PPI scores of the two corresponding single mutants and has three free parametersATABwt, BTABwt andbABwt.

The model was fitted to the data with an exhaustive parameter search from 1.1 to 2 with a step of 0.01 forATABwt and BTABwt and from 0.3 to 1.2 with a step of 0.001 for bABwt. These intervals were chosen based on the distribution of PPI scores, for example a background growth of > 1.2 would not make sense since it would mean that everything is background growth. As can be seen in Figure 3—figure supplement 2A, it is fair to assume that the optimal fit lies within these intervals. In the cases where one or both single mutants had a PPI score, respectively, below or above the bottom or top asymptotes defined by equation (7) and illustrated on Figure 3D, equation (9) would give infinite or aberrant (negative) values. The corresponding predicted double mutant PPI score was hence set to the minimal and maximal values, respectively, allowed by the function ((bABwt)/(bABwt+1) and min(ATABwt, BTABwt)/(bABwt+1), respectively).

For the trans library alone (first part of the paper), the model was fitted on the three replicates pooled together and the parameter set leading to the lowest percentage of variance explained between the observed and predicted double mutants PPI scores was then chosen. To ensure that the model was not over-fitting, a Monte-Carlo cross-validation was performed as follow. One half of the Fos and Jun single mutants were randomly sampled and the corresponding double mutants were used as a fitting set. The double mutants corresponding to the other halves of the Fos and Jun single mutants constituted the test set, to ensure independency of the two sets. The model was then fitted with an exhaustive parameter search from 1.1 to 1.6 with a step of 0.01 forATABwt and BTABwt and from 0.3 to 1 with a step of 0.01 for bABwt. The procedure was repeated 1000 times and the median value of the three fitted free parameters matched the values obtained from fitting the model on the whole dataset, indicating no over-fitting. The median of the proportion of variance explained for the 1000 test sets differed by only 0.001 with a standard error of 1.2 × 10−4 across the 1000 random samples (Figure 3—figure supplement 2B).

Calculation of structural genetic interaction scores

Genetic interaction scores for the three replicate of each double mutant were calculated as the difference between the observed and the predicted PPI scores and averaged across the three replicates. To test whether the genetic interaction scores were significantly different from 0, a one-sample t-test was performed. p-Values were then corrected using a permutation procedure as described above for genetic interactions calculated from the multiplicative model.

Enrichment in strong structural genetic interactions per pair of positions, between position types and at different distance thresholds

Enrichments for cases of either strong positive or strong negative genetic interactions (absolute genetic interaction score > 0.1 and FDR < 0.2) were tested using Fisher’s exact test. FDRs were controlled independently for enrichment in positive and negative genetic interactions and for each feature tested. Significant enrichments or depletions were determined at an FDR of 10%. Distance between mutated positions were measured from the crystal structure (pdb 1fos, [Glover and Harrison, 1995]). The smallest distance between side chain atoms was used. The robustness of these enrichments was tested by repeating this analysis using different absolute genetic interaction score cut-offs.

cis library

Experimental design

For the cis library, FOS was mutated by doped oligonucleotide synthesis instead of using NNS mutagenic primers, which would have required an iterative mutagenic process. A doping rate of 2.1% was chosen to maximize the frequency of double nucleotide mutants at 0.274, or 6.66 × 10−6 for each of the 41,040 double mutants. Accounting for ~ 25% of non-workable, indel-containing sequences, flow was estimated at ~ 5×10−6. Using the same rule of thumb than for the trans library, we targeted 10/flow = 2 × 106 transformants in bacteria and yeast. Selections and competitions were performed in 200 mL in order to have a bottleneck size and a number of generations similar to the trans library during the competition assay (cell density of 107 cells / OD/ mL * 0.05 OD * 200 mL * flow ≈ 500 cells per variant). However, the six samples (three replicates input and output) were still sequenced on a whole lane, leading to expected increase in coverage per double mutant (~30M reads per library * flow ≈ 200 reads per sample). A total of 2 × 108 template molecules per sample were targeted for the sequencing library preparation.

Construction of the cis library by Gibson assembly of a doped oligonucleotide

The doped oligonucleotide was purchased from TriLink and cloned into pGD012 by Gibson assembly. The backbone fragment was prepared as for the trans library. The upstream FOS fragment and terminator + JUN fragment were amplified by PCR using using Kapa HiFi HotStart DNA polymerase (Kapa Biosystems, Wilmington, Massachusetts) according to the manufacturer’s protocol, each in 4 × 50 μL with 1 ng of pGD012 template DNA and 15 pmol of each primer (OGD145-OGD271 and OGD138-OGD272) per 50 μL reaction and using a melting temperature of 63˚C and 58˚C, respectively, and an extension time of 1 min for 24 cycles. The PCR products of correct sizes were purified on agarose gel using QIAEX II (Qiagen, Netherlands) beads according to manufacturer’s protocol, using 10 μL beads and eluted in 2 × 20 μL of EB buffer. The doped oligonucleotide was prepared by complementary strand synthesis using Q5 Hot Start High-Fidelity DNA Polymerase (New England Biolabs, Ipswich, Massachusetts) according to manufacturer’s protocol with 12 pmol of template doped oligonucleotide and 12.5 pmol of primer OGD270 per 25 μL reaction. A temperature gradient of 12 reactions between 50˚C and 72˚C was first performed to determine the optimal hybdrization temperature, with an extension time of 30 s and 32 cycles to make sure all complementary strands are synthesized. Melting temperature didn’t seem to affect specificity and the double stranded DNA from the first eight reactions were each purified on a lane of an E-gel SizeSelect 2% (Invitrogen, Carlsbad, California), pooled and then concentrated on minElute column (Qiagen, Netherlands) according to manufacturer’s protocol and eluted in 2 × 10 μL EB buffer.

The four fragments were assembled by Gibson assembly in 50 μL with 137.5 fmol of backbone fragment, 343.75 fmol (2.5X) of each of the upstream and downstream fragments and 1.375 pmol (10X) of the double stranded doped oligonucleotide. After a 24 hr incubation at 50˚C, the reaction was dialyzed for 1 hr on nitrocellulose membranes with 0.025 μm pores (Merck Millipore, Burlington, Massachusetts). A single batch of transformation was realized as described for the trans library and pooled for 25 min recovery (to account for the time it takes to do all eight electroporations and ensure cells from the first electroporation didn’t divide yet). After this recovery period, 2 μL were diluted 100 X in the same medium and 100 μL were plated on pre-warmed LB + Amp Petri dishes and incubated for ~16 hr at 37˚C. The remaining ~ 16 mL of cells were added to 400 mL of pre-warmed LB + 400 μg/mL in 1L flasks and incubated for ~16 hr at 37˚C under constant agitation at 200 rpm. This resulted in a total of ~1.75×106 transformants. The cells were harvested and the plasmids were extracted as described for the trans library.

Large-scale yeast transformation

The transformation protocol was further optimized between the transformation of the trans and cis libraries and scaled down for the latter since it is of lower complexity. First, a glycerol stock of the BY4742 strain was streaked to single colonies on YPD Petri dishes and incubated at 30˚C for 2 days. For each replicate, a whole colony was re-suspended in 20 YPDA for the cis and trans libraries, respectively, and grown overnight to saturation at 30˚C under constant agitation at 200 rpm. The next day, three flasks of 175 mL of pre-warmed YPDA were inoculated with each of the pre-culture at a starting OD600nm of 0.3 and incubated at 30˚C under constant agitation at 200 rpm. After 4 hr of incubation, the cells were harvested by centrifuging for 5 min at 3000 g in an Avanti J-26 XP Centrifuge using a JLA 8.1000 rotor (Beckman-Coulter, Brea, California). Cells were washed once with 50 mL sterile distilled water, re-suspended in 10 mL SORB, transferred to 50 mL Falcon tubes and completed to 25 mL with SORB. The cells were harvested by centrifugation 5 min at 3000 g in an Eppendorf Centrifuge 5810 R (Eppendorf, Germany), re-suspended in 8.6 mL SORB and incubated at room temperature on a wheel for 30 min. Meanwhile, 1.1 mL of 10 mg/mL salmon sperm DNA (Agilent Technologies, Santa Clara, California) were thawed, boiled for 5 min in a heating block and cooled on ice for 2 min. When the cells were ready, 3.5 μg of plasmid DNA library, 175 μL salmon sperm DNA and 35 mL of plate mixture were added to each replicate and mixed by inverting the tubes six times. The Falcon tubes were incubated at room temperature for 30 min on a SSM4 mini seesaw rocker (Stuart Equipment, United Kingdom) at 30 oscillations/min. Then, 3.5 mL of DMSO were added to each tube and mixed in by inverting the tubes six times. The heat-shock procedure was identical as for the trans library. The cells from each tube were re-suspended in 10 mL of pre-warmed recovery medium and added to 350 mL of pre-warmed recovery medium in 1 L flasks. Recovery medium were incubated for 1 hr at 30˚C under constant agitation at 200 rpm. The cells were then harvested by centrifugation at 3000 g for 5 min in a Avanti J-26 XP Centrifuge using a JLA 8.1000 rotor (Beckman-Coulter, Brea, Californi), washed once with 25 mL of SC –ura, respectively, and added to 350 mL of SC –ura in 2 L flasks. After homogenization by stirring, 10 μL were platted on SC –ura Petri dishes and incubated for ~48 hr at 30˚C to measure transformation efficiency. The liquid cultures were grown to saturation for ~48 hr at 30˚C under constant agitation at 200 rpm. This transformation resulted in 1.17 × 107, 9.6 × 106 and 1.09 × 107 transformants for replicates 1, 2 and 3, respectively.

Competition assay

The competition was performed in the same way as for the trans library, except for the following minor changes. The post-transformation selection cycle and the two cycles of competition were performed in 200 mL in 500 mL flasks (the cis library is of lower complexity hence smaller volumes are sufficient). The first cycles of competition were grown to an OD600nm of 1.585, 1.615 and 1.65 for the three replicates, respectively. The second cycle of competition was inoculated at an OD600nm of 0.1 and the three replicates were harvested at an OD600nm of 1.475, 1.59 and 1.49, respectively. Finally, 150 mL of the post-transformation selection and competition cultures were harvested at the end of the cycles for DNA extraction.

DNA extraction

The cis library was processed in the same way as the trans library, except for the following minor changes. Because the DNA was extracted from 150 mL instead of 1 L cultures, all the volumes were scaled down by 0.15 X. Cell lysis and the two phenolic extractions were performed the same way. In between the processing of the two different libraries, the protocol was simplified and improved to get rid of the residual SDA and impurities more efficiently. DNA was first precipitated in the same way as previously but using 0.1 volumes of 3 M NaOAc instead of 2 M NaCl. After pelleting and drying overnight, the pellets were re-suspended in 900 μL TE buffer. As for the trans samples, the pellets didn’t re-suspend well and a lot of insoluble material remained. The samples were treated with 7.5 μL RNAseA 10 mg/ml and incubated 30 min at 37˚C. DNA was then purified using 75 μL QIAEX II beads (Qiagen, Netherlands) per sample following the manufacturer’s protocol. DNA was eluted in 2 × 187.5 75 μL after 5-min incubation at 60˚C.

Sequencing library preparation

The preparation of the cis library for sequencing was performed the same way as for the trans library, except for the following minor changes. First, only the Fos region was amplified, from directly upstream of the first mutated codon to directly downstream of the last mutated codon. For the first round of PCR, only 4 × 50 μL reactions per sample were performed, with 50 × 106 molecules of plasmid in each reaction and primers OGD277 and OGD278 at a melting temperature of 56˚C and an extension time of 30 s. The four PCRs from the same sample were pooled and purified using 12 μL (6 μg) QIAEX II beads (Qiagen, Netherlands). The eluted DNA was then split into two PCR reactions per sample, which were performed in the same way as for the trans library. The two PCRs of each sample were then pooled but not further concentrated. Relative DNA concentrations were measured as for the trans library but directly from the PCR reactions. The six samples were then pooled at equimolar ratio and the correct band was purified from 100 μL on a 1% agarose gel using 4 μL of QIAEX II beads (Qiagen, Netherlands). The pooled library was sequenced in a single lane of an Illumina HiSeq2500 with 125 bp paired-end reads at the EMBL Genomics Core Facilities in Heidelberg, Germany.

Sequencing data analysis and filtering

A total of 262,393,254 read pairs were obtained from the lane, with ~ 12% adapter sequences. However, these were not filtered at this step. Paired-end reads, which were overlapping over the full length of the variable region, were assembled using PEAR version 0.9.6 (Zhang et al., 2014) with a p-value threshold for correct assembly of 0.05, a maximum and minimum fragment length of 150 nt, and a minimum overlap size of 100 nt. These parameters force the assembly of sequences of the unique expected length. An average of 30% and 20% of the paired end reads from the three input and output libraries, respectively, were not assembled and filtered out. These included the adapter sequences and indels (resulting from the <100% coupling efficiency inherent to DNA synthesis). Additionally, assembled reads with Ns in any part of the sequence were filtered out. These filtering steps resulted in between 3.0 and 3.7 × 107 usable reads per replicate input and output.

PPI score calculation

PPI scores were calculated in the same way as for the trans library. However, all amino acid variants with more than two substitutions were filtered out along with amino acid substitution that require more than one nucleotide substitution from the WT sequence. This was required because double amino acids mutants for which one or both of the two mutated codons were more than one nucleotide away from the WT codon had a lower sequencing coverage and were hence less reliable. However, frequencies were calculated accounting for the total number of different UMIs of the corresponding replicate, that is accounting for all the other variants present in the sequenced population including those with more than two mutations. As for the trans library, variants with less than 10 UMIs in any of the three input replicates or without any UMI in any of the three output replicates were filtered out at this step. The WT and variants containing stop codons were also excluded from downstream analyses.

Restricting the trans library for direct comparison to the cis library

For a fair comparison of mutation effects and genetic interaction patterns between the two libraries, all analyses for the trans library were repeated after filtering out all variants for which the amino acid substitutions require more than one nucleotide change from the WT amino acid.

Correction for background growth

The mode of detrimental variants was adjusted in the same way as for trans library to correct for batch effects in background growth. The two libraries, that is the cis and the restricted trans, were adjusted to the same value, that is the maximal value of the mode across the three replicates of the two libraries.

Classification of mutant effects

Classification was performed using the same cut-offs as for the non-restricted trans library. However, FDR was re-computed for the restricted trans library to take into account the ~8 x decrease in the number of tests.

Matching of the single mutant distributions between the cis and trans libraries

The differences in double mutants effects and pattern of genetic interactions in the cis and trans libraries can be the result of i) biases in double mutant compositions of the two libraries even after restricting the trans library to amino acid substitutions reachable through single nucleotide changes; ii) differences in single mutant effect distributions, that is combining two Fos single mutants has a different multiplicative and thermodynamic expectation than combining one Fos and one Jun; and iii) differences in structural genetic interactions in cis and trans. To control for the two effects, the two libraries were sub-sampled as follows to match the effects of their single mutant distributions. For each of 1000 sub-samplings, the two libraries were each binned in two dimensions according to the PPI scores of their constituting single mutants. Each dimension was binned into 20 evenly spaced bins from the lowest to the highest single mutants PPI score measured in the two libraries (0.477 and 1.082, respectively). For the cis library, the two single mutants constituting each double mutant were assigned to one or the other dimension randomly in each sub-sampling. For each two-way bin, the double mutants belonging to the library that was the most represented in the bin were randomly sub-sampled to match the number of double mutants belonging to the same bin in the other library. This resulted in identical distributions of single mutants PPI scores according to these binning breaks.

Comparisons of proportions of double mutants of different effect classes between the cis and trans libraries

Double mutants from each of the 1000 sub-sampled libraries of matched single mutant effects were classified using the same class definitions as above. The proportions of each class in each library averaged over the 1000 matched sub-samples were reported. The p-value corresponds to the proportion of matched sub-samples where the differences between the two libraries were in the opposite direction than the averages.

Thermodynamic model

The same thermodynamic model as above (Equation 9) was fitted. All fitting procedures were performed after pooling the three replicates in the same model. To test whether the same model with the same parameter would fit both the cis and trans libraries, the model was first fitted on both libraries independently. These libraries were the original ones, that is before matching single mutant distributions (but the restricted trans library). The two fitting procedures converged to similar parameters that had an almost identical predictive power when tested on the other library than the one it was fitted on. To test whether the differences in the distribution of single mutant effects between the two libraries would affect the fitting, this was repeated on the 1000 matched, sub-sampled libraries. This confirmed that the model was robust to and independent of these differences. The parameters and genetic interaction scores used in subsequent analyses were thus the ones from the model fitted on the two libraries pooled together.

Genetic interaction scores were calculated as the difference between the observed double mutants PPI scores and the ones predicted by the thermodynamic model. The significance of genetic interactions was tested using a one-sample t-test. The FDR at each p-value threshold was then calculated independently for the two libraries to account for the differences in the p-value distributions that might be partly the consequences of different levels of noise or technical artifacts in the two libraries. This was performed as explained above using 10,000 permutations.

The proportion of the non-random variance in the double mutants PPI scores explained by the model for each library was calculated first for the original libraries. The proportion of total non-random variance was calculated as the average of the squared Pearson’s correlation coefficients between the double mutant PPI scores of each pair of the three replicates of a given library. The proportion of non-random variance explained by the model was then calculated as the percentage ovariance explained by the model divided by the proportion of total non-random variance.

To test whether this proportion of non-random variance explained by the model is independent of the differences in single mutant effects, this was repeated on 1000 libraries sub-sampled to match their distribution of single mutants effects. The proportion of total non-random variance was recalculated in each sub-sample because they might differ since the double mutant compositions differ. The model was not re-fitted on each sub-sample since it was demonstrated above that this does not vary significantly. The proportion of non-random variance explained by the model was then recalculated and averaged across the 1000 matched sub-samples.

Comparisons of the proportion of structural genetic interaction in cis and trans

To compare the proportion of structural positive and negative genetic interactions in cis and trans, the same p-value threshold was chosen in the two libraries instead of the same FDR because identical variants with identical effect sizes and identical levels of noise, that is variants that would have identical p-values after the t-test, could otherwise be significant in one but not in the other library due to different FDR adjustments. The average proportion of true cases of genetic interactions was then estimated in each library as the total number of significant cases of either positive or negative genetic interactions multiplied by one minus the average FDR calculated by permutation at any given p-value threshold.

To ensure that the higher prevalence of both positive and negative genetic interactions is not the consequence of different single mutant effect sizes in the two libraries, this analyses was repeated on the 1000 libraries randomly sub-sampled to match their distributions of single mutants effects. In each sub-sample, FDRs were re-computed using 10,000 permutations as explained above. The proportion of both positive and negative genetic interactions was then computed in each sub-sample using the FDR averaged over the 1000 randomizations.

Enrichment in strong structural genetic interactions per pair of positions, between position types and at different distance thresholds

Enrichments for cases of either strong positive or strong negative genetic interactions (absolute genetic interaction score > 0.1 and FDR < 0.2) were tested by Fisher’s exact test as described above for the non-restricted trans library alone. For this analysis, different p-value thresholds leading to the same FDR in the two libraries were used to minimize the differences in noise levels between the two libraries.

Acknowledgements

This work was supported by a European Research Council (ERC) Consolidator grant (616434), the Spanish Ministry of Economy and Competitiveness (BFU2011-26206 and ‘Centro de Excelencia Severo Ochoa' SEV-2012–0208), the AXA Research Fund, the Bettencourt Schueller Foundation, Agencia de Gestio d’Ajuts Universitaris i de Recerca (AGAUR, SGR-831), the EMBL-CRG Systems Biology Program, and the CERCA Program/Generalitat de Catalunya. GD is a non-stipendiary EMBO Fellow and a Marie-Curie Fellow under grant agreement 608959.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Ben Lehner, Email: lehner.ben@gmail.com.

Naama Barkai, Weizmann Institute of Science, Israel.

Funding Information

This paper was supported by the following grants:

  • European Research Council 616434 to Ben Lehner.

  • AXA Research Fund Professorship to Ben Lehner.

  • Ministerio de Economía y Competitividad BFU2011-26206 and SEV-2012-0208 to Ben Lehner.

  • Agència de Gestió d’Ajuts Universitaris i de Recerca SGR-831 to Ben Lehner.

  • Fondation Bettencourt Schueller Bettencourt Prize to Ben Lehner.

  • European Commission Marrie-Currie co-fund fellowship 608959 to Ben Lehner.

  • EMBL-CRG Systems Biology Program to Ben Lehner.

  • Generalitat de Catalunya CERCA Program to Ben Lehner.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing.

Conceptualization, Supervision, Funding acquisition, Writing—original draft, Writing—review and editing.

Additional files

Supplementary file 1. Data from the full trans library.
elife-32472-supp1.xlsx (55.2MB, xlsx)
DOI: 10.7554/eLife.32472.024
Supplementary file 2. Variance explained by physico-chemical amino acid features.
elife-32472-supp2.xlsx (554.1KB, xlsx)
DOI: 10.7554/eLife.32472.025
Supplementary file 3. Data from the restricted trans library.
elife-32472-supp3.xlsx (9.9MB, xlsx)
DOI: 10.7554/eLife.32472.026
Supplementary file 4. Data from the cis library.
DOI: 10.7554/eLife.32472.027
Supplementary file 5. Primer sequences.
elife-32472-supp5.xlsx (33.1KB, xlsx)
DOI: 10.7554/eLife.32472.028
Transparent reporting form
DOI: 10.7554/eLife.32472.029

Major datasets

The following dataset was generated:

Diss G, author; Lehner B, author. The genetic landscape of a physical interaction. 2017 http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE102901 Publicly available at the NCBI Gene Expression Omnibus (accession no: GSE102901).

References

  1. Aakre CD, Herrou J, Phung TN, Perchuk BS, Crosson S, Laub MT. Evolving new protein-protein interaction specificity through promiscuous intermediates. Cell. 2015;163:594–606. doi: 10.1016/j.cell.2015.09.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Araya CL, Fowler DM, Chen W, Muniez I, Kelly JW, Fields S. A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function. PNAS. 2012;109:16858–16863. doi: 10.1073/pnas.1209751109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baryshnikova A, Costanzo M, Myers CL, Andrews B, Boone C. Genetic interaction networks: toward an understanding of heritability. Annual Review of Genomics and Human Genetics. 2013;14:111–133. doi: 10.1146/annurev-genom-082509-141730. [DOI] [PubMed] [Google Scholar]
  4. Bershtein S, Segal M, Bekerman R, Tokuriki N, Tawfik DS. Robustness-epistasis link shapes the fitness landscape of a randomly drifting protein. Nature. 2006;444:929–932. doi: 10.1038/nature05385. [DOI] [PubMed] [Google Scholar]
  5. Chinenov Y, Kerppola TK. Close encounters of many kinds: Fos-Jun interactions that mediate transcription regulatory specificity. Oncogene. 2001;20:2438–2452. doi: 10.1038/sj.onc.1204385. [DOI] [PubMed] [Google Scholar]
  6. Costanzo M, Baryshnikova A, Bellay J, Kim Y, Spear ED, Sevier CS, Ding H, Koh JL, Toufighi K, Mostafavi S, Prinz J, St Onge RP, VanderSluis B, Makhnevych T, Vizeacoumar FJ, Alizadeh S, Bahr S, Brost RL, Chen Y, Cokol M, Deshpande R, Li Z, Lin ZY, Liang W, Marback M, Paw J, San Luis BJ, Shuteriqi E, Tong AH, van Dyk N, Wallace IM, Whitney JA, Weirauch MT, Zhong G, Zhu H, Houry WA, Brudno M, Ragibizadeh S, Papp B, Pál C, Roth FP, Giaever G, Nislow C, Troyanskaya OG, Bussey H, Bader GD, Gingras AC, Morris QD, Kim PM, Kaiser CA, Myers CL, Andrews BJ, Boone C. The genetic landscape of a cell. Science. 2010;327:425–431. doi: 10.1126/science.1180823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Costanzo M, VanderSluis B, Koch EN, Baryshnikova A, Pons C, Tan G, Wang W, Usaj M, Hanchard J, Lee SD, Pelechano V, Styles EB, Billmann M, van Leeuwen J, van Dyk N, Lin ZY, Kuzmin E, Nelson J, Piotrowski JS, Srikumar T, Bahr S, Chen Y, Deshpande R, Kurat CF, Li SC, Li Z, Usaj MM, Okada H, Pascoe N, San Luis BJ, Sharifpoor S, Shuteriqi E, Simpkins SW, Snider J, Suresh HG, Tan Y, Zhu H, Malod-Dognin N, Janjic V, Przulj N, Troyanskaya OG, Stagljar I, Xia T, Ohya Y, Gingras AC, Raught B, Boutros M, Steinmetz LM, Moore CL, Rosebrock AP, Caudy AA, Myers CL, Andrews B, Boone C. A global genetic interaction network maps a wiring diagram of cellular function. Science. 2016;353:aaf1420. doi: 10.1126/science.aaf1420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Diss G, Filteau M, Freschi L, Leducq JB, Rochette S, Torres-Quiroz F, Landry CR. Integrative avenues for exploring the dynamics and evolution of protein interaction networks. Current Opinion in Biotechnology. 2013;24:775–783. doi: 10.1016/j.copbio.2013.02.023. [DOI] [PubMed] [Google Scholar]
  9. Diss G, Gagnon-Arsenault I, Dion-Coté AM, Vignaud H, Ascencio DI, Berger CM, Landry CR. Gene duplication can impart fragility, not robustness, in the yeast protein interaction network. Science. 2017;355:630–634. doi: 10.1126/science.aai7685. [DOI] [PubMed] [Google Scholar]
  10. Diss G. Diss_et_al_eLife_2018. 6e172bdGithub. 2018 https://github.com/gdiss/Diss_et_al_eLife_2018
  11. Fowler DM, Araya CL, Fleishman SJ, Kellogg EH, Stephany JJ, Baker D, Fields S. High-resolution mapping of protein sequence-function relationships. Nature Methods. 2010;7:741–746. doi: 10.1038/nmeth.1492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature Methods. 2014;11:801–807. doi: 10.1038/nmeth.3027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fowler DM, Stephany JJ, Fields S. Measuring the activity of protein variants on a large scale using deep mutational scanning. Nature Protocols. 2014;9:2267–2284. doi: 10.1038/nprot.2014.153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Freschi L, Torres-Quiroz F, Dubé AK, Landry CR. qPCA: a scalable assay to measure the perturbation of protein-protein interactions in living cells. Mol. BioSyst. 2013;9:36–43. doi: 10.1039/C2MB25265A. [DOI] [PubMed] [Google Scholar]
  15. Gjuvsland AB, Hayes BJ, Omholt SW, Carlborg O. Statistical epistasis is a generic feature of gene regulatory networks. Genetics. 2007;175:411–420. doi: 10.1534/genetics.106.058859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Glover JN, Harrison SC. Crystal structure of the heterodimeric bZIP transcription factor c-Fos-c-Jun bound to DNA. Nature. 1995;373:257–261. doi: 10.1038/373257a0. [DOI] [PubMed] [Google Scholar]
  17. Halabi N, Rivoire O, Leibler S, Ranganathan R. Protein sectors: evolutionary units of three-dimensional structure. Cell. 2009;138:774–786. doi: 10.1016/j.cell.2009.07.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hiatt JB, Patwardhan RP, Turner EH, Lee C, Shendure J. Parallel, tag-directed assembly of locally derived short sequence reads. Nature Methods. 2010;7:119–122. doi: 10.1038/nmeth.1416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Horovitz A. Double-mutant cycles: a powerful tool for analyzing protein structure and function. Folding and Design. 1996;1:R121–R126. doi: 10.1016/S1359-0278(96)00056-9. [DOI] [PubMed] [Google Scholar]
  20. Janke C, Magiera MM, Rathfelder N, Taxis C, Reber S, Maekawa H, Moreno-Borchart A, Doenges G, Schwob E, Schiebel E, Knop M. A versatile toolbox for PCR-based tagging of yeast genes: new fluorescent proteins, more markers and promoter substitution cassettes. Yeast. 2004;21:947–962. doi: 10.1002/yea.1142. [DOI] [PubMed] [Google Scholar]
  21. Kacser H, Burns JA. The control of flux. Symposia of the Society for Experimental Biology. 1973;27:65–104. [PubMed] [Google Scholar]
  22. Kawashima S, Pokarowski P, Pokarowska M, Kolinski A, Katayama T, Kanehisa M. AAindex: amino acid index database, progress report 2008. Nucleic Acids Research. 2008;36:D202–D205. doi: 10.1093/nar/gkm998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lehner B. Molecular mechanisms of epistasis within and between genes. Trends in Genetics. 2011;27:323–331. doi: 10.1016/j.tig.2011.05.007. [DOI] [PubMed] [Google Scholar]
  24. Letunic I, Doerks T, Bork P. SMART: recent updates, new developments and status in 2015. Nucleic Acids Research. 2015;43:D257–D260. doi: 10.1093/nar/gku949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Levy ED, Kowarzyk J, Michnick SW. High-resolution mapping of protein concentration reveals principles of proteome architecture and adaptation. Cell Reports. 2014;7:1333–1340. doi: 10.1016/j.celrep.2014.04.009. [DOI] [PubMed] [Google Scholar]
  26. Levy SF, Blundell JR, Venkataram S, Petrov DA, Fisher DS, Sherlock G. Quantitative evolutionary dynamics using high-resolution lineage tracking. Nature. 2015;519:181–186. doi: 10.1038/nature14279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Mason JM, Schmitz MA, Müller KM, Arndt KM. Semirational design of Jun-Fos coiled coils with increased affinity: Universal implications for leucine zipper prediction and design. PNAS. 2006;103:8989–8994. doi: 10.1073/pnas.0509880103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Melamed D, Young DL, Gamble CE, Miller CR, Fields S. Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein. RNA. 2013;19:1537–1551. doi: 10.1261/rna.040709.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Olson CA, Wu NC, Sun R. A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain. Current Biology. 2014;24:2643–2651. doi: 10.1016/j.cub.2014.09.072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Omholt SW, Plahte E, Oyehaug L, Xiang K. Gene regulatory networks generating the phenomena of additivity, dominance and epistasis. Genetics. 2000;155:969–980. doi: 10.1093/genetics/155.2.969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Patel L, Abate C, Curran T. Altered protein conformation on DNA binding by Fos and Jun. Nature. 1990;347:572–575. doi: 10.1038/347572a0. [DOI] [PubMed] [Google Scholar]
  32. Phillips PC. Epistasis--the essential role of gene interactions in the structure and evolution of genetic systems. Nature Reviews Genetics. 2008;9:855–867. doi: 10.1038/nrg2452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Raman AS, White KI, Ranganathan R. Origins of allostery and evolvability in proteins: a case study. Cell. 2016;166:468–480. doi: 10.1016/j.cell.2016.05.047. [DOI] [PubMed] [Google Scholar]
  34. Ransone LJ, Visvader J, Sassone-Corsi P, Verma IM. Fos-Jun interaction: mutational analysis of the leucine zipper domain of both proteins. Genes & Development. 1989;3:770–781. doi: 10.1101/gad.3.6.770. [DOI] [PubMed] [Google Scholar]
  35. Schlecht U, Miranda M, Suresh S, Davis RW, St Onge RP. Multiplex assay for condition-dependent changes in protein-protein interactions. PNAS. 2012;109:9213–9218. doi: 10.1073/pnas.1204952109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shaulian E, Karin M. AP-1 as a regulator of cell life and death. Nature Cell Biology. 2002;4:E131–E136. doi: 10.1038/ncb0502-e131. [DOI] [PubMed] [Google Scholar]
  37. Shendure J, Akey JM. The origins, determinants, and consequences of human mutations. Science. 2015;349:1478–1483. doi: 10.1126/science.aaa9119. [DOI] [PubMed] [Google Scholar]
  38. Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160:882–892. doi: 10.1016/j.cell.2015.01.035. [DOI] [PubMed] [Google Scholar]
  39. Tarassov K, Messier V, Landry CR, Radinovic S, Serna Molina MM, Shames I, Malitskaya Y, Vogel J, Bussey H, Michnick SW. An in vivo map of the yeast protein interactome. Science. 2008;320:1465–1470. doi: 10.1126/science.1153878. [DOI] [PubMed] [Google Scholar]
  40. Thompson KS, Vinson CR, Freire E. Thermodynamic characterization of the structural stability of the coiled-coil region of the bZIP transcription factor GCN4. Biochemistry. 1993;32:5491–5496. doi: 10.1021/bi00072a001. [DOI] [PubMed] [Google Scholar]
  41. Tokuriki N, Tawfik DS. Stability effects of mutations and protein evolvability. Current Opinion in Structural Biology. 2009;19:596–604. doi: 10.1016/j.sbi.2009.08.003. [DOI] [PubMed] [Google Scholar]
  42. Tong AH, Lesage G, Bader GD, Ding H, Xu H, Xin X, Young J, Berriz GF, Brost RL, Chang M, Chen Y, Cheng X, Chua G, Friesen H, Goldberg DS, Haynes J, Humphries C, He G, Hussein S, Ke L, Krogan N, Li Z, Levinson JN, Lu H, Ménard P, Munyana C, Parsons AB, Ryan O, Tonikian R, Roberts T, Sdicu AM, Shapiro J, Sheikh B, Suter B, Wong SL, Zhang LV, Zhu H, Burd CG, Munro S, Sander C, Rine J, Greenblatt J, Peter M, Bretscher A, Bell G, Roth FP, Brown GW, Andrews B, Bussey H, Boone C. Global mapping of the yeast genetic interaction network. Science. 2004;303:808–813. doi: 10.1126/science.1091317. [DOI] [PubMed] [Google Scholar]
  43. Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. 2014;30:614–620. doi: 10.1093/bioinformatics/btt593. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Naama Barkai1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "The genetic landscape of a physical interaction" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by Naama Barkai as the Senior and Reviewing Editor. The following individual involved in review of your submission has agreed to reveal their identity: Jesse D Bloom (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

As you will see below, the reviewers greatly appreciated the approach, but were worried about technical details. Please address all of the concerns that were raised. Most important are the concerns regarding sequencing errors, controls, and the use of the same thermodynamic model to two different aspects of the problem.

Here are the comments that I drew out of the reviews that I think should be noted. I also agree with many of the other listed points, but these are the ones that most stand out to me:

1) I agree with reviewer #1 that the authors should justify the cutoff of >10 reads. I don't really have a problem with this cutoff, but some justification would be nice. I strongly agree with #1 that the filtering to require >0 counts in selected library doesn't make any sense. A deleterious mutation would be expected to go to zero. So as #1 says, the authors should either remove this filter or come up with a really good reason why it is justified.

2) Reviewers 2 and 3 both think that it doesn't make sense to apply the same thermodynamic model to cis interactions (which presumably relates to protein stability) and trans interactions (which presumably related to binding affinity).

3) Several noted that the availability of the computer code is clearly inadequate.

4) Several noted any clear description of how they dealt with sequencing error, and the lack of controls to quantify this.

5) I agree with reviewer #2's point that the correlations should be reported in terms of the actual quantity on which inferences are based, which is PPI.

6) I agree with reviewers #1 and #2 that they need to justify the 0.64 / 0.96 / 1.04 cutoffs.

Reviewer #1:

The manuscript "The genetic landscape of a physical interaction by Lehner and Diss presents methods for detecting genetic interactions between combinations of point mutations in two genes encoding physically interacting proteins. The combination of Deep Mutational Scanning (DMS) with a Protein Fragment Complementation assay is innovative. They first define interactions in the usual multiplicative way, and then (most interestingly) present a thermodynamic model which explains a large fraction of the observed interactions (stemming from the idea that function depends non-linearly on folding energy). They then essentially redefine genetic interaction to look for combinatorial effects that are not explained by this null model. The combinatorial library construction method that directly establishes a genetic link between the interacting variants is an elegant experimental design. The paper is well written and provides novel insights into physical underpinnings of genetic interactions.

Despite overall enthusiasm, there are some points that should be addressed:

- The sequencing results were filtered to only consider clones with > 10 reads in the input library. How was this cutoff chosen? A plot of #reads vs. variance of technical replicates might be helpful to show demonstrate the effect of this cutoff. The rule of thumb for counts data is that the standard deviation is the square root of the count, so that a 10-count measurement will have a CV of at least 33%. Is this acceptable?

- The results were also filtered to only consider clones with >0 counts in the selected library. This was a bad idea. The mutational combinations that cause interactions to disappear altogether might correspond to some of the strongest and most interesting negative genetic interactions. Fixing this mistake may or may not have much impact on the results, but this analysis error should be corrected.

- R2 can be a misleading indicator of data quality. In measuring agreement between thermodynamic model and results, it would be useful to examine RMSD for held-out data not used in fitting the model.

- The expression "positive (compensatory)" is used, as though these were synonyms. In genetic interaction studies with sufficient resolution to capture different subtypes of positive interactions (see St Onge et al., Nat Genetics 2007), the bulk of positive interactions have been "masking" or "diminishing returns" interactions, where the first mutation disrupted a process, and the second disruptive mutation could do no further harm because the damage was already done. To be suppressive or compensatory means that not only must the combination of mutations be above multiplicative (or thermodynamic?) expectation, but the combination must cause less harm to the phenotypes than the most harmful of the two mutations. Adding specific separate analyses of masking and suppressive/compensatory positive interactions could greatly improve the impact of this study, and this would not be hard to do.

- It is said that two mutations within Fos are more likely to increase the strength of the PPI than one mutation in Fos combined with a second mutation in Jun. To complete the argument, the authors should also examine whether two mutations within Jun are more likely to increase the strength of the PPI than one mutation in Jun combined with a second mutation in Fos.

- In Figure 4E, the last example (bottom right panel) does not seem to make sense biochemically. While the WT and double mutant both can be expected to have a hydrophobic interaction, the individual single mutants would also be expected to be able to fulfill that role.

- "Sequencing data was filtered using homemade Perl scripts" is hardly a reproducible description, especially given that there is no statement about code availability. The code written for this study (e.g. perl scripts for sequence analysis, R scripts for statistical analyses) would ideally be publicly posted (e.g., at Github) and not simply "available on request".

- Some thresholds used for classifying mutant effects seem to come from nowhere. While the 1.04 threshold is justified in terms of its control of FDR at 0.05, no similar justification is provided for 0.64 or 0.96 thresholds.

- Subsection “Comparisons of the proportion of structural epistasis in cis and trans”. It is not at all clear that using the same P-value threshold allows apples-apples comparison between different libraries of different sizes. Would be better to computationally down-sample the larger of the two libraries.

Reviewer #2:

This paper uses deep mutational scanning (DMS) on two proteins to study the genetic determinants of binding across a protein-protein interaction interface; further, it interprets these observations in a structural context. It is the first study to analyze large-scale epistasis across an interface, so it will be of considerable interest to those interested in sequence-structure-function relationships and the determinants of PPIs. I find the conceptual framework and design to be solid. The experiments appear to have been well executed and the data have been carefully analyzed, so I think the paper is a good candidate for eLife.

A few comments follow. I am mostly concerned that better attention to analysis of error and reproducibility is necessary. I expect that it should be possible to address my comments with a handful of additional statistical analyses and more precise description in the text.

1) Stochastic error in inferring growth rate from frequencies has not been adequate attention. The authors use all sequences with >10 input reads and >0 post-selection reads. Estimates of PPI scores (and, in turn, of epistasis) for genotypes with low read numbers will have considerable stochastic error associated with them, particularly if that error is propagated into products or ratios (as they are for epistatic effects). Although reproducibility may be relatively high overall, the authors should incorporate uncertainty caused by stochastic error into their classifications and quantifications of mutational and epistatic effects.

2) The authors claim that reproducibility is high, and refer to a high rank-order correlation coefficient among replicates. The paper's major inferences, however, are not based on rank-order but on PPI score and statistics derived from it. What matters, then, is the reproducibility of the PPI score. The authors should instead report R2 for PPI among reps and between DMS and small-scale experiment. If the R2 is low, discussion of the implications for the authors' claims will be required.

3) Because the library was created using doped degenerate codons rather than targeted synthesis, there are a great variety of genotypes in the library with different numbers of mutations. Paired-end sequencing is used (rather than bar-coding) to identify genotypes. Sequencing error may lead to mis-identification of genotypes, and this error may be biased towards genotypes with certain numbers of mutations. This concern does not appear to have been addressed. Reporting the degree of sequencing error and showing that it does not strongly affect the claims is necessary.

4) I find the PPI score unintuitive. In theory, it might range asymmetrically around the value of 1 (wildtype) from 0 for a totally deleterious genotype to infinity. This alone bothers me (and could be addressed with a log-scale relative to wildtype). The assay itself puts practical limits on how much a mutant genotype might possibly increase the PPI, because there must be saturation point at which occupancy of the complex, given the dose of methotrexate in the assay, no longer improves fitness. The PPI numbers are therefore somewhat hard to interpret. I would prefer a transform that expresses the PPI on a log scale relative to wild type, rescaled so the intervals between the minimum and wild-type and that between wild-type and maximum are the same. I don't view the authors' method as fatally flawed, and I have no reason to think the conclusions would change if my method were used. I'd like the authors to consider the possibility that such an approach might make the numbers in the paper easier for the reader to interpret.

5) I like the authors' efforts to quantitatively distinguish specific structural epistatic interactions from general thermodynamic epistasis caused by the nonlinear relationship between the proteins' affinity for each other and occupancy of the complex. Although the model has a nice theoretical basis and accounts for a substantial amount of epistasis, there may be additional causes of nonspecific epistasis remaining in the system – for example, if there are nonlinearities in the relationship between occupancy of the jun-fos complex and growth rate. I would therefore like to see the authors explore using Sailer and Harms' Genetics 2017 method, which identifies general epistasis in a way that is more agnostic to the specific biochemical causes or quantitative relationships, to determine if this is the case. If general epistasis can be identified using Sailer's approach even after the thermodynamic model is applied, this would be important, because it would further reduce (and probably sharpen) the cases of specific structural epistasis, and it would be an interesting finding, as well.

6) I have some concern that the thermodynamic correction may affect cis and trans interactions differently. Stated a little too simply, the model is built around the expectation that mutations in trans that additively affect the energy of complex formation will have an exponential effect on occupancy of the complex and therefore on PPI; epistasis not accounted for by this relationship represent specific structural epistasis. It is not apparent to me that the same model appropriately corrects for general thermodynamic nonlinearity among cis- acting mutations. Suppose that two mutations within a protein independently affect the dG of the native structure; they will have an exponential effect on occupancy of the native structure. It is not obvious to me that the model will account for such nonlinear effects on the effective concentration of each molecule. If so, then pairs of cis-mutations that in fact act additively on the protein might be classified as structural specific interactions even after the thermodynamic model is applied. This could contribute to the apparent excess of classified epistasis in cis compared to in trans after removal of thermodynamic epistasis (Figure 5). If I am misinterpreting the model, it would be helpful for the authors to better explain in the main text what the model does and doesn't account for.

7) The authors classify the magnitude of PPI effects as strongly deleterious, weakly deleterious, neutral, and beneficial based on thresholds of 0.64, 0.96, and 1.04. The authors should provide a justification these numbers, which are presented as if they are arbitrary. (They do resemble some relevant numbers from the normal distribution, but the authors should make this explicit and justify their use.)

8) The interface of jun and fos appears as if it might be isologous, such that a mutation in one partner would be expected to have precisely the same structural and functional effect as the same mutation in the other (although this prediction would be compromised by divergence between the paralogs over time). The authors might discuss this, particularly as an explanation for the correlation between the effects of a mutation in one partner and the same mutation in the other. Isology would explain correlation; divergence would explain imperfect correlation.

9) Some bZip proteins, including fos and jun family members, can homodimerize. In the current assay, such events would interfere with growth, assuming that homodimers would compete with the heterodimer for the relevant subunits. Mutations that specifically affect the affinity of homodimerization – including those at the interface – would thus affect PPI, but not through trans-interactions. The authors should consider this possibility and note how it might contribute to their observations.

10) The authors state that combining two mutations that positively affect growth rate leads to positive epistasis and cite Figure 3C. I believe that should be apparent as pies colored mostly green in the upper-right quadrant of the array. I don't see that in the data at all. That quadrant looks mostly yellow, suggesting little epistasis. What I believe is apparent in Figure 3C is that positive epistasis is concentrated in the rows where a weak beneficial mutation plus a deleterious mutation yield an outcome that is less deleterious than expected; this effect goes away when the thermodynamic model is applied (compare 3F). This is worth noting and, I hope, providing an explanation for.

Reviewer #3:

This study uses deep mutational scanning to assess the effects of essentially all single amino-acid mutations to the JUN and FOS proteins. It then uses an interesting strategy to analyze the effects of combinations of mutations to the two.

This is the first deep mutational scanning study to look at both sides of a protein-protein interface. In general, the experiments appear to be well done with adequate replicates – at least I could identify no obvious technical flaws. They also have a nice and detailed Materials and methods section.

The analysis in terms of thermodynamic effects was quite nice, and I like the result about a mix of thermodynamic and structural epistasis.

In general, unless the other reviewers identify significant problems that I have overlooked, I support publication of this paper.

* The only thing that I see lacking experimentally is deep sequencing of the wildtype sequences to estimate the error rate. Right now there is no such estimate of how many of the observed mutations are actually due to library preparation of sequencing errors. This could easily be determined by sequencing wild type.

* The computer code used for the data analysis should be included as a supplementary file and/or a GitHub repository (eLife now allows this).

* In various places (first in the subsection “Determinants of single mutant outcome”), the authors refer to the "identical substitutions" in both proteins. But they never clearly explain what this means. I'm assuming that the proteins are homologous, and they mean the homologous positions? This needs to be described much better.

* Unless I am misunderstanding, the thermodynamic model is in terms of mutational effects on the protein-protein binding. This makes sense for analyzing the trans mutational effects. But for analyzing the cis mutational effects, wouldn't we expect to instead be concerned with ddG values for the stabilities of the individual proteins?

eLife. 2018 Apr 11;7:e32472. doi: 10.7554/eLife.32472.034

Author response


Here are the comments that I drew out of the reviews that I think should be noted. I also agree with many of the other listed points, but these are the ones that most stand out to me:

1) I agree with reviewer #1 that the authors should justify the cutoff of >10 reads. I don't really have a problem with this cutoff, but some justification would be nice. I strongly agree with #1 that the filtering to require >0 counts in selected library doesn't make any sense. A deleterious mutation would be expected to go to zero. So as #1 says, the authors should either remove this filter or come up with a really good reason why it is justified.

We used these read cut-offs because the low read counts lead to a lower boundary for PPI scores when the output reads equal one. Output read counts of 0 thus lead to a large uncertainty in PPI score. With 10 reads or below in the input, the PPI score can never be lower than ~0.5 (when output read count is 1), which biases those variants with low input read counts. For more details, please refer to the answers to reviewers’ comments.

2) Reviewers 2 and 3 both think that it doesn't make sense to apply the same thermodynamic model to cis interactions (which presumably relates to protein stability) and trans interactions (which presumably related to binding affinity).

Most leucine zippers, including FOS and JUN, have a coupled folding-binding equilibrium. The system can thus be modeled as a two-state equilibrium with a single ΔΔG that fits both the cis and trans double mutants.

3) Several noted that the availability of the computer code is clearly inadequate.

We now uploaded the code on GitHub at https://github.com/gdiss/Diss_et_al_eLife_2018 and added the link at the beginning of the Materials and methods section.

4) Several noted any clear description of how they dealt with sequencing error, and the lack of controls to quantify this.

We now estimate sequencing error and show that more stringent quality filters only affect PPI scores marginally.

5) I agree with reviewer #2's point that the correlations should be reported in terms of the actual quantity on which inferences are based, which is PPI.

We now report Pearson’s correlation coefficient instead of Spearman’s where appropriate.

6) I agree with reviewers #1 and #2 that they need to justify the 0.64 / 0.96 / 1.04 cutoffs.

We show in Figure 2—figure supplement 1B-C and Figure 5—figure supplement 2B-E that our results are robust to the choice of the cutoffs within ranges relevant to each mutant class according to the distribution of mutant effects.

Reviewer #1:

[…] Despite overall enthusiasm, there are some points that should be addressed:

- The sequencing results were filtered to only consider clones with > 10 reads in the input library. How was this cutoff chosen? A plot of #reads vs. variance of technical replicates might be helpful to show demonstrate the effect of this cutoff. The rule of thumb for counts data is that the standard deviation is the square root of the count, so that a 10-count measurement will have a CV of at least 33%. Is this acceptable?

We added panel B in Figure 1—figure supplement 1 and a paragraph in the Materials and methods (section”Sequencing data analysis and filtering”) to explain why we chose both the 10 input reads and 0 output read cut-offs:

“Variants with zero UMI in any of the three outputs were filtered out. Even though variants with lethal mutations could be expected to go to 0, they cannot go further down while their actual frequency in the population is still decreasing (e.g. they could still not be detected even if sequencing 10X or 100X deeper). […] We thus filtered out variants with less than 10 UMIs in any of the three input replicates”.

We now also report that our results regarding the enrichment of epistatic interactions between proximal and interfacial residues hold when increasing this threshold to 100 (Figure 4—figure supplement 3C and Figure 5—figure supplement 5)

- The results were also filtered to only consider clones with >0 counts in the selected library. This was a bad idea. The mutational combinations that cause interactions to disappear altogether might correspond to some of the strongest and most interesting negative genetic interactions. Fixing this mistake may or may not have much impact on the results, but this analysis error should be corrected.

Please refer to the answer to the previous comment.

- R2 can be a misleading indicator of data quality. In measuring agreement between thermodynamic model and results, it would be useful to examine RMSD for held-out data not used in fitting the model.

We agree and apologize for using the wrong term. What we report is actually the percentage of variance explained as calculated by 1- RSS/TSS, where RSS and TSS are the residual and total sum of squares, respectively. While it is also sometimes called R2, we understand it can be confusing as it doesn’t not equate to the actual square of the Pearson correlation coefficient. We hence replaced this term by “% var. explained” where appropriate.

We also now report the RMSD as we agree that it is relevant. As explained in the method, we use a Monte-Carlo cross-validation to fit the thermodynamic model. Briefly, double mutants are split in two independent sets representing each 25% of the data. The identity of the mutants compositing these two sets do not overlap, ensuring their independence (i.e. double mutants corresponding to 50% of the FOS and JUN single mutants, chosen randomly, were attributed to the training set and the double mutants corresponding to the other 50% of FOS and JUN single mutants were attributed to the test set). The thermodynamic model was then fitted on the training set and evaluated on the test set. This was repeated 1,000 times and the median of each of the three parameters corresponded exactly to when fitting the data on the whole dataset. The median% of variance explained was also within 0.001 of that when the model was fitted on the whole dataset.

We report in Figure 3E the% of variance explained and the RMSD for the whole dataset and in Figure 3—figure supplement 2B the median of these two values across the 1,000 Monte-Carlo cross-validations.

- The expression "positive (compensatory)" is used, as though these were synonyms. In genetic interaction studies with sufficient resolution to capture different subtypes of positive interactions (see St Onge et al., Nat Genetics 2007), the bulk of positive interactions have been "masking" or "diminishing returns" interactions, where the first mutation disrupted a process, and the second disruptive mutation could do no further harm because the damage was already done. To be suppressive or compensatory means that not only must the combination of mutations be above multiplicative (or thermodynamic?) expectation, but the combination must cause less harm to the phenotypes than the most harmful of the two mutations. Adding specific separate analyses of masking and suppressive/compensatory positive interactions could greatly improve the impact of this study, and this would not be hard to do.

We agree that all positive interactions are not necessarily compensatory and apologize for the imprecision. We have removed this term. As for the separate analyses of masking vs. compensatory interactions, we believe these go beyond the scope of the present study.

- It is said that two mutations within Fos are more likely to increase the strength of the PPI than one mutation in Fos combined with a second mutation in Jun. To complete the argument, the authors should also examine whether two mutations within Jun are more likely to increase the strength of the PPI than one mutation in Jun combined with a second mutation in Fos.

We agree that this comparison would be interesting. However, we make this statement based on the actual comparison between the trans and cis FOS double mutant data. We did not assay the effects of cis double mutants in JUN and can thus unfortunately not do the same comparison.

- In Figure 4E, the last example (bottom right panel) does not seem to make sense biochemically. While the WT and double mutant both can be expected to have a hydrophobic interaction, the individual single mutants would also be expected to be able to fulfill that role.

This example can indeed be counter-intuitive. However, a speculative explanation could be the following. As can be seen on the figure, the two facing Leucines side chains extend outward of the interface and in opposite directions, so that the bulky, branched end of the side chain do not clash with each other. Valine has one less carbon atom in the side chain than leucine but the same branched carbon at the end of the side chain. A mutation from Leucine to Valine in FOS might therefore bring the branched end of the side chain closer to the interface, which could clash with the facing Leucine in JUN. We indeed see a PPI score of 0.71 for this single mutant. On the contrary, the less bulky Alanine mutation in JUN appears to be near neutral (PPI score = 0.95). At the same time, it makes room so that the Leu-to-Val mutation in FOS does not clash anymore, such that the double mutant is more stable than the Val single mutant (PPI score = 0.91).

- "Sequencing data was filtered using homemade Perl scripts" is hardly a reproducible description, especially given that there is no statement about code availability. The code written for this study (e.g. perl scripts for sequence analysis, R scripts for statistical analyses) would ideally be publicly posted (e.g., at Github) and not simply "available on request".

We now uploaded the code on GitHub at https://github.com/gdiss/Diss_et_al_eLife_2018 and added the link at the beginning of the Materials and methods section.

- Some thresholds used for classifying mutant effects seem to come from nowhere. While the 1.04 threshold is justified in terms of its control of FDR at.0.05, no similar justification is provided for 0.64 or 0.96 thresholds.

We agree that these thresholds are arbitrary. However we show in Figure 2—figure supplement 1B-C and Figure 5—figure supplement 2B-E that our conclusions hold when modifying the thresholds within relevant ranges for the class they delineate based on the distribution of PPI scores.

- Subsection “Comparisons of the proportion of structural epistasis in cis and trans”. It is not at all clear that using the same P-value threshold allows apples-apples comparison between different libraries of different sizes. Would be better to computationally down-sample the larger of the two libraries.

We understand that this can be unintuitive (we actually had to think about how to make these comparisons for quite a while), which is why we also reported comparisons after 1000 random down-samplings of the larger library (Figure 5—figure supplement 2, 3C and 9).

However, we would like to keep the comparison at the same p-value threshold because we believe it is valid, as explained in the Materials and methods. Indeed, two identical mutations in the two libraries will have the exact same effect on the PPI and therefore ought to be identically classified as significantly detrimental or beneficial. If the technical noise is the same in the two libraries, these two variants should have the same measured mean effect and the same standard error and hence the same p-value, independently of all the other variants in the library. They will thus be classified the same way if the classification is based on the p-value. However, they will have different q-values (i.e. FDR), because the distributions of p-values differ between the two libraries, and would hence not be necessarily classified in the same way. By using a p-value threshold, we thus ensure that variants with similar mean effect and standard error are classified the same way. We use the FDRs associated to that p-value threshold (these FDRs are different in the two libraries) to subtract, in each library, the number of expected false discoveries from the total number of discoveries at that p-value threshold and hence report the number of true discoveries.

Reviewer #2:

[…] A few comments follow. I am mostly concerned that better attention to analysis of error and reproducibility is necessary. I expect that it should be possible to address my comments with a handful of additional statistical analyses and more precise description in the text.

1) Stochastic error in inferring growth rate from frequencies has not been adequate attention. The authors use all sequences with >10 input reads and >0 post-selection reads. Estimates of PPI scores (and, in turn, of epistasis) for genotypes with low read numbers will have considerable stochastic error associated with them, particularly if that error is propagated into products or ratios (as they are for epistatic effects). Although reproducibility may be relatively high overall, the authors should incorporate uncertainty caused by stochastic error into their classifications and quantifications of mutational and epistatic effects.

We agree and are well aware of this issue. We note however that since single mutants have a much higher coverage than double mutants (~20x more, see Materials and methods), the error on epistasis scores is mostly from the double mutant, had we used an error model, despite the product propagation of single mutants errors.

However, previous studies have been able to make new and significant discoveries without the use of such models. The fact that we find non-random enrichments for structurally related pairs of position shows that the increased noise added by those epistatic interactions derived from variants with low read numbers does not impede our ability to detect general trends.

To ensure that these results are not biased by these stochastic effects, we now show that they hold even when using only variants with a larger input read count (>100 in all three replicates; Figure 4—figure supplement 3C and Figure 5—figure supplement 5), which should decrease the stochasticity associated with low read counts.

2) The authors claim that reproducibility is high, and refer to a high rank-order correlation coefficient among replicates. The paper's major inferences, however, are not based on rank-order but on PPI score and statistics derived from it. What matters, then, is the reproducibility of the PPI score. The authors should instead report R2 for PPI among reps and between DMS and small-scale experiment. If the R2 is low, discussion of the implications for the authors' claims will be required.

We understand the reviewer’s concern. We initially used Spearman’s rank correlation because the distributions of PPI scores that are compared are not normal. We however now report Pearson’s correlation coefficient R as suggested. In all cases, the differences are negligible.

3) Because the library was created using doped degenerate codons rather than targeted synthesis, there are a great variety of genotypes in the library with different numbers of mutations. Paired-end sequencing is used (rather than bar-coding) to identify genotypes. Sequencing error may lead to mis-identification of genotypes, and this error may be biased towards genotypes with certain numbers of mutations. This concern does not appear to have been addressed. Reporting the degree of sequencing error and showing that it does not strongly affect the claims is necessary.

We agree and now report in the Materials and methods section “Sequencing data analysis and filtering” an estimate of the degree of sequencing errors based on the constant regions upstream of each leucine zipper (where the primer anneals to amplify the library) in the trans library:

“To estimate sequencing errors, we measured the per base error probability in the constant region upstream of the variable regions of FOS and JUN that were used as primer annealing sites for library amplification. […] Sequencing errors do not seem to affect the downstream estimate of PPI scores as filtering reads at a Phred score of 30, 35 or additionally filtering out reads where at least one of the called mutations had a Phred score below 30 (>90% of the wrongly called mutations in the constant region had a Phred score below 30 compared to < 25% of the correctly called bases) led to PPI scores that were highly correlated to the ones calculated using the above filters (Figure 1—figure supplement 1A).”

We also added Figure 1—figure supplement 1A that shows that filtering reads at a higher average Phred score or additionally filtering reads where the called mutations had a Phred score below 30 does not strongly affect the PPI scores (Pearson correlation of ~0.99).

For the cis library, the paired end reads overlap along the whole variable region making sequencing errors much less likely than in the trans library. However, this error cannot be accurately estimated as the paired end reads do not overlap over the constant region. We assume sequencing errors will not affect PPI scores in the cis library since the higher rate in the trans library does not.

4) I find the PPI score unintuitive. In theory, it might range asymmetrically around the value of 1 (wildtype) from 0 for a totally deleterious genotype to infinity. This alone bothers me (and could be addressed with a log-scale relative to wildtype). The assay itself puts practical limits on how much a mutant genotype might possibly increase the PPI, because there must be saturation point at which occupancy of the complex, given the dose of methotrexate in the assay, no longer improves fitness. The PPI numbers are therefore somewhat hard to interpret. I would prefer a transform that expresses the PPI on a log scale relative to wild type, rescaled so the intervals between the minimum and wild-type and that between wild-type and maximum are the same. I don't view the authors' method as fatally flawed, and I have no reason to think the conclusions would change if my method were used. I'd like the authors to consider the possibility that such an approach might make the numbers in the paper easier for the reader to interpret.

We understand the reviewer’s concerns and actually initially tried different ways of scoring variants, including rescaling. However, we believe that the present score is the most intuitive as it corresponds to the number of doublings each variant underwent and so relates to the selection coefficient (the selection coefficient would be equal to PPI score minus 1). This measure is widely used. Moreover, we believe, as does reviewer 1, that our score makes it intuitive to relate the growth rate in our assay to the thermodynamic model. Rescaling the data so that the difference between the wild-type and the minimum is the same as the difference between the wild-type and the maximum could potentially bias the data because there is no reason to think that the absolute change in complex concentration should be the same for the minimum and maximum. The position of wild-type on the sigmoidal function between PPI score and ΔΔG supports this view. This also shows that in practice, the PPI score cannot reach infinity, not because of saturation but because it is not possible to have more than 100% complex bound. The y-axis is indeed equivalent to the fraction of the limiting partner that is bound.

Regarding the saturation effects, we added the following sentences in the Discussion, to suggest that these might have little effects: “A substantial fraction of genetic interaction could however not been explained by structural contacts. […] Moreover, Levy et al. have shown using protein-complementation assay that the growth is correlated to complementation complex concentration over a wide-range of concentrations (Levy et al., 2014).”

To summarize, we believe that our score is relevant and intuitive because it relates the population dynamics of the competition assay to the biochemical/thermodynamic basis of the PPI.

5) I like the authors' efforts to quantitatively distinguish specific structural epistatic interactions from general thermodynamic epistasis caused by the nonlinear relationship between the proteins' affinity for each other and occupancy of the complex. Although the model has a nice theoretical basis and accounts for a substantial amount of epistasis, there may be additional causes of nonspecific epistasis remaining in the system – for example, if there are nonlinearities in the relationship between occupancy of the jun-fos complex and growth rate. I would therefore like to see the authors explore using Sailer and Harms' Genetics 2017 method, which identifies general epistasis in a way that is more agnostic to the specific biochemical causes or quantitative relationships, to determine if this is the case. If general epistasis can be identified using Sailer's approach even after the thermodynamic model is applied, this would be important, because it would further reduce (and probably sharpen) the cases of specific structural epistasis, and it would be an interesting finding, as well.

We agree with the reviewer and initially used a loess regression as an agnostic model instead of the thermodynamic model. The increase in percentage of variance explained is only of 0.4% on average across the three replicates of the whole trans library, 0.2% for the trans libraries restrained to single nucleotides changes and 0.5% for the cis library (and this using a pretty relaxed bandwidth parameter making the regression surface quite rugged, but however fitted by cross-validation). However, the thermodynamic model is both simpler and more interesting because it gives a biophysical explanation to the non-specific epistasis compared to an agnostic model. Also, since there were no major differences between the downstream results when using the two models, we decided to report only one model to save space. We chose the thermodynamic model for the reasons stated above.

6) I have some concern that the thermodynamic correction may affect cis and trans interactions differently. Stated a little too simply, the model is built around the expectation that mutations in trans that additively affect the energy of complex formation will have an exponential effect on occupancy of the complex and therefore on PPI; epistasis not accounted for by this relationship represent specific structural epistasis. It is not apparent to me that the same model appropriately corrects for general thermodynamic nonlinearity among cis- acting mutations. Suppose that two mutations within a protein independently affect the dG of the native structure; they will have an exponential effect on occupancy of the native structure. It is not obvious to me that the model will account for such nonlinear effects on the effective concentration of each molecule. If so, then pairs of cis-mutations that in fact act additively on the protein might be classified as structural specific interactions even after the thermodynamic model is applied. This could contribute to the apparent excess of classified epistasis in cis compared to in trans after removal of thermodynamic epistasis (Figure 5). If I am misinterpreting the model, it would be helpful for the authors to better explain in the main text what the model does and doesn't account for.

FOS and JUN, like most bZIP proteins, fold upon binding. This means that the whole reaction can be modeled as a two-state equilibrium between the unfolded proteins on one side and the complex on the other, with a single ΔΔG value for the whole process.

We added the following sentence: “Because leucine zippers, including Fos and Jun, fold upon binding (Patel et al., 1990, Thompson et al., 1993), the same thermodynamic model based on a two-state equilibrium between the two unfolded proteins and the complex can describe how mutations combine in cis as well as in trans.”

7) The authors classify the magnitude of PPI effects as strongly deleterious, weakly deleterious, neutral, and beneficial based on thresholds of 0.64, 0.96, and 1.04. The authors should provide a justification these numbers, which are presented as if they are arbitrary. (They do resemble some relevant numbers from the normal distribution, but the authors should make this explicit and justify their use.)

We agree that these thresholds are arbitrary. However we show in Figure 2—figure supplement 1 and Figure 5—figure supplement 2 that our conclusions hold when modifying the thresholds within relevant ranges for the class they delineate based on the distribution of PPI scores.

8) The interface of jun and fos appears as if it might be isologous, such that a mutation in one partner would be expected to have precisely the same structural and functional effect as the same mutation in the other (although this prediction would be compromised by divergence between the paralogs over time). The authors might discuss this, particularly as an explanation for the correlation between the effects of a mutation in one partner and the same mutation in the other. Isology would explain correlation; divergence would explain imperfect correlation.

That is indeed the point we tried to make. We now make this point more explicit by adding the following sentence: “This correlation indicates that the structural determinants of mutation effects in FOS and JUN remain well conserved despite sequence divergence over long evolutionary timescales.”

We did not employ the word “isolog” because we found only 15 references in PubMed and hence thought it could be considered as jargon to the wide readership of eLife.

9) Some bZip proteins, including fos and jun family members, can homodimerize. In the current assay, such events would interfere with growth, assuming that homodimers would compete with the heterodimer for the relevant subunits. Mutations that specifically affect the affinity of homodimerization – including those at the interface – would thus affect PPI, but not through trans-interactions. The authors should consider this possibility and note how it might contribute to their observations.

We now add the following paragraph in the Discussion to state how homodimerization could mitigate some of the results: “A substantial fraction of genetic interaction could however not be explained by structural contacts. […] Elucidating the remaining mechanisms of genetic interactions will thus require further studies that take these effects into account.”

10) The authors state that combining two mutations that positively affect growth rate leads to positive epistasis and cite Figure 3C. I believe that should be apparent as pies colored mostly green in the upper-right quadrant of the array. I don't see that in the data at all. That quadrant looks mostly yellow, suggesting little epistasis. What I believe is apparent in Figure 3C is that positive epistasis is concentrated in the rows where a weak beneficial mutation plus a deleterious mutation yield an outcome that is less deleterious than expected; this effect goes away when the thermodynamic model is applied (compare 3F). This is worth noting and, I hope, providing an explanation for.

We agree with the reviewer that two mutations that increase strength do not exhibit epistasis and deleted this statement. However, regarding the combination of strength-increasing and –decreasing mutations, we already state that “Positive genetic interactions are, however, generally detected between two mutations that greatly weaken the interaction and also often when combining strength-increasing and strength-decreasing mutations (Figure 3C).”

We now added the following sentence to explain how this effect is the result of the sigmoidal relationship between PPI score and ΔΔG and thus regressed out by the thermodynamic model: “Indeed, because of the sigmoidal nature of the model, a single mutant that decreases ΔΔG will increase PPI scores to a lower extent in the wild-type context rather than when combined with a mutation that destabilized the complex because of the saturation effect caused by the plateau of the sigmoid.”

Reviewer #3:

[…] * The only thing that I see lacking experimentally is deep sequencing of the wildtype sequences to estimate the error rate. Right now there is no such estimate of how many of the observed mutations are actually due to library preparation of sequencing errors. This could easily be determined by sequencing wild type.

We agree that an estimate of sequencing error was lacking and now provide in the Materials and methods section “Sequencing data analysis and filtering” an estimate of the degree of sequencing errors based on the constant regions upstream of each leucine zipper (where the primer anneals to amplify the library) in the trans library:

“To estimate sequencing errors, we measured the per base error probability in the constant region upstream of the variable regions of FOS and JUN that were used as primer annealing sites for library amplification. […] Sequencing errors do not seem to affect the downstream estimate of PPI scores as filtering reads at a Phred score of 30, 35 or additionally filtering out reads where at least one of the called mutations had a Phred score below 30 (>90% of the wrongly called mutations in the constant region had a Phred score below 30 compared to < 25% of the correctly called bases) led to PPI scores that were highly correlated to the ones calculated using the above filters (Figure 1—figure supplement 1A).”

We also added Figure 1—figure supplement 1A that shows that filtering reads at a higher average Phred score or additionally filtering reads where the called mutations had a Phred score below 30 does not strongly affect the PPI scores (Pearson correlation of ~0.99).

For the cis library, the paired end reads overlap along the whole variable region making sequencing errors much less likely than in the trans library. However, this error cannot be accurately estimated as the paired end reads do not overlap over the constant region. We thus assume sequencing errors will not affect PPI scores in the cis library since the higher rate in the trans library does not.

* The computer code used for the data analysis should be included as a supplementary file and/or a GitHub repository (eLife now allows this).

We now uploaded the code on GitHub at https://github.com/gdiss/Diss_et_al_eLife_2018 and added the link at the beginning of the Materials and methods section.

* In various places (first in the subsection “Determinants of single mutant outcome”), the authors refer to the "identical substitutions" in both proteins. But they never clearly explain what this means. I'm assuming that the proteins are homologous, and they mean the homologous positions? This needs to be described much better.

We added the following sentence: “This correlation indicates that the structural determinants of mutation effects in FOS and JUN remain well conserved despite sequence divergence over long evolutionary timescales.”

* Unless I am misunderstanding, the thermodynamic model is in terms of mutational effects on the protein-protein binding. This makes sense for analyzing the trans mutational effects. But for analyzing the cis mutational effects, wouldn't we expect to instead be concerned with ddG values for the stabilities of the individual proteins?

FOS and JUN, like most bZIP, fold upon binding. This means that the whole reaction can be modeled as a two-state equilibrium between the unfolded proteins on one side and the complex on the other, with a single ΔΔG value for the whole process.

We added the following sentence: “Because leucine zippers, including Fos and Jun, fold upon binding (Patel et al., 1990, Thompson et al., 1993), the same thermodynamic model based on a two-state equilibrium between the two unfolded proteins and the complex can describe how mutations combine in cis as well as in trans.”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Data from the full trans library.
    elife-32472-supp1.xlsx (55.2MB, xlsx)
    DOI: 10.7554/eLife.32472.024
    Supplementary file 2. Variance explained by physico-chemical amino acid features.
    elife-32472-supp2.xlsx (554.1KB, xlsx)
    DOI: 10.7554/eLife.32472.025
    Supplementary file 3. Data from the restricted trans library.
    elife-32472-supp3.xlsx (9.9MB, xlsx)
    DOI: 10.7554/eLife.32472.026
    Supplementary file 4. Data from the cis library.
    DOI: 10.7554/eLife.32472.027
    Supplementary file 5. Primer sequences.
    elife-32472-supp5.xlsx (33.1KB, xlsx)
    DOI: 10.7554/eLife.32472.028
    Transparent reporting form
    DOI: 10.7554/eLife.32472.029

    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES