Abstract
Recent studies of genotype-phenotype (GP) maps have reported universally enhanced phenotypic robustness to genotype mutations, a feature essential to evolution. Virtually all of these studies make a simplifying assumption that each genotype—represented as a sequence—maps deterministically to a single phenotype, such as a discrete structure. Here, we introduce probabilistic genotype-phenotype (PrGP) maps, where each genotype maps to a vector of phenotype probabilities, as a more realistic and universal language for investigating robustness in a variety of physical, biological, and computational systems. We study three model systems to show that PrGP maps offer a generalized framework which can handle uncertainty emerging from various physical sources: (1) thermal fluctuation in RNA folding, (2) external field disorder in spin glass ground state finding, and (3) superposition and entanglement in quantum circuits, which are realized experimentally on IBM quantum computers. In all three cases, we observe a novel biphasic robustness scaling which is enhanced relative to random expectation for more frequent phenotypes and approaches random expectation for less frequent phenotypes. We derive an analytical theory for the behavior of PrGP robustness, and we demonstrate that the theory is highly predictive of empirical robustness.
Introduction.—
Systems which take a sequence as input and nontrivially produce a structure, function, or behavior as output are ubiquitous throughout the sciences and engineering. In biological systems such as RNA folding [1–11], lattice protein folding [4], protein self-assembly [12, 13], and gene regulatory networks [14, 15], the relationship between genotype (stored biological information) and phenotype (observable or functional properties) can be structured as genotype-phenotype (GP) maps, which have a rich history of computational and analytical investigation [1–34]. Systems from physics and computer science have also been analyzed as GP maps, including the spin glass ground state problem [30], linear genetic programming [26], and digital circuits [31].
Despite being completely disparate systems, all of the GP maps above share a number of common structural features, most notably an enhanced robustness of the phenotypes to genotype mutations. Phenotypic robustness of a phenotype is the average probability that a single character mutation of a genotype which maps to does not change the resultant phenotype , averaged over all genotypes mapping to . Random assignment of genotype to phenotype predicts that [4], where is the fraction of genotypes that map to phenotype . However, the systems mentioned above display substantially enhanced robustness, exhibiting the relationship with system-dependent constants and . It has been shown that, in evolution, this enhanced robustness facilitates discovery of new phenotypes [11, 19, 20, 35] and is crucial for navigating fitness landscapes [5]. As a result, it is important to accurately quantify robustness and its relationship with phenotype frequency.
All GP map studies referenced make the assumption that a genotype maps deterministically to a single phenotype. However, we argue that for most of the above systems, this is a major simplification. For instance, by mapping an RNA genotype to only the ground state energy structure, previous studies [1–11] make an implicit zero temperature approximation for the ensemble of molecules, even if the Gibbs free energy of an individual molecule itself is calculated within the folding software at finite temperature. Similarly, in studies of gene regulatory networks, spin glasses, linear genetic programs, and digital circuits, the systems investigated do not interact with external networks or variables. These investigations assume that the environmental effect on the GP mapping of the subsystem of interest is static.
In this Letter, we introduce probabilistic genotype-phenotype (PrGP) maps, in contrast to the above systems which we call deterministic genotype-phenotype (DGP) maps, which emerge as a limiting case of PrGP maps. The definitions of phenotypic robustness and transition probabilities retain the same physical meaning in PrGP maps as in DGP maps, and we emphasize that PrGP maps can handle disorder and uncertainty emerging from a variety of sources. To address the implicit zero temperature approximation in sequence-to-structure mappings (RNA, lattice protein folding, protein self-assembly), we study the folding of RNA primary sequences to a canonical ensemble of secondary structures corresponding to low-lying local free energy minima. To address external variable disorder with a known distribution, we study the zero temperature mapping of a spin glass bond configuration to its ground state with quenched external field disorder, building a phenotype probability vector using many replicas of the disordered field. This has implications for viral fitness landscape inference [36–40], where external fields, in part, model host immune pressure [39]. Lastly, to investigate inherent uncertainty in phenotypes, we introduce quantum circuit GP maps where uncertainty emerges from superposition and entanglement of classically measurable basis states. Our experimental realization of these quantum circuits on a 7-qubit IBM quantum computer also introduces measurement noise, which has a clear and unique effect on robustness. The PrGP map properties of the three model systems are summarized in Table I and visually in the Supplemental Material (SM) [41]. We observe that PrGP maps exhibit a novel biphasic scaling of robustness versus phenotype frequency which, for higher frequency phenotypes, resembles the seen in DGP maps but is suppressed, and, for lower frequency phenotypes, settles closer to a linear relationship between and or a more complex for .
TABLE I.
Overview of the genotypes and phenotypes of each PrGP system, as well as their respective sources of uncertainty.
| System | Genotype Alphabet | Alphabet size | Phenotype | Source of Uncertainty |
|---|---|---|---|---|
| RNA folding | {A, U, G, C} (or {G, C}) | 4 (or 2) | Folded dot-bracket structure | Thermal fluctuation, |
| Spin glass ground state | {−1, +1} | 2 | Ground state spin configuration | Disordered external field |
| Quantum circuit | {Z, X, Y, H, S, S†, T, T†} | 8 | Classical measurement of circuit output | Superposition and entanglement |
Theory.—
In this study, we focus on mappings of discrete genotypes, which can be written as sequences from a fixed alphabet, onto a discrete set of phenotypes. Let represent the mapping of genotype to phenotype , where is an element of , the set of all sequences of length drawn from an alphabet of characters. A generalization of robustness is the transition probability , the average probability that a single character mutation of a genotype mapping to phenotype will change the phenotype to , with the average taken over all genotypes mapping to . For DGP maps, is given by
| (1) |
where is the single character mutational neighborhood of sequence . For PrGP maps, we show in the SM [41] that the transition probability takes the form
| (2) |
where with , the probability that genotype maps to phenotype . In the above equation, is the set of all unordered pairs of sequences in which differ by exactly one character. The phenotype probability vector obeys the normalization conditions and for all , and phenotype robustnesses are given by the diagonal of the transition probability matrix, . We also are interested in the phenotype entropy , which quantifies the spread of a genotype’s mappings onto multiple phenotypes, and the genotype entropy , which quantifies the spread of a phenotype across all genotypes.
In DGP maps, a random null model [4] for robustness can be built by randomly assigning genotype-phenotype pairings while keeping the frequencies constant. As a result, the probability of a single character mutation leading to a change from phenotype to phenotype is approximately for all . For PrGP maps, a naive expectation can be built by letting all phenotype probability vectors equal the frequency vector, for all genotypes . From eq. (2), one finds that ; thus, the two random expectations are the same, even though they physically represent different scenarios.
A fundamental difference between PrGP maps and DGP maps is that DGP maps can have no frequencies lower than , but PrGP phenotypes in principle could have arbitrarily small frequencies. Thus, existing theory on DGP robustness [4, 12, 34] cannot be extended to PrGP phenotypes with sufficiently small frequencies. We thus develop a theory for PrGP robustness based on two key assumptions: (1) a phenotype with frequency has probability mass evenly across genotypes, and (2) the genotypes would be a robustness-maximizing set in the DGP sense (i.e. maximizing eq. (1)). We discuss the validity of these assumptions later in the paper and extensively in the SM [41].
Two central results of this paper which follow (see SM [41] for a detailed derivation) from the above assumptions are the approximate PrGP robustness:
| (3) |
and approximate upper bounds on the PrGP robustness given by the piecewise continuous function
| (4) |
The upper bound illustrates two distinct scaling laws—namely, a DGP-like scaling for sufficiently large frequencies, and a null model-like linear scaling for small frequencies. Since empirical DGP robustness often scales like a “suppressed” downscaling of the DGP maximum , the biphasic scaling of the PrGP upper bound suggests that empirical PrGP robustness may also appear biphasic and suppressed relative to the upper bound. In the Results section we show that eq. (3), which is highly successful at recapitulating empirical robustness in 3 systems (RNA, spin glasses, quantum circuits), is amenable to further analytical approximation given system-specific information about , yielding such biphasic scaling in different frequency regimes.
Methods:
RNA.—
In RNA folding DGP map studies [1–11], the global free energy minimum secondary structure (reported as a “dot-bracket” string indicating polymer connectivity) was calculated for every RNA sequence of fixed length drawn from the alphabet of the four canonical nucleotides {A,C,G,U} (alphabet size ). Here, we are interested in not only the global free energy minimum structures but also the low-lying local minima, and we additionally investigate the temperature-dependent behavior of the robustness. We use the RNAsubopt program from the ViennaRNA package (version 2.4.17) [42] to calculate the secondary structures and associated Gibbs free energies for the local free energy minima within 6 kcal/mol of the global free energy minimum (or all the nonpositive free energy local minima, if the global minimum is greater than −6 kcal/mol). Because of the increased computational time required to discover all the local minima within an energy range, we use a reduced alphabet of {C,G} for our main simulations with sequence length . A validation study with and the full alphabet is reported in the SM [41]. Simulations for the , trials were conducted at 20 °C, 37 °C (human body temperature), and 70 °C. We take the low-lying local free energy minima structures to comprise a canonical ensemble at the simulation temperature, so the probability of RNA sequence mapping to secondary structure is determined from , where normalizes the vector.
Spin Glasses.—
In a previous spin glass [43, 44] DGP map study [30], a zero temperature ± spin glass on a random graph with Hamiltonian was considered. The genotype is the bond configuration where each , and the phenotype is the ground state configuration where each . Degeneracies of the ground state were broken by the uniformly drawn, i.i.d. random external fields which were fixed for each simulation. In our spin glass PrGP map, we use a similar setup, but we are interested in the effect of external field disorder on robustness. We therefore incorporate the effects of Gaussian-distributed external fields , where the uniformly distributed means are fixed across all realizations of the disorder for each simulation. To obtain accurate robustness measurements, we exactly calculate every ground state for spin glasses with , and by exhaustive enumeration. We examine the effect of external field disorder by simulating 450 replicas of with variances , , and 0.1 and fixed means . Phenotype probability vectors for each genotype were constructed by tallying and normalizing the number of appearances of each ground state across each replica. Graph topology corresponding to data presented here, as well as validation trial data, are in the SM [41].
Quantum Circuits.—
Although methods to evolve quantum circuits have been suggested [45], to our knowledge this work is the first to analyze the structural properties of quantum circuit GP maps. We generate perform 7 trials in which we generate random quantum circuits (see SM for algorithm) with 7 qubits and 4 layers of gates; we also conduct an additional trial with 11 qubits and 4 layers of gates. Circuits are randomly seeded with CNOT gates which cannot participate in the genotype, and the remaining spaces are filled with single-qubit gates drawn from the alphabet {, , , , , , , }. We choose ( for the 11 qubit trial) of these gates to be variable gates which comprise the genotype. The input to the circuit is the prepared state , and the exact probability of classically measuring the basis state is , where is the -th qubit, and is the total circuit operation. We realize these quantum circuits on the ibm_lagos v1.2.0 quantum computer [41], one of the 7-qubit IBM Quantum Falcon r5.11H processors. Experimental phenotype probability vectors are constructed from tallying classical measurements from 1000 shots for each genotype. The 11-qubit trial is conducted on a Qiskit Aer simulator instead of an experimental quantum computer, using the ibm_brisbane noise profile to simulate noise. The circuits from our experimental trials are depicted in the SM [41].
Results and Discussion.—
After computing the PrGP map data from the RNA, spin glass, and quantum circuit numerical experiments, we computed robustness, transition probabilities, phenotype entropy distributions, genotype entropy for each phenotype, as well as versus for each phenotype, all of which are plotted in the SM [41]. In Figure 1 we plot robustness versus frequency and versus log frequency for each of the 3 main systems studied (additional RNA, spin glass, and quantum circuit trials are in the SM [41]). Notable common features across all systems include robustness much higher than predicted by the null model for sufficiently large frequencies and a convergence toward the null model for sufficiently small frequencies. The RNA PrGP maps, all show suppressed robustness relative to their DGP counterparts, and this scaling is further suppressed as temperature increases. Similarly, in spin glasses, the DGP robustness is highest and closest to the log-linear relationship; the PrGP maps show increasingly suppressed scaling as the disorder variance is tuned higher. In quantum circuit PrGP maps, the trials with experimental or simulated noise show the appearance of a long tail of many new small-frequency phenotypes with, leading to the supression of the robustness of the large-frequency phenotypes with a maintenance of the approximate scaling.
FIG. 1.
Plots of robustness versus (a,c,e) frequency and versus (b,d,f) log10(frequency) for (a,b) RNA folding in, (c,d) spin glass ground state, and (e,f) quantum circuit PrGP maps. The dashed line is the random null expectation .
From the phenotype entropy distributions in the SM [41], we see that as disorder parameters are increased (temperature, field variance, measurement noise), phenotype entropy distributions widen, meaning a genotype is more likely to have a broader distribution of phenotypes to which it maps. Similarly, genotype entropy, which is , for DGP maps, maintains similar scaling but is suppressed in PrGP maps, meaning that a phenotype with fixed frequency is likely to be spread out over more genotypes in the PrGP case than in the DGP case.
Notably, we also compute for all systems and predict robustness by directly plugging in and into eq. (3). We show an example plot of the thoeretical robustness, empirical robustness, null model, and upper bound for spin glasses with in Figure 2(a). Not only does the the theoretical robustness, given only and , recapitulate the salient scaling behavior of the empirical robustness, as shown in Figure 2(b), but the Pearson correlation between the predicted and empirical robustness is ; in the SM [41], we show that the Pearson correlations from robustness obtained from eq. (3) for all systems studied here ranged from 0.859–0.9996 and outperformed the null model and DGP maximum robustness formulas across all systems, illustrating the success of eq. (3). While the Pearson correlations are high, the prediction from eq. (3) varies by additive or multiplicative constant factors likely due to violation of one or both assumptions mentioned in the Theory section. As disorder parameters increase, these violations become more prominent and eq. (3) and the null model converge toward similar performance (see Table S1 in SM [41]), meaning that biphasic scaling starts to fade away in favor of null model-like linear scaling when there is too much disorder. In all cases though, the theory remains highly predictive.
FIG. 2.
(a) Plot of versus , where has either been computed empirically from the experimental data or theoretically from eq. (3) for the spin glass system . Includes upper bounds from eq. (4) and null model. (b) Scatter plot of theoretical versus empirical for the spin glass system with Pearson .
Interestingly, we find that tends to obey a power law with (generally with ) over many orders of magnitude, for all 3 systems, though with slightly differing behavior. This leads to a robustness expression
| (5) |
Notably, when (e.g. for small frequencies in the spin glass case and for all frequencies in the case), eq. (5) becomes . In RNA and quantum circuits, a sublinear power law is observed, resulting in . Lastly, for sufficiently large frequencies, substituting into eq. (5), the leading order behavior for small becomes . The theory suggests that the biphasic behavior is characteristic of the form of eq. (3), as there are other functional forms for which produce the same or similar behaviors in different regimes (see SM [41]).
Compared to existing DGP maps, PrGP maps not only allow for the inclusion of realistic, physical sources of disorder like thermal fluctuation and external variables, but they also permit the analysis of new systems like quantum circuits with inherent uncertainty. We emphasize the broad applicability of this framework to a vast array of systems across biology, physics, and computer science, and other disciplines for the analysis of robustness and stability. The analytical theory introduced here indicates a link between a phenotype and how it is spread over the genotypes. Given the empirical observation of a power law relationship between the number of unique genotypes over which the phenotype is spread and the frequency of the phenotype, we can show that for high frequencies loglinear (DGP-like) robustness is expected, while for small frequencies linear or a more complex is expected, depending on system specific information. Moreover, as disorder in a system is increased, phenotypes spread over a larger number of genotypes, leading to increasingly suppressed robustness and more null model-like behavior. Most notably, our theory in eq. (3) is highly successful, measured by Pearson correlation, in predicting empirical robustness across all systems.
The scaling we observe empircally and justify theoretically in this letter is observed in all three studied systems, despite being disparate, hinting at its universality. How this robustness trend affects navigability of (probabilistic) fitness landscapes is an important direction for further investigation. We also suggest that the mapping of genotypes to probability vectors instead of discrete phenotypes may facilitate the taking of gradients of, for instance, a negative loss-likelihood loss function in the process of learning PrGP or even DGP maps using statistical learning methods. Specifically, one might model a GP map using a graph neural network [46] and predict the phenotype or related properties of neighboring nodes. Such a model may ultimately aid in inferring fitness landscapes from limited initial GP data [47–49].
Supplementary Material
Acknowledgements.—
We acknowledge the use of IBM Quantum services and the MIT Engaging Cluster for this work. This work was supported by awards T32GM007753 and T32GM144273 from the National Institute of General Medical Sciences, a Hertz Foundation Fellowship (to VM), and a PD Soros Fellowship (to VM). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences, the National Institutes of Health, IBM, or the IBM Quantum Team.
Footnotes
The authors declare no known conflict of interest.
References
- [1].Weiß M. and Ahnert S. E., Neutral components show a hierarchical community structure in the genotype–phenotype map of RNA secondary structure, Journal of The Royal Society Interface 17, 20200608 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Weiß M. and Ahnert S. E., Using small samples to estimate neutral component size and robustness in the genotype–phenotype map of RNA secondary structure, Journal of The Royal Society Interface 17, 20190784 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Aguirre J., Buldú J. M., Stich M., and Manrubia S. C., Topological Structure of the Space of Phenotypes: The Case of RNA Neutral Networks, PLoS ONE 6, e26324 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Greenbury S. F., Schaper S., Ahnert S. E., and Louis A. A., Genetic Correlations Greatly Increase Mutational Robustness and Can Both Reduce and Enhance Evolvability, PLOS Computational Biology 12, e1004773 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Greenbury S. F., Louis A. A., and Ahnert S. E., The structure of genotype-phenotype maps makes fitness landscapes navigable, Nature Ecology & Evolution 6, 1742 (2022). [DOI] [PubMed] [Google Scholar]
- [6].Dingle K., Schaper S., and Louis A. A., The structure of the genotype–phenotype map strongly constrains the evolution of non-coding RNA, Interface Focus 5, 20150053 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Dingle K., Camargo C. Q., and Louis A. A., Input–output maps are strongly biased towards simple outputs, Nature Communications 9, 761 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Dingle K., Ghaddar F., Šulc P., and Louis A. A., Phenotype Bias Determines How Natural RNA Structures Occupy the Morphospace of All Possible Shapes, Molecular Biology and Evolution 39, msab280 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Dingle K., Pérez G. V., and Louis A. A., Generic predictions of output probability based on complexities of inputs and outputs, Scientific Reports 10, 4415 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Jörg T., Martin O. C., and Wagner A., Neutral network sizes of biological RNA molecules can be computed and are not atypically small, BMC Bioinformatics 9, 464 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Wagner A., Robustness and evolvability: a paradox resolved, Proceedings of the Royal Society B: Biological Sciences 275, 91 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Greenbury S. F., Johnston I. G., Louis A. A., and Ahnert S. E., A tractable genotype–phenotype map modelling the self-assembly of protein quaternary structure, Journal of The Royal Society Interface 11, 20140249 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Tesoro S. and Ahnert S. E., Non-deterministic genotype-phenotype maps of biological self-assembly, EPL (Europhysics Letters) 123, 38002 (2018). [Google Scholar]
- [14].Camargo C. Q. and Louis A. A., Boolean Threshold Networks as Models of Genotype-Phenotype Maps, Complex Networks XI, 143 (2020). [Google Scholar]
- [15].Kauffman S., Homeostasis and Differentiation in Random Genetic Control Networks, Nature 224, 177 (1969). [DOI] [PubMed] [Google Scholar]
- [16].Wagner A., Distributed robustness versus redundancy as causes of mutational robustness, BioEssays 27, 176 (2005). [DOI] [PubMed] [Google Scholar]
- [17].Wagner A., Robustness and evolvability in living systems, 3rd ed., Princeton studies in complexity (Princeton Univ. Press, Princeton, NJ, 2007) oCLC: 845177181. [Google Scholar]
- [18].Payne J. L. and Wagner A., Constraint and Contingency in Multifunctional Gene Regulatory Circuits, PLoS Computational Biology 9, e1003071 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Payne J. L., Moore J. H., and Wagner A., Robustness, evolvability, and the logic of genetic regulation, Artificial Life 20, 111 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Payne J. L. and Wagner A., The Robustness and Evolvability of Transcription Factor Binding Sites, Science 343, 875 (2014). [DOI] [PubMed] [Google Scholar]
- [21].Schaper S. and Louis A. A., The Arrival of the Frequent: How Bias in Genotype-Phenotype Maps Can Steer Populations to Local Optima, PLoS ONE 9, e86635 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Greenbury S. F. and Ahnert S. E., The organization of biological sequences into constrained and unconstrained parts determines fundamental properties of genotype–phenotype maps, Journal of The Royal Society Interface 12, 20150724 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Ahnert S. E., Structural properties of genotype–phenotype maps, Journal of The Royal Society Interface 14, 20170275 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Weiß M. and Ahnert S. E., Phenotypes can be robust and evolvable if mutations have non-local effects on sequence constraints, Journal of The Royal Society Interface 15, 20170618 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Nichol D., Robertson-Tessi M., Anderson A. R. A., and Jeavons P., Model genotype–phenotype mappings and the algorithmic structure of evolution, Journal of The Royal Society Interface 16, 20190332 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Hu T., Tomassini M., and Banzhaf W., A network perspective on genotype–phenotype mapping in genetic programming, Genetic Programming and Evolvable Machines 10.1007/s10710-020-09379-0 (2020). [DOI]
- [27].Manrubia S., Cuesta J. A., Aguirre J., Ahnert S. E., Altenberg L., Cano A. V., Catalán P., Diaz-Uriarte R., Elena S. F., García-Martín J. A., Hogeweg P., Khatri B. S., Krug J., Louis A. A., Martin N. S., Payne J. L., Tarnowski M. J., and Weiß M., From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics, Physics of Life Reviews 38, 55 (2021). [DOI] [PubMed] [Google Scholar]
- [28].Payne J. L. and Wagner A., The causes of evolvability and their evolution, Nature Reviews Genetics 20, 24 (2019). [DOI] [PubMed] [Google Scholar]
- [29].Schaper S., Johnston I. G., and Louis A. A., Epistasis can lead to fragmented neutral spaces and contingency in evolution, Proceedings of the Royal Society B: Biological Sciences 279, 1777 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Mohanty V. and Louis A. A., Robustness and stability of spin-glass ground states to perturbed interactions, Physical Review E 107, 014126 (2023), publisher: American Physical Society. [DOI] [PubMed] [Google Scholar]
- [31].Wright A. H. and Laue C. L., Evolving Complexity is Hard (2022), arXiv:2209.13013 [cs].
- [32].Johnston I. G., Dingle K., Greenbury S. F., Camargo C. Q., Doye J. P. K., Ahnert S. E., and Louis A. A., Symmetry and simplicity spontaneously emerge from the algorithmic nature of evolution, Proceedings of the National Academy of Sciences 119, e2113883119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Mohanty V., Robustness of evolutionary and glassy systems, Ph.D. thesis, University of Oxford (2021). [Google Scholar]
- [34].Mohanty V., Greenbury S. F., Sarkany T., Narayanan S., Dingle K., Ahnert S. E., and Louis A. A., Maximum mutational robustness in genotype–phenotype maps follows a self-similar blancmange-like curve, Journal of The Royal Society Interface 20, 20230169 (2023), publisher: Royal Society. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Draghi J. A., Parsons T. L., Wagner G. P., and Plotkin J. B., Mutational robustness can facilitate adaptation, Nature 463, 353 (2010), number: 7279 Publisher: Nature Publishing Group. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Louie R. H. Y., Kaczorowski K. J., Barton J. P., Chakraborty A. K., and McKay M. R., Fitness landscape of the human immunodeficiency virus envelope protein that is targeted by antibodies, Proceedings of the National Academy of Sciences 115, E564 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Butler T. C., Barton J. P., Kardar M., and Chakraborty A. K., Identification of drug resistance mutations in HIV from constraints on natural evolution, Physical Review E 93, 022412 (2016). [DOI] [PubMed] [Google Scholar]
- [38].Barton J. P., Goonetilleke N., Butler T. C., Walker B. D., McMichael A. J., and Chakraborty A. K., Relative rate and location of intra-host HIV evolution to evade cellular immunity are predictable, Nature Communications 7, 11660 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Shekhar K., Ruberman C. F., Ferguson A. L., Barton J. P., Kardar M., and Chakraborty A. K., Spin models inferred from patient-derived viral sequence data faithfully describe HIV fitness landscapes, Physical Review E 88, 062705 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Hopf T. A., Ingraham J. B., Poelwijk F. J., Schärfe C. P. I., Springer M., Sander C., and Marks D. S., Mutation effects predicted from sequence co-variation, Nature Biotechnology 35, 128 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].See Supplemental Material for this work.
- [42].Lorenz R., Bernhart S. H., Höner zu Siederdissen C., Tafer H., Flamm C., Stadler P. F., and Hofacker I. L., ViennaRNA Package 2.0, Algorithms for Molecular Biology 6, 26 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Edwards S. F. and Anderson P. W., Theory of spin glasses, Journal of Physics F: Metal Physics 5, 965 (1975). [Google Scholar]
- [44].Sherrington D. and Kirkpatrick S., Solvable Model of a Spin-Glass, Physical Review Letters 35, 1792 (1975). [Google Scholar]
- [45].Tandeitnik D. and Guerreiro T., Evolving Quantum Circuits (2022), arXiv:2210.05058 [quant-ph].
- [46].Kipf T. N. and Welling M., enSemi-Supervised Classification with Graph Convolutional Networks (2017), arXiv:1609.02907 [cs, stat].
- [47].Shaw R. G. and Geyer C. J., Inferring Fitness Landscapes, Evolution 64, 2510 (2010), publisher: [Society for the Study of Evolution, Wiley]. [DOI] [PubMed] [Google Scholar]
- [48].Nozoe T., Kussell E., and Wakamoto Y., enInferring fitness landscapes and selection on phenotypic states from single-cell genealogical data, PLOS Genetics 13, e1006653 (2017), publisher: Public Library of Science. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].D’Costa S., Hinds E. C., Freschlin C. R., Song H., and Romero P. A., Inferring protein fitness landscapes from laboratory evolution experiments, PLOS Computational Biology 19, e1010956 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


