Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2024 May 13;121(21):e2318905121. doi: 10.1073/pnas.2318905121

Solvent constraints for biopolymer folding and evolution in extraterrestrial environments

Ignacio E Sánchez a,b,1, Ezequiel A Galpern a,b, Diego U Ferreiro a,b,1
PMCID: PMC11127021  PMID: 38739787

Significance

It is generally believed that life must occur in liquid solvents, as this is the regime in which relevant biochemistry can happen. For biochemistry to evolve, we propose that spontaneous molecular folding and the ability to encode function in biopolymer sequences is also required. We explore the limits to life emergence set by folding and evolution theories by relating them to the physical characteristics of different solvents that may occur in extraterrestrial environments. We find that along water, many solvents are compatible with biopolymer folding and evolution.

Keywords: glass transition, folding temperature, exoplanet

Abstract

We propose that spontaneous folding and molecular evolution of biopolymers are two universal aspects that must concur for life to happen. These aspects are fundamentally related to the chemical composition of biopolymers and crucially depend on the solvent in which they are embedded. We show that molecular information theory and energy landscape theory allow us to explore the limits that solvents impose on biopolymer existence. We consider 54 solvents, including water, alcohols, hydrocarbons, halogenated solvents, aromatic solvents, and low molecular weight substances made up of elements abundant in the universe, which may potentially take part in alternative biochemistries. We find that along with water, there are many solvents for which the liquid regime is compatible with biopolymer folding and evolution. We present a ranking of the solvents in terms of biopolymer compatibility. Many of these solvents have been found in molecular clouds or may be expected to occur in extrasolar planets.


Biology is based on biochemistry and biochemistry as we know it requires biopolymers. There are two fundamental characteristics that can be expected to be universal for any biopolymer: their potential for folding and evolution. It has been argued that biopolymers allow processes of folding and assembly to be detached from the required investment of free energy (1). For small molecules, by contrast, assembly and investment are directly coupled. Therefore, small molecules cannot achieve the elaborate folds and assemblies that we find on terrestrial biological relics. Spontaneous folding to specific structures is one of the fundamental aspects that come naturally to biopolymers. Specific, fast and robust folding can only occur in polymers that follow the “Principle of Minimal Frustration” (2), for which there is a fundamental correlation between similarity in structure and energetic distribution. In these polymers, the folded structure can be encoded in the sequence of monomers given an environment (3). The application of statistical mechanics to polymer models defines the parameters relevant to describe the characteristic structural transitions of these systems (4). Quantitative molecular interpretation of these theories provide the basis for the exploration of the ranges of the constraints that are expected to be found for any biological polymer.

Besides (but not independently of) folding, biopolymers must be able to change over time, they must explore new structural and functional forms that allow for biochemistry to evolve. Biopolymer evolution can be fundamentally described as explorations on the sequence space that encodes the structural forms (5). Changes in the sequences relate to changes in the structures, giving rise to genotype to phenotype mappings. For terrestrial biopolymers, this mapping is certainly complex and depends on the details of the environment in which the information is decoded. Recently, molecular information theory was conjured with energy landscape theory to find the evolutionary informational footprint in foldable polymers (6). It was found that the average information contained in the sequences of evolved terrestrial proteins is very close to the average information needed to specify a fold. Moreover, it was shown that it is possible to compute the efficiency for conversion of folding free energy into sequence information, that for terrestrial proteins is around 50%.

We propose here that spontaneous folding and molecular evolution are two universal aspects that must concur whether life is happening. These aspects are fundamentally related to the chemical composition of the biopolymers and crucially depend on the solvent in which these are embedded. It is generally accepted that life must occur in liquid solvents, as this is the phase in which relevant biochemistry can happen (7, 8). Some characteristics that are expected for possible alternative solvents are the ability to form hydrogen bonds, the presence of hydrophobic phase separation allowing for membrane systems, acid/base properties, chemical stability of known biomolecules, feasibility of a wide range of metabolic reactions, temperature range of liquid, etc. (13). There is a wide range of proposed life-sustaining solvents that includes water, ammonia, sulfuric acid, formamide, hydrocarbons, dihydrogen, dinitrogen, carbon dioxide, carbon disulfide, hydrazine, hydrogen cyanide, hydrogen sulfide, nitric oxide, and silicon dioxide to name some (13), many of which have been found in molecular clouds or may be expected to occur in extrasolar planets (8). The likelihood of the occurrence of seas of different candidate solvents in exoplanets has been evaluated as a function of stellar type (8). Here, we present a different but complementary analysis, by abstracting out the detailed chemistry of the polymer and concentrating on the solvent characteristics that constrain biopolymer folding and evolution in extraterrestrial environments.

Results

Characteristic Temperatures of Biopolymer Folding and Evolution.

Energy landscape theory and molecular information theory deduce in an abstract manner the conditions that are necessary for the evolution of folded biopolymers. Since the deductions do not depend on the chemical nature of the biopolymer or the solvent, the results can be used to assess whether alternative biochemistries support biopolymer existence.

Energy landscape theory and molecular information theory identify at least four characteristic temperatures of biopolymer folding and evolution (6). First, the physiological temperature Tphys is the one at which the molecule functions in a biological setting. Second, the folding temperature Tf is the one at which 50% of the molecules remain folded. Empirically, on earth, Tf is on average around 15 degrees higher than Tphys (6). Given the broad temperature ranges considered in this work, we will approximate Tphys=Tf for simplicity. Relaxing this assumption to allow for a Tf that is up to 30 K higher than Tphys does not change the main conclusions of our work. Third, the glass temperature Tg is related to the thermodynamics of trapping the polymer in the configurational space. Below Tg, the system runs out of entropy and its kinetics exhibit unfoldable glass-like behavior (2). Fourth, the selection temperature Tsel is related to the strength of selection for folding stability during evolution of the biopolymer sequence in the sequence space. If Tsel is lower than Tg, a biopolymer sequence can evolve for folding.

We propose that evolved biopolymer sequences must correspond with polymers that are folded and active at the physiological temperature. From energy landscape theory considerations, the relationship between these four characteristic temperatures is (6)

Tsel<Tg<Tphys<Tf. [1]

Eq. 1 implies that the ratios Tg/Tsel and Tf /Tg are both higher than one. Both conditions need to be met for evolution of folded biopolymers. The condition for Tg/Tsel ensures that foldable biopolymers’ sequences can evolve in a relevant timescale, while the condition for Tf /Tg ensures that evolved biopolymer structure can fold in a relevant timescale.

We can refine these theoretical limits given by Eq. 1 as follows. Energy landscape theory provides the following relationship between Tsel, Tg, and Tf (6, 9, 10):

2TfTsel=1Tg2+1Tf2. [2]

According to molecular information theory, the folded structures can be encoded in sequences of monomers, and the ratio of Tsel to Tphys defines the efficiency for the conversion of folding free energy into sequence information. This ratio has an upper limit of ln(2) (6, 11):

Eff=TselTphysTselTfln(2). [3]

If we solve Eq. 2 for Tg and substitute Tsel=Eff·Tf, we obtain

TfTg=(2-Eff)Eff1.373. [4]

If we solve Eq. 2 for Tf and combine the solution with Eq. 3 we obtain Eff=(Tsel/Tg)2/(1+1-(Tsel/Tg)2). Since Effln(2), it follows that

TgTsel1.048. [5]

Finally, putting together [4] and [5] we get

TfTsel1.439. [6]

Since Eq. 2 comes from a simplified random energy model for the energetics of folding and evolution, the precise values obtained should be considered as first-order approximations. The combination of energy landscape theory with molecular information theory provides theoretical limits for the ratios between Tsel, Tg, and Tf. An evolved, coded, biopolymer able to fold into a specific conformation should fulfill conditions [4]–[6]. A general diagram of these relations is presented in Fig. 1. The Tf /Tg ratio indicates the funneling of the folding energy landscape, which directly relates to the foldability of the biopolymer (4). The Tg/Tsel ratio is related to the selection strength of the sequences, as the ratio increases the possibility of finding codable sequences by mere chance decreases, as the sequence entropy of the foldable ensemble must decrease as Tsel lowers. Notice that there is a fundamental compromise between foldability and evolvability: The better the folding, the more difficult it is to find sequences that will code for it (12). This result is independent of the chemistry of the polymer and the physics of the solvent, and we propose that this is a universal characteristic expected to occur in biological polymers in any extraterrestrial environment.

Fig. 1.

Fig. 1.

Relationship between the characteristic temperatures for folding and evolution of biopolymers. The Tf/Tg ratio indicates the funneling of the folding energy landscape, which directly relates to the foldability of a biopolymer. The Tg/Tsel ratio relates to the selection strength of the sequences that code for a biopolymer. The limits imposed by energy landscape theory are shown in brown and the limits set by molecular information theory in blue.

Biopolymer Folding in Water.

In this section, we examine the properties of water to provide values for the Tf and Tg of biopolymers. As a starting point, we focus on a pressure of 1 atm. The potential influence of changes in environmental pressure is discussed in a latter section. Under these conditions, water is a liquid from 273.15 to 373.13 K. Since terrestrial life mainly exists in liquid water, we assume that Tf may take values in this range of temperatures.

First, we consider the value of the glass temperature Tg for water. Water is a complex substance presenting multiple transitions (13) (Fig. 2, Left). Liquid water can be superheated to about 553 K, and small droplets with 1 to 10 microns in diameter can be supercooled to about 231 K (13). Water, like any other liquid, can be vitrified when cooled fast enough to avoid crystallization. Most experimental observations report that rapidly cooled water at atmospheric pressure has a glass transition temperature of about 136 K (13). The dynamic and transport properties of supercooled water show critical behavior as a function of temperature, as indicated by various techniques such as viscosity, dielectric relaxation and NMR measurements, neutron and light scattering spectroscopies and molecular dynamics simulations (1315). The estimated values for the critical temperature of supercooled water range from 212 to 228 K, with an average of 220 ± 7 K (Fig. 1, red horizontal line).

Fig. 2.

Fig. 2.

Temperature domains of water and terrestrial biopolymers at atmospheric pressure. Left: temperature domains of stability and metastability for liquid and glassy water (black) and dynamic transition (red) (1315). Equilibrium transitions are shown as full lines, kinetically controlled transitions as dashed lines. Right: temperature domains and glass transition for folding of terrestrial biopolymers (blue). Circles indicate proteins (1620), squares indicate RNA (17, 2123), and triangles indicate DNA (2427). Above the glass transition temperature (dashed line), a biopolymer can search for its folded form in an efficient manner. Below the biopolymer glass transition temperature a biopolymer is trapped in a rough energy landscape and will not fold in a biologically relevant timescale. The theoretical limits for Tsel and Tf calculated from the average glass transition temperature and Eqs. 46 are shown as continuous lines. Adapted from ref. 13.

Second, we consider the glass transition of three terrestrial biopolymers undergoing structural transitions in aqueous media: proteins, RNA and DNA, which were studied using molecular dynamics simulations and neutron scattering (Fig. 2, Right). The glass transition of model proteins takes place between 180 and 210 K, with an average value of 196 ± 14 K (1620) (Fig. 2, blue circles). The glass transition of different RNA samples takes place between 190 and 230 K, with an average value of 212 ± 17 K (17, 2123) (Fig. 2, blue squares). Finally, the glass transition of different DNA samples was observed at temperatures between 205 and 223 K, with an average value of 215 ± 9 K (2427) (Fig. 2, blue triangles). In sum, three terrestrial biopolymers present a glass transition at an average temperature close to 207 ± 16 K (Fig. 2, blue horizontal line).

The glass transition temperature of terrestrial biopolymers at 207 ± 16 K is very close to the critical temperature of supercooled water at approximately 220 ± 7 K (Fig. 2, red horizontal line). This result is in line with the general idea that biopolymer folding and solvent dynamics are intimately linked. Large-scale protein motions do follow the solvent fluctuations but can be slower by a large factor. Slowing takes place because large-scale motions consist of many small steps, each determined by the rate constant for solvent fluctuations. This phenomenon is called slaving (28) and suggests that the glass transition of biopolymers may be coupled to a water transition. As discussed above, this temperature may be 136 K or close to 220 K, depending on which solvent transition is most relevant for folding. From the comparison of the Left and Right sides of Fig. 2, we propose that the limiting factor for the Tg of a biopolymer in any liquid can be approximated as the critical temperature of the supercooled liquid.

This empirical result is supported by the views of polymer evolution stemming from energy landscape theory (2, 12). Foldability of a polymer can be characterized by the ratio Tf /Tg and may be expected to increase during evolution. Tf is linked to the temperature at which the biopolymer functions, Tphys (6). Increasing Tf beyond the typical value of Tphys + 15 would generally compromise polymer function due excessive stability and damping of function-related dynamics (3). We thus expect the Tg of a polymer to decrease during evolution toward the lower bound dictated by solvent motions, namely the temperature for the dynamical transition of the solvent. In sum, a well-evolved biopolymer can be expected to present solvent slaving and a Tg corresponding to the dynamical transition of the solvent. Relaxing this assumption to allow for a glass transition temperature for the polymer that is up to 20 K higher than that of the solvent does not change the main conclusions of our work.

We can put together Eqs. 46 relating Tf, Tg, and Tsel for biopolymer folding with our empirical estimation for the glass transition temperature at 207 K. This yields an upper limit for the selection temperature Tsel at approximately 197 K (Fig. 2, Right). This upper limit is compatible with previous estimates of Tsel close to 108 K (10). It also yields a lower limit for the folding temperature Tf close to 284 K (Fig. 2, Right). The Tf of most proteins from psychrophilic organisms being above 298 K (29), is compatible with the lower limit of 284 K. Further, experimental and/or theoretical estimation of the characteristic temperatures Tf , Tg, and Tsel for multiple biomolecules yields values for the Tf /Tg ratio higher than the lower theoretical limits of Eqs. 4 and 6 in 25 out of 28 cases (6, 10, 1618, 20, 21, 26, 3036). We conclude that these theoretical limits for Tf, Tg, and Tsel in water are in reasonable agreement with the available empirical data and can therefore be used as a starting point to analyze biopolymer folding in nonaqueous solvents.

Characteristic Temperatures of Solvents Linked to Alternative Biochemistries.

We include in this study a total of 56 solvents from four broad categories (SI Appendix, Table S1). The first category is water and other solvents for which the dynamic transition temperature has been characterized. The second category is hydrocarbons, which have been linked to alternative biochemistries (37, 38). The third category is low molecular weight (up to approximately 100 Da) substances made out of elements abundant in the universe such as H, C, N, O, and S (39), which may potentially partake in alternative biochemistries. The fourth category is other substances for which we know the melting point, boiling point, and the glass transition temperature, such as halogenated substances and aromatic substances. These are included in order to explore a wide range of solvent physical properties.

The present approach requires knowledge of the melting point, boiling point, and dynamic transition temperature of each solvent. The values for the melting point and boiling point are taken from the literature. Since the dynamic transition temperature of most relevant solvents remains uncharacterized, we calculate the dynamic transition temperature for the remaining solvents from literature values for the glass transition temperature and the empirical linear relationships shown in SI Appendix, Fig. S1. We restrict ourselves to substances with a known glass transition temperature in the range of 40 to 400 K to avoid large extrapolations in the calculations. These requirements leave out of consideration liquids without a documented glass transition temperature in the 40 to 400 K range, such as ammonia, formamide, hydrogen sulfide, dinitrogen, and dihydrogen. Last, we exclude supercritical fluids, where applicability of the theory is unclear since their properties lie somewhere between a liquid and a gas.

Biopolymer Folding in Alternative Solvents.

We use Eqs. 3 and 4 and SI Appendix, Eq. S2 to calculate the Tf /Tg ratio, the efficiency for conversion of free energy into information, and the Tg/Tsel ratio as a function of Tf , for 54 solvents, for the temperature range in which each substance is a liquid at 1 atm. The results are shown in Fig. 3. Water is shown as a thick black line, hydrocarbons as green lines, small substances from C, H, O, N, and S as magenta lines, and the remaining solvents in SI Appendix, Table S1 as thin black lines. The horizontal lines highlight the theoretical limits for each parameter, with those from energy landscape theory shown in brown and those from molecular information theory in blue.

Fig. 3.

Fig. 3.

Temperature dependence of parameters relevant to biopolymer foldability, codability, and evolvability, for 54 solvents at 1 atm. Water is shown as a thick black line, hydrocarbons as green lines, small substances from C, H, O, N, and S as magenta lines and the remaining solvents in SI Appendix, Table S1 as gray lines. The horizontal lines highlight the theoretical limits for each parameter, with those from energy landscape theory shown as blue lines and those from molecular information theory in brown lines. (A) Tf/Tg ratio. (B) Efficiency for conversion of folding free energy into sequence information. (C) Tg/Tsel ratio.

The ratio Tf /Tg increases linearly with Tf , the efficiency decreases monotonically with Tf and the ratio Tg/Tsel increases monotonically with Tf in the biologically relevant range (Fig. 3). The values of Tf for which the Tf /Tg ratio and the efficiency cross their theoretical limits of approximately 1.373 and 0.69 are the same in all panels due to the linkage between Eqs. 2 and 3 and coincide with the highest value of Tf for which the Tg/Tsel ratio crosses its theoretical limit of 1.048. Above this value of Tf , foldable biopolymer sequences can be coded and evolve in a relevant timescale, and evolved biopolymer sequences can fold in a relevant timescale. All solvents considered have Tf /Tg and Tg/Tsel ratios above the theoretical minima and an efficiency below the theoretical maximum for at least a fraction of the temperature range in which they are liquid (Fig. 3). Thus, all substances considered here are in principle compatible with the existence of some biopolymer at 1 atm.

Is Water Particularly Well-Suited for Biopolymer Folding?

We present a quantitative comparison between the substances in SI Appendix, Table S1 with regard to their support for biopolymer folding and evolution, with particular attention to water. Fig. 3 shows that all solvents considered have values of Tf, Tg, Tsel, and efficiency compatible with biopolymer folding and evolution for at least a fraction of the temperature range in which they are liquid (Fig. 3). Here, we would like to focus on the following question: “If solvent X is present in the liquid state in a given celestial body, how likely is it to support biopolymer folding and evolution?”. We propose a Biopolymer Solvent score (BSscore) for each solvent that addresses this question by evaluating the geometric average of two contributions:

BSscore=Biopolymer rangeLiquid range·Average efficiencyMaximal efficiency. [7]

The difference between the highest and the lowest temperature for which the theoretical conditions are met is the liquid range that is compatible with folding. The upper limit of this range is the boiling point for all solvents considered here, while the lower limit depends on the specific solvent (SI Appendix, Table S1). In principle, solvents for which the temperature compatible with folding and evolution matches with the liquid range are more likely to yield a positive answer to the question above. We evaluate this using a normalized measure, the fraction of the temperature range for the liquid state that is compatible with folding, (Biopolymer range)/(Liquid range). This measure takes values from 0.14 to 1 for the solvents in our database. We also reason that solvents with a higher average efficiency in the liquid range that is compatible with folding are more likely to yield a positive answer to our question. We thus consider the efficiency for conversion of folding free energy into sequence information, averaged over the liquid range that is compatible with folding and evolution and normalized by its theoretical maximum ln(2). This measure takes values from 0.36 to 1 for the solvents in our database. As indicated by Eq. 7, the BSscore for each solvent is the geometric average of the fraction of the liquid range that is compatible with folding and evolution and the normalized average efficiency.

Fig. 4, Left panel, shows the relationship between the two variables for 54 substances in SI Appendix, Table S1 at 1 atm. Water is shown as a black circle, hydrocarbons as green circles, small substances from C, H, O, N, and S as magenta circles and the remaining solvents in SI Appendix, Table S1 as gray circles. The gray lines indicate constant values of the BSscore. Since Fig. 2 shows considerable uncertainty in the dynamic transition temperature of water, with values ranging from 180 to 230 K, we recalculated the BSscore for water using values within this range. This yields the black line in Fig. 4, Left panel, with BSscore values from 0.71 to 0.89 (average 0.82 ± 0.06), confirming that water belongs to the group of solvents with both a high average efficiency for conversion of folding free energy into sequence information and a large fraction of the liquid range compatible with folding. As suspected, water appears to be an exceptionally good solvent to support biopolymer folding and evolution. The remaining solvents can be ranked by the BSscore and the distribution is shown in Fig. 4, Right panel.

Fig. 4.

Fig. 4.

Biopolymer solvent scoring. (Left) Relationship between the fraction of the biopolymer compatible range and the liquid range of solvents and the average fractional efficiency for conversion of folding free energy to sequence information, for 54 solvents at 1 atm. Large black circle: water. Green circles: hydrocarbons. Magenta circles: small substances from C, H, O, N, and S. Gray circles: other solvents in SI Appendix, Table S1. Black line: range for water calculated from the uncertainty in the dynamic transition temperature presented in Fig. 2 (values range from 180 to 230 K). The gray lines indicate constant values of the BSscore (Eq. 7). (Right) Ranking of the solvents according to the BSscore.

Role of Pressure.

The presented results suggest that all 54 solvents analyzed may support biopolymer folding and evolution at 1 atm, at least in some range of temperatures. However, solvents may be subject to very different pressures in extraterrestrial environments. For example, the atmospheric pressures in the solar system may range from tens of microbars to nearly 100 bar (40). Liquid water can exist in a large fraction of this pressure range, from approximately 6 × 10−3 to 220 bar. It seems relevant to ask how the results presented in Figs. 3 and 4 change at different pressures. The temperatures for the dynamic transition of water and for its melting point vary little with pressure (15, 41). On the other hand, the boiling point of water increases sharply with increasing pressure (41). We expect other solvents to behave similarly due to the small changes in molar volume associated with the glass transition and melting, relative to the change in molar volume upon boiling (42).

We may now reconsider our calculations with Tg and the melting point as constants and a boiling point that increases with pressure. The main effect would then be an increase of the liquid range from the right extreme of all lines in Fig. 3. The effect of this would be twofold: the average efficiency for conversion of folding free energy into sequence information would decrease and the fraction of the liquid range where folding is possible would increase. The outcome in terms of BSscore would depend on the interplay between these two factors. In the case of water, Fig. 5 shows the pressure dependence of the BSscore (black), the normalized average efficiency (blue) and the fraction of the liquid range where folding is possible (orange). In this case, the efficiency is maximal at lower pressures, the fraction of the liquid range where folding is possible is maximal at higher pressures and the BSscore varies within a relatively narrow range (0.45 to 0.86), is above 0.7 for pressures above 0.04 atm, and is close to the maximum value (0.85) at 1 atm. Overall, the influence of pressure on the ability of water to support biopolymer folding is modest. Since the overall shape of the phase diagram is expected to be the same for other substances, we propose that this conclusion holds for the other solvents considered here.

Fig. 5.

Fig. 5.

Pressure dependence for biopolymer folding and evolution in water. Pressure dependence of the BSscore (black), the normalized average efficiency (blue), and the fraction of the liquid range where folding and evolution are possible for water (orange).

Discussion

We are still uncertain whether Life happens in other parts of the universe besides our beloved planet Earth. Much research and lots of speculations are being put forward yet few consensus on how Life may look like, and thus how to detect it have been reached (43). At the molecular level, is life the same everywhere? Are there alternative biochemistries? To gain insight into these questions, we propose to abstract out the many known details of terrestrial biochemistry and ask: What are physicochemical characteristics of the solvents that may sustain life that constrain biopolymer folding and evolution?

Energy landscape and molecular information theories allow one to put the fundamentals of biopolymer folding, coding, and evolution under scrutiny given the assumption that these must happen in liquid solvents and some sequence space. We have shown that the physical characteristics of solvents constrain the ranges where folding and evolution are possible. Folding is constrained by the characteristic temperatures of the solvents, and coding is constrained by the efficiency of the energy-to-information conversion. We believe that both must play a significant role in determining the emergence of biopolymers, regardless of their detailed chemical composition.

The ability of most nonaqueous solvents to support biopolymer folding in general is in line with previous experimental results with specific terrestrial biopolymers. For example, several DNA duplexes maintain their structure in glycerol and ethylene glycol (44), and the proteins subtilisin and lysozyme remain folded in glycerol, dioxane, and acetonitrile (32, 4547). Lysozyme can undergo reversible folding in glycerol (48). Moreover, many enzymes retain their catalytic activities in nonaqueous organic solvents (49, 50). It seems likely that most mixtures of the solvents considered here also support biopolymer folding. This opens the door to considering the existence of life in mixed solvents.

We can speculate on whether terrestrial biopolymers (proteins, RNA, and DNA) can evolve, fold, and function in any nonaqueous solvent. Our results suggest that the solvent requirements of evolvability and foldability can be fulfilled in most cases. However, it should also be kept in mind that biopolymer folding and function also require that the folded state is more stable than the unfolded state and soluble in the new solvent (51). Terrestrial proteins are likely to be less stable in polar solvents such as ethanol, and markedly insoluble in nonpolar solvents such as cyclohexane. This is clearly due to the physicochemical properties of the genetically coded terrestrial amino acids in relation to the properties of the solvent (51). Thus, it seems unlikely that the current set of terrestrial amino acids can support life in solvents other than water (51).

Life in nonaqueous solvents would likely be supported by biopolymers that are chemically different from the terrestrial ones (52). There are three main points relevant to the suitability of a protein-like biopolymer to evolve, fold, and sustain life in a given solvent. We can first consider that biopolymer sequences must evolve to the point in which they contain enough information to specify a given fold (6). This minimal amount of information is determined by the configurational entropy change upon folding of the biopolymer in a given solvent (6). The evolutionary choice of a given biopolymer–solvent pair poses an informational requirement for folding. Second, the folding of a biopolymer in a given solvent is associated with a favorable change in the free energy of the system. At the atomic scale, folding involves breaking a myriad of molecular interactions between solvent molecules and between solvent and biopolymer and forming precise intramolecular interactions in the biopolymer. The free energy balance associated with this process is also dictated by the properties of the solvent–biopolymer pair (51). Third, the properties of the solvent determine the efficiency for the conversion of free energy into sequence information. Interestingly, the efficiency expected for different solvents is predicted to be at least half of the maximal one in most solvents examined (Fig. 4). The maximal amount of information that is gained by a biopolymer by folding in a solvent is dictated by the properties of both solvent and biopolymer. If a biopolymer is to evolve and fold in a certain solvent, the amount of information gained by conversion of folding free energy should match the amount of information required to specify its folded conformation (6). Both of these quantities are determined by physicochemical properties of the solvent-biopolymer pair. We speculate that biopolymers associated with alternative solvents must be chemically different from those found in water-based terrestrial life. Alternative chemical structures previously suggested for biopolymers include polyelectrolytes in general (53), protein-like polyesters (54, 55), peptides made from beta- and gamma-amino acids (56), and polymers originating in the opening of common cyclic prebiotic chemicals (57).

The theory presented here is universal for biopolymers that fold and function in regular fluids, those for which we can estimate the dynamic transition temperature from the glass transition temperature. In its present form, the theory can not address several solvents that have been linked to extraterrestrial life. For example, solvents such as ammonia (37) do not have a known dynamic or glass transition in the 40 to 400 K range. Supercritical fluids such as carbon dioxide (58) and biopolymer folding in vacuo (59) also fall out of the scope of the theory. Some naturally occurring biopolymers also challenge the theory because they do not fold in a conventional manner but follow downhill folding (60) because they function through liquid–liquid phase separation (61) or because they are intrinsically disordered (62). Nevertheless, the foldable and evolved biopolymers considered here seem to be highly relevant to Life.

All solvents considered here support biopolymer folding under some conditions, with efficiencies for the conversion of free energy into information in the same order of magnitude (Fig. 3B). This result can be traced back to the correlations between the characteristic temperatures of these substances (SI Appendix, Fig. S1), which can be explained by the law of corresponding states. This law recasts the properties of fluids in terms of their critical points. In this scenario, the properties that vary among fluids determine their critical temperature and pressure. The degree to which all fluids deviate from ideal gas behavior is the same at the same reduced temperature and pressure, which in turn leads to the correlations between the characteristic temperatures (63). The law of corresponding states assumes that intermolecular interactions in the fluid are a function only of the ratio between the intermolecular distance R and a characteristic distance R0 (64). This assumption is accurate for large values of R where the intermolecular potential energy is proportional to R6. To a point, our results can then be traced back to the existence of Van der Waals interactions, which is undoubtedly a universal phenomenon. Substances which are not spherically symmetrical and for which other intermolecular forces are relevant deviate to varying degrees from the law of corresponding states, leading to deviations from the correlations between characteristic temperatures and to the diversity of behaviors shown in Fig. 3. Thus, the differences in efficiency and in the range of liquid temperatures where a solvent supports folding and evolution are fundamentally related to the rotational asymmetry and polarity of its constituent molecules. We have shown here how to derive the basic characteristics expected for biopolymers from these fundamental principles.

Materials and Methods

In estimating Tf/Tg, Tg/Tsel, and efficiency for biopolymer folding in nonaqueous solvents, we utilized known solvent melting and boiling points from literature sources, presented in SI Appendix, Table S1 and SI Text. Tg values were obtained from literature or calculated using established empirical relationships between solvent characteristic temperatures. Additionally, we estimated the dynamic transition temperature for nonaqueous solvents by leveraging linear relationships between glass transition temperatures and characteristic temperatures of various substances, as detailed in SI Appendix, SI Text. Tf values were set within the liquid state temperature range. To assess efficiency, we applied Eq. 3, assuming Tf ~ Tphys as detailed in SI Appendix, SI Text.

Supplementary Material

Appendix 01 (PDF)

pnas.2318905121.sapp.pdf (276.1KB, pdf)

Acknowledgments

This work was supported by the Consejo de Investigaciones Científicas y Técnicas (CONICET) (I.E.S. and D.U.F. are CONICET researchers and E.A.G. is a postdoctoral fellow); CONICET Grant PIP2022-2024—11220210100704CO and UBACYT Grant 20020220200106BA. Additional support from NAI and Grant Number 80NSSC18M0093 Proposal ENIGMA: Evolution of Nanomachines in Geospheres and Microbial Ancestors (NASA Astrobiology Institute Cycle 8). We call the attention of the international scientific community about the potential erosion of Argentina’s strong scientific tradition due to current funding constraints and the sudden termination of long-term policies.

Author contributions

I.E.S., E.A.G., and D.U.F. designed research; performed research; contributed new reagents/analytic tools; analyzed data; and wrote the paper.

Competing interests

The authors declare no competing interest.

Footnotes

This article is a PNAS Direct Submission.

Contributor Information

Ignacio E. Sánchez, Email: isanchez@qb.fcen.uba.ar.

Diego U. Ferreiro, Email: ferreiro@qb.fcen.uba.ar.

Data, Materials, and Software Availability

All study data are included in the article and/or SI Appendix.

Supporting Information

References

  • 1.Runnels C. M., et al. , Folding, assembly, and persistence: The essential nature and origins of biopolymers. J. Mol. Evol. 86, 598–610 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bryngelson J. D., Wolynes P. G., Spin glasses and the statistical mechanics of protein folding. Proc. Natl. Acad. Sci. U.S.A. 84, 7524–7528 (1987). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ferreiro D. U., Komives E. A., Wolynes P. G., Frustration in biomolecules. Quart. Rev. Biophys. 47, 285–363 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Onuchic J. N., Luthey-Schulten Z., Wolynes P. G., Theory of protein folding: The energy landscape perspective. Annu. Rev. Phys. Chem. 48, 545–600 (1997). [DOI] [PubMed] [Google Scholar]
  • 5.Maynard Smith J., Natural selection and the concept of a protein space. Nature 225, 563–564 (1970). [DOI] [PubMed] [Google Scholar]
  • 6.Sánchez I. E., Galpern E. A., Garibaldi M. M., Ferreiro D. U., Molecular information theory meets protein folding. J. Phys. Chem. B 126, 8655–8668 (2022). [DOI] [PubMed] [Google Scholar]
  • 7.Wolynes P. G., “Counterfactual biomolecular physics: Protein folding and molecular recognition in water and other fluid environments” in Water and Life, Lynden-Bell R. M., Morris S. C., Barrow J. D., Finney J. L., Harper C.. Eds. (CRC Press, ed. 1, 2010), pp. 209–214. [Google Scholar]
  • 8.Ballesteros F. J., Fernandez-Soto A., Martínez V. J., Diving into exoplanets: Are water seas the most common? Astrobiology 19, 642–654 (2019). [DOI] [PubMed] [Google Scholar]
  • 9.Pande V. S., Grosberg A. Y., Tanaka T., Statistical mechanics of simple models of protein folding and design. Biophys. J. 73, 3192–3210 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Morcos F., Schafer N. P., Cheng R. R., Onuchic J. N., Wolynes P. G., Coevolutionary information, protein folding landscapes, and the thermodynamics of natural selection. Proc. Natl. Acad. Sci. U.S.A. 111, 12408–12413 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Schneider T. D., 70% efficiency of bistate molecular machines explained by information theory, high dimensional geometry and evolutionary convergence. Nucleic Acids Res. 38, 5995–6006 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wolynes P. G., Evolution, energy landscapes and the paradoxes of protein folding. Biochimie 119, 218–230 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Debenedetti P. G., Supercooled and glassy water. J. Phys.: Condens. Matter 15, R1669–R1726 (2003). [Google Scholar]
  • 14.Sokolov A. P., Hurst J., Quitmann D., Dynamics of supercooled water: Mode-coupling theory approach. Phys. Rev. B 51, 12865–12868 (1995). [DOI] [PubMed] [Google Scholar]
  • 15.Giovambattista N., Loerting T., Lukanov B. R., Starr F. W., Interplay of the glass transition and the liquid-liquid phase transition in water. Sci. Rep. 2, 390 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gupta M., Chakravarty C., Bandyopadhyay S., Sensitivity of protein glass transition to the choice of water model. J. Chem. Theory Comput. 12, 5643–5655 (2016). [DOI] [PubMed] [Google Scholar]
  • 17.Caliskan G., et al. , Dynamic transition in tRNA is solvent induced. J. Am. Chem. Soc. 128, 32–33 (2006). [DOI] [PubMed] [Google Scholar]
  • 18.Tournier A. L., Smith J. C., Principal components of the protein dynamical transition. Phys. Rev. Lett. 91, 208106 (2003). [DOI] [PubMed] [Google Scholar]
  • 19.Doster W., Cusack S., Petry W., Dynamical transition of myoglobin revealed by inelastic neutron scattering. Nature 337, 754–756 (1989). [DOI] [PubMed] [Google Scholar]
  • 20.Ringe D., Petsko G. A., The ‘glass transition’ in protein dynamics: What it is, why it occurs, and how to exploit it. Biophys. Chem. 105, 667–680 (2003). [DOI] [PubMed] [Google Scholar]
  • 21.Frank A. T., Zhang Q., Al-Hashimi H. M., Andricioaei I., Slowdown of interhelical motions induces a glass transition in RNA. Biophys. J. 108, 2876–2885 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Chu X., Fratini E., Baglioni P., Faraone A., Chen S.-H., Observation of a dynamic crossover in RNA hydration water which triggers a dynamic transition in the biopolymer. Phys. Rev. E 77, 011908 (2008). [DOI] [PubMed] [Google Scholar]
  • 23.Yoon J., Lin J.-C., Hyeon C., Thirumalai D., Dynamical transition and heterogeneous hydration dynamics in RNA. J. Phys. Chem. B 118, 7910–7919 (2014). [DOI] [PubMed] [Google Scholar]
  • 24.Chakraborty K., Bandyopadhyay S., Correlated dynamical crossovers of the hydration layer of a single-stranded DNA oligomer. J. Phys. Chem. B 118, 413–422 (2014). [DOI] [PubMed] [Google Scholar]
  • 25.Norberg J., Nilsson L., Glass transition in DNA from molecular dynamics simulations. Proc. Natl. Acad. Sci. U.S.A. 93, 10173–10176 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Chen S.-H., et al. , Experimental evidence of fragile-to-strong dynamic crossover in DNA hydration water. J. Chem. Phys. 125, 171103 (2006). [DOI] [PubMed] [Google Scholar]
  • 27.Sokolov A. P., Grimm H., Kisliuk A., Dianoux A. J., Slow relaxation process in DNA. J. Biol. Phys. 27, 313–327 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Frauenfelder H., Fenimore P. W., Chen G., McMahon B. H., Protein folding is slaved to solvent motions. Proc. Natl. Acad. Sci. U.S.A. 103, 15469–15472 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gault S., Higgins P. M., Cockell C. S., Gillies K., A meta-analysis of the activity, stability, and mutational characteristics of temperature-adapted enzymes. Biosci. Rep. 41, BSR20210336 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Miyazawa S., Selection originating from protein stability/foldability: Relationships between protein folding free energy, sequence ensemble, and fitness. J. Theor. Biol. 433, 21–38 (2017). [DOI] [PubMed] [Google Scholar]
  • 31.Qiu L., Pabit S. A., Roitberg A. E., Hagen S. J., Smaller and faster: The 20-residue Trp-cage protein folds in 4 μs. J. Am. Chem. Soc. 124, 12952–12953 (2002). [DOI] [PubMed] [Google Scholar]
  • 32.Knubovets T., Osterhout J. J., Connolly P. J., Klibanov A. M., Structure, thermostability, and conformational flexibility of hen egg-white lysozyme dissolved in glycerol. Proc. Natl. Acad. Sci. U.S.A. 96, 1262–1267 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Katina N. S., et al. , Apomyoglobin mutants with single point mutations at Val10 can form amyloid structures at permissive temperature. Biochem. Moscow 76, 555–563 (2011). [DOI] [PubMed] [Google Scholar]
  • 34.Cho K. C., Choy C. L., Thermal stability of hemoglobin and myoglobin. Biochim. Biophys. Acta Protein Struct. 622, 320–330 (1980). [DOI] [PubMed] [Google Scholar]
  • 35.Sosic A., et al. , Bis-3-chloropiperidines targeting TAR RNA as a novel strategy to impair the HIV-1 nucleocapsid protein. Molecules 26, 1874 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Li H. J., Brand B., Rotter A., Thermal denaturation of calf thymus DNA: Existence of a GC-richer fraction. Nucl. Acids Res. 1, 257–266 (1974). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.National Research Council (U.S.) et al. , Eds., The Limits of Organic Life in Planetary Systems (National Academies Press, 2007). [Google Scholar]
  • 38.Benner S. A., Ricardo A., Carrigan M. A., Is there a common chemical model for life in the universe? Curr. Opin. Chem. Biol. 8, 672–689 (2004). [DOI] [PubMed] [Google Scholar]
  • 39.Bains W., Many chemistries could be used to build living systems. Astrobiology 4, 137–167 (2004). [DOI] [PubMed] [Google Scholar]
  • 40.Encrenaz T., Water in the solar system. Annu. Rev. Astron. Astrophys. 46, 57–87 (2008). [Google Scholar]
  • 41.Wagner W., Pruß A., The IAPWS formulation 1995 for the thermodynamic properties of ordinary water substance for general and scientific use. J. Phys. Chem. Ref. Data 31, 387–535 (2002). [Google Scholar]
  • 42.Atake T., Angell C. A., Pressure dependence of the glass transition temperature in molecular liquids and plastic crystals. J. Phys. Chem. 83, 3218–3223 (1979). [Google Scholar]
  • 43.Meadows V., et al. , Community report from the biosignatures standards of evidence workshop. arXiv [Preprint] (2022). https://arxiv.org/abs/2210.14293 (Accessed 17 October 2023).
  • 44.Bonner G., Klibanov A. M., Structural stability of DNA in nonaqueous solvents. Biotechnol. Bioeng. 68, 339–344 (2000). [DOI] [PubMed] [Google Scholar]
  • 45.Xu K., Griebenow K., Klibanov A. M., Correlation between catalytic activity and secondary structure of subtilisin dissolved in organic solvents. Biotechnol. Bioeng. 56, 485–491 (1997). [DOI] [PubMed] [Google Scholar]
  • 46.Schmitke J. L., Stern L. J., Klibanov A. M., Comparison of x-ray crystal structures of an acyl-enzyme intermediate of subtilisin Carlsberg formed in anhydrous acetonitrile and in water. Proc. Natl. Acad. Sci. U.S.A. 95, 12918–12923 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Schmitke J. L., Stern L. J., Klibanov A. M., The crystal structure of subtilisin Carlsberg in anhydrous dioxane and its comparison with those in water and acetonitrile. Proc. Natl. Acad. Sci. U.S.A. 94, 4250–4255 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Rariy R. V., Klibanov A. M., Correct protein folding in glycerol. Proc. Natl. Acad. Sci. U.S.A. 94, 13520–13523 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Klibanov A., Enzymatic catalysis in anhydrous organic solvents. Trends in Biochem. Sci. 14, 141–144 (1989). [DOI] [PubMed] [Google Scholar]
  • 50.Clark D. S., Characteristics of nearly dry enzymes in organic solvents: Implications for biocatalysis in the absence of water. Phil. Trans. R. Soc. Lond. B 359, 1299–1307 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Nick Pace C., Treviño S., Prabhakaran E., Martin Scholtz J., Protein structure, stability and solubility in water and other solvents. Phil. Trans. R. Soc. Lond. B 359, 1225–1235 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Saven J. G., Wolynes P. G., Statistical mechanics of the combinatorial synthesis and analysis of folding macromolecules. J. Phys. Chem. B 101, 8375–8389 (1997). [Google Scholar]
  • 53.Benner S. A., Hutter D., Phosphates, DNA, and the search for nonterrean life: A second generation model for genetic molecules. Bioorganic Chem. 30, 62–80 (2002). [DOI] [PubMed] [Google Scholar]
  • 54.Deechongkit S., et al. , Context-dependent contributions of backbone hydrogen bonding to β-sheet folding energetics. Nature 430, 101–105 (2004). [DOI] [PubMed] [Google Scholar]
  • 55.Chandru K., Mamajanov I., Cleaves H. J., Jia T. Z., Polyesters as a model system for building primitive biologies from non-biological prebiotic chemistry. Life 10, 6 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Seebach D., Beck A. K., Bierbaum D. J., The world of β- andγ-peptides comprised of homologated proteinogenic amino acids and other components. Chem. Biodivers. 1, 1111–1239 (2004). [DOI] [PubMed] [Google Scholar]
  • 57.Chandru K., Jia T. Z., Mamajanov I., Bapat N., Cleaves H. J., Prebiotic oligomerization and self-assembly of structurally diverse xenobiological monomers. Sci. Rep. 10, 17560 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Kamat S. V., Beckman E. J., Russell A. J., Enzyme activity in supercritical fluids. Crit. Rev. Biotechnol. 15, 41–71 (1995). [Google Scholar]
  • 59.Wolynes P. G., Biomolecular folding in vacuo!!!(?) Proc. Natl. Acad. Sci. U.S.A. 92, 2426–2427 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Muñoz V., Campos L. A., Sadqi M., Limited cooperativity in protein folding. Curr. Opin. Struct. Biol. 36, 58–66 (2016). [DOI] [PubMed] [Google Scholar]
  • 61.Brangwynne C. P., Tompa P., Pappu R. V., Polymer physics of intracellular phase transitions. Nat. Phys. 11, 899–904 (2015). [Google Scholar]
  • 62.Uversky V. N., A decade and a half of protein intrinsic disorder: Biology still waits for physics: Protein Intrinsic Disorder. Protein Sci. 22, 693–724 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Yalkowsky S. H., Krzyzaniak J. F., Myrdal P. B., Relationships between melting point and boiling point of organic compounds. Ind. Eng. Chem. Res. 33, 1872–1877 (1994). [Google Scholar]
  • 64.Guggenheim E. A., The principle of corresponding states. J. Chem. Phys. 13, 253–261 (1945). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix 01 (PDF)

pnas.2318905121.sapp.pdf (276.1KB, pdf)

Data Availability Statement

All study data are included in the article and/or SI Appendix.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES