Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2013 Aug 6;105(3):545–546. doi: 10.1016/j.bpj.2013.06.020

Efficiently Refining a Transition Path Using Clustering

Ian F Thorpe 1,
PMCID: PMC3736680  PMID: 23931301

In this study, Wang et al. (1) use molecular simulations and free energy calculations to characterize the transition pathway between open and closed states in RNA polymerase II. This important enzyme is responsible for the majority of mRNA production in eukaryotes. Wang et al. (1) studied conformational changes of a regulatory element in the enzyme known as the trigger loop (TL). In response to nucleotide triphosphate binding, the TL undergoes structural changes thought to regulate the conversion between open and closed conformational states of the enzyme. Information from conventional all-atom molecular dynamics trajectories, Hamiltonian replica exchange simulations, and targeted molecular dynamics were combined to assemble a connectivity map of the conformational transition (Fig. 1).

Figure 1.

Figure 1

Schematic illustration of the use of clustering to refine a transition pathway. (Topographical map) Hypothetical free energy landscape. (Upper panel) Initial path (dashed line) drawn from the minimum in the upper left to the basin in the upper right. Crosses represent structural data present in snapshots taken from molecular simulations performed on this free energy landscape. The density of crosses is greatest at the location of the free energy minima, which tend to give rise to a larger number of structures that are similar to each other. The increased density of structurally similar conformations at these locations can be identified by clustering the data. This information can then be used to refine the route between the two states (lower panel) to more accurately describe a low free energy transition pathway between them.

The authors find that with a cognate nucleotide triphosphate present, the closed TL conformation is energetically favored by ∼2 kcal/mol over the open conformation. This observation indicates that the closing of TL should be a spontaneous process under these conditions. The authors also identify key structural features of states along the transition pathway. The potential of mean force along the pathway features a stable intermediate between two transition states with significant free energy barriers. The presence of the stable intermediate raises the possibility that partial rather than complete opening may be sufficient to allow completion of the transcription cycle during processive elongation. In this case, it would only be necessary to overcome one of the two barriers observed for elongation to proceed. This would suggest that the open state is only attained during relatively rare events, such as pausing or backtracking of the polymerase. However, the authors note that conclusively answering this question will require additional studies.

One of the notable features of the article is that the authors employ clustering of the protein coordinates in an innovative way to identify intermediates along the transition pathway. In this way, conformational minima (which are by definition low free energy states) are selected as locations to help define the path rather than a path being imposed based on an arbitrary reaction coordinate. Determining relevant transition pathways for biophysical processes is an important component of understanding how these processes occur (2,3). However, such efforts can be challenging and methods to define transition paths continue to be a highly active research area (4,5). In many cases, pathway-finding algorithms can be computationally demanding (6,7).

A benefit of the approach employed by Wang et al. (1) is that it should have modest computational cost because clustering methods have undergone significant algorithmic development in recent years and employ sophisticated techniques to reduce their computational expense (8–10). The authors’ general approach could be employed in a wide variety of contexts and would be particularly useful in complex systems where precisely defining a reaction coordinate can be difficult.

References

  • 1.Wang B., Predeus A.V., Feig M. Energetic and structural details of the trigger loop closing transition in RNA polymerase II. Biophys. J. 2013;105:767–775. doi: 10.1016/j.bpj.2013.05.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gan W.X., Yang S.C., Roux B. Atomistic view of the conformational activation of Src kinase using the string method with swarms-of-trajectories. Biophys. J. 2009;97:L8–L10. doi: 10.1016/j.bpj.2009.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Huang K., García A.E. Free energy of translocating an arginine-rich cell-penetrating peptide across a lipid bilayer suggests pore formation. Biophys. J. 2013;104:412–420. doi: 10.1016/j.bpj.2012.10.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhao R.J., Shen J.F., Skeel R.D. Maximum flux transition paths of conformational change. J. Chem. Theory Comput. 2010;6:2411–2423. doi: 10.1021/ct900689m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Dickson B.M., Huang H., Post C.B. Unrestrained computation of free energy along a path. J. Phys. Chem. B. 2012;116:11046–11055. doi: 10.1021/jp304720m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Juraszek J., Bolhuis P.G. Rate constant and reaction coordinate of Trp-cage folding in explicit water. Biophys. J. 2008;95:4246–4257. doi: 10.1529/biophysj.108.136267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ovchinnikov V., Karplus M., Vanden-Eijnden E. Free energy of conformational transition paths in biomolecules: the string method and its application to myosin VI. J. Chem. Phys. 2011;134:085103. doi: 10.1063/1.3544209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Harder T., Borg M., Hamelryck T. Fast large-scale clustering of protein structures using Gauss integrals. Bioinformatics. 2012;28:510–515. doi: 10.1093/bioinformatics/btr692. [DOI] [PubMed] [Google Scholar]
  • 9.Zhao Y., Sheong F.K., Huang X. A fast parallel clustering algorithm for molecular simulation trajectories. J. Comput. Chem. 2013;34:95–104. doi: 10.1002/jcc.23110. [DOI] [PubMed] [Google Scholar]
  • 10.Wolf A., Kirschner K.N. Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain. J. Mol. Model. 2013;19:539–549. doi: 10.1007/s00894-012-1563-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES