Abstract
Constructing reduced representations of high-dimensional systems is a fundamental problem in physical chemistry. Many unsupervised machine learning methods can automatically find such low-dimensional representations. However, an often overlooked problem is what high-dimensional representation should be used to describe systems before dimensionality reduction. Here, we address this issue using a recently developed method called the reweighted diffusion map [J. Chem. Theory Comput.2022, 18, 7179–7192]. We show how high-dimensional representations can be quantitatively selected by exploring the spectral decomposition of Markov transition matrices built from data obtained from standard or enhanced sampling atomistic simulations. We demonstrate the performance of the method in several high-dimensional examples.
Machine learning is becoming widely used for performing dimensionality reduction in atomistic simulations and general analysis of high-dimensional systems in physical chemistry. Many unsupervised methods have been developed to extract a few variables encoding macroscopic information about complex processes hidden in high-dimensional observations from simulation data.1−12 The quality of variables resulting from such methods depends heavily on the input data consisting of many configuration variables referred to as features. However, the selection of such high-dimensional representations used subsequently for dimensionality reduction is often overlooked or performed by trial and error.
A possible basis for the interpretable construction of high-dimensional representations in complex systems is the preservation of their time scale separation between slow and fast variables. The slow variables are intrinsically related to the kinetics of rare transitions between long-lived metastable states in configuration space,13,14 which are essential in many processes, for instance, catalysis,15 crystallization,16 or conformational transitions.17,18 The fast variables, however, are adiabatically slaved to the dynamics of the slow variables and correspond mainly to equilibration within metastable states. Therefore, we can consider different representations of the same system equivalent if the same time scale separation characterizes them.
In this Letter, we exploit this idea and develop a method for a quantitative and interpretable selection of the high-dimensional configuration space based on the spectral decomposition of Markov transition matrices from simulation data. Eigendecomposition has a long history of being used to analyze complex high-dimensional spaces, particularly for finding a low-dimensional manifold on which the data reside.19−27 In contrast to such approaches, our method does not use eigenfunctions to parametrize the slow coarse-grained variables. Instead, it iteratively removes variables from the complete high-dimensional representation to find a partial selection of configuration variables while preserving kinetic information about the complete high-dimensional representation.
Let us first introduce the concept of the high-dimensional representation
of physical systems. Each microscopic configuration of the system
is described by n configuration variables (i.e.,
features) x = (x1, ..., xn). In the case of the microscopic
coordinates, the configuration variables are sampled from an equilibrium
probability distribution given by the Boltzmann density p(x) ∝ e–βU(x), where U(x)
is the potential energy of the system, and
is the inverse temperature. However, the
equilibrium probability distribution is usually unknown for other
high-dimensional spaces (e.g., invariant representations).
To estimate the kinetic information encoded in the high-dimensional space, we collect N samples of n configuration variables from a simulation to construct the Markov transition matrix and perform its spectral decomposition. To this aim, a data set consisting of these samples is
| 1 |
where the samples are augmented by statistical weights w if we sample a biased probability distribution, such as in enhanced sampling simulations. The weights are given as follows:
| 2 |
where p(xk) and q(xk) are the unbiased and biased probability distributions at the k-th sample from X, respectively. For unbiased simulations, the weights are reduced to unity as they are sampled from the equilibrium distribution.
Next, let us introduce the reweighted diffusion map.12 We start by constructing an auxiliary Gaussian kernel which encodes information about the local geometry of the configuration space, gε(xk, xl) = exp(−∥xk – xl∥2/2ε2), where xk and xl are n-dimensional samples from the data set (eq 1) and ε is a scale constant which is chosen depending on the data set usually selected so that it matches the distance between neighboring samples. Here, we calculate the scale constant as the median of Euclidean distances ∥·∥ for simplicity; see the Supporting Information for details. Other techniques can also be used when further adjustment of the scale constant is required.28−30
Then, a reweighted anisotropic kernel is introduced to employ additional information about the density and importance of the configuration space:
| 3 |
where ϱ(x) = ∑kgε(x, xk) is up to a multiplicative constant a kernel density estimate. In eq 3, we reweight the anisotropic kernel using r(xk, xl), which is introduced to correct the effect of sampling from the biased probability distribution q (i.e., diffusion reweighting12). The reweighting factor is given as10,12
| 4 |
where w(xk) and w(xl) are the statistical weights corresponding to the k-th and l-th samples, respectively. (See ref (12) for a derivation of diffusion reweighting and a more general discussion.) For unbiased simulations, eq 4 reduces to the anisotropic diffusion kernel used in the diffusion map.23,31
Equation 3 asymptotically corresponds to a reversible, overdamped approximation to the slow dynamics with the unbiased probability p(x) as the stationary density23,31 even if the underlying dynamics proceeds according to the biased probability distribution.12 As such, the reweighted diffusion kernel given in eq 3 is a reasonable approximation for our method.
Having calculated the reweighted anisotropic diffusion kernel (eq 3), we can finally compute the related row-normalized kernel to define the Markov transition matrix M(xk, xl) with the corresponding transition probabilities mkl:
| 5 |
which is equivalent to an unbiased Markov chain given by Pr{xt+1 = xl | xt = xk} that defines a probability that the system transitions from xk to xl in one time step t. Note that the time in the Markov chain is auxiliary and should not be confused with the simulation time step or a time lag. Regardless of whether the data set is sampled from the biased probability distribution, the Markov transition matrix corresponds to the unbiased Markov chain after the reweighting.12
The reweighted diffusion map is very related to the target measure diffusion map introduced by Banisch et al.32 The main difference between both methods lies in their formulation. Here, we employ general statistical weights from an enhanced sampling simulation (eq 2), and the target measure diffusion map uses target probability distributions. Both techniques can be used in our framework. However, working with statistical weights is more suitable for our purposes as, usually, even approximate target measures are unknown.
Here, we use the Markov transition matrix M to determine if the configuration space spanned by a partial selection of the configuration variables contains similar kinetic information as the high-dimensional representation of the system spanned by all its configuration variables. For this, we first perform an eigendecomposition of the Markov transition matrix constructed from the complete set of the configuration variables Mψ = λψ and calculate its eigenvalues {λk} and eigenfunctions {ψk}. The eigenvalues are sorted by decreasing values. The corresponding eigenfunctions contain kinetic information about the system as the eigenvalues are related to the intrinsical time scales of the system.
Next, the eigendecomposition is carried out for combinations of the configuration variables, which define a data set Xd (d is the number of the configuration variables in the partial representation) and compare it to the eigenvalues of the complete high-dimensional representation. To describe how much kinetic information is conserved in the partial representations, we define a spectral loss:
| 6 |
where α is a normalization constant so that the value of the spectral loss for one variable is equal to 1, and λk and λdk are the eigenvalues of the complete high-dimensional representation and the partial representation consisting of d configuration variables, respectively. Therefore, a combination of the configuration variables preserves kinetic information encoded in the complete representation if the spectral loss is close to zero.
Given the data set X of n configuration variables x = (x1, ..., xn) and its spectral decomposition {λk, ψk} of the related Markov transition matrix M, we search for the partial high-dimensional data set Xd of d configuration variables that upon spectral decomposition of its Markov transition matrix into {λdk, ψdk} contains similar kinetic information as the Markov transition matrix calculated from X. To avoid an exhaustive and computationally demanding search through all combinations of the configuration variables, we use an algorithm that provides a suboptimal result.33 Our algorithm is summarized below:
-
(1)
Start from the complete high-dimensional representation.
-
(2)Repeat until the number of selected configuration variables is d:
-
(a)Remove from the data a variable corresponding to the minimal spectral loss upon removing one variable.
-
(b)Add a configuration variable to the data if the spectral loss upon addition back decreases.
-
(c)Go to (a) if adding any configuration variable does not result in decreasing the spectral loss; else, go to (b).
-
(a)
-
(3)
Return the selected d-dimensional representation.
As an initial test for our method, we apply it to four high-dimensional benchmark data sets of dimension n = 10 sampled from multivariate Gaussian distributions that create clusters of samples on vertices of an m-informative hypercube with interdependence between these variables and additional noise. They consist of a different number of informative variables (m = 4, 6, 8, 10). The remaining variables in these data sets are combinations of the informative variables. Further details about these data sets are available in the Supporting Information.
We present our results in Figure 1. By computing the spectral loss in reference to the eigendecomposition of the Markov transition matrix from the data set X, we can see that when the number of informative variables (m) is lower in the partial data sets, the data resembles X much faster in comparison to the complete data set for m = 10; see Figure 1(a). This observation indicates that the method correctly identifies the number of informative variables, as the improvement of the spectral representation of the high-dimensional space upon adding the remaining variables is negligible.
Figure 1.
High-dimensional data sets (n = 10) sampled from multivariate Gaussian distributions with a different number of informative variables (m). For additional information about the benchmark data sets, see the Supporting Information. (a) Spectral loss σd shown for the data sets consisting of m = 4, 6, 8, 10 informative variables as a function of the number of the variables selected as the partial representation of the data set d. Spectral loss is rescaled so that its value for one variable and m = 10 is equal to one. (b) Similarity between the equilibrium distributions calculated as the zeroth eigenfunctions for two selected data sets (m = 4 and m = 10). The eigenfunctions ψd0 are compared to ψ0, i.e., the eigenfunctions of the Markov transition matrix M computed from the complete data set X.
This observation is also supported by checking how the eigenfunctions ψd0 of the partial high-dimensional representations converge to the eigenvalues calculated based on the Markov transition matrix from the data set X; see Figure 1(b). For example, the eigenfunctions ψn0 for the m = 4 informative variables resemble ψ0 much faster. In contrast, for m = 10 informative variables, all the variables must be included in the data set Xd to match the eigenvalues and eigenfunctions calculated from the Markov transition matrix from X.
As a subsequent example, we consider alanine dipeptide in vacuum, which has two main long-lived metastable states that its Φ dihedral angle can separate; see Figure 2(a). As a high-dimensional representation, we select all heavy-atom distances (the number of variables n = 45) monitored during a 100 ns long simulation. For the data set, we use the last 10 ns of the simulation, sampled every 4 ps. The total number of samples used to construct the Markov transition matrix is N = 2500. The simulation is performed using GROMACS 2019.234 patched with a development version of the PLUMED plugin.35,36 The biased data set X is sampled using well-tempered metadynamics37 at 300 K with a bias factor of 5. The fluctuations of the Φ and Ψ dihedral angles of alanine dipeptide are biased. The statistical weights are calculated using the Tiwary–Parrinello reweighting algorithm38 suitable for a time-dependent bias potential. Further details about simulation parameters are available in the Supporting Information.
Figure 2.
Selecting high-dimensional configuration space for alanine dipeptide (Ace-Ala-Nme) in vacuum. (a) Structure of alanine dipeptide with the Φ and Ψ dihedral angles marked. The system is represented by distances between heavy atoms (n = 45) from the data set of N = 2500 samples generated using well-tempered metadynamics biasing Φ and Ψ with a bias factor of 5. For additional information about the simulation, see the Supporting Information. (b) Equilibrium density calculated as the zeroth eigenvector of the Markov transition matrices using the reweighted diffusion map in the space spanned by the Φ and Ψ dihedral angles. (c) Spectral loss σd showing the convergence of the partial selection of the configuration variables to the reference data set at about d ∼ 10 (the number of the configuration variables in the partial representations d is shown on a logarithmic scale). Spectral loss is rescaled so that its value for one variable is equal to one. (d) Eigenvalues {λdk} (k is the index of the eigenvalue) calculated from the partial data sets Xd (d = 1, 2, 3, 10) shown in blue and X shown in magenta. The last shown spectrum for d = 10 is the same as for the complete data set X.
We present the results in Figure 2. We can see in Figure 2(b) that the equilibrium density spanned by the Ψ and Ψ dihedral angles given by the zeroth eigenfunction ψ0 (the left eigenfunction of M approximating stationary distribution of the Markov chain) is qualitatively identical for the partial representation consisting of d = 10 variables and the complete high-dimensional representation (n = 45). These equilibrium densities correctly identify the long-lived metastable states of alanine dipeptide. The spectral loss can also confirm this conclusion quantitatively; see Figure 2(c). We can see that after including d = 10 variables, which give a steep decrease in the spectral loss, the spectral loss remains constant, and the error compared to the complete representation is roughly zero; see inset in Figure 2(c). Adding the remaining configuration variables does not improve the spectral representation.
We can also notice that for d = 10, the eigenvalues match precisely the eigenvalues of the Markov transition matrix calculated on the complete data set X; see Figure 2(d). Additionally, none of the selected configuration variables for d = 10 describe the distances within the same residue, which means that the method picks only those distances relevant for describing the conformational transitions in alanine dipeptide. Instead, four correspond to the distances between Ace and Ala, five describe the distances between Ala and Nme, and one is between Ace and Nme. A list of the selected variables can be seen in Table S1 in the Supporting Information.
As a final example, we consider the folding and unfolding of a ten-residue protein chignolin in solvent; see inset in Figure 3(a). As before, we perform calculations using the GROMACS 2019.2 code34 patched with a development version of the PLUMED(35,36) plugin. We run a 1-μs well-tempered metadynamics37 simulation at 340 K with a bias factor of 5 and select samples every 8 ps from the last 20 ns of the simulation, amounting to the data set consisting of N = 2500 samples. As biased variables to enhance transitions between the folded and unfolded states of chignolin, we choose the distance between Cα atoms of residues Y1 and Y10 and the radius of gyration. The statistical weights are calculated using the Tiwary–Parrinello reweighting algorithm.38 From the resulting trajectory, we calculate the sines and cosines of the backbone Φ and Ψ dihedral angles of chignolin and use them as the complete high-dimensional representation (n = 36 variables in total).
Figure 3.
Selecting high-dimensional configuration space for chignolin in solvent. The system is represented by the sines and cosines of the Φ and Ψ dihedral angles (n = 36). The data set consists of N = 2500 samples generated using well-tempered metadynamics biasing the distance between the Cα atoms of the Y1 and Y10 residues and the radius of gyration with a bias factor of 5 at 340 K. For details, we refer to the Supporting Information. (a) Spectral loss σd for the partial high-dimensional representations of Xd (the number of the configuration variables in the partial representations d is shown on a logarithmic scale). Spectral loss is rescaled so that its value for one variable is equal to one. (d) Eigenvalues {λdk} (k is the index of the eigenvalue) calculated for the partial representations consisting of d = 1, 2, 3, 16 variables. The eigenvalues of the Markov transition matrix from the complete set (magenta) are identical to the representation of d = 16 variables (blue). (c) Eigenfunctions ψ1 and ψ2 colored as a function of the equilibrium eigenfunction ψ0 (colorbar) shown for the partial selection d = 16 and the complete data set d = 36. The eigenfunctions are calculated from the d-dimensional representation. (d) Distributions of the ψ0 and ψ1 eigenfunctions obtained based on the partial (d = 16) and complete selection (d = 36) of the configuration variables.
We present the results in Figure 3. The spectral loss calculated for every partial selection of the configuration variables is shown in Figure 3(a). We can see that the spectral loss gradually decreases, reaching similarity with the complete selection for d > 15. This fact can also be observed in Figure 3(b), where the eigenvalues of the Markov transition matrix calculated for d = 16 are converged in reference to the eigenvalues computed from the complete data set. Moreover, the eigenfunctions ψ1 and ψ2 shown as a function of the equilibrium eigenfunction ψ0 calculated based on the selections of the configuration variables for d = 16 and d = 36 are very similar; see Figure 3(c). Additionally, in Figure 3(d), we show that the distributions of the ψ0 and ψ1 eigenfunctions for the partial selection of the configuration variables (d = 16) and the complete representation (d = 32) are also in agreement. Our calculations show that less than half of the configuration variables in the complete high-dimensional representation is needed to represent the system. Interestingly, the configuration variables selected for d = 16 correspond to every residue of chignolin, showing that the method identifies the variables that can carry the information about the folding and unfolding of chignolin while neglecting the possibly spurious variables. A list of the selected variables can be seen in Table S2 in the Supporting Information.
In this Letter, we present a simple and practical method for selecting high-dimensional representations of complex physical systems sampled by standard and enhanced sampling atomistic simulations. The presented technique is general and requires only simulation data with statistical weights if the data set is generated using an enhanced sampling technique. We use well-tempered metadynamics here for enhanced sampling and the Tiwary–Parrinello reweighting to generate data. However, data can be obtained from, for instance, parallel tempering, which does not require defining variables to bias before the simulation and is easy to reweight. In general, the presented algorithm can be tailored to any enhanced sampling technique; however, a reweighting scheme should be chosen as appropriate for the used method.
The method constructs unbiased Markov transition matrices from simulation data and calculates their spectral decomposition for combinations of the configuration variables comprising the complete high-dimensional representation of the system. The selection algorithm creates a high-dimensional representation by iteratively removing the configuration variables from the data so that the spectral decomposition performed on the partial selection of the configuration variables resembles that of the complete representation. This kinetic equivalence is obtained by minimizing the spectral loss that measures the deviation between the eigenvalues obtained from the considered high-dimensional representations. Since the selected partial high-dimensional representation is interpretable and preserves the time scale separation of the complete representation of the system, it can be used as an initial representation for subsequent construction of the slow variables, ensuring that the high-dimensional representation includes all relevant information about the system dynamics. When the time scale separation is ensured in a partial high-dimensional representation, further dimensionality reduction has the essential information about the dynamics of the studied complex physical system. In conclusion, our method can become a helpful approach to analyzing high-dimensional physical systems and has the potential to be further explored.
Acknowledgments
Funding from the Polish Science Foundation (START), the National Science Center in Poland (Sonata 2021/43/D/ST4/00920, “Statistical Learning of Slow Collective Variables from Atomistic Simulations”), and the Ministry of Science and Higher Education in Poland is acknowledged. We thank Omar Valsson for critically reading the manuscript and Ming Chen for valuable discussions.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jpclett.3c00265.
Details about simulations and data sets and constructing Markov transition matrices; selected configuration variables for high-dimensional systems (PDF)
The author declares no competing financial interest.
Supplementary Material
References
- Nadler B.; Lafon S.; Coifman R. R.; Kevrekidis I. G. Diffusion Maps, Spectral Clustering and Reaction Coordinates of Dynamical Systems. Appl. Comput. Harmon. Anal. 2006, 21, 113–127. 10.1016/j.acha.2005.07.004. [DOI] [Google Scholar]
- Tribello G. A.; Ceriotti M.; Parrinello M. Using Sketch-Map Coordinates to Analyze and Bias Molecular Dynamics Simulations. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 5196–5201. 10.1073/pnas.1201152109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hashemian B.; Millán D.; Arroyo M. Modeling and Enhanced Sampling of Molecular Systems with Smooth and Nonlinear Data-Driven Collective Variables. J. Chem. Phys. 2013, 139, 214101. 10.1063/1.4830403. [DOI] [PubMed] [Google Scholar]
- Rohrdanz M. A.; Zheng W.; Clementi C. Discovering Mountain Passes via Torchlight: Methods for the Definition of Reaction Coordinates and Pathways in Complex Macromolecular Reactions. Annu. Rev. Phys. Chem. 2013, 64, 295–316. 10.1146/annurev-physchem-040412-110006. [DOI] [PubMed] [Google Scholar]
- Rydzewski J.; Nowak W. Machine Learning Based Dimensionality Reduction Facilitates Ligand Diffusion Paths Assessment: A Case of Cytochrome P450cam. J. Chem. Theory Comput. 2016, 12, 2110–2120. 10.1021/acs.jctc.6b00212. [DOI] [PubMed] [Google Scholar]
- McCarty J.; Parrinello M. A Variational Conformational Dynamics Approach to the Selection of Collective Variables in Metadynamics. J. Chem. Phys. 2017, 147, 204109. 10.1063/1.4998598. [DOI] [PubMed] [Google Scholar]
- Rydzewski J.; Jakubowski R.; Nowak W.; Grubmuller H. Kinetics of Huperzine A Dissociation from Acetylcholinesterase via Multiple Unbinding Pathways. J. Chem. Theory Comput. 2018, 14, 2843–2851. 10.1021/acs.jctc.8b00173. [DOI] [PubMed] [Google Scholar]
- Zhang J.; Chen M. Unfolding Hidden Barriers by Active Enhanced Sampling. Phys. Rev. Lett. 2018, 121, 010601. 10.1103/PhysRevLett.121.010601. [DOI] [PubMed] [Google Scholar]
- Rodriguez-Nieva J. F.; Scheurer M. S. Identifying Topological Order through Unsupervised Machine Learning. Nat. Phys. 2019, 15, 790–795. 10.1038/s41567-019-0512-x. [DOI] [Google Scholar]
- Rydzewski J.; Valsson O. Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling. J. Phys. Chem. A 2021, 125, 6286–6302. 10.1021/acs.jpca.1c02869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morishita T. Time-Dependent Principal Component Analysis: A Unified Approach to High-Dimensional Data Reduction using Adiabatic Dynamics. J. Chem. Phys. 2021, 155, 134114. 10.1063/5.0061874. [DOI] [PubMed] [Google Scholar]
- Rydzewski J.; Chen M.; Ghosh T. K.; Valsson O. Reweighted Manifold Learning of Collective Variables from Enhanced Sampling Simulations. J. Chem. Theory Comput. 2022, 18, 7179–7192. 10.1021/acs.jctc.2c00873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valsson O.; Tiwary P.; Parrinello M. Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint. Annu. Rev. Phys. Chem. 2016, 67, 159–184. 10.1146/annurev-physchem-040215-112229. [DOI] [PubMed] [Google Scholar]
- Bussi G.; Laio A. Using Metadynamics to Explore Complex Free-Energy Landscapes. Nat. Rev. Phys. 2020, 2, 200–212. 10.1038/s42254-020-0153-0. [DOI] [Google Scholar]
- Piccini G.; Lee M.-S.; Yuk S. F.; Zhang D.; Collinge G.; Kollias L.; Nguyen M.-T.; Glezakou V.-A.; Rousseau R. Ab Initio Molecular Dynamics with Enhanced Sampling in Heterogeneous Catalysis. Catal. Sci. Technol. 2022, 12, 12–37. 10.1039/D1CY01329G. [DOI] [Google Scholar]
- Neha; Tiwari V.; Mondal S.; Kumari N.; Karmakar T. Collective Variables for Crystallization Simulations–from Early Developments to Recent Advances. ACS Omega 2023, 8, 127–146. 10.1021/acsomega.2c06310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rydzewski J.; Nowak W. Ligand Diffusion in Proteins via Eenhanced Sampling in Molecular Dynamics. Phys. Life Rev. 2017, 22, 58–74. 10.1016/j.plrev.2017.03.003. [DOI] [PubMed] [Google Scholar]
- Van Speybroeck V.; Vandenhaute S.; Hoffman A. E. J.; Rogge S. M. J. Towards Modeling Spatiotemporal Processes in Metal–Organic Frameworks. Trends Chem. 2021, 3, 605–619. 10.1016/j.trechm.2021.04.003. [DOI] [Google Scholar]
- Bérard P.; Besson G.; Gallot S. Embedding Riemannian Manifolds by their Heat Kernel. Geom. Funct. Anal. 1994, 4, 373–398. 10.1007/BF01896401. [DOI] [Google Scholar]
- Belkin M.; Niyogi P. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering. Advances in Neural Information Processing Systems 2001, 585–591. [Google Scholar]
- Belkin M.; Niyogi P. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation. Neural Comput. 2003, 15, 1373–1396. 10.1162/089976603321780317. [DOI] [Google Scholar]
- Mezić I. Spectral Properties of Dynamical Systems, Model Reduction and Decompositions. Nonlinear Dyn. 2005, 41, 309–325. 10.1007/s11071-005-2824-x. [DOI] [Google Scholar]
- Coifman R. R.; Lafon S.; Lee A. B.; Maggioni M.; Nadler B.; Warner F.; Zucker S. W. Geometric Diffusions as a Tool for Harmonic Analysis and Structure Definition of Data: Diffusion Maps. Proc. Natl. Acad. Sci. U.S.A. 2005, 102, 7426–7431. 10.1073/pnas.0500334102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coifman R. R.; Lafon S. Diffusion Maps. Appl. Comput. Harmon. Anal. 2006, 21, 5–30. 10.1016/j.acha.2006.04.006. [DOI] [Google Scholar]
- Jones P. W.; Maggioni M.; Schul R. Manifold Parametrizations by Eigenfunctions of the Laplacian and Heat Kernels. Proc. Natl. Acad. Sci. U.S.A. 2008, 105, 1803–1808. 10.1073/pnas.0710175104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coifman R. R.; Kevrekidis I. G.; Lafon S.; Maggioni M.; Nadler B. Diffusion Maps, Reduction Coordinates, and Low Dimensional Representation of Stochastic Systems. Multiscale Model. Simul. 2008, 7, 842–864. 10.1137/070696325. [DOI] [Google Scholar]
- Tiwary P.; Berne B. J. Spectral Gap Optimization of Order Parameters for Sampling Complex Molecular Systems. Proc. Natl. Acad. Sci. U.S.A. 2016, 113, 2839. 10.1073/pnas.1600917113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coifman R. R.; Shkolnisky Y.; Sigworth F. J.; Singer A. Graph Laplacian Tomography from Unknown Random Projections. IEEE Trans. Image Process 2008, 17, 1891–1899. 10.1109/TIP.2008.2002305. [DOI] [PubMed] [Google Scholar]
- Berry T.; Harlim J. Variable Bandwidth Diffusion Kernels. Appl. Comput. Harmon. Anal. 2016, 40, 68–96. 10.1016/j.acha.2015.01.001. [DOI] [Google Scholar]
- Lindenbaum O.; Salhov M.; Yeredor A.; Averbuch A. Gaussian Bandwidth Selection for Manifold Learning and Classification. Data Min. Knowl. Discovery 2020, 34, 1676–1712. 10.1007/s10618-020-00692-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singer A.; Erban R.; Kevrekidis I. G.; Coifman R. R. Detecting Intrinsic Slow Variables in Stochastic Dynamical Systems by Anisotropic Diffusion Maps. Proc. Natl. Acad. Sci. U.S.A. 2009, 106, 16090–16095. 10.1073/pnas.0905547106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Banisch R.; Trstanova Z.; Bittracher A.; Klus S.; Koltai P. Diffusion Maps Tailored to Arbitrary Non-Degenerate Itô Processes. Appl. Comput. Harmon. Anal. 2020, 48, 242–265. 10.1016/j.acha.2018.05.001. [DOI] [Google Scholar]
- Pudil P.; Novovičová J.; Kittler J. Floating Search Methods in Feature Selection. Pattern Recognit. Lett. 1994, 15, 1119–1125. 10.1016/0167-8655(94)90127-9. [DOI] [Google Scholar]
- Abraham M. J.; Murtola T.; Schulz R.; Páll S.; Smith J. C.; Hess B.; Lindahl E. GROMACS: High Performance Molecular Simulations through Multi-Level Parallelism from Laptops to Supercomputers. SoftwareX 2015, 1–2, 19–25. 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- Tribello G. A.; Bonomi M.; Branduardi D.; Camilloni C.; Bussi G. PLUMED 2: New Feathers for an Old Bird. Comput. Phys. Commun. 2014, 185, 604–613. 10.1016/j.cpc.2013.09.018. [DOI] [Google Scholar]
- Promoting Transparency and Reproducibility in Enhanced Molecular Simulations. Nat. Methods 2019, 16, 670–673. 10.1038/s41592-019-0506-8. [DOI] [PubMed] [Google Scholar]
- Barducci A.; Bussi G.; Parrinello M. Well-Tempered Metadynamics: A Smoothly Converging and Tunable Free-Energy Method. Phys. Rev. Lett. 2008, 100, 020603. 10.1103/PhysRevLett.100.020603. [DOI] [PubMed] [Google Scholar]
- Tiwary P.; Parrinello M. A Time-Independent Free Energy Estimator for Metadynamics. J. Phys. Chem. B 2015, 119, 736–742. 10.1021/jp504920s. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




