Abstract
We show that machine learning (ML) can be used to accurately reproduce nonadiabatic excited-state dynamics with decoherence-corrected fewest switches surface hopping in a 1-D model system. We propose to use ML to significantly reduce the simulation time of realistic, high-dimensional systems with good reproduction of observables obtained from reference simulations. Our approach is based on creating approximate ML potentials for each adiabatic state using a small number of training points. We investigate the feasibility of this approach by using adiabatic spin-boson Hamiltonian models of various dimensions as reference methods.
Excited-state dynamics simulations of molecules and molecular assemblies are as important as challenging. Some of the primary processes in nature (photosynthesis, light detection), medicine (phototherapy, DNA damage), and technology (photovoltaics, photonics) have at least one photoinduced reaction step occurring in the excited state.1−3 The main difficulties in modeling these processes arise from the intricacies of excited-state electronic structure and from the intrinsic nonadiabaticity caused by the coupling between nuclear and electronic degrees of freedom driving the time evolution.
Significant advances in the simulation of nonadiabatic dynamics in excited states have been achieved in recent years.4 The development of on-the-fly nonadiabatic mixed quantum-classical (NA-MQC) strategies, in particular, has boosted the research field in the past decade, allowing full-dimensional simulations of systems with tens of atoms for several picoseconds. In these methods, nonadiabatic phenomena are introduced into a classical ensemble of trajectories through averaging, spawning, or hopping of quantum electronic information. At the same time, they rely on a local approximation, allowing for the computation of electronic properties only at the classical nuclear coordinates.
The on-the-fly strategy is a fundamental advantage because it avoids the costly calculation of multidimensional potential energy surfaces (PESs), a task that is the main bottleneck in full quantum approaches. However, the on-the-fly propagation of the dynamics is computationally demanding because expensive quantum mechanical (QM) quantities—energies, forces, and couplings between the electronic states—must be computed at each time step in the numerical integration of the equations of motion. Consequently, an on-the-fly NA-MQC simulation of a medium-sized molecule for several picoseconds may require hundreds of thousands of CPU hours when using first-principles QM methods.
The emergence of machine learning (ML) algorithms has the potential to change this scenario, ideally leading to situations where ML inexpensively predicts excited-state energies, forces, and couplings for on-the-fly NA-MQC dynamics. Encouragingly, ML has already been successfully applied in many atomistic simulations, for example, to represent PESs, to perform molecular dynamics in the ground state, and to predict excited-state properties.5−22 However, the application of ML to on-the-fly NA-MQC dynamics poses unique challenges. Among the most crucial problems is the higher complexity of the excited-state electronic structure, often leading to a high density of coupled states, with a strongly anharmonic dependence on nuclear coordinates. Moreover, in many cases, the nonadiabatic processes happen on time scales shorter than those of thermal equilibration, requiring the propagation of microcanonical rather than canonical ensembles, which are associated with much stricter conservation requirements.
Only few recent studies have attempted to use ML for such purposes. In a pilot study of ML-enhanced NA-MQC dynamics, ML was used only for the representation of the relevant PESs; however, the generation of training points was rather tedious and time-consuming, whereas the number of QM calculations performed during the training of the ML models and during the dynamics was close to the number of QM calculations typically required for a corresponding on-the-fly QM simulation.6 In another study, the accuracy of direct quantum wavepacket dynamics with ML PESs was shown to deteriorate quickly with increasing number of dimensions so that it became problematic to achieve good accuracy even for as few dimensions as six.17
The main aim of our work is to outline how ML can be used to achieve a significant reduction of the number of required QM calculations in practical on-the-fly NA-MQC simulations of high-dimensional systems. For this purpose, we use the popular decoherence-corrected fewest switches surface hopping (DC-FSSH; see the Supporting Information (SI)) approach in on-the-fly NA-MQC dynamics and an ML approach based on kernel ridge regression (KRR; see the SI for details). To avoid any bias associated with the choice of a specific QM method and a target molecule, we decided to use the two-state spin-boson Hamiltonian in the adiabatic representation (A-SBH; see the SI), which is easily adjustable in terms of the number of degrees of freedom and couplings. This choice of an analytical Hamiltonian allows for an extensive and general assessment of ML capabilities because we can compute many more trajectories and time steps than we would be able to do with any on-the-fly electronic structure method. Note, however, that the use of A-SBH does not lead to any loss of generality, as every element of an atomistic two-state simulation is present in this model, and the generalization to more states is straightforward. As discussed later, because of the strong coupling between different A-SBH dimensions, ML-based NA-MQC dynamics may in some aspects be even more challenging for A-SBH than for an atomistic model.
We start by demonstrating for the 1-D A-SBH model that it is possible, in principle, to create a complete ML model, which can accurately reproduce the reference A-SBH trajectory with all hopping events (Figure 1). This is achieved when using at least Ntr = 128 points in the training set (see the SI for further details).
For high-dimensional systems, it is generally not feasible to build such complete ML models. First, generating a sufficient amount of accurate QM reference data quickly becomes too costly due to the curse of dimensionality. For a realistic 33-D model, it would be necessary to calculate reference values for 12833 = 3.45 × 1069 grid points to ensure sampling with the same density as for our complete 1-D ML model (Figure 1).23 This is obviously impossible. Second, processing large amounts of reference data is also very challenging, in terms of both memory requirements and training time. Third, if the number of training points for ML becomes too large, then it may be more reasonable to run pure QM dynamics. Thus, for ML to significantly speed up nonadiabatic dynamics simulations, the training set has to be as small as possible and to be generated as quickly as possible.
In practical terms, our first aim is to keep the number of points in the training set preferably at most 10 000 points. For comparison, ground-state ML potentials have been trained on many fewer molecular geometries and used successfully for various purposes such as molecular dynamics, calculation of vibrational spectra, and geometry optimization.7,14,15 From our experience with on-the-fly NA-MQC dynamics based on ab initio and semiempirical methods, we know that these simulations are typically run with about 100 trajectories with a time step of 0.5 fs for 1 ps; that is, they require 200 000 QM calculations. Therefore, 10 000 points represents merely 5% of a typical on-the-fly NA-MQC project. Moreover, the gains are potentially much larger. Although 100 trajectories are enough to reveal all main reaction pathways, their pathway yields are delivered with rather low precision. However, after training the machine, it can be used to run thousands of trajectories, producing highly precise results, which would simply be unaffordable with conventional QM approaches.
Proposed approaches for generating training sets are often iterative and rather time-consuming.15 Our second aim is to avoid such handicaps and keep the construction of the training set as simple and inexpensive as possible.
On the basis of these considerations, we target a relatively sparse grid of training points, sampled with a low discrepancy algorithm.7,24 Inevitably, this will lead to some loss of accuracy. However, it is known that ML trained on points sampled along vibrational modes can describe larger molecules and also give rather accurate PESs.18 Furthermore, a very accurate ML PES can be obtained for small molecules.7
The most serious additional challenge in the case of DC-FSSH dynamics is that the nonadiabatic couplings feature sharp, narrow spikes around certain geometries (Figure 2). This necessitates very small time steps or special treatments even in the pure QM DC-FSSH dynamics.25 Sparse sampling of ML training points will, in most cases, miss these spikes (Figure 2), and, consequently, no or too few hops will happen during ML dynamics.
Indeed, we find no hops at all in the 2 ps trajectory produced with an ML model of the 1-D system trained on only 16 points (Figure S1) compared with four hops when we calculate nonadiabatic couplings with A-SBH throughout the trajectory (Figure S2). In previous work on ML for FSSH,6 the Zhu–Nakamura approach26 was used to avoid the calculation of nonadiabatic couplings altogether. Here we solve this problem by performing QM (in this work A-SBH) calculations of nonadiabatic couplings instead of estimating them with ML when the band gap estimated with ML is small. A similar approach was tested for the Zhu–Nakamura dynamics for a different reason (to avoid errors of the ML potential in the vicinity of the conical intersections).6 In the case of our 1-D ML model trained on 16 points, five hops occur if we switch on A-SBH calculations for V2est – V1 < 0.03 hartree (Figure S3). We use this cutoff in the following. Problems with sparse sampling concerning total energy conservation are discussed in the SI.
Next, we compare the performance of ML dynamics with A-SBH simulations for a realistic 33-D system. This system is rather challenging for ML, as exemplified by the fact that during a 50 ps A-SBH trajectory the smallest Euclidian distance between a point at a given time step to any point visited during the first 80% of previous time steps remained very large and did not decrease over time. This means that the same region in the high-dimensional space was not visited again during the 50 ps of nonadiabatic dynamics. Consequently, all of our attempts to use on-the-fly or adaptive learning strategies commonly employed in ground-state ML dynamics for atomistic systems16,22 failed for our 33-D A-SBH system. This may be attributed to the strong coupling between different A-SBH dimensions, whereas in atomistic simulations it is often possible to partition the system into smaller, sufficiently independent substructures.13,15,18,22
To collect enough data for statistical analysis, we ran 1000 trajectories with A-SBH and two different ML models, each starting on the S1 surface. The ML models were trained on 1000 and on 10 000 points. The evolution of the S1 state population during 2 ps dynamics is reproduced very well by both ML models (Figure 3) despite the relatively small training sets generated in a simple, noniterative manner. The excited-state lifetime is estimated (see the SI) to be 114 ± 1, 100 ± 1, and 105 ± 1 fs with A-SBH, ML trained on 1000 points, and ML trained on 10 000 points, respectively. The ML lifetimes are thus in reasonable agreement with the A-SBH lifetime. An additional comparative analysis is provided in the SI.
In our ML dynamics, A-SBH calculations during DC-FSSH dynamics were invoked only in 13–16% of time steps for V2est – V1 < 0.03 hartree. Such QM calculations should be avoided in the future altogether by using local diabatization, which eliminates the problem with narrow couplings.27 In this case, the cost of ML nonadiabatic dynamics will be essentially determined by the cost of training set generation.
In summary, we show that ML can be used to simulate accurate, multidimensional nonadiabatic dynamics with significant cost reduction. We suggest using fairly sparse small training sets sampled from the high-dimensional space to build approximate ML potentials for each adiabatic surface. We show that dynamics runs with these ML potentials well reproduce the time evolution of the adiabatic-state population and the excited-state lifetime obtained from the dynamics with the reference method.
Acknowledgments
M.B. acknowledges support from the Excellence Initiative of Aix-Marseille University (A*MIDEX) and the project Equip@Meso (ANR-10-EQPX-29-01), both funded by the French Government “Investissements d’Avenir” program, and funding from the WSPLIT project (ANR-17-CE05-0005-01). W.T. acknowledges support from an ERC Advanced Grant (OMSQC).
Supporting Information Available
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jpclett.8b02469.
Description of methods and additional technical details on nonadiabatic dynamics (PDF)
The authors declare no competing financial interest.
Supplementary Material
References
- Gozem S.; Luk H. L.; Schapiro I.; Olivucci M. Theory and Simulation of the Ultrafast Double-Bond Isomerization of Biological Chromophores. Chem. Rev. 2017, 117, 13502–13565. 10.1021/acs.chemrev.7b00177. [DOI] [PubMed] [Google Scholar]
- Akimov A. V.; Prezhdo O. V. Large-Scale Computations in Chemistry: A Bird’s Eye View of a Vibrant Field. Chem. Rev. 2015, 115, 5797–5890. 10.1021/cr500524c. [DOI] [PubMed] [Google Scholar]
- Brunk E.; Rothlisberger U. Mixed Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of Biological Systems in Ground and Electronically Excited States. Chem. Rev. 2015, 115, 6217–6263. 10.1021/cr500628b. [DOI] [PubMed] [Google Scholar]
- Crespo-Otero R.; Barbatti M. Recent Advances and Perspectives on Nonadiabatic Mixed Quantum-Classical Dynamics. Chem. Rev. 2018, 118, 7026–7068. 10.1021/acs.chemrev.7b00577. [DOI] [PubMed] [Google Scholar]
- Brockherde F.; Vogt L.; Li L.; Tuckerman M. E.; Burke K.; Müller K.-R. Bypassing the Kohn-Sham Equations with Machine Learning. Nat. Commun. 2017, 8, 872. 10.1038/s41467-017-00839-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu D.; Xie Y.; Li X.; Li L.; Lan Z. Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics Simulation. J. Phys. Chem. Lett. 2018, 9, 2725–2732. 10.1021/acs.jpclett.8b00684. [DOI] [PubMed] [Google Scholar]
- Dral P. O.; Owens A.; Yurchenko S. N.; Thiel W. Structure-Based Sampling and Self-Correcting Machine Learning for Accurate Calculations of Potential Energy Surfaces and Vibrational Levels. J. Chem. Phys. 2017, 146, 244108. 10.1063/1.4989536. [DOI] [PubMed] [Google Scholar]
- Chmiela S.; Tkatchenko A.; Sauceda H. E.; Poltavsky I.; Schütt K. T.; Müller K.-R. Machine Learning of Accurate Energy-Conserving Molecular Force Fields. Sci. Adv. 2017, 3, e1603015. 10.1126/sciadv.1603015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hansen K.; Montavon G.; Biegler F.; Fazli S.; Rupp M.; Scheffler M.; von Lilienfeld O. A.; Tkatchenko A.; Müller K.-R. Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. J. Chem. Theory Comput. 2013, 9, 3404–3419. 10.1021/ct400195d. [DOI] [PubMed] [Google Scholar]
- Rupp M. Machine Learning for Quantum Mechanics in a Nutshell. Int. J. Quantum Chem. 2015, 115, 1058–1073. 10.1002/qua.24954. [DOI] [Google Scholar]
- Ramakrishnan R.; Hartmann M.; Tapavicza E.; von Lilienfeld O. A. Electronic Spectra from TDDFT and Machine Learning in Chemical Space. J. Chem. Phys. 2015, 143, 084111. 10.1063/1.4928757. [DOI] [PubMed] [Google Scholar]
- Unke O. T.; Meuwly M. Toolkit for the Construction of Reproducing Kernel-Based Representations of Data: Application to Multidimensional Potential Energy Surfaces. J. Chem. Inf. Model. 2017, 57, 1923–1931. 10.1021/acs.jcim.7b00090. [DOI] [PubMed] [Google Scholar]
- Bartók A. P.; Csányi G. Gaussian Approximation Potentials: A Brief Tutorial Introduction. Int. J. Quantum Chem. 2015, 115, 1051–1057. 10.1002/qua.24927. [DOI] [Google Scholar]
- Denzel A.; Kästner J. Gaussian Process Regression for Geometry Optimization. J. Chem. Phys. 2018, 148, 094114. 10.1063/1.5017103. [DOI] [Google Scholar]
- Gastegger M.; Behler J.; Marquetand P. Machine Learning Molecular Dynamics for the Simulation of Infrared Spectra. Chem. Sci. 2017, 8, 6924–6935. 10.1039/C7SC02267K. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Z.; Kermode J. R.; De Vita A. Molecular Dynamics with On-the-Fly Machine Learning of Quantum-Mechanical Forces. Phys. Rev. Lett. 2015, 114, 096405. 10.1103/PhysRevLett.114.096405. [DOI] [PubMed] [Google Scholar]
- Richings G. W.; Habershon S. Direct Quantum Dynamics Using Grid-Based Wave Function Propagation and Machine-Learned Potential Energy Surfaces. J. Chem. Theory Comput. 2017, 13, 4012–4024. 10.1021/acs.jctc.7b00507. [DOI] [PubMed] [Google Scholar]
- Smith J. S.; Isayev O.; Roitberg A. E. ANI-1: An Extensible Neural Network Potential with DFT Accuracy at Force Field. Computational Cost. Chem. Sci. 2017, 8, 3192–3203. 10.1039/C6SC05720A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hase F.; Kreisbeck C.; Aspuru-Guzik A. Machine Learning for Quantum Dynamics: Deep Learning of Excitation Energy Transfer Properties. Chem. Sci. 2017, 8, 8419–8426. 10.1039/C7SC03542J. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L.; Han J.; Wang H.; Car R.; E W. Deep Potential Molecular Dynamics: A Scalable Model with the Accuracy of Quantum Mechanics. Phys. Rev. Lett. 2018, 120, 143001. 10.1103/PhysRevLett.120.143001. [DOI] [PubMed] [Google Scholar]
- Imbalzano G.; Anelli A.; Giofré D.; Klees S.; Behler J.; Ceriotti M. Automatic Selection of Atomic Fingerprints and Reference Configurations for Machine-Learning Potentials. J. Chem. Phys. 2018, 148, 241730. 10.1063/1.5024611. [DOI] [PubMed] [Google Scholar]
- Botu V.; Ramprasad R. Learning Scheme to Predict Atomic Forces and Accelerate Materials Simulations. Phys. Rev. B: Condens. Matter Mater. Phys. 2015, 92, 094306. 10.1103/PhysRevB.92.094306. [DOI] [Google Scholar]
- Hastie T.; Tibshirani R.; Friedman J.. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed.; Springer-Verlag, 2009. [Google Scholar]
- Sobol’ I. M.; Asotsky D.; Kreinin A.; Kucherenko S. Construction and Comparison of High-Dimensional Sobol’ Generators. Wilmott 2011, 2011, 64–79. 10.1002/wilm.10056. [DOI] [Google Scholar]
- Wang L.; Prezhdo O. V. A Simple Solution to the Trivial Crossing Problem in Surface Hopping. J. Phys. Chem. Lett. 2014, 5, 713–719. 10.1021/jz500025c. [DOI] [PubMed] [Google Scholar]
- Zhu C.; Nakamura H. The Two-State Linear Curve Crossing Problems Revisited. III. Analytical Approximations for Stokes Constant and Scattering Matrix: Nonadiabatic Tunneling Case. J. Chem. Phys. 1993, 98, 6208–6222. 10.1063/1.464814. [DOI] [Google Scholar]
- Plasser F.; Granucci G.; Pittner J.; Barbatti M.; Persico M.; Lischka H. Surface Hopping Dynamics Using a Locally Diabatic Formalism: Charge Transfer in the Ethylene Dimer Cation and Excited State Dynamics in the 2-Pyridone Dimer. J. Chem. Phys. 2012, 137, 22A514. 10.1063/1.4738960. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.