Abstract
The solid acids CsH2PO4 and Cs7(H4PO4)(H2PO4)8 pose significant challenges for the simulation of proton transport phenomena. In this work, we use the recently developed machine-learned force field (MLFF) MACE to model the proton dynamics on nanosecond time scales for these systems and compare its performance with long-term ab initio molecular dynamics (AIMD) simulations. The MACE-MP-0 foundation model shows remarkable performance for all observables derived from molecular dynamics (MD) simulations, but minor quantitative discrepancies remain compared to the AIMD reference data. However, we show that minimal fine-tuningfitting to as little as 1 ps of AIMD dataleads to full quantitative agreement between the radial distribution functions of MACE force field and AIMD simulations. In addition, we show that traditional long-term AIMD simulations fail to capture the correct qualitative trends in diffusion coefficients and activation energies for these solid acids due to the limited accessible time scale. In contrast, accurate and convergent diffusion coefficients can be reliably obtained through multinanosecond long MD simulations using machine-learned force fields. The obtained qualitative and quantitative behavior of the converged diffusion coefficients and activation energies now matches the experimental trends for both solid acids, in contrast to previous AIMD simulations that yielded a qualitatively wrong picture.


Introduction
The unsupervised, black-box-like prediction of atomic forces with ab initio accuracy has been a central goal of the machine-learned force field (MLFF) paradigm ever since Behler and Parrinello published their seminal paper on the representation of potential-energy surfaces with neural networks in 2007. Several generations of machine learning approaches, such as feed-forward and graph neural networks, and kernel regression methods, e.g., Gaussian Approximation Potentials (GAP), have been successfully applied to molecular dynamics simulations. − Each of these methods relies on a descriptor that represents the local atomic environment. A variety of different descriptors have been developed in the past, to name but a few: atom centered symmetry functions, the smooth overlap of atomic positions descriptor and moment tensor potential basis functions.
Inspired by the success of foundational models in natural language processing, recent efforts have proposed training MLFFs on large-scale data sets such as the Materials Project database. , Foundational MLFFs can be thought of as fundamental models for atomistic materials chemistry that can generalize across different classes of materials while remaining easily adaptable to specific applications. ,
These advances are driven by innovations in atomic environment descriptors, such as the atomic cluster expansion (ACE), and the integration of equivariance principles as a core design element in graph neural networks. The synergy between the ACE descriptor and the enforcement of equivariance has enabled the development of accurate and generalizable MLFFs, − which can be considered foundation models.
In this paper, we focus on one of the most promising publicly available equivariant MLFFs, and investigate its fine-tuning and transferability on the example of two particularly challenging compounds for simulating proton transport: the solid acids CsH2PO4 (CDP) and Cs7(H4PO4)(H2PO4)8 (CPP). We benchmark the performance of the MACE foundation model and evaluate the amount of fine-tuning required to achieve quantitative agreement between the radial distribution functions of MACE and ab initio molecular dynamics (AIMD) simulations, as well as the degree of transferability of fine-tuned models to closely related material systems. We show how the limited time scales accessible in AIMD simulations prevent the calculation of diffusion coefficients that are even qualitatively consistent with existing experimental data. Furthermore, we demonstrate that MLFF-based MD simulations on the nanosecond time scale yield converged diffusion coefficients that are in agreement with experiments.
Solid acids are inorganic crystalline compounds with molecular formulas similar to CsH y XO4 (X = S, P, Se; y = 1, 2), where protons can act as charge carriers. These compounds undergo a superprotonic phase transition from a low- to a high-temperature phase, which is associated with a drastic increase in proton conductivity. While the low-temperature phases are almost proton insulators, the high-temperature phases are excellent proton conductors. The technological relevance of these materials arises from the fact that they enable water-free proton conduction, which qualifies them as suitable materials for proton exchange fuel cell membranes operating in the medium temperature range. − The most prominent representative of this class of compounds is CDP, which is already used as a membrane material in a commercially available fuel cell and exhibits the superprotonic phase transition (monoclinic to cubic phase) at 509 K. , A similar but chemically more intriguing solid acid, CPP, is derived from CDP by replacing one in eight cesium ions with the unusual tetrahydroxyphosphonium cation H4PO4 , leading to the unique situation of anionic H2PO4 and cationic H4PO4 phosphate groups coexisting in a single crystalline phase. The coexistence was confirmed by 31P NMR and high-temperature X-ray crystal diffraction by the group of Haile, who also synthesized this compound for the first time in 2020. This finding is all the more remarkable because proton transfer between the phosphate groups is possible via a strong and fluctuating hydrogen bond network. We have previously studied the distribution of protons in this compound using AIMD simulations and were able to show that the H4PO4 cation introduces a complicated distribution of protons in CPP and that the different phosphate groups differ in their proton interaction. If only covalently bound protons are considered, the oxygen atoms of the “formally” cationic phosphate groups H4PO4 are bound to a smaller number of protons compared to the oxygen atoms of the anionic phosphate groups H2PO4 . However, the “formal” protonation states can be maintained if both hydrogen and covalently bonded hydrogen atoms are considered. All oxygen atoms in CPP are in a dynamic equilibrium (fluctuating on the sub-ps time scale) between hydrogen and covalently bonded states, which justifies counting the number of covalently and hydrogen bonded protons per oxygen atom followed by division of this number by two, as each proton is always involved in one hydrogen and one covalent bond. Following this approach, the ‘formal’ protonation states H4PO4 /H2PO4 are obtained. The crystal structures of CDP and CPP, with the anionic H2PO4 and cationic H4PO4 phosphate groups highlighted by green and cyan tetrahedra, are shown in Figures a and a, respectively.
1.
(a) Snapshot from an AIMD simulation of CDP. Color scheme for atoms: hydrogen in blue, oxygen in red, phosphorus in green, and cesium in pink. (b) Comparison of the O–H radial distribution function g(r) obtained from different MACE models (see main text) with g(r) from an AIMD simulation. The inset highlights the peak of g(r) at d OH = 1.5 Å, which is commonly referred to as the ”short strong hydrogen bond” or ”low/barrier hydrogen bond”. − (c) Histogram showing the similarity coefficients SC as defined by eq with respect to the AIMD results, i.e., SC[MACE; AIMD] for all possible bond combinations for the three selected MACE models from b, see legend. The average SC is indicated by vertical dashed lines in the corresponding color.
2.
(a) Snapshot from an AIMD simulation of CPP. Anionic H2PO4 tetrahedra are highlighted in green, and cationic H4PO4 tetrahedra are highlighted in yellow. Color scheme for atoms: hydrogen in blue, oxygen in red, phosphorus in green/yellow (anionic/cationic), and cesium in pink. (b) Comparison of the anionic O–H radial distribution function g(r) obtained from different MACE models (see main text) with g(r) from an AIMD simulation. The inset highlights the peak of g(r) at d OH = 1.5 Å, which is commonly referred to as the “short-strong hydrogen bond” or “low-barrier hydrogen bond”. (c) Same as (b) but for the cationic O–H radial distribution function.
The proper description of these materials using MD simulations faces significant challenges. As discussed in ref . AIMD simulations are not able to predict the correct qualitative trend of the diffusion coefficients. Although the accuracy of the chosen level of theory (here: GGA-DFT) may be questionable, this discrepancy is most likely due to the short time scales accessible to AIMD simulations. In addition, classical force fields are unlikely to effectively capture the complex dynamics of these systems, where the anions and cations exhibit solid-like behavior, while the rotational dynamics of the phosphate groups are comparable to the liquid state. Since the resulting fluctuating hydrogen bond network is the origin of the extraordinarily high proton conductivity in these compounds, its proper treatment is critical. Promising alternatives to overcome these problems are multiscale methods, which combine, for example, MD simulations with Monte Carlo approaches. − Another approach is MLFFs, which aim to combine the speed of classical force fields with the precision of AIMD. For the foundation model approach regarding MLFFs, another issue arises: To our knowledge, the tetrahydroxyphosphonium cation H4PO4 has been described only 5 times in the literature − and is therefore not or rarely included in any database used for the initial training of a foundation model. This leads to the four main questions of this work: (a) How well do foundation models perform for solid acids? (b) How much AIMD data is needed to fine-tune MLFFs for solid acids? (c) How transferable is a fine-tuned model within its chemical class? (d) Is an adequate description of the transport properties of these highly complex materials finally possible?
To investigate these questions, we have performed long-term AIMD and MLFF simulations using the MACE-MP-0 model with or without the empirical D3 dispersion correction (referred to as F and F + D3 in the following) and several fine-tuned versions of the MACE-MP-0 model for both compounds. We fine-tune the MACE-MP-0 on different numbers of snapshots: every 10th snapshot for 0.01 , 0.1 and 1 ps, and every 100th snapshot from a 228 ps trajectory of CDP and 233 ps trajectory of CPP. We name these models accordingly, i.e., FT@X with X ∈ {0.01, 0.1, 1, 228/233 ps}. This corresponds to 2, 20, 200 and 456/466 snapshots used for fine-tuning, respectively. Note that to investigate the transferability of the MLFF models, we will apply the fully fine-tuned MACE model for CPP (FT@233 ps) to molecular dynamics simulations of CDP, and vice versa. In the context of this transferability discussion, we refer to FT@228 ps as FT@CDP and FT@233 ps as FT@CPP. The time step for all MD simulation was set to 0.5 fs.
Methods
AIMD Simulations
We performed ab initio molecular dynamics simulations (Born–Oppenheimer MD scheme) using the CP2K program package to describe the solid acids CDP and CPP. We utilized the Quickstep module and orbital transformation for faster convergence. The electronic structure was calculated with density functional theory utilizing the PBE functional. A basis set of the type DZVP-MOLOPT-SR-GTH and GTH pseudopotentials , were applied. Furthermore, we used the empirical dispersion correction (D3) to improve the accuracy of the trajectories. The temperature was set by a Nosé–Hoover Chain thermostat − (NVT ensemble). All AIMD simulations were performed with a time step of 0.5 fs. We have performed AIMD simulations for 420 ps for CDP at 510 K and for 475 ps for CPP at 513 K. The simulated systems contained 512 atoms (including 128 H atoms) for CDP and 576 atoms (including 160 H atoms) for CPP. The dimensions of the simulation box and the starting configurations of the systems were obtained from crystal structure data from the literature. ,
MACE Fine-Tuning
Starting from the MACE-MP-0 foundation model, we have created several fine-tuned models for CDP and CPP using the MACE python package (version 0.3.6). To provide reference data for refinement, we recalculated energies and forces using CP2K on different numbers of frames from the AIMD trajectories. Compared to the settings for the AIMD simulation, we only switched off the Grimme dispersion correction, as this correction is also available in the MACE python package and can be used optionally during the production run. For both compounds (CDP and CPP) we used every 10th snapshot from a 0.01, 0.1, and 1 ps trajectory, resulting in 2, 20, and 200 snapshots for fine-tuning. The resulting models were named by FT@X with X ∈ {0.01 ps, 0.1 ps, 1 ps, 228 ps/233 ps}. The full fine-tuned models were obtained by retraining the MACE-MP-0 foundation model on every 100th snapshot of a 228 ps (233 ps) trajectory for CDP (CPP). The training error is reported in Supporting Note 2. We fine-tuned on both energies and forces and used the following hyperparameters for fine-tuning: Learning rate: 0.01; Number of epochs: 200; Training batch size: 5; Force error weight: 10; Energy error weight: 0.1.
MACE MD Simulations
We used the ASE Python package to perform MD simulations using the MACE models for force predictions. For the fully fine-tuned models, we performed 10 ns simulation runs of CDP and CPP at 510, 535, 560, 585, 610 and 650 K to investigate trends in the diffusion coefficients. The time step for these simulations was set to 0.5 fs and a Langevin thermostat was used to maintain a constant temperature.
MSD and Diffusion Coefficients
The mean square displacement MSD(τ) is calculated according to
| 1 |
Here, R denotes the position of the hydrogen atoms and the double average ⟨⟨·⟩ H ⟩ t 0 corresponds to an averaging over all hydrogen atoms H as well as an averaging over all possible starting times t 0 within the trajectory. For t 0, a large number of time points along the total simulation is taken, in order to avoid a bias due to a particular choice of the initial configuration (the “zero” in time) for the actual mean square displacement ⟨|R(0 + τ) – R(0)|2⟩ H .
Phosphor-Oxygen Vector Autocorrelation Function
The phosphor-oxygen vector autocorrelation function c(τ) is defined via the unit vector along the chemical bond between P and one of its O atoms R PO according to
| 2 |
where the double average ⟨⟨·⟩ O ⟩ t 0 corresponds to an averaging over all oxygen atoms O as well as an averaging over all possible starting times t 0 within the trajectory. For t 0, a large number of time points along the total simulation is taken, in order to avoid a bias due to a particular choice of the initial configuration (the “zero” in time) for the actual autocorrelation function ⟨R PO (0) · R PO (0 + τ)⟩ O .
Results
The fine-tuned models exhibit very small force errors on the test set, e.g., around 35 meV Å–1 per atom for the fully fine-tuned models. The test set was constructed by selecting equally spaced frames from the extended MD trajectories generated using the fully fine-tuned MACE models. Forces and energies calculated for these frames via DFT were compared to those predicted by various MACE models. Detailed information on the test set errors are given in Supporting Note 3. However, as was recently discussed in ref ., better accuracy in the forces does not always lead to accuracy in the trajectories and derived properties. Therefore, we decide to evaluate the performance of the MLFF models by comparing radial distribution functions (RDF) from AIMD and MLFF simulations at 510 K. This temperature was chosen because CDP undergoes its superprotonic phase transition at 509 K. We calculated the RDFs for all possible atomic combinations found in CDP. The most important RDF in terms of predicting proton mobility, the OH-RDF, is shown in Figure b. The OH-RDF predicted by each MACE model, i.e., both fine-tuned and foundation models, are in excellent agreement with the RDF obtained from our AIMD simulation. Focusing on the region of the OH-RDF associated with “short-strong” or “low-barrier” hydrogen bonds (d OH ∈ [1.25, 1.75] Å), − notable differences between the predictions of the fine-tuned and foundation models become apparent (see inset in Figure b). This region of the OH-RDF is particularly critical for the accurate description of proton transport phenomena in solid acids, as proton jumps within these distances are characterized by low activation energies, making them highly important for the overall process. While the foundation model shows deviations from the AIMD reference data in this region, the fine-tuned models achieve a high degree of quantitative accuracy, effectively capturing the key features of the RDF. Remarkably, the quantitative agreement of the fine-tuned MACE models is given for all levels of fine-tuning, i.e., even the fine-tuned models obtained by training on only one frame (FT@0.01 ps) yield OH-RDFs with excellent accuracy. Comparing F and F + D3 in Supporting Figure 1, we note that the inclusion of the D3 dispersion correction does not significantly change the RDFs. This is due to the fact that the intermolecular interactions of the present system are dominated by strong hydrogen bonds, for which dispersion forces do not play a major role.
To quantify the agreement between RDFs calculated from AIMD and MACE models, we use a similarity coefficient SC previously used in other contexts, , which is defined as follows
| 3 |
where g(r), g̃(r) are the radial distribution functions to be compared. The SC value equals 1 for a perfect match between the functions being compared and decreases, potentially toward negative infinity, as the two functions become increasingly dissimilar. The distribution of similarity coefficients and their mean value for three MACE models (F, FT@228 ps, FT@CPP) are shown in Figure c. Even the worst similarity coefficients for each of the six variants of the MACE model considered are all above 0.85, the worst being SC = 0.864 for the CsP-RDF of the foundation model. All fine-tuned models significantly outperform the foundation model, with all SC ≥ 0.92. The very good agreement between the RDFs obtained by AIMD and each variant of the MACE model can be seen in detail in Supporting Table 1.
We repeat this procedure for CPP, where the longest AIMD trajectory used for fine-tuning is 233 ps long. The OH-RDF between anionic (Oan) and cationic (Ocat) oxygen atoms and hydrogen atoms for CPP is shown in Figure b,c, respectively. Again, the foundation and fine-tuned MACE models are compared with the AIMD. Just as with CDP, we find good qualitative agreement for the foundation model and excellent quantitative agreement for the fine-tuned model. The general shape of the OanH-RDF of CPP is similar to the shape of the OH-RDF of CDP. These RDFs are prototypes for OH-RDFs of excellent proton conductors, which form short and very strong hydrogen bonds. In contrast to the OanH-RDF, the OcatH-RDF differs significantly. The area of the peak corresponding to the “short-strong” hydrogen bonds (d OH ∈ [1.25, 1.75] Å) is larger than that of the covalently bonded hydrogen atoms (d OH ∈ [0.9, 1.2] Å), i.e., more H atoms are bonded to the cationic oxygen atoms by a hydrogen bond than by a covalent bond. If we integrate the OcatH-RDF, we find that there are a total of two protons bound to each oxygen atom, either by a hydrogen or a covalent bond. Thus, each oxygen atom of the cationic phosphate groups is involved in a bifurcated hydrogen bond. This remarkable protonation state is associated with the unusual tetrahydroxyphosphonium cation H4PO4 , which, as mentioned above, is reported only about 5 times in the literature. − Thus, the good qualitative agreement of the foundation model is therefore quite remarkable.
We recall that the atomistic structures of CDP and CPP are closely related. They differ only in the substitution of one in every eight Cs+ ions by the unusual tetrahydroxyphosphonium cation. This observation motivated us to use the fully fine-tuned model for CDP (denoted as FT@CDP) to calculate the RDFs of CPP, see Figure . The OanH-RDFs and the OcatH RDFs are in full agreement with the RDFs obtained from the fully fine-tuned model for CPP (FT@233 ps) and also with the RDFs from the AIMD simulation. The similarity coefficients for the RDFs obtained from all possible atomic combinations in CPP are shown in Supporting Table 2, further underlining the results from the particular example of the OH-RDFs. Applying the model fine-tuned on CPP (denoted as FT@CPP) to CDP works similarly well, see Figure . This is also a remarkable result: For the chemically closely related systems we have studied, fine-tuning on only one compound is required. The resulting model can be generalized to the other compound.
From our discussion of the RDFs of CDP and CPP obtained from AIMD and various MACE model simulations, we can now answer the first three questions posed above: (1) Foundation models perform remarkably well even for exotic solid acids, but they do not provide full quantitative agreement, which may have a noticeable impact on derived properties. (2) Fine-tuning to quantitative agreement is possible with a small number of AIMD snapshots, in extreme cases even a single snapshot, although increasing the number of snapshots improves quantitative agreement. This raises interesting questions about the future need for long-term AIMD simulations. (3) For the compounds investigated in this work fine-tuned MLFFs are transferable. This is somewhat surprising, because even though the compounds are similar, they feature distinct chemical environments, in particular the coexistence of the common phosphate anion with the exotic tetrahydroxyphosphonium cation in CPP.
We now turn to the final question: Can we obtain “correct” transport properties for solid acids? Since solid acids are potential membrane materials for fuel cells, the diffusion coefficients are of much more general interest than the RDFs. The Grotthuss diffusion mechanism of protons in solid acids consist of an alternating sequence of proton jumps between neighboring anions and rotations of these anions. Both processes are crucial and equally important for long-range proton transfer. Only a balanced interplay of both contributions allows efficient proton transport. Without anion rotation, proton motion would be limited to rattling between two neighboring oxygen atoms. Without proton jumps between neighboring anions, no long-range proton transfer could be observed at all, since the centers of mass of anions and cations are fixed due to the crystallinity of the solid acids. The characteristic time scale of phosphate rotation in these compounds is on the order of several hundred picoseconds. , These time scales (1) represent the minimum length of an MD simulation for calculating converged diffusion coefficients, (2) are difficult (or impossible) to achieve with AIMD simulations, (3) but are easily accessible with MLFFs.
We have calculated the diffusion coefficients from multinanosecond MACE simulations using the fully fine-tuned models (FT@228 ps for CDP and FT@233 ps for CPP), see details in the Methods section. For the sake of comparability, we have also performed AIMD simulations for exceptionally long time scales (CDP: 420 ps, CPP: 475 ps). The calculation of these trajectories required a significant amount of computational resources. For example, we used about 5 × 105 CPU hours to compute the 475 ps of CPP, which is equivalent to 12 months on a 64-core compute node. It is obviously not possible to run such long AIMD simulations in standard workflows for predicting the properties of a large number of materials. However, the 10 ns MACE trajectories were calculated well within 10 days on a single computing node with an NVIDIA A100 graphics carda speedup by a factor of around 700.
Figure visualizes the diffusion coefficients as a function of trajectory length. Starting from the 100 ps AIMD simulation (marking the beginning of the lines in Figure ), the diffusion coefficient computed over about 12 months on a 64-core CPU differs significantly (by about a factor of 4) from the converged values obtained with much longer MACE simulations. Convergence of the diffusion coefficients was achieved after about 5 ns for both solid acids. Such long time frames are obviously inaccessible with any reasonable amount of computational resources using AIMD simulations for the system sizes studied. The agreement between the diffusion coefficient obtained from very long AIMD simulations and the fully fine-tuned MACE model is convincing. However, the direct comparison of these values is problematic due to the limited time scales of the AIMD simulations.
3.
Visualization of the convergence behavior of the hydrogen diffusion coefficient with respect to simulation time for CDP and CPP. AIMD simulations at 513 K and fully fine-tuned mace simulations at 510 K are compared. Convergence of the diffusion coefficient is achieved only for time scales above 5 ns, which are not accessible by AIMD simulations. The overall diffusion coefficient (represented by solid and dashed lines) is computed as the average of the diffusion coefficients of all individual protons. The shaded area around each line represents the standard deviation calculated for these sets of individual diffusion coefficients.
Discussion
The converged diffusion coefficient values for the fully finetuned models, especially for CPP, finally resolve an open question. On the short AIMD time scales, a smaller diffusion coefficient of CDP compared to CPP was predicted at 510 K, and the diffusion coefficient of CPP did not differ significantly between 410 and 510 K. These predictions were in direct contradiction with the conductivity measurements (e.g., see Figure 10 in ref . and Figure 8 in ref .) and indicated a qualitative wrong trend for the activation energy. If we analyze the diffusion coefficients obtained from the converged MACE simulations, with trajectory lengths longer than 5 ns, we obtain a significantly altered picture, in agreement with the experimental trends for the diffusion coefficients. While the diffusion coefficients obtained from the long-term AIMD and converged (even longer running) MACE simulations are in good agreement for CDP, they differ by a factor of 4 for CPP. These deviations are the origin of the qualitatively wrong picture we get from the AIMD simulations. In contrast, the activation energies for proton diffusion obtained from the MACE simulations (CDP: 0.40(1) eV, CPP: 0.65(6) eV) are in very good agreement with the experimental activation energies (CDP: 0.40 eV, CPP: 0.65 eV). , Details of the calculation of the activation energies are given in Supporting Note 4.
Using the extended time scales accessible with MLFF simulations, we can not only report but also elucidate the observed trends in the diffusion coefficient by analyzing the PO-vector autocorrelation function (Figure ). For both compounds, the PO-vector autocorrelation exhibits a rapid initial decay from 1 to 0.8 for t < 5 ps, a change attributable exclusively to small angular oscillations. Only at longer simulation times actual phosphate rotation processes become apparent. Notably, the long-term decay of the PO-vector autocorrelation function is significantly faster for CDP than for CPP. This finding is further supported by the rotational time constants derived from a biexponential fit of the autocorrelation function. In both compounds, the fit reveals two distinct time constants: a short one (approximately 5 ps) corresponding to small angular oscillations, which is consistent between the compounds, and a longer one that describes actual phosphate rotation and is considerably greater for CPP (approximately 2 ns) than for CDP (approximately 800 ps).
4.

PO-vector autocorrelation function for the PO4 groups in CDP and CPP. The decay of the autocorrelation function reflects the rotation dynamics of the PO4 groups. A faster decay observed for CDP indicates a higher PO4 rotation rate, which contributes to its increased diffusion coefficient. Details for the calculation of the PO-vector autocorrelation function are given in the Methods section.
The magnitude of the diffusion coefficient can be estimated from two fundamental contributions: (1) the proton rattling frequency and (2) the anion rotation frequency. In a previous study, we demonstrated that the proton rattling frequencydefined as the rate at which a proton changes its nearest oxygen neighbor within a hydrogen bondis similar for both CPP and CDP, and thus cannot explain the difference in the diffusion coefficients. We have also verified that these values are converged on the AIMD time scale and that the proton transfer rates obtained from MLFF and AIMD simulations are in good agreement. Detailed numerical values for the proton rattling rates are provided in the SI. By combining the similar proton rattling rates with the increased rotation rates of the phosphate groups in CDP compared to CPP, we can explain the enhanced diffusion coefficient observed for CDP (at 510 K).
A closer examination of the inset in Figure reveals that accurately capturing the long-term decay of the autocorrelation functions requires very long simulations. The decays observed over the limited time scales accessible by AIMD simulations (between 15 and 40 ps) can be misleading. This example clearly illustrates that, despite improved convergence, the extended time scales provided by MLFF simulations enable new physical insights into the material properties.
Conclusions
In summary, our work highlights the importance of fine-tuning MLFFs to achieve accurate quantitative descriptions of complex proton transport dynamics in solid acids. While the MACE foundation model qualitatively describes all RDFs within this class very well, fine-tuning is essential for precise quantitative agreement with the AIMD reference data. Within the similar compounds investigated in this work, fine-tuned MLFFs are transferable. Using the fine-tuned MACE models, we were finally able to reconcile theoretical predictions with experimental trends for proton diffusion coefficients, especially for CPP. This underscores the utility of the MLFF MACE in advancing accurate simulations of diffusion phenomena in complex materials, especially solid acids. Arguably, these results open the possibility of studying diffusion processes in larger systems at different temperatures and pressures, and potentially indicate that AIMD simulations are now “just” a tool for generating training data for MLFFs. In conclusion, our work examines the performance of state-of-the-art foundational MLFF models on highly complex solid acids, provides insight into their fine-tunability and transferability, and lays the groundwork for future studies of diffusion phenomena across diverse material systems.
Supplementary Material
Acknowledgments
We thank the staff of the Compute Center of the Technische Universität Ilmenau and especially Mr. Henning Schwanbeck for providing an excellent research environment. We gratefully acknowledge funding from the Thringer Aufbaubank (TAB) (KapMemLyse, grant no. 2024 FGR 0081/0082) and the European Social Fund Plus (ESF+).
The data supporting the findings of this study have been included as reference material to the reviewers and will be made publicly available upon publication. Training data used for fine-tuning the foundation model is already openly available at the following URL/DOI: 10.5281/zenodo.14914554. The workflow utilized to produce and postprocess the results presented in this study are available from the corresponding author upon reasonable request.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jpcc.5c02064.
Additional figures visualizing the radial distribution functions (RDFs) obtained from molecular-dynamics simulations with various MACE models, including comparisons of anionic and cationic O–H peaks and the corresponding similarity coefficients quantifying agreement with ab initio molecular-dynamics (AIMD) results; tables of similarity coefficients (SC) for all bond-pair combinations in both CDP and CPP systems, comparing MACE foundation and fine-tuned models; tables summarizing root-mean-square error (RMSE) metrics for energy and force predictions of each MACE model after training for CDP and CPP; tables comparing MACE-predicted energies and forces against density-functional theory (DFT) reference calculations for CDP and CPP; a figure illustrating the Arrhenius analysis of temperature-dependent diffusion coefficients used to extract the proton-diffusion activation energy; and a table reporting the average number of proton jumps per hydrogen atom per picosecond in both AIMD and fine-tuned MACE simulations (PDF)
∥.
M.Gru. and M.Gro. contributed equally to this work; M.Gru., M.Gro., and C.D. conceived the idea; M.Gru. fine-tuned the MACE models based on AIMD simulation from J.H. and C.D.; C.D ran the MACE MD simulation; M.Gru., M.Gro., and C.D. analyzed the data; M.Gro. visualized the results; M.Gru., M.Gro., and C.D. wrote the manuscript; J.L.W. created scripts for assembling data for fine-tuning; J.O. and A.F. carried out preliminary calculations as a starting point for this work; C.D. and E.R. supervised the work; all authors modified and approved the manuscript.
The authors declare no competing financial interest.
References
- Behler J., Parrinello M.. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 2007;98:146401. doi: 10.1103/PhysRevLett.98.146401. [DOI] [PubMed] [Google Scholar]
- Bartók A. P., Payne M. C., Kondor R., Csnyi G.. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 2010;104:136403. doi: 10.1103/PhysRevLett.104.136403. [DOI] [PubMed] [Google Scholar]
- Unke O. T., Chmiela S., Sauceda H. E., Gastegger M., Poltavsky I., Schøutt K. T., Tkatchenko A., Møller K. R.. Machine Learning Force Fields. Chem. Rev. 2021;121:10142–10186. doi: 10.1021/acs.chemrev.0c01111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friederich P., Häse F., Proppe J., Aspuru-Guzik A.. Machine-learned potentials for next-generation matter simulations. Nat. Mater. 2021;20:750–761. doi: 10.1038/s41563-020-0777-6. [DOI] [PubMed] [Google Scholar]
- Reiser P., Neubert M., Eberhard A., Torresi L., Zhou C., Shao C., Metni H., van Hoesel C., Schopmans H., Sommer T., Friederich P.. Graph neural networks for materials science and chemistry. Commun. Mater. 2022;3:93. doi: 10.1038/s43246-022-00315-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shapeev A. V.. Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Sim. 2016;14:1153–1173. doi: 10.1137/15M1054183. [DOI] [Google Scholar]
- Jain A., Ong S. P., Hautier G., Chen W., Richards W. D., Dacek S., Cholia S., Gunter D., Skinner D., Ceder G., Persson K. A.. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater. 2013;1:011002. doi: 10.1063/1.4812323. [DOI] [Google Scholar]
- Ong S. P., Cholia S., Jain A., Brafman M., Gunter D., Ceder G., Persson K. A.. The Materials Application Programming Interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles. Comput. Mater. Sci. 2015;97:209–215. doi: 10.1016/j.commatsci.2014.10.037. [DOI] [Google Scholar]
- Batatia, I. ; Benner, P. ; Chiang, Y. ; Elena, A. M. ; Kov́acs, D. P. ; Riebesell, J. ; Advincula, X. R. ; Asta, M. ; Avaylon, M. ; Baldwin, W. J. . et al. A Foundation Model for Atomistic Materials Chemistry. 2024, arXiv:2401.00096. arXiv.org e-Printarchive. https://arxiv.org/abs/2401.00096.
- Yang, H. ; Hu, C. ; Zhou, Y. ; Liu, X. ; Shi, Y. ; Li, J. ; Li, G. ; Chen, Z. ; Chen, S. ; Zeni, C. . et al. MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures. 2024, arXiv:2405.04967. arXiv.org e-Printarchive. https://arxiv.org/abs/2405.04967.
- Drautz R.. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B. 2019;99:014104. doi: 10.1103/PhysRevB.99.014104. [DOI] [Google Scholar]
- Batatia, I. ; Kovacs, D. P. ; Simm, G. ; Ortner, C. ; Csanyi, G. . Mace: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force Fields. In Advances in Neural Information Processing Systems; Koyejo, S. ; Mohamed, S. ; Agarwal, A. ; Belgrave, D. ; Cho, K. ; Oh, A. , Eds.; Curran Associates, Inc., 2022; Vol. 35, pp 11423–11436. [Google Scholar]
- Batzner S., Musaelian A., Sun L., Geiger M., Mailoa J. P., Kornbluth M., Molinari N., Smidt T. E., Kozinsky B.. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 2022;13:2453. doi: 10.1038/s41467-022-29939-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kovács D. P., Batatia I., Arany E. S., Csányi G.. Evaluation of the MACE force field architecture: From medicinal chemistry to materials science. J. Chem. Phys. 2023;159:044118. doi: 10.1063/5.0155322. [DOI] [PubMed] [Google Scholar]
- Suh S. B., Kim J. C., Choi Y. C., Yun S., Kim K. S.. Nature of One-Dimensional Short Hydrogen Bonding: Bond Distances, Bond Energies, and Solvent Effects. J. Am. Chem. Soc. 2004;126:2186–2193. doi: 10.1021/ja037607a. [DOI] [PubMed] [Google Scholar]
- Limbach H.-H., Tolstoy P. M., Pérez-Hernández N., Guo J., Shenderovich I. G., Denisov G. S.. OHO Hydrogen Bond Geometries and NMR Chemical Shifts: From Equilibrium Structures to Geometric H/D Isotope Effects, with Applications for Water, Protonated Water, and Compressed Ice. Isr. J. Chem. 2009;49:199–216. doi: 10.1560/IJC.49.2.199. [DOI] [Google Scholar]
- Dominelli-Whiteley N., Brown J. J., Muchowska K. B., Mati I. K., Adam C., Hubbard T. A., Elmi A., Brown A. J., Bell I. A. W., Cockroft S. L.. Strong Short- Range Cooperativity in Hydrogen-Bond Chains. Angew. Chem., Int. Ed. 2017;56:7658–7662. doi: 10.1002/anie.201703757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dereka B., Yu Q., Lewis N. H. C., Carpenter W. B., Bowman J. M., Tokmakoff A.. Crossover from Hydrogen to Chemical Bonding. Science. 2021;371:160–164. doi: 10.1126/science.abe1951. [DOI] [PubMed] [Google Scholar]
- Baranov A. I.. Crystals with disordered hydrogen-bond networks and superprotonic conductivity. Review. Crystallogr. Rep. 2003;48:1012–1037. doi: 10.1134/1.1627443. [DOI] [Google Scholar]
- Goñi-Urtiaga A., Presvytes D., Scott K.. Solid acids as electrolyte materials for proton exchange membrane (PEM) electrolysis: Review. Int. J. Hydrogen Energy. 2012;37:3358–3372. doi: 10.1016/j.ijhydene.2011.09.152. [DOI] [Google Scholar]
- Haile S. M., Boysen D. A., Chisholm C. R., Merle R. B.. Solid Acids as Fuel Cell Electrolytes. Nature. 2001;410:910–913. doi: 10.1038/35073536. [DOI] [PubMed] [Google Scholar]
- Boysen D. A., Cha S., Chisholm C. R. I., Giapis K. P., Haile S. M., Papandrew A. B., Sasaki K. A.. From Laboratory Breakthrough to Technological Realization: The Development Path for Solid Acid Fuel Cell. Electrochem. Soc. Interface. 2009;18:53. doi: 10.1149/2.f06093if. [DOI] [Google Scholar]
- Haile S. M., Chisholm C. R. I., Sasaki K., Boysen D. A., Uda T.. Solid Acid Proton Conductors: From Laboratory Curiosities to Fuel Cell Electrolytes. Faraday Discuss. 2007;134:17–39. doi: 10.1039/B604311A. [DOI] [PubMed] [Google Scholar]
- Boysen D. A., Uda T., Chisholm C. R. I., Haile S. M.. High-Performance Solid Acid Fuel Cells through Humidity Stabilization. Science. 2004;303:68–70. doi: 10.1126/science.1090920. [DOI] [PubMed] [Google Scholar]
- Yamada K., Sagara T., Yamane Y., Ohki H., Okuda T.. Superprotonic conductor CsH2PO4 studied by 1H, 31P NMR and X-ray diffraction. Solid State Ionics. 2004;175:557–562. doi: 10.1016/j.ssi.2004.03.042. [DOI] [Google Scholar]
- Wang L. S., Patel S. V., Sanghvi S. S., Hu Y.-Y., Haile S. M.. Structure and Properties of Cs7(H4PO4)(H2PO4)8: A New Superprotonic Solid Acid Featuring the Unusual Polycation (H4PO4)+ . J. Am. Chem. Soc. 2020;142:19992–20001. doi: 10.1021/jacs.0c08870. [DOI] [PubMed] [Google Scholar]
- Dreßler C., Hänseroth J., Sebastiani D.. Coexistence of Cationic and Anionic Phosphate Moieties in Solids: Unusual but Not Impossible. J. Phys. Chem. Lett. 2023;14:7249–7255. doi: 10.1021/acs.jpclett.3c01521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joerg F., Wieder M., Schr̈oder C.. Protex Python utility for proton exchange in molecular dynamics simulations. Front. Chem. 2023;11:1140896. doi: 10.3389/fchem.2023.1140896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreßler C., Kabbe G., Sebastiani D.. Proton Conductivity in Hydrogen Phosphate/Sulfates from a Coupled Molecular Dynamics/Lattice Monte Carlo (cMD/LMC) Approach. J. Phys. Chem. C. 2016;120:19913–19922. doi: 10.1021/acs.jpcc.6b05822. [DOI] [Google Scholar]
- Kabbe G., Dreer C., Sebastiani D.. Toward Realistic Transfer Rates within the Coupled Molecular Dynamics/Lattice Monte Carlo Approach. J. Phys. Chem. C. 2016;120:19905–19912. doi: 10.1021/acs.jpcc.6b05821. [DOI] [Google Scholar]
- Dreßler C., Kabbe G., Brehm M., Sebastiani D.. Exploring non-equilibrium molecular dynamics of mobile protons in the solid acid CsH2PO4 at the micrometer and microsecond scale. J. Chem. Phys. 2020;152:164110. doi: 10.1063/5.0002167. [DOI] [PubMed] [Google Scholar]
- Dreßler C., Kabbe G., Brehm M., Sebastiani D.. Dynamical matrix propagator scheme for large-scale proton dynamics simulations. J. Chem. Phys. 2020;152:114114. doi: 10.1063/1.5140635. [DOI] [PubMed] [Google Scholar]
- Hibbert R. C., Logan N.. Multinuclear Nuclear Magnetic Resonance Study of the Interaction of some Phosphorus(V) Compounds with Inorganic Acids. The Protonating Abilities of HNO3, MeSO3H, and HPO2F2 towards the phosphoryl group. J. Chem. Soc., Dalton Trans. 1985:865–866. doi: 10.1039/dt9850000865. [DOI] [Google Scholar]
- Addison C., Bailey J. W., Bruce S. H., Dove M. F. A., Hibbert R. C., Logan N.. Chemistry in Fuming Nitric Acids NMR Spectroscopic Study of PF5, HPO2F2 and P4O10 in the Solvent System 44 wt.% N2O4 in 100% HNO3 . Polyhedron. 1983;2:651–656. doi: 10.1016/S0277-5387(00)81528-4. [DOI] [Google Scholar]
- Minkwitz R., Schneider S.. Synthesis and Characterization of the Tetrahydroxyphosphonium Hexafluorometalates P(OH)4 + MF6 – (M = As, Sb) Angew. Chem., Int. Ed. 1999;38:210–212. doi: 10.1002/(SICI)1521-3773(19990115)38:1/2<210::AID-ANIE210>3.0.CO;2-H. [DOI] [Google Scholar]
- Mathew M., Wong-Ng W.. Crystal Structure of a New Monoclinic Form of Potassium Dihydrogen Phosphate Containing Orthophosphacidium Ion, [H4PO4]+1 . J. Solid State Chem. 1995;114:219–223. doi: 10.1006/jssc.1995.1031. [DOI] [Google Scholar]
- Arlman E. J.. The Basic Properties of Orthophosphoric Acid. Recl. Trav. Chim. Pays-Bas. 1937;56:919–922. doi: 10.1002/recl.19370561002. [DOI] [Google Scholar]
- Grimme S., Antony J., Ehrlich S., Krieg H.. A Consistent and Accurate Ab Initio Parametrization of Density Functional Dispersion Correction (DFT-D) for the 94 Elements H-Pu. J. Chem. Phys. 2010;132:154104. doi: 10.1063/1.3382344. [DOI] [PubMed] [Google Scholar]
- Køhne T. D., Iannuzzi M., Del Ben M., Rybkin V. V., Seewald P., Stein F., Laino T., Khaliullin R. Z., Schøtt O., Schiffmann F.. et al. CP2K: An electronic structure and molecular dynamics software package- Quickstep: Efficient and accurate electronic structure calculations. J. Chem. Phys. 2020;152:194103. doi: 10.1063/5.0007045. [DOI] [PubMed] [Google Scholar]
- VandeVondele J., Krack M., Mohamed F., Parrinello M., Chassaing T., Hutter J.. QUICKSTEP: Fast and Accurate Density Functional Calculations Using a Mixed Gaussian and Plane Waves Approach. Comput. Phys. Commun. 2005;167:103–128. doi: 10.1016/j.cpc.2004.12.014. [DOI] [Google Scholar]
- VandeVondele J., Hutter J.. An Efficient Orbital Transformation Method for Electronic Structure Calculations. J. Chem. Phys. 2003;118:4365–4369. doi: 10.1063/1.1543154. [DOI] [Google Scholar]
- Perdew J. P., Burke K., Ernzerhof M.. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 1996;77:3865. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]
- VandeVondele J., Hutter J.. Gaussian Basis Sets for Accurate Calculations on Molecular Systems in Gas and Condensed Phases. J. Chem. Phys. 2007;127:114105. doi: 10.1063/1.2770708. [DOI] [PubMed] [Google Scholar]
- Hartwigsen C., Goedecker S., Hutter J.. Relativistic Separable Dual-Space Gaussian Pseudopotentials from H to Rn. Phys. Rev. B. 1998;58:3641. doi: 10.1103/PhysRevB.58.3641. [DOI] [PubMed] [Google Scholar]
- Krack M.. Pseudopotentials for H to Kr Optimized for Gradient-Corrected Exchange-Correlation Functionals. Theor. Chem. Acc. 2005;114:145–152. doi: 10.1007/s00214-005-0655-y. [DOI] [Google Scholar]
- Nosé S.. A Unified Formulation of the Constant Temperature Molecular Dynamics Methods. J. Chem. Phys. 1984;81:511–519. doi: 10.1063/1.447334. [DOI] [Google Scholar]
- Martyna G. J., Klein M. L., Tuckerman M.. Nosé- Hoover chains: The Canonical Ensemble via Continuous Dynamics. J. Chem. Phys. 1992;97:2635–2643. doi: 10.1063/1.463940. [DOI] [Google Scholar]
- Nosé S.. A Molecular Dynamics Method for Simulations in the Canonical Ensemble. Mol. Phys. 1984;52:255–268. doi: 10.1080/00268978400101201. [DOI] [Google Scholar]
- Fu, X. ; Wu, Z. ; Wang, W. ; Xie, T. ; Keten, S. ; Gomez- Bombarelli, R. ; Jaakkola, T. . Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations. 2023, arXiv:2210.07237. arXiv.org e-Printarchive. https://arxiv.org/abs/2210.07237.
- Biswas T., Singh A. K.. pyGWBSE: a high throughput workflow package for GW-BSE calculations. Npj Comput. Mater. 2023;9:22. doi: 10.1038/s41524-023-00976-y. [DOI] [Google Scholar]
- Grunert M., Großmann M., Runge E.. Deep learning of spectra: Predicting the dielectric function of semiconductors. Phys. Rev. Mater. 2024;8:L122201. doi: 10.1103/PhysRevMaterials.8.L122201. [DOI] [Google Scholar]
- Dreßler C., Sebastiani D.. Effect of anion reorientation on proton mobility in the solid acids family CsH y XO4 (X = S, P, Se, y = 1, 2) from ab initio molecular dynamics simulations. Phys. Chem. Chem. Phys. 2020;22:10738–10752. doi: 10.1039/C9CP06473G. [DOI] [PubMed] [Google Scholar]
- Wang L. S., Patel S. V., Truong E., Hu Y.-Y., Haile S. M.. Phase Behavior and Superprotonic Conductivity in the System (1 – x)CsH2PO4 – xH3PO4: Discovery of Off-Stoichiometric α-[Cs1–x H x ]H2PO4 . Chem. Mater. 2022;34:1809–1820. doi: 10.1021/acs.chemmater.1c04061. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data supporting the findings of this study have been included as reference material to the reviewers and will be made publicly available upon publication. Training data used for fine-tuning the foundation model is already openly available at the following URL/DOI: 10.5281/zenodo.14914554. The workflow utilized to produce and postprocess the results presented in this study are available from the corresponding author upon reasonable request.



