Abstract
Describing intermolecular forces is fundamental to modeling and predicting the behavior of molecular systems. In particular, long-range molecular interactionswith electrostatic, induction, and dispersion as the main componentsplay a critical role, especially for low-temperature and low-density regimes. Long-range interactions are often described through perturbation theory, representing the electronic charge distribution via a multipolar series of the moments and polarizability tensors corresponding to each molecule. However, while the theory is well established, obtaining the resulting analytical expressions (and their practical implementation) constitutes a highly complex and system-dependent task. To address this challenge, we developed long-range-fit (LRF), an interactive and user-friendly software package designed to automate the generation and fitting of long-range interaction terms for arbitrary molecules in nondegenerate (ground or excited) electronic states. We have derived and implemented all terms up to 15th order, without approximations, via a spherical tensor representation, with symmetry adaptation to all molecular point-group symmetries. The resulting potential energy surface is compatible with most representations of the close interaction region.


1. Introduction
The description of molecular interactions plays a crucial role in numerous scientific disciplines, with applications that span atmospheric chemistry, environmental chemistry, astrochemistry, and beyond. − In particular, the importance of cold collisionswhich take place in environments where the temperature ranges from perhaps a few to over a hundred Kelvinhas received increasing attention in the past decade, mainly due to significant theoretical and experimental advances in the field. − Due to the rapid development of novel experimental techniques and theoretical approaches, the preparation/manipulation/simulation of a variety of species under cold conditions is a reality with growing capabilities, opening the door to scrutinize the complex chemistry taking place in remote parts of the Universesuch as the interstellar medium (ISM), circumstellar media, planetary atmospheres, and cometary comae. − These efforts complement and guide the interpretation of the rapidly growing amount of data recorded by earth- and space-based telescopes.
For theoretical studies, the representation of a potential energy surface (PES) mapping the interaction energy with the corresponding configuration of the system, whether for a near-global space or a reduced region of interestsuch as the region about the global minimum, along a reaction coordinate, or for some discrete set of geometriesoften constitutes a crucial first step. − Being a function of the coordinates of all atoms, the potential energy for a combined system of molecules can be separated into the intramolecular (the sum of the potential energies for each separated molecule) and intermolecular parts (the difference between the total and intramolecular potential energies). If the intermolecular PES is constructed using the rigid-rotor approximation, which usually works very well to describe nonreactive collisions of small molecules in low-temperature environments, the intramolecular energy becomes a constant, that of the separate monomers, and the dimensionality of the problem is significantly reduced: resulting in PESs being at most 6-dimensional for dimers, 12-dimensional for trimers, and so on. While 3- and higher-body interactions are of growing interest, , most studies of the gas phase are limited to treating dimers, as is reasonable in low-pressure environments.
Although nowadays the energy for arbitrary points on the PES can usually be accurately obtained using high-level ab initio electronic structure methodsand used directly in molecular dynamics simulations, i.e., the “on-the-fly” approachthe computational cost makes this approach impractical in many cases. , The common alternative is to compute a set of reference ab initio energies using a preferred/affordable level of theory, which provides the necessary accuracy, which is then used to fit an analytic PES. Methods to fit the data range from flexible or physically motivated single expansions to interpolative approaches, beginning with simple splines, as well as more sophisticated procedures such as interpolating moving least-squares − and artificial intelligence/machine learning approaches. ,− In the past decade, automated procedures that facilitate/minimize human intervention in the PES-construction process have also become popular, as implemented in programs such as AUTOSURF, AUTOPES, and ROBOSURFER. Our AUTOSURF package is typically employed to produce and refine a fit of the close interaction region, converging a specified configuration space to a specified error tolerance. However, depending on the problem of interest, not all regions of the PES play an equally important role. Given that statistically most gas-phase bimolecular collisions are not head-on, but rather have large impact parameters, then especially for interactions occurring at low temperatures with low collision energies, the system’s dynamics are largely determined by the long-range (LR) region of the PES. − In the LR, small absolute errors can be large relative errors, with important implications. Thus, for theoretical studies of cold collisions, constructing a PES with the correct asymptotic behavior is an essential priority. This is the main focus of the present work: the representation and study of long-range molecular interactions.
There are three main contributions to the LR interactions, namely, electrostatic, induction (Debye interaction) and dispersion (London interaction). Numerous physics-based models have been developed in the last decades to represent LR interactions, such as the Morse-long-range method, a nonlinear fitting approach that combines radial potential functions with flexible parameters to enforce the well-known inverse power behavior into a single analytical function. − Other approaches employ Morse-type variables to enforce slow variation in the LR, suppressing the oscillatory or divergent behavior of ordinary polynomials. Recently, machine learning approaches using different schemes and architectures such as feedforward , and physics-informed neural networks , have been introduced to the field. However, in terms of enforcing a physically correct behavior, methods based on the multipole expansion of the interaction potential are difficult to surpass.
At sufficiently large separations, the interaction between two molecules (A and B) can be treated as two distinct charge distributions separated by the distance R between their respective centers-of-mass. In this scenario, the overlap between their wave functions can be ignoredsince the error made by ignoring it decreases exponentially as R increasesand the system’s Hamiltonian can be considered as a combination of , the Hamiltonian corresponding to the isolated molecules , and the operator , that describes the electrostatic interaction between the particles (nuclei and electrons) in different molecules. Since at large distances , it can be seen as a small perturbation in the system’s Hamiltonian. Then, since electron exchange can be neglected due to the molecular separation, the eigenfunctions of the unperturbed Hamiltonian can be written as simple products of the wave functions corresponding to the molecules A and B (Ψm Ψn ) in a particular state m and n
| 1 |
and standard Rayleigh–Schrödinger perturbation theory can be used to obtain the system’s total interaction energy for any particular nondegenerate state |pq⟩
| 2 |
where the (zeroth-order) energy W pq = W p + W q corresponds to the unperturbed state in which molecule A is in the excited state p (with energy W p ) and molecule B in the excited state q (with energy W q ); the first-order energy correction
| 3 |
is the expectation value of operator for state |pq⟩, which corresponds to the electrostatic energy of the system (E elec); while the second-order energy correction
| 4 |
the polarization energy, splits into the induction (E ind) and dispersion (E disp) energies
| 5 |
where
| 6 |
| 7 |
| 8 |
Notice that the sums in eqs and – go over all possible states of the system, except for |pq⟩.
Without losing generality, we can describe the electrostatic interaction between the molecules considering the charge distribution B under the influence of an external nonuniform electric field generated by molecule A: ; where the sum goes for all particles b ∈ B, q b represents the charge of particle b in position b⃗ (with respect to the center-of-mass of molecule B). Assuming that does not vary significantly over the whole charge distribution of molecule B, we can expand , the potential in particle b, as a Taylor series around the center-of-mass of the molecule
| 9 |
where Greek letters (α and β) stand for the Cartesian coordinates (x, y, or z); b α,β are the Cartesian coordinates of particle b; R⃗ is the vector from the center-of-mass of molecule A to the center-of-mass of molecule B; and we use the notation V α for ∂V/∂b α, V αβ for ∂2 V/∂b α∂b β, and so on. Then, we can write in terms of the multipole moments of molecule B interacting with the weakly varying electrostatic potential generated by molecule A as
| 10 |
where we are using the Einstein summation convention: a repeated suffix implies summation over the three values (x, y, and z) corresponding to that suffix. In the first term, the monopole (zeroth-order) moment q B = ∑q b is the total charge of molecule B; in the second term, the first-order moment μα = ∑q b b α is the dipole moment; in the third term, the second-order moment is the quadrupole moment (δαβ is the Kronecker delta); and so on. Generalizing this notation, the 2p-pole moment operator for molecule B can be written as , a tensor of rank p, with p suffixes
| 11 |
The same way, the 2p-pole moment operator for molecule A can be written as
| 12 |
where the sum goes for all particles a ∈ A; and particle a is at position a⃗ (with respect to the center-or-mass of molecule A) and has a charge q a. By using this notation, we can rewrite eq in a more compact form
| 13 |
where n!! = 1 × 3 × 5 ×...n. If we now expand as a Taylor series about the center-of-mass of molecule A
| 14 |
where the T-tensors ; we can finally obtain the electrostatic interaction in terms of the multipole moments of molecules A and B, the so-called multipole (or multipolar) expansion in Cartesian form
| 15 |
where C lk = (−1) k /[(2k – 1)!!(2l – 1)!!].
The different energy contributions to the LR interactions can be calculated by substituting the multipole expansion in eqs and –. This way, the electrostatic energy for state |pq⟩ will be the expectation value of (cf. eq ) and can be obtained by simply replacing each multipole operator in eq by its expectation value for the state of interest
| 16 |
For simplicity, and to avoid overloading the notation, here and in the following, we do not include another set of indices (p, q) specifying the particular state of the systeme.g., when referring to the expectation value of the multipole operators: ξ = ⟨pq|ξ̂|pq⟩. Notice how each term in the sum depends on a specific combination of the multipole moments corresponding to the charge distributions A and B. It is also worth mentioning that although both ξ- and T-tensors are related to the nth-gradient of a Cartesian vector, they have a critical difference: ξαβ...η are constant, related to the nth-gradient of a “local” particle position, describing a fixed picture of the charge distribution of each molecule with respect to its center-of-mass; the T-tensors, on the other hand, have a functional dependence on the separation and relative orientation of the two molecules.
Since the T-tensors are computed as successive derivatives of 1/R, T αβ...η will be proportional to R –(n+1), and by regrouping the terms by the same power of 1/R, eq can be conveniently written as
| 17 |
where the various multipole–multipole interactions are grouped according to their R-dependence, with the nth term in the expansion (or nth “order”) being proportional to R –n . Similar expressions can be obtained for the dispersion and induction energies
![]() |
18 |
where α represents the dipole–dipole polarizabilities, A is the dipole–quadrupole polarizabilities, B is the dipole–octupole polarizabilities, C is the quadrupole–quadrupole polarizabilities, and D is the dispersion coefficients. This way, the terms in the multipolar expansion are arranged according to their potential importance at various distances with each contribution clearly identified/separated.
At very large R, the lowest-rank moments and polarizabilities are most important, but the others may become increasingly significant as R decreases, depending on their coefficients. For example, at large distances, the most relevant term for the electrostatic interaction between two ions is proportional to 1/R (the charge–charge interaction), but for neutral polar molecules, the leading (first nonzero) term corresponds to the dipole–dipole interaction (∝1/R 3), while for two nonpolar molecules, the leading interaction might be the quadrupole–quadrupole term, proportional to 1/R 5. For higher symmetry cases, the leading term can be an even higher moment, such as for dimers of tetrahedral molecules (e.g., methane), where the leading term corresponds to the octupole–octupole interaction (∝1/R 7), or for two fullerene molecules (I h symmetry), where the leading electrostatic term is given by the interaction between the 26-pole of each molecule (∝1/R 13). Systems with higher-order leading terms are relatively common, including H2 and ethylene, which both have vanishing dipole and octupole moments, making the leading moments quadrupole and hexadecapole (24-pole). Note that for neutral molecules, induction and dispersion interactions both start at sixth-order, and for ions, induction appears even sooner, at fourth-order, thus making these other types of interaction potentially as significantor more sothan electrostatics. This can be the case when leading electrostatic terms are nonzero but small, such as for the CO-dimer, which will be discussed in more detail later, or for the example of two fullerene molecules just mentioned above.
Each term in the expansion for each interaction has a clear physical meaning. The magnitude of the terms in the electrostatic interactions depends on the values of multipole moments; the terms in the induction and dispersion expansions are proportional to the molecular polarizabilities. In most applications, the main challenge will be the precise evaluation of each of the interactions for a given order of the multipole expansion. This raises the issue of how to obtain values for the electrostatic moments and polarizability tensors to implement in the expressions. Experimentally determined data are only available for a limited number of systems and are usually confined to the leading electrostatic moment(s) and perhaps the dipole polarizability. Values can be obtained directly from electronic structure calculations but are not so easily extracted from the high levels of theory commonly employed in constructing PESs (e.g., CBS-extrapolated coupled cluster theory with perturbative triples or even higher-order correlation treatment). To do so, one would need to compute extrapolated energies within a series of finite-field calculations, formulas that become increasingly complicated and numerically sensitive for higher moments. , As we will discuss later, our chosen strategy is simply to fit them using ab initio energies from the LR region of the PES.
The choice of representation has a large impact on the total number of coefficients and therefore the total number of terms to be handled in the multipole expansion. Due to its simplicity, the Cartesian representation described by eqs and is the most straightforward to develop/understand, and as a consequence, it is the most commonly employed. However, the Cartesian representation has a severe drawback. The number of components for each higher multipole moment increases rapidly with order, which soon becomes unmanageable. Cartesian tensors ξ(p) have rank p, and consequently, each has 3p components/coefficients. For the electrostatic energy, for example, this means (cf. eq ) that the total number of new coefficients increases exponentially as ∼3 n–1 with the n-th order in the multipole expansion. Although only 2n – 1 of them are linearly independent for each molecule (and therefore only 4n – 2 need to be computed or known a priori for each new order), the complete set of coefficients still needs to be implemented in the analytical formulas, making the total number of terms increase excessively. Thus, for any practical application, to include high-order terms of the multipole expansion employing a Cartesian representation quickly becomes an infeasible task: to includefor the electrostatic interactions aloneall terms up to sixth-order, a total of 728 coefficients need to be determined; at eighth-order, there are 6560 coefficients; at tenth-order, there are around 59 thousand coefficients; and at 15th-order, the total number of coefficients to be included in the multipole expansion would be close to 14.3 million. It is worth emphasizing that this is only considering the electrostatic interactions: at 15th-order, there are also ∼14 million coefficients in the induction expansion and another ∼300 million in the dispersion energy expansion that would need to be considered. Given that the issue comes from the components of ξ(p) not being linearly independent, the solution is straightforward: switch from Cartesian to a representation in spherical coordinatesan irreducible representation, which for each moment has no linear dependencies, so the number of coefficients involved grows only linearly with the orderthe trade-off: the mathematical complexity of those fewer terms is considerably higher. Using the same example as before, up to sixth-order (for the electrostatic interactions), there are 72 components in the spherical form of the multipole expansion, which is about a factor of 10 fewer than in Cartesian coordinates. Moreover, this reduction in the number of terms with respect to the Cartesian formulation is exponentially more significant as the order of the expansion increasesat the 15th-order, there are only 450 coefficients in the electrostatic expansionmaking the spherical formulation, as can be seen in Figure , dramatically more efficient as higher orders are included.
1.

Comparison between the total number of electrostatic coefficients as a function of the order of the multipole expansion in the Cartesian and spherical representations.
The fundamental theory of intermolecular forces is well established. The mathematical framework for describing long-range interactions based on the multipole expansion has been known for decades, even for molecules in excited states, systems with degenerate electronic states, and higher-order corrections to the perturbation treatment such as the hyperpolarizability contribution. ,− However, the general implementation for specific systems can be challenging, considering both the growing number of terms with the order in the multipole expansion and the complexity of the analytical expressions for each term. Additionally, the general formulas depend on the symmetry of the systemmolecules with lower symmetry have more nonzero moments, many more coefficients and terms, and therefore a more complex multipole expansion. Further considerations should be made if the two molecules are identical or chiral partners. This is why in practice in the construction of global PESs, treatment of the LR interactions has often been an afterthought, where a switch to a simple LR model is added based on at most a few leading electrostatic terms, in a system-specific case-by-case basis.
The software package AUTOPES , is one notable exception. Their approach is a symmetry adapted perturbation theory (SAPT)-based representation of both the short- and long-range interaction regions. Site–site interactions, termed a distributed expansion, are used to represent the close interaction region, and the number of sites in each fragment, including possible off-atom sites, is used to tune the accuracy of the fit. For the LR, they also employ the distributed expansion, but the fit is informed by consideration of the expected behavior of an asymptotic multipole-based expansion as described in their 2016 paper. In their approach, multipoles and polarizabilities are computed for each monomer, and then a constructed set of terms is used to generate energies, which are in turn used to fit the distributed expansion. For the long range, the accuracy of their approach depends on the fitting data (how accurately the moments and polarizabilities can be obtained via the SAPT method) as well as how accurately the distributed expansion can accommodate the topography computed by their multipole-based expansion.
In 2024, Yu et al. presented a multipole-based package for representation of the LR interaction between two molecules in their nondegenerate electronic ground states. Written in FORTRAN, the program ABLRI contains system-dependent subroutines that can be modified by the user to obtain the LR expansion with six possible types of symmetry for each fragment (C 1, C s, C 2v, C ∞v, D ∞h, and spherical), provided the required components of the electrostatic moments and polarizabilities are known/computed a priori and used as input. Using the Cartesian representation, electrostatic interactions are included up to the fifth-order and induction interactions up to the sixth-order (if both monomers have zero charge, then electrostatic interactions are included up to sixth-order and induction interactions up to eighth-order), while the dispersion energy is estimated (up to eighth-order) using the Unsöld approximation , instead of the corresponding polarizability expansion. A recently developed approach for treating cases where one or both fragments are in a degenerate electronic state was reported by Yu and collaborators, based on degenerate perturbation theory. There, a multipole-based LR expansion was implemented to construct the potential energy matrix for two test cases, systems in their ground but degenerate electronic states: O(3P)–OH(2Π), with electrostatic interactions up to sixth-order, induction interactions up to seventh-order, and dispersion interactions up to seventh-order; and OH(2Π)–OH(2Π), with electrostatic interactions up to fifth-order, induction interactions up to seventh-order, and dispersion interactions up to seventh-order. Since only the first few terms were included for each interactionand also due to the symmetry of the fragmentsboth the Cartesian and spherical representations were able to be satisfactorily implemented and compared. This is a promising step in an important area where couplings between the electronic state components can reflect complicated behaviors and topographies within the manifold of states. It will be interesting to see how this description of the long-range meshes with various diabatization protocols used to treat the close interaction region given that most diabatization schemes are somewhat ad hoc and damping is often applied.
Our first approach to this problem used the Cartesian framework, and it was a significant effort to include in the LR expansionfor electrostatic, induction, and dispersion interactionsall terms up to the sixth-order (and some seventh-order terms) for seven different symmetries: C 1, C s, C 2v, C 3v, C∞v, D ∞h, and spherical. Furthermore, that implementation was restricted to ground electronic states. Our goal was to go well beyond this and develop a general methodology that could be used to describe LR intermolecular interactions between two molecules with arbitrary symmetries, either in their ground or excited (nondegenerate) electronic states, with rigorous inclusion of high-order terms in the expansion. There were four main challenges that needed to be addressed: (1) given the electronic state, symmetry, and charge of each molecule, identify the relevant moments, polarizabilities, and dispersion coefficients and construct, for each interaction, the arrangement of terms to be included in the LR expansioni.e., obtain the system-specific equivalent of eqs and ; (2) derive the analytical expression (depending on the intermolecular coordinates) for each of the terms in this system-specific expansion; (3) obtain all the coefficients included in the expansion; and (4) produce user-friendly subroutines that could be used to reproduce the constructed LR-PES in various sets of coordinatesideally black-box, rapid to evaluate, and portable (written in FORTRAN or other popular language), so that it can easily be used by other codes, or combined with other representations of the close interactions to provide a global description of the PES. Here, we introduce our solution to these challenges, released as a freely available software package named long-range-fit (LRF). This first release is designed to treat systems composed of two arbitrary molecules in their ground or excited (nondegenerate) states.
Based on the spherical representation, LRF is designed to automatically generate the complete analytical LR expansion (up to 15th-order) for any system composed of two neutral or charged moleculesin a ground or excited (nondegenerate) stateand determine the coefficients by fitting to ab initio data. Whether they are different, identical, or chiral partners, the symmetry of the system is respected and enforced. To provide insight into the nature of the most important interactions, tables of all relevant terms (classified as electrostatic, induction, and dispersion) are produced based on the charge and point group symmetry for each of the molecules. For each fragment, a fully exhaustive list of all molecular point group symmetries is available, covering every possible scenario. If enough quantities (multipole, polarizability, and dispersion coefficients) are accurately known/computed for each fragment, then those can be directly inputted as fixed parameters and LRF can render a useable representation of the long-range interactions, truncated to any chosen order. More commonly, the molecular parameters are not known accurately, if at all. This is where LRF shines. Given a limited set of single-point energies distributed arbitrarily over the long-range region of the PESoften, but not necessarily, computed at the same level of theory as in the close interaction regionLRF will fit the data using all relevant terms up to a user-specified maximum order. No finite field calculations nor special grids are required. There are no missing terms up to the specified order; the expressions have correct symmetry constraints, and each term has its proper angular and distance dependencies. Beyond producing fits, LRF includes a suite of tools to analyze the quality of those fits and systematically investigate the role of different types of interaction and multipole orders in the long-range behavior of molecular systems. This way, LRF functions both as a practical fitting engine and as a platform for exploring the physics encoded in long-range interactions. LRF will also export a FORTRAN subroutine for external evaluation of the fitted potential, which can also be easily and smoothly connected to any preferred representation of the close interaction region, contributing to providing a global description of the PES.
The rest of the article is organized as follows. In Section , we describe the strategy and underlying algorithms implemented in LRFonly the most important aspects are highlighted here; additional details will be discussed in depth in a future paper. In Section , the program is introduced, and its main features and capabilities are demonstrated using illustrative examples. The final section provides a summary and concluding remarks.
2. Methodology
2.1. Coordinate System
The coordinate system used is shown in Figure . We start by selecting three different frames of reference: one molecule-fixed (MF) frame attached to each molecule, with its origin at its center-of-mass; and a dimer-fixed (DF) frame, with its origin at the center-of-mass of molecule A and its z-axis pointing at the center-of-mass of molecule B. The orientation of each MF frame can be chosen arbitrarilythe principal axes of the moment of inertia tensor of the molecule is a common choice. However, we will explain later how the orientation of the MF frame should be specified to enable use of the molecule’s relevant symmetry-elements in our program. Based on these definitions, two sets of Euler angles, ΩA = (αA, βA, γA) and ΩB = (αB, βB, γB), can be used to specify the orientation of each MF frame with respect to the DF frame, where α and β are the azimuth and polar angles of each molecule, respectively, and γ describes the rotation about their z-axis. When ΩA = ΩB = (0, 0, 0), all z-axes are aligned and all x- and y-axes are parallel. This way, the intermolecular coordinates can be chosen as (R, ΩA, ΩB), where R is the distance between the molecular centers-of-mass. Note that not all of the Euler angles are independent: since the first rotation of both molecules is around the z-axis of the DF frame, it suffices to know only the difference or phase α = αA – αB between the azimuthal angles. Furthermore, the number of angular coordinates is also reduced, depending on the structure of the molecules, as shown in Table .
2.

Intermolecular coordinates (R, α, βA, βB, γA, γB). For clarity, the X- and Y-axes of the DF frame have been omitted.
1. Angular Coordinates Employed by LRF and Corresponding Dimensionality of the System.
| Dim. | molecule A | molecule B | angular coordinates |
|---|---|---|---|
| 1D | atom | atom | - |
| 2D | linear | atom | βA |
| 3D | nonlinear | atom | βA, γA |
| 4D | linear | linear | α, βA, βB |
| 5D | nonlinear | linear | α, βA, βB, γA |
| 6D | nonlinear | nonlinear | α, βA, βB, γA, γB |
2.2. Spherical Tensor Formulation
In the spherical formulation, the electrostatic interaction operator in the DF (space-fixed) frame can be written as
| 19 |
where represent the components of the 2 l -pole moment operators for each molecule, which depend on the polar and azimuthal angular coordinates (θa,b, φa,b) of each particle a ∈ A and b ∈ B
| 20 |
and
| 21 |
where the expression in large brackets is a Wigner 3-j symbol and are irregular spherical harmonics. The multipole moment operators in the DF frame are related to those in the MF frame
| 22 |
where Ω = (α, β, γ) is the rotation that takes the global DF axes to the local MF axes, and are the Wigner rotation matrix elements. Since we will be only using local axes to describe the multipole operators from now on, we do not include another superscript (MF) to avoid overloading the notation.
Now, defining analogues to the T-tensors of the Cartesian formulation
| 23 |
we can rewrite eq in a more convenient notation
| 24 |
where the multipole moment operators are now referred to local molecular axes (different for each molecule) and the angular and distance dependence is all contained in the T-tensors. This can be abbreviated even further if we rearrange the series following the notation introduced by Stone
| 25 |
in which t and u refer to the members of the series of angular momentum labels: {00, 10, 11c, 11s, 20, 21c, 21s, 22c, 22s, 30, ...}. These labels correspond to lm, with c and s (cosine and sine) designating the positive and negative projections of m, respectively. This way, for example, the T 20,11c component of the T-tensor represents the interaction between the component m a = 0 of the spherical quadrupole (l a = 2) of molecule A and the component m b = 1c of the dipole moment (l b = 1) of molecule B. By substituting this multipole expansion of in eq , and eqs –, the different energy contributions to the LR interactions can be obtained in the spherical tensor formulation
| 26 |
| 27 |
| 28 |
| 29 |
where and are the expectation value of the corresponding moment operator for the state (Ψp Ψq ) of interest, and α represents the polarizability of each molecule
| 30 |
where is the expectation value of the transition moment between the states |iq⟩ and |jq⟩, and is the expectation value of the transition moment between the states |pi⟩ and |pj⟩. Finally, D t,u;t′,u′ are the dispersion coefficients
| 31 |
The last term in eq contains all possible combinations of multipole transitions from state |pq⟩ to all lower energy levels in both molecules, generalizing the standard formulation of the dispersion energy to account for molecules in excited states. When both molecules are in the ground state (p = 0 and q = 0), all sums ∑ m<p and ∑ n<q disappear, resulting in the classical expression found in books. When only one molecule is in the ground state (e.g., q = 0), then this expression reduces to more advanced formulations found in the literature, like the one given in ref .
2.3. A System-Specific Representation of LR Intermolecular Interactions
The description of LR interactions provided by eqs – separates all terms in each expansion into (i) a constant partinvolving multipole moment operators, polarizabilities, and dispersion coefficientsi.e., coefficients depending on the symmetry and properties of the molecules’ charge distributions; and (ii) a functional part, given by the T-tensors, that describes the dependence with the distance and relative orientation between the molecules. However, if a molecule has symmetry, some of the coefficients will be zero. Only the interaction between two charged molecules with no symmetry (C1 symmetry point group) contains all possible terms in eqs –. For any other case, the expansions are simplified in such a way that the resulting expressions reflect the nature of the system. Conceptually in our approach, we start with the expression for the most complex scenariowhere both molecules are charged and have C 1 symmetry. Then, the formulas are adapted (pruned) to treat any particular case by removing all terms in the expansion for which the corresponding coefficients are zero.
The point group symmetry of the molecule determines the nonzero components of the multipoles, polarizabilities, and dispersion coefficients, but it is important for a general/practical implementation to adhere to certain conventions. Table lists all of the molecular symmetry point groups: C ∞v, D ∞h, C 1, C s, C i , C n, C nh, C nv, D n, D nh, D nd, S 2n, T, T h, T d, O, O h, I, I h, and indicates our convention for how the molecule should be oriented. For each point group in the table, check marks highlight the corresponding subset of generating elements to be considered when placing the molecule into the MF frame: C n = n-fold rotation axis around z; C 2 = C 2 rotation axis around x; σ xy and σ xz designate xy and xz as reflection planes; i = inversion center at the origin of the MF frame; S n = S n roto-reflection axis around z; C 3 = C 3 rotation axis around x + y + z = 1 (for tetrahedral, octahedral and icosahedral symmetries only); C 5 = C 5 rotation axis around (icosahedral symmetry only).
2. LRF’s Conventions for Orientation of Molecules of All Symmetry Point Groups with Respect to Relevant Symmetry Elements .
| C n | C 2 | σ xy | σ xz | i | S n | C 3 | C 5 | |
|---|---|---|---|---|---|---|---|---|
| C ∞v | √ | – | – | – | – | – | – | – |
| D ∞h | √ | – | √ | – | – | – | – | – |
| C 1 | – | – | – | – | – | – | – | – |
| C s | – | – | √ | – | – | – | – | – |
| C i | – | – | – | – | √ | – | – | – |
| C n | √ | – | – | – | – | – | – | – |
| C nh | √ | – | √ | – | – | – | – | – |
| C nv | √ | – | – | √ | – | – | – | – |
| D n | √ | √ | – | – | – | – | – | – |
| D nh | – | √ | – | – | – | √ | – | – |
| D nd | √ | √ | – | – | – | √ | – | – |
| S 2n | – | – | – | – | – | √ | – | – |
| T | √ | – | – | – | – | – | √ | – |
| T h | √ | – | √ | – | – | – | √ | – |
| T d | – | – | – | – | – | √ | √ | – |
| O | √ | – | – | – | – | – | √ | – |
| O h | √ | – | √ | – | – | – | √ | – |
| I | √ | – | – | – | – | – | √ | √ |
| I h | √ | – | √ | – | – | – | √ | √ |
Check marks indicate which elements should be present when the molecule is appropriately oriented in the MF frame. See the text for details.
Our procedure to identify the nonzero components in the spherical representation is similar to how character tables often list how vectors transform in the Cartesian representation. For example, consider a molecule with C 3v symmetry. The multipole moment operators must first be invariant with respect to a 3-fold rotation axis around z: , which dictates that
| 32 |
Therefore, Q lm = 0 unless m = 3k . In our final notation (cf. eq ), this means that after considering the C 3 rotation operation, the nonzero coefficients will be {Q 00, Q 10, Q 20, Q 30, Q 33c , Q 33s, ··· }.
This process is then extended to other point groups in the C nv family: multipole moments Q lm must be zero, unless m = nk
| 33 |
Surviving terms must also satisfy the other operations in the C nv group. We have completed the same analysis for all the symmetry operations listed in Table and thus determined the nonzero spherical multipoles for every point group, with the results stored in the LRF code. Once the nonzero multipole moments are known for a given symmetry, they can be used to obtain the nonzero polarizabilities, which in turn can be used to obtain the dispersion coefficients.
Once all the nonzero coefficients in eqs – are established, we compute the corresponding T-tensors by using their recursive relationshipsthis practical solution to obtain the T-tensors in the spherical formulation was introduced by Hättig in 1996. Upon specification of the system, LRF generates and stores the complete case-specific expansion up to the 15th-order. This takes only a few seconds. For the fitting, at each step, only the specified terms are employed.
2.4. Fitting the Expansion Coefficients
After the system-specific full expansion is constructed, fitting the coefficients in this representation is intrinsically nonlinear. Unlike linear fits, nonlinear optimization demands iterative strategies, careful initialization, and is vulnerable to convergence to local minima, making the process increasingly complex as the number of terms grows. The fitting in LRF is performed by the MATLAB fitnlm routine (in the Statistics and Machine Learning Toolbox), which is an implementation of the Levenberg–Marquardt algorithm. LRF’s default tolerances for fitnlm can be adjusted in the advanced fitting options. The initialization takes advantage of system specifications, such as charge, symmetry, and whether the two molecules are identical or chiral partners, to constrain the coefficients. The user canbut is not required toinput any known quantities, either as fixed values or as initial values for the fitting. By default, the initialization routine also looks at a portion of the data set furthest into the long range and does a preliminary fit to the coefficients from the first two leading orders, estimating also the asymptotic energy. In LRF, the user does not need to “zero” the data set; rather, LRF finds the best fit for the asymptote as one of the coefficients. Once initialized, in our experience, the most robust way to proceed toward a satisfactory fit is to introduce the terms of the expansion, order by order, updating the fit at each step, monitoring the behavior and statistics of the fit, until sufficient accuracy is achieved.
The dimensionality of the PESs treated by LRF ranges from one (atom–atom) to six (two nonlinear molecules). For construction of PESs describing the close interaction region, the difficulty of achieving an accurate fit, and the amount of ab initio data required, usually scales strongly with dimension. In contrast, for representation of the long range, the amount of data required is mainly dictated by the number of parameters in the expansion and is less closely tied to dimensionality. Upon specification of the system, charges, symmetry, identical or different molecules, and the order of the expansion, LRF will determine the number of parameters. The fitting data set should then be computed to sample at least the minimal symmetry subspace of angles and range of R and be generous with respect to the number of parameters to be fitted (ca. 5–10 times larger) but generally requires far fewer points than the close interaction region. The fitting itself takes between a few seconds and a few minutes. This depends on the number of ab initio data points and the number of coefficients (which scales with the order of the expansion). For the 4D example shown in Figure , which includes an overly generous set of 4000 data points, fitting the first few orders using LRF on a laptop PC takes only a few seconds. The time-to-solution increases as higher terms are added, reaching 2–3 min for the eighth-order expression.
6.
Once the system is defined and the data file is read, the app leads the user to the “FITTING” tab, where the data can be fitted and the coefficients exported once the desired accuracy is reached. The table in the upper-right side follows the progression of an LRF fit to an ab initio data set for the CF+–H2 system. The fit is performed sequentially, where each row in the table introduces a higher maximum order in the interaction expressions (third through eighth), including all relevant terms from the electrostatic, induction, and dispersion expansions. The fitting statistics improve rapidly and then plateau (see the text).
3. LRF: Features and Capabilities
LRF applies the methodology outlined above for systems in nondegenerate states by implementing the analytic long-range expansions (up to 15th-order) via a tensorial spherical basis with recursive relations. Symmetry, fragment charges, and intermolecular relationships are automatically accounted for, providing a symmetry-adapted expansion for any pair of neutral or chargedidentical, chiral, or distinctpartners. The entire procedure is controlled through a Windows MATLAB-based graphic user interface (GUI). The interface gives users control over the fitting model and constraints, includes diagnostics to assess fit quality, flag anomalies in the ab initio data, and provides 1D/2D visualizations of theelectrostatic, induction, or dispersioninteractions with respect to the various coordinates. LRF provides an accurate representation of the long-range PES, and it is simple to use even for nonspecialists.
As can be seen in the program workflow, shown in Figure , the process begins with (step 1) the “system definition”, specifying each molecule by symmetry group and net charge; and (step 2) uploading an input file (if any) containing the ab initio data to be used in the fit. There are no special requirements regarding the number of data points or their distribution. This is all done in the “SETUP” tab (see Figure ). Relevant information concerning the input file (type of file, structure, choices of energy and coordinate units of the input data, and more) can be found in the Supporting Information. Once the initialization is complete, tables of the relevant interaction terms organized by order can be consulted in the “EXPANSION” tab (see Figure ). This valuable information is available after the symmetry of the molecules is specified, even if no fit is to take place. Finally, in the “FITTING” tab (see Figure ), the user can (step 3) select which interaction terms to include and fit the coefficients, and once the fit has converged to the desired accuracy (step 4) export the coefficients.
3.

LRF flowchart for construction of the long-range representation.
4.
The “SETUP” tab is employed for the first two steps in the flowchart: define the system and initialize the program.
5.
The “EXPANSION” tab renders tables highlighting the active interaction terms for any specified system. Here, for CF+–H2, the electrostatics table was selected, and the relevant terms are seen highlighted in red, beginning at third-order. Analogous tables can be rendered for induction or dispersion by selecting them from the menu in the top left of the figure.
The LRF MATLAB-based GUI is freely available from the authors for noncommercial purposes. Although based on MATLAB, the use of LRF does not require the installation of MATLAB or a MATLAB license. Only a freely available MATLAB Runtime library is needed (version 2024a for WINDOWS), which is downloaded and installed by the LRF installer. More details about obtaining and using LRF can be found in the Supporting Information.
In the rest of this section, we demonstrate some of the main features and capabilities of LRF by using the software to study the LR interactions for some illustrative examples.
3.1. CF+–H2
Although fluorine is one of the most reactive elements in the ISM, the dominant F-bearing species (HF and CF+) are governed by a compact set of reactions whose rates are largely controlled by the local abundances of atomic F, H2, and C+. , In this context, the CF+ and H2 molecules are considered crucial species for monitoring fluorine chemistry, as their relatively simple formation pathway makes them promising tracer of atomic fluorine in diverse ISM environments. Our previously reported PES for this system was constructed to support scattering dynamics calculations relevant to fluorine chemistry in the ISM and includes an analytic LR.
Fitting the LR of this system helps to illustrate several aspects of our approach. Here, we show how upon setup and specifying the fragment charges and symmetries, LRF produces tables of the interaction terms organized by order (Figure ). Our usual practice is then to introduce the interaction terms, order by order, updating the fit at each step, simply by selecting the desired order of each interaction type in the fitting menu. Figure shows a table that is produced, providing some details of the fit and the derived coefficients. Each fit to a successively higher order adds a row to the table, where the root-mean-square error (RMSE) and the coefficient of determination R 2 are reported along with the first few multipole coefficients. Statistics of the fit are produced, which help the user determine when a satisfactory fit has been obtained. Figure shows examples of the plots of histograms of fitting residuals, making it easy to see when an accurate balanced fit has been obtained. Once an acceptable fit has been obtained, a complete set of coefficients is exported, formatted to use with an LRF external subroutine to evaluate the PES representation.
7.

Histograms of fitting residuals are produced by LRF as each order of interaction terms is introduced. Here, for CF+–H2, the results are plotted for the third-, fifth-, and eighth-order fits (upper, middle, and lower panels, respectively). Using only the leading third-order charge–dipole term, the residuals are large and unbalanced. When sufficient terms are added, they become orders of magnitude smaller and balanced (see the text).
As we reported previously for this system, the PES reflects remarkably significant angular dependencies far into the LR. Even at R ≥ 25 Å, where the interaction is dominated by the electrostatic charge–quadrupole term q AΘB ∝ R –3 (see Figure ) and the induction leading term , there are still energy variations of nearly 2 cm–1 with respect to the orientation of H2, and moving to shorter distances, these variations become rapidly larger. Moreover, even at longer distances, employing only the leading terms is not sufficient to accurately fit the ab initio data. This is reflected in both the fitting error (see the table in Figure , second row) and the observation of an unbalanced residual distribution (see Figure , upper panel) produced by error analysis tools implemented in the LRF code.
When an insufficient number of terms are employed in the fitting expansion, the coefficients for those few leading terms will deviate substantially from physically realistic valuesin order to best accommodate the dataand lacking the correct angular and radial dependencies, the fit will have relatively large and unbalanced errors. A systematically nonrandom distribution of residuals reflects an inadequate model. On the other hand, if one fixes the coefficients for only a few leading terms to known benchmarks (either from experiment or theory), but excludes other relevant terms, then again the detailed topography of accurate energy data will not be well reflected.
Note also that when fitting the set of interaction terms, there may be ambiguities. For example, if one fits a data set of interaction energies between two different molecules A and B, using only the dipole–dipole term, expressed as E 3 = (μ10 μ10 )T 10,10, then the derived coefficient is the product μ10 μ10 of the two dipoles, with no information about the separate magnitudes. Even if the two molecules are identical, there is still a sign ambiguity. However, if even one coefficient is known (such as the charge of fragment A in the CF+–H2 system), then a cascade of determinations occurs through the set of coefficients. Here, the known charge value determines the related multipoles in the electrostatic expansion, such as Θ20 in the leading term q AΘ20 , which in turn defines others, until all included multipoles are determined. Once the multipoles are known, the polarizability coefficients become determinable in the induction terms, which also define the dispersion coefficients. A similar effect can be achieved by knowing and fixing at least one coefficient in neutral systems, which is convenient since often at least one quantity might be available or easily computed.
For the CF+–H2 system, our data set for the PES was generated at the CCSD(T)-F12b/CBS level, with all electrons included in the correlation treatment, which is generally spectroscopically accurate in light-atom systems where affordable. To estimate the CBS limit, total energy data from the CVTZ-F12 and CVQZ-F12 basis sets were extrapolated using the l –3 formula. At this high level of electronic structure theory, one expects physically realistic coefficients from the fitting, though generally some differences can be expected due to basis set incompleteness, order of correlation treatment, inclusion/exclusion of other corrections, truncation of the fitting expansion, and fitting error. The data span 10–30 Å of separation, and with only the leading third-order interaction (electrostatic charge–quadrupole), the RMSE is large, about 1.2 cm–1 and the R 2 is only 0.8. The histogram of residuals shown in the first panel of Figure is very unbalanced, and the fit is deemed poor. At fourth-order (third row in the table, Figure ), adding the dipole–quadrupole and introducing the first induction term dramatically improve the fitting error and R 2 to 0.08 cm–1 and 0.999, respectively. Induction seems quite important, and by fifth-order, the overall fit is already quite good. Dispersion enters at sixth-order, but its inclusion appears to be less important. The fitting error improves by roughly an order of magnitude with each increase in the order of the fit, before plateauing between seventh- and eighth-orders. No significant further improvements were found in tests up to 12th-order. The hexadecapole of H2 first enters at fifth-order (row 4 in the table, Figure ) and its coefficient can be seen to jump to a realistic value in that row, from being essentially undetermined at lower orders. As seen in the table, the coefficients adjust slightly as new terms are introduced but are relatively stable. The final RMSE of 0.0005 cm–1 is on the order of the convergence of the electronic structure data and together with the balanced histogram of residuals is deemed to be a practically perfect fit. We interpret the abrupt plateau in the convergence behavior to indicate that at that order, all of the relevant terms needed to represent the coordinate range in the data set are included. If the data were extended to shorter interaction distances, then higher-order terms would become valuable/necessary.
For this system, it is remarkable how accurately the known quantities of H2 were obtained, incidental to our fitting. A 2018 paper by Miliordos and Hunt studied the bond distance dependence of electrostatic and polarizability expansions of H2 at the CCSD level, which is Full-CI for isolated H2. Their results are consistent with previous high-level studies of H2. In table V of their paper, they list their best computed values for the quadrupole, hexadecapole, and both components of the dipole-polarizability tensor for r H2 = r 0. For the quadrupole and hexadecapole, our values from fitting the LR of the CF+–H2 data set are 0.4817 and 0.3155 au, respectively, extremely close to the values of 0.4823 and 0.3139 au reported by Miliordos and Hunt. For dipole polarizability, they report the two Cartesian tensor components α zz and α xx at 6.7178 and 4.7308 au, respectively. These values can be directly compared to the two components of our spherical tensor representation, α10,10 and α1c,1c, for which we obtain values of 6.7023 and 4.5633 au, respectively.
3.2. C10H––H2
Carbon-chain anions have been identified in interstellar and circumstellar media fairly recently, offering insights into the physical and chemical conditions in different astrophysical contexts. − Their quantities also inform the free electron density, which relate to the rate of cloud collapse and star formation. Carbon-chain anions are much less abundant than their neutral counterparts, and current chemical models fail to accurately replicate the observed anion-to-neutral ratios. In cool, low-density areas where anions are found, thermal equilibrium statistics generally do not hold, and accurate emission line modeling must consider radiative and collisional rates with H2, the most commonly encountered molecular species. ,
This system, shown in Figure , is composed of two linear molecules: the anion C10H– (with symmetry C ∞v) and H2, which is a neutral linear molecule (with symmetry D ∞h). Since the fragment symmetries and the presence of a charge (negative in this case) are the same as the previous example, we expect to see the same set of interaction types, beginning with the electrostatic charge–quadrupole term (∝R –3) and the induction leading term (∝R –4). The anion is a long chain of 11 atoms, making the PES extremely anisotropic. However, the anisotropy is more complicated and interesting than just the obvious steric effect of the long C10H– fragment but can be interpreted via the leading electrostatic terms. Figure shows radial cuts through the PES for various orientations of the two fragments. The plot also includes the close interaction region, combining two fits into a global representation. The electronic structure data set for this PES was computed at the CCSD(T)-F12b/VTZ-F12 level. Anisotropy in this context is defined as a strong sensitivity to angular coordinates. For this system, far into the LR, the charge–quadrupole term is dominant and thus the PES has little dependence on the orientation of the anion but depends strongly on the orientation of H2 (which governs the quadrupole term). Moving to closer distances, the PES also takes on a strong dependence on the orientation of the C10H– anion, first partly due to its large dipole moment and then due also to the steric effect. The shape of C10H– dictates that the onset of short-range repulsion occurs much further out for approach to the ends of the molecule, relative to approaching from the side. Indeed, in this way, the boundary defining the onset of the long range by a negligible charge overlap depends on the orientation of C10H–. Figure shows that when C10H– has a side-on orientation (βA = 90), curves for various orientations of H2 (βB) all have repulsive walls at much closer distances. However, it is fascinating to see that even in the regions of deepest attraction, electrostatic effects are retained such that the PES is enormously sensitive to the orientation of H2. For example, around R = 9 Å, the collinear (βA = 180, βB = 0) global minimum is located with a well depth of more than 600 cm–1, but by only rotating H2 from end-on to side-on (βB = 90), the well is eliminated almost completely, becoming more than 500 cm–1 less stable (compare the black dashed and light blue traces in the figure). At even closer distances, similar behavior is noted for the cuts with minima around R = 3.5 Å, representing the approach to the side of the anion. Thus, overall, the PES has strong sensitivity to the orientation of H2, all the way from the far long range to the close interaction regions.
8.

Representation of the internal coordinates for the C10H––H2 system.
9.

Radial cuts through a combined close interaction/LR PES for C10H––H2 are plotted for various orientations of the two fragments. Behavior with respect to the center-of-mass distance R is shown in the plot. The angles listed in the figure legend in degrees are βA, βB, and α, respectively, which are the angles shown in Figure . Remarkable behavior is noted with respect to the orientation of both C10H– and H2 (βA and βB); see the text.
The effect of the shape of molecules such as C10H– on where the onset of the long range can be defined has motivated developments such as distributed multipoles. Here, we maintain a single expansion but have implemented a strategy to allow some of the data from approach to the side of C10H– to be included, since it still meets the negligible overlap criterion. LRF includes an optional “LeRoy Filter” algorithm, with a specified distance parameter, that determines whether a particular geometrical configuration belongs to the fitting set or not, based on the minimal distance for any pair of atoms belonging to different molecules (r ij ). This simple approach captures the steric aspects of the charge distribution, allowing classification of the boundary regions. This methodology allows for a more robust fit in cases of large anisotropy since more data points can be included (see Figure ), providing broader coverage and a smoother switch to the short-range representation. C10H––H2 is provided as an example system in Supporting Information, including instructions on how to employ this LeRoy Filter algorithm.
10.

Points from the C10H––H2 data set are plotted, indicating intermolecular separation R and orientation of C10H– (βA). The plot illustrates the idea behind LRF’s algorithm for treating highly anisotropic data sets (see the text). The method carves out a region (red points), which can be added to what is normally considered the long range beyond a simple distance cutoff (blue points). A hypothetical fixed long-range cutoff at R LR = 15 Å partitions the data into R < 15 Å (red, ≈35% of all points) and R ≥ 15 Å (blue).
C10H––H2 has the same partner (H2) as the previous CF+–H2 system, but it is much more challenging to represent C10H– as a single expansion than it is for CF+. Results for the fitting are shown in Figure S3 of Supporting Information. In contrast to the behavior of the CF+–H2 fit, the fitting error for C10H––H2 drops more slowly with order, reaching 0.1 cm–1 at the 10th-order.
Another important feature is outlier detection. Data sets obtained from ab initio calculations may occasionally contain unconverged points inconsistent with the rest of the data set. Wild errors completely outside the energy range are easily identified, but subtle irregularities can appear with magnitudes of only fractions of a cm–1. For example, in some cases, such small disruptions have been attributed to the use of incomplete auxiliary bases in explicitly correlated (F12) calculations. Small errors are harder to identify, especially if the PES includes both attractive and repulsive energies for various combinations of coordinates. LRF identifies outliers to the fit with a weighting that emphasizes relative error, thus becoming more sensitive at large R. Any outliers that are not accommodated by a generous set of physically correct interaction terms can be selected for removal. No such points were noted in the example systems discussed here, except in some preliminary data for C20H20–Ne, as discussed in that section.
3.3. CO-Dimer
LRF handles identical-monomer dimers by enforcing exchange symmetry in its angular basis so that the fit is invariant to the exchange of molecules, and the two sets of expansion coefficients are constrained to be equal. (CO)2 is one of many examples of such systems. CO is a common species in cometary comae, produced by sublimation of its ice as the comet passes near a star, and therefore, scattering data for CO with a variety of collision partnersother common components of the comaeare needed for modeling those environments. Other components to be considered as collision partners include water, CO2, and HCN/HNC, but collisions with other CO molecules are also important to model. −
The nature of LR interactions in (CO)2 helps to illustrate the strength of our LRF method and emphasizes some of the points we have been making about the possible inadequacy of a few leading terms to represent the long range. CO is a polar molecule, so the leading electrostatic term is the dipole–dipole interaction, proportional to R –3, appearing three orders before the induction and dispersion interactions which both enter at sixth-order and are proportional to R –6. The dipole of CO is known to be small (0.043 au) in the ground vibrational state and, in an interesting contradiction to expectations based on atomic electronegativities, is oriented with the negative end on the C atom. Our ab initio data set is again at the CCSD(T)-F12b/CBS level with all electrons correlated and was used to construct a PES of demonstrated spectroscopic accuracy and so is expected to accurately reflect the electronic structure. Fitting the data set in the distance range of 6–30 Å finds the single leading dipole–dipole term almost useless, yielding an RMSE of 3.9 cm–1 and an R 2 of only 0.07, with the resulting model qualitatively different from the PES. Adding the fourth- and fifth-order electrostatics, which includes the quadrupole–quadrupole interaction among other terms, yields only a modest improvement with values of 3.3 cm–1 and 0.35 for the RMSE and R 2, respectively. The fit quality changes dramatically at the sixth-order. Adding the sixth-order electrostatic terms alone does not produce any improvement, but with the sixth-order induction and dispersion included, the fitting error improves by an order of magnitude, and the R 2 jumps above 0.99. At this stage, the model is in qualitative agreement with the ab initio data, though still higher terms do continue to improve the fit, which reaches an RMSE of 0.01 cm–1 at 12th-order. The coefficient for the dipole, obtained by fitting the data as an LR expansion is 0.094 au, small as expected, but somewhat larger than the experimental value (0.043 au). To explore this difference a bit further, we performed a four-point finite field calculation of the monomer dipole at the same level of electronic structure theory as the PES (CCSD(T)-F12b/CBS), which yielded a result of 0.045 au, very close to the experimental value. The CO-dimer system has long been known to be significantly impacted by even higher-order correlation not captured by CCSD(T). , Thus, our interpretation of the fitting behavior for this case is that while the monomer dipole is well captured at the employed level of theory, higher terms in the interaction are less well represented and the dipole coefficient adjusts slightly to compensate for other terms in order to minimize the fitting error.
Figures and illustrate the behavior of the fits and the relevance of various contributions. In Figure , the combined electrostatic terms from third- to sixth-order are compared with the total energy for distances beyond 16 Å. As seen in the figure, the electrostatic contribution matches the total energy quite well in the far-range, and the two merge near 25 Å. In contrast, for shorter distances, as seen in Figure , the combined electrostatic terms from third- to sixth-order simply lack the necessary form (both radial and angular) to represent the data and gross errors are seen in comparison to the total energy. This highlights the key role played by induction/dispersion in this system and the importance of including a sufficient set of interactions to obtain an accurate and balanced representation.
11.

CO–CO long-range potential: total energy (blue) and electrostatic contribution (orange) as functions of separation R and orientation β1. Angles α and β2 are both fixed at 0 degrees. In the R = 16–26 Å regime, the interaction energy is small and the residual anisotropy is governed by the R –3 dipole–dipole and R –4 dipole–quadrupole terms, while induction/dispersion (∝R –6) becomes negligible. The decomposition highlights that far-field behavior is dominated by electrostatics (see the text).
12.

CO–CO midrange potential: total energy (blue) and electrostatic contribution (orange) versus R and β1 for R = 6–10 Å. Despite electrostatics beginning at third-order, the well depth and strong orientational dependence arise primarily from induction and dispersion. The gap between the two surfaces highlights the dominance of nonelectrostatic interactions in the near field (see the text).
3.4. C20H20–Ne
Fascination with highly symmetric systems goes back at least as far as Plato’s dialogues, and society as a whole has shown special interest in molecular manifestations of the Platonic solids such as tetrahedrane, cubane, octahedrane, dodecahedrane, and of course fullerene. Scientists are studying such systems not only for reasons such as their synthesis and reactivity, but also for their possible unusual behavior and properties, as hosts of encapsulated molecules, for cold control, and even as possible molecular qubits. −
Symmetry has strong implications for LR interactions. For example, as we mentioned in the Introduction, the leading electrostatic term for interactions between two C60 fullerenes is at 13th-order, while dispersion begins at sixth. The efficiency of cooling rotational degrees of freedom via collisions, e.g., the rate of rotationally inelastic collisions, is largely determined by the anisotropy of the PES. Although dispersion begins at the sixth-order, it is higher-order dispersion terms that provide anisotropy, the subtle corrugation of the PES for these systems. This is why rotationally cooling C60 by collisions with atomic bath gases like He or Ne has proven to be so inefficient.
Studying such systems computationally also presents challenges. Dodecahedrane and C60 have 140 and 360 electrons, respectively, even without a collision partner, making them prohibitively expensive to treat with large basis sets and high-order correlation treatments in supersystem calculations. On the other hand, trying to estimate the interactions from a monomer property-based approach is also difficult given the high order of the relevant terms.
Here, as an illustration, we surveyed the LR interactions of dodecahedrane (C20H20) with a neon atom (Ne). First, we computed a small data set of interaction energies distributed in distance from R = 6–30 Å using a new local correlation method called PNO-LCCSD(T)-F12 as implemented in MOLPRO 2025.2, which has been introduced as a cost-effective beyond-DFT approximation to explicitly correlated coupled-cluster theory. , We employed recommended tight settings of DOMOPT = TIGHT, DOMOPT_INTMOL = TIGHT+, but noted some behavior in the data that made it unsuitable for this purpose. We observed a small discontinuity (17 cm–1) in a radial cut, just past 10.5 Å, and also some general noise throughout (e.g., erratic points along angular test cuts), on the order of a few wavenumbers. This is likely due to the domain approximations in the local correlation method. The tightness of the domain approximations for PNO-LCCSD(T)-F12 significantly affects the cost, so we did not tighten them further since the cost for each geometry was already quite high, and the goal here was to illustrate capabilities of LRF, rather than performing a high-level study of dodecahedrane. Though perhaps less accurate, we switched to the DF-MP2-F12 method, which is not a local correlation method, in order to get more precisely converged data to better determine the subtle anisotropy of this system. Ultimately, about 170 energies were computed using MOLPRO at the DF-MP2-F12/VDZ-F12 level, each taking about 2 h to converge using 4 cores. This relatively small number of points is generous due to the very high symmetry, which dictates that there are only a few nonzero coefficients.
Dodecahedrane has icosahedral symmetry, and Ne has spherical symmetry, which significantly limits the number of terms in the LR expansion. For the interaction with Ne, there are no electrostatic or induction terms through the 15th-order, but dispersion appears in even numbered orders beginning at the sixth (sixth, eighth, 10th, ...). The sixth- and eighth-order contributions are isotropic, so anisotropy enters at 10th-order, which limits its effect to relatively close distances as discussed also by Klos et al. for C60.
The interaction between C20H20 and Ne is perhaps counterintuitive, considering the enormous size of C20H20, but can be rationalized based on the high-order nonzero terms in the LR expansion. The system is shown in Figure , with the H atoms extending slightly more than 3 Å in all directions from the origin at the center-of-mass. Due to the high symmetry of C20H20, and the spherical collision partner, there is very little interaction at longer distances. As shown in Figure , the interaction, governed by the even ordered dispersion series, turns on significantly inside of 7.5 Å, which given the size of C20H20 is quite close. In our radial test cuts with the DF-MP2-F12 method, we found the minimum to be near 6 Å, and further in, at 5 Å, is already significantly repulsive (above the asymptote). The anisotropy is also apparent in the plot as waviness of the lines. As mentioned above, anisotropy begins with the 10th-order term in the dispersion series, which greatly limits the range of its relevance. The symmetry and anisotropy can be better appreciated in Figure , which plots the PES with respect to the two angles with R fixed at 7.0 Å. There is 4.7 cm–1 of variation in Figure , where the mean value is −29 cm–1, so the relative anisotropy is about 16%. This is in contrast to the nature of the C60 with He or Ar systems mentioned earlier, where the anisotropy in the long range was mentioned to be only about 1% and would correspond to extremely low efficiency for collisionally cooling the rotational temperature. Note, however, that the relative anisotropy is strongly dependent on distance. In our results, the relative anisotropy drops off rapidly to only 0.16% at R = 12 Å, a 100-fold decrease. He and Ne are the most commonly employed cooling buffer gases in experimental studies. Those authors mention that for C60, anisotropy does increase in the close interaction region due to nondispersive contributions, and similar effects are expected for C20H20 also. We ascribe the markedly more significant anisotropy in the LR of C20H20 vs C60, being due to the hydrogen atom sticking out from each C atom in C20H20. Thus, even with the same high I h symmetry, the anisotropy can vary significantly, but since the anisotropy is governed by terms starting at the 10th-order, it is quite constrained in terms of the range of distances for which it is relevant. In our results, the relative anisotropy for C20H20 and Ne drops off at a rate with an R-dependence between 1/R 7 and 1/R 8, but this is interpreted with caution due to the modest level of theory and small basis set. Lowering the symmetry of the system of interest via functionalization, or perhaps that of the buffer gas, can change the nature of the interactions dramatically and would affect the efficiency of cooling. No doubt, there are practical constraints on the choice of buffer gases, but it is interesting to note that (without modifying the system of interest) switching to a molecular collidereven H2would turn on lower-order terms with anisotropy, including those from the electrostatic and induction series.
13.

Dodecahedrane is shown interacting with the Ne atom, with C20H20 oriented according to LRF’s convention for I h symmetry (see Table ).
14.

2D plot of R and γ with β fixed at 90 deg. Energy is plotted in units of cm–1. The modest, but significant anisotropy is apparent as well as the repeating pattern due to symmetry (see the text).
15.

2D plot of γ and β, with R fixed at 7.0 Å. Energy is plotted in units of cm–1. The high symmetry and corrugated nature of the PES are illustrated (see the text).
4. Conclusions
We have introduced long-range-fit (LRF), a software package to construct and study a representation of LR molecular interactions based on a spherical tensor expansion of electrostatic, induction, and dispersion interactions. LRF can implement specified values of electrostatic moments and polarizabilities, or more typically, can process ab initio data from any electronic structure package and fit the coefficients for a selected set of interaction terms. The program can handle any two atoms or rigid molecules, charged or neutral, in ground or excited (nondegenerate) electronic states, and proper treatment is given to identical or chiral partners. LRF is adapted to all molecular symmetry point groups and upon specification of the two interacting species will produce a table of all relevant interaction terms, organized by electrostatics, induction, and dispersion, up to 15th-order (R –15). Terms to be included in the expansion can then be specified by order for each interaction type.
LRF is built under a MATLAB GUI, which can be run on a WINDOWS PC and provides a user-friendly interface with various menus and check boxes to specify the workflow, with many options. For strongly anisotropic molecules, LRF includes an algorithm that goes beyond a simple radial cutoff when specifying the long range. The package also provides an analysis platform to obtain information and interpretation of the different interactions and their individual components. There are built-in tools to assess fit quality, provide outlier detection, PES visualization, and export efficient (FORTRAN) routines to allow evaluation of the LR interactions and integration with short-range PESs and/or dynamics packages.
We have highlighted some of the features and capabilities of LRF by treating four example systems that span cases of ions, extreme anisotropy, unusual electrostatics/dispersion, and high symmetry. LRF is designed as a convenient tool to extend any developed ab initio PES with an accurate long range, including as an easy update to existing PESs. Accuracy of the LR is especially relevant to spectroscopic and scattering studies in low-density and low-temperature environments such as the atmosphere, ISM, circumstellar environments, and atmospheres of moons, planets, and comets.
Supplementary Material
Acknowledgments
This work was supported by the Kummer Institute for Student Success of the Missouri University of Science and Technology and the United States Department of Energy (DOE), Grant No. DE-SC0025420.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.5c01984.
LRF user manual describing how to obtain and submit the signed license agreement and then the software, which is provided with two example data sets; installed file directory (S1); an image of the LRF dashboard following the two examples (S2 and S3); and an image illustrating how the exported fit can be called as a Fortran routine (S4) (PDF)
The authors declare no competing financial interest.
References
- Brasseur, G. P. ; Jacob, D. J. . Modeling of Atmospheric Chemistry; Cambridge University Press, 2017. [Google Scholar]
- Schlegel H. B.. Exploring potential energy surfaces for chemical reactions: an overview of some practical methods. J. Comput. Chem. 2003;24:1514–1527. doi: 10.1002/jcc.10231. [DOI] [PubMed] [Google Scholar]
- Biczysko M., Bloino J., Puzzarini C.. Computational challenges in Astrochemistry. Wiley Interdiscip. Rev.:Comput. Mol. Sci. 2018;8:e1349. doi: 10.1002/wcms.1349. [DOI] [Google Scholar]
- Sandford S. A., Nuevo M., Bera P. P., Lee T. J.. Prebiotic astrochemistry and the formation of molecules of astrobiological interest in interstellar clouds and protostellar disks. Chem. Rev. 2020;120:4616–4659. doi: 10.1021/acs.chemrev.9b00560. [DOI] [PubMed] [Google Scholar]
- Puzzarini C.. Gas-phase chemistry in the interstellar medium: the role of laboratory astrochemistry. Front. Astron. Space Sci. 2022;8:811342. doi: 10.3389/fspas.2021.811342. [DOI] [Google Scholar]
- Desrousseaux B., Quintas-Sánchez E., Dawes R., Marinakis S., Lique F.. Collisional excitation of interstellar PN by H2: New interaction potential and scattering calculations. J. Chem. Phys. 2021;154:034304. doi: 10.1063/5.0039145. [DOI] [PubMed] [Google Scholar]
- Toscano J., Lewandowski H., Heazlewood B. R.. Cold and controlled chemical reaction dynamics. Phys. Chem. Chem. Phys. 2020;22:9180–9194. doi: 10.1039/D0CP00931H. [DOI] [PubMed] [Google Scholar]
- Heazlewood B. R., Softley T. P.. Towards chemistry at absolute zero. Nat. Rev. Chem. 2021;5:125–140. doi: 10.1038/s41570-020-00239-0. [DOI] [PubMed] [Google Scholar]
- Liu Y., Luo L.. Molecular collisions: From near-cold to ultra-cold. Front. Phys. 2021;16:42300. doi: 10.1007/s11467-020-1037-6. [DOI] [Google Scholar]
- Balakrishnan N., Jambrina P. G., Croft J. F., Guo H., Aoiz F. J.. Quantum stereodynamics of cold molecular collisions. Chem. Commun. 2024;60:1239–1256. doi: 10.1039/D3CC04762H. [DOI] [PubMed] [Google Scholar]
- Schnell M., Meijer G.. Cold molecules: preparation, applications, and challenges. Angew. Chem., Int. Ed. 2009;48:6010–6031. doi: 10.1002/anie.200805503. [DOI] [PubMed] [Google Scholar]
- Liu Y., Ni K.-K.. Bimolecular chemistry in the ultracold regime. Annu. Rev. Phys. Chem. 2022;73:73–96. doi: 10.1146/annurev-physchem-090419-043244. [DOI] [PubMed] [Google Scholar]
- Bohn J. L., Rey A. M., Ye J.. Cold molecules: Progress in quantum engineering of chemistry and quantum matter. Science. 2017;357:1002–1010. doi: 10.1126/science.aam6299. [DOI] [PubMed] [Google Scholar]
- Truhlar, D. Potential Energy Surfaces and Dynamics Calculations: For Chemical Reactions and Molecular Energy Transfer; Springer Science & Business Media, 2013. [Google Scholar]
- Schatz G. C.. The analytical representation of electronic potential-energy surfaces. Rev. Mod. Phys. 1989;61:669. doi: 10.1103/RevModPhys.61.669. [DOI] [Google Scholar]
- Jiang B., Li J., Guo H.. High-fidelity potential energy surfaces for gas-phase and gas–surface scattering processes from machine learning. J. Phys. Chem. Lett. 2020;11:5120–5131. doi: 10.1021/acs.jpclett.0c00989. [DOI] [PubMed] [Google Scholar]
- Simkó I., Felker P. M., Bačić Z.. HCl trimer: HCl-stretch excited intramolecular and intermolecular vibrational states from 12D fully coupled quantum calculations employing contracted intra-and inter-molecular bases. J. Chem. Phys. 2024;160:164304. doi: 10.1063/5.0207366. [DOI] [PubMed] [Google Scholar]
- Simkó I., Felker P. M., Bačić Z.. H2O trimer: Rigorous 12D quantum calculations of intermolecular vibrational states, tunneling splittings, and low-frequency spectrum. J. Chem. Phys. 2025;162:034301. doi: 10.1063/5.0250018. [DOI] [PubMed] [Google Scholar]
- Li Z., Kermode J. R., De Vita A.. Molecular dynamics with on-the-fly machine learning of quantum-mechanical forces. Phys. Rev. Lett. 2015;114:096405. doi: 10.1103/PhysRevLett.114.096405. [DOI] [PubMed] [Google Scholar]
- Pratihar S., Ma X., Homayoon Z., Barnes G. L., Hase W. L.. Direct chemical dynamics simulations. J. Am. Chem. Soc. 2017;139:3570–3590. doi: 10.1021/jacs.6b12017. [DOI] [PubMed] [Google Scholar]
- Sathyamurthy N., Raff L. M.. Quasiclassical trajectory studies using 3D spline interpolation of ab initio surfaces. J. Chem. Phys. 1975;63:464–473. doi: 10.1063/1.431126. [DOI] [Google Scholar]
- Dawes R., Thompson D. L., Guo Y., Wagner A. F., Minkoff M.. Interpolating moving least-squares methods for fitting potential energy surfaces: Computing high-density potential energy surface data from low-density ab initio data points. J. Chem. Phys. 2007;126:184108. doi: 10.1063/1.2730798. [DOI] [PubMed] [Google Scholar]
- Dawes R., Wang X.-G., Jasper A. W., Carrington T.. Nitrous oxide dimer: A new potential energy surface and rovibrational spectrum of the nonpolar isomer. J. Chem. Phys. 2010;133:134304. doi: 10.1063/1.3494542. [DOI] [PubMed] [Google Scholar]
- Majumder M., Ndengue S. A., Dawes R.. Automated construction of potential energy surfaces. Mol. Phys. 2016;114:1–18. doi: 10.1080/00268976.2015.1096974. [DOI] [Google Scholar]
- Dral P. O., Owens A., Dral A., Csányi G.. Hierarchical machine learning of potential energy surfaces. J. Chem. Phys. 2020;152:204110. doi: 10.1063/5.0006498. [DOI] [PubMed] [Google Scholar]
- Schmitz G., Godtliebsen I. H., Christiansen O.. Machine learning for potential energy surfaces: An extensive database and assessment of methods. J. Chem. Phys. 2019;150:244113. doi: 10.1063/1.5100141. [DOI] [PubMed] [Google Scholar]
- Kang P.-L., Shang C., Liu Z.-P.. Large-scale atomic simulation via machine learning potentials constructed by global potential energy surface exploration. Acc. Chem. Res. 2020;53:2119–2129. doi: 10.1021/acs.accounts.0c00472. [DOI] [PubMed] [Google Scholar]
- Bowman J. M., Qu C., Conte R., Nandi A., Houston P. L., Yu Q.. Δ-machine learned potential energy surfaces and force fields. J. Chem. Theory Comput. 2023;19:1–17. doi: 10.1021/acs.jctc.2c01034. [DOI] [PubMed] [Google Scholar]
- Quintas-Sánchez E., Dawes R.. AUTOSURF: A Freely Available Program to Construct Potential Energy Surfaces. J. Chem. Inf. Model. 2019;59:262–271. doi: 10.1021/acs.jcim.8b00784. [DOI] [PubMed] [Google Scholar]
- Metz M. P., Piszczatowski K., Szalewicz K.. Automatic Generation of Intermolecular Potential Energy Surfaces. J. Chem. Theory Comput. 2016;12:5895–5919. doi: 10.1021/acs.jctc.6b00913. [DOI] [PubMed] [Google Scholar]
- Győri T., Czakó G.. Automating the Development of High-Dimensional Reactive Potential Energy Surfaces with the ROBOSURFER Program System. J. Chem. Theory Comput. 2020;16:51–66. doi: 10.1021/acs.jctc.9b01006. [DOI] [PubMed] [Google Scholar]
- Weck P. F., Balakrishnan N.. Importance of long-range interactions in chemical reactions at cold and ultracold temperatures. Int. Rev. Phys. Chem. 2006;25:283–311. doi: 10.1080/01442350600791894. [DOI] [Google Scholar]
- Dörfler A. D., Eberle P., Koner D., Tomza M., Meuwly M., Willitsch S.. Long-range versus short-range effects in cold molecular ion-neutral collisions. Nat. Commun. 2019;10:5429. doi: 10.1038/s41467-019-13218-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patkowski K., Cencek W., Jankowski P., Szalewicz K., Mehl J. B., Garberoglio G., Harvey A. H.. Potential energy surface for interactions between two hydrogen molecules. J. Chem. Phys. 2008;129:094304. doi: 10.1063/1.2975220. [DOI] [PubMed] [Google Scholar]
- Georgievskii Y., Klippenstein S. J.. Long-range transition state theory. J. Chem. Phys. 2005;122:194103. doi: 10.1063/1.1899603. [DOI] [PubMed] [Google Scholar]
- Le Roy R. J., Huang Y., Jary C.. An accurate analytic potential function for ground-state N2 from a direct-potential-fit analysis of spectroscopic data. J. Chem. Phys. 2006;125:164310. doi: 10.1063/1.2354502. [DOI] [PubMed] [Google Scholar]
- Roy R. J. L., Henderson R. D. E.. A new potential function form incorporating extended long-range behaviour: Application to ground-state Ca2 . Mol. Phys. 2007;105:663–677. doi: 10.1080/00268970701241656. [DOI] [Google Scholar]
- Le Roy R. J., Haugen C. C., Tao J., Li H.. Long-range damping functions improve the short-range behaviour of MLR potential energy functions. Mol. Phys. 2011;109:435–446. doi: 10.1080/00268976.2010.527304. [DOI] [Google Scholar]
- Braams B. J., Bowman J. M.. Permutationally Invariant Potential Energy Surfaces in High Dimensionality. Int. Rev. Phys. Chem. 2009;28:577–606. doi: 10.1080/01442350903234923. [DOI] [PubMed] [Google Scholar]
- Valentini P., Grover M. S., Josyula E.. Constructing feed-forward artificial neural networks to fit potential energy surfaces for molecular simulation of high-temperature gas flows. Phys. Rev. E. 2020;102:053302. doi: 10.1103/physreve.102.053302. [DOI] [PubMed] [Google Scholar]
- Manzhos S., Carrington T. Jr. Neural Network Potential Energy Surfaces for Small Molecules and Reactions. Chem. Rev. 2021;121:10187–10217. doi: 10.1021/acs.chemrev.0c00665. [DOI] [PubMed] [Google Scholar]
- Jiang B., Li J., Guo H.. Potential energy surfaces from high fidelity fitting of ab initio points: the permutation invariant polynomial - neural network approach. Int. Rev. Phys. Chem. 2016;35:479–506. doi: 10.1080/0144235X.2016.1200347. [DOI] [Google Scholar]
- Fu B., Zhang D. H.. Accurate Fundamental Invariant-Neural Network Representation of Ab Initio Potential Energy Surfaces. Natl. Sci. Rev. 2023;10:nwad321. doi: 10.1093/nsr/nwad321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeziorski B., Moszynski R., Szalewicz K.. Perturbation theory approach to intermolecular potential energy surfaces of van der Waals complexes. Chem. Rev. 1994;94:1887–1930. doi: 10.1021/cr00031a008. [DOI] [Google Scholar]
- Stone, A. J. Theory of Intermolecular Forces; Oxford University Press, 2013. [Google Scholar]
- Elking D. M., Perera L., Duke R., Darden T., Pedersen L. G.. A finite field method for calculating molecular polarizability tensors for arbitrary multipole rank. J. Comput. Chem. 2011;32:3283–3295. doi: 10.1002/jcc.21914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maroulis G., Karamanis P.. Finite-field Møller–Plesset perturbation theory and coupled cluster calculations of the electric multipole moments and the dipole polarizability of As2. Chem. Phys. 2001;269:137–146. doi: 10.1016/S0301-0104(01)00340-8. [DOI] [Google Scholar]
- Stone A., Tough R.. Spherical tensor theory of long-range intermolecular forces. Chem. Phys. Lett. 1984;110:123–129. doi: 10.1016/0009-2614(84)80160-8. [DOI] [Google Scholar]
- Price S., Stone A., Alderton M.. Explicit formulae for the electrostatic energy, forces and torques between a pair of molecules of arbitrary symmetry. Mol. Phys. 1984;52:987–1001. doi: 10.1080/00268978400101721. [DOI] [Google Scholar]
- Buckingham A., Fowler P., Hutson J. M.. Theoretical studies of van der Waals molecules and intermolecular forces. Chem. Rev. 1988;88:963–988. doi: 10.1021/cr00088a008. [DOI] [Google Scholar]
- Metz M. P., Szalewicz K.. Automatic Generation of Flexible-Monomer Intermolecular Potential Energy Surfaces. J. Chem. Theory Comput. 2020;16:2317–2339. doi: 10.1021/acs.jctc.9b01241. [DOI] [PubMed] [Google Scholar]
- Yu Y., Yang D., Hu X., Xie D.. ABLRI: A program for calculating the long-range interaction energy between two monomers in their non-degenerate states. J. Chem. Phys. 2024;160:172501. doi: 10.1063/5.0205486. [DOI] [PubMed] [Google Scholar]
- Unsöld A.. Quantentheorie des Wasserstoffmolekülions und der Born-Landéschen Abstoßungskräfte. Z. Phys. 1927;43:563–574. doi: 10.1007/BF01397633. [DOI] [Google Scholar]
- London F.. Über einige eigenschaften und anwendungen der molekularkräfte. Z. Phys. Chem. B. 1930;11:222–251. [Google Scholar]
- Yu Y., Hu X., Guo H., Xie D.. Long-Range Interaction between Molecules with Electronic Degeneracy: Beyond the Born–Oppenheimer Approximation. JACS Au. 2025;5:5079. doi: 10.1021/jacsau.5c00966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lepers, M. ; Dulieu, O. . Cold Chemistry: Molecular Scattering and Reactivity near Absolute Zero; The Royal Society of Chemistry, 2017. [Google Scholar]
- Hättig C.. Recurrence relations for the direct calculation of spherical multipole interaction tensors and Coulomb-type interaction energies. Chem. Phys. Lett. 1996;260:341–351. doi: 10.1016/0009-2614(96)00952-9. [DOI] [Google Scholar]
- Neufeld D. A., Wolfire M. G., Schilke P.. The Chemistry of Fluorine-bearing Molecules in Diffuse and Dense Interstellar Gas Clouds. Astrophys. J. 2005;628:260–274. doi: 10.1086/430663. [DOI] [Google Scholar]
- Neufeld D. A., Schilke P., Menten K. M., Wolfire M. G., Black J. H., Schuller F., Müller H. S. P., Thorwirth S., Güsten R., Philipp S.. Discovery of interstellar CF+ . Astron. Astrophys. 2006;454:L37–L40. doi: 10.1051/0004-6361:200600015. [DOI] [Google Scholar]
- Desrousseaux B., Quintas-Sánchez E., Dawes R., Lique F.. Collisional excitation of CF+ by H2: Potential energy surface and rotational cross sections. J. Phys. Chem. A. 2019;123:9637–9643. doi: 10.1021/acs.jpca.9b05538. [DOI] [PubMed] [Google Scholar]
- Straume M., Johnson M. L.. Analysis of residuals: criteria for determining goodness-of-fit. Methods Enzymol. 1992;210:87. doi: 10.1016/0076-6879(92)10007-z. [DOI] [PubMed] [Google Scholar]
- Feller D., Peterson K. A., Crawford T. D.. Sources of error in electronic structure calculations on small chemical systems. J. Chem. Phys. 2006;124:054107. doi: 10.1063/1.2137323. [DOI] [PubMed] [Google Scholar]
- Miliordos E., Hunt K. L.. Dependence of the multipole moments, static polarizabilities, and static hyperpolarizabilities of the hydrogen molecule on the H–H separation in the ground singlet state. J. Chem. Phys. 2018;149:234103. doi: 10.1063/1.5066308. [DOI] [PubMed] [Google Scholar]
- Harada N., Herbst E.. Modeling carbon chain anions in L1527. Astrophys. J. 2008;685:272. doi: 10.1086/590468. [DOI] [Google Scholar]
- Cordiner M., Millar T., Walsh C., Herbst E., Lis D., Bell T., Roueff E.. Organic molecular anions in interstellar and circumstellar environments. Proc. Int. Astron. Union. 2008;4:157–160. doi: 10.1017/S1743921308021431. [DOI] [Google Scholar]
- McCarthy M., Gottlieb C., Gupta H., Thaddeus P.. Laboratory and astronomical identification of the negative molecular ion C6H. Astrophys. J. 2006;652:L141. doi: 10.1086/510238. [DOI] [Google Scholar]
- Walker K. M., Dumouchel F., Lique F., Dawes R.. The first potential energy surfaces for the C6H––H2 AND C6H––He collisional systems and their corresponding inelastic cross sections. J. Chem. Phys. 2016;145:024314. doi: 10.1063/1.4955200. [DOI] [PubMed] [Google Scholar]
- Walker K. M., Lique F., Dumouchel F., Dawes R.. Inelastic rate coefficients for collisions of C6H- with H2 and He. Mon. Not. R. Astron. Soc. 2017;466:831–837. doi: 10.1093/mnras/stw3065. [DOI] [Google Scholar]
- Rob F., Szalewicz K.. Distributed molecular polarisabilities and asymptotic intermolecular interaction energies. Mol. Phys. 2013;111:1430–1455. doi: 10.1080/00268976.2013.808770. [DOI] [Google Scholar]
- Dawes R., Wang X.-G., Carrington T. Jr. CO dimer: New potential energy surface and rovibrational calculations. J. Phys. Chem. A. 2013;117:7612–7630. doi: 10.1021/jp404888d. [DOI] [PubMed] [Google Scholar]
- Bostan D., Mandal B., Joy C., Żółtowski M., Lique F., Loreau J., Quintas-Sánchez E., Batista-Planas A., Dawes R., Babikov D.. Mixed quantum/classical calculations of rotationally inelastic scattering in the CO+ CO system: a comparison with fully quantum results. Phys. Chem. Chem. Phys. 2024;26:6627–6637. doi: 10.1039/D3CP05369E. [DOI] [PubMed] [Google Scholar]
- Castro-Juárez E., Wang X.-G., Carrington T., Quintas-Sánchez E., Dawes R.. Computational study of the ro-vibrational spectrum of CO–CO2 . J. Chem. Phys. 2019;151:084307. doi: 10.1063/1.5119762. [DOI] [PubMed] [Google Scholar]
- Zadrożny A., Jóźwiak H., Quintas-Sánchez E., Dawes R., Wcisło P.. Ab initio quantum scattering calculations for the CO–O2 system and a new CO–O2 potential energy surface: O2 and air broadening of the R(0) line in CO. J. Chem. Phys. 2022;157:174310. doi: 10.1063/5.0115654. [DOI] [PubMed] [Google Scholar]
- Godard Palluet A., Dawes R., Quintas-Sánchez E., Batista-Planas A., Lique F.. A promising statistical approach for studying the collisional excitation induced by CO: Application to the CS–CO system. J. Chem. Phys. 2025;162:241101. doi: 10.1063/5.0273126. [DOI] [PubMed] [Google Scholar]
- Tonolo F., Quintas-Sánchez E., Batista-Planas A., Dawes R., Lique F.. Collisional excitation of HCN by CO to refine the modeling of cometary comae. J. Phys. Chem. A. 2025;129:9583–9590. doi: 10.1021/acs.jpca.5c05448. [DOI] [PubMed] [Google Scholar]
- Rode M., Sadlej J., Moszynski R., Wormer P. E., van der Avoird A.. The importance of high-order correlation effects for the CO–CO interaction potential. Chem. Phys. Lett. 1999;314:326–332. doi: 10.1016/S0009-2614(99)01168-9. [DOI] [Google Scholar]
- Kodrycka M., Patkowski K.. Platinum, gold, and silver standards of intermolecular interaction energy calculations. J. Chem. Phys. 2019;151:070901. doi: 10.1063/1.5116151. [DOI] [PubMed] [Google Scholar]
- Komatsu K., Murata M., Murata Y.. Encapsulation of molecular hydrogen in fullerene C60 by organic synthesis. Science. 2005;307:238–240. doi: 10.1126/science.1106185. [DOI] [PubMed] [Google Scholar]
- Benjamin S. C., Ardavan A., Briggs G. A. D., Britz D. A., Gunlycke D., Jefferson J., Jones M. A., Leigh D. F., Lovett B. W., Khlobystov A. N.. et al. others Towards a fullerene-based quantum computer. J. Phys.: Condens. Matter. 2006;18:S867–S883. doi: 10.1088/0953-8984/18/21/s12. [DOI] [Google Scholar]
- Changala P. B., Weichman M. L., Lee K. F., Fermann M. E., Ye J.. Rovibrational quantum state resolution of the C60 fullerene. Science. 2019;363:49–54. doi: 10.1126/science.aav2616. [DOI] [PubMed] [Google Scholar]
- Kłos J., Tiesinga E., Kotochigova S.. Quantum scattering of icosahedron fullerene C60 with noble-gas atoms. Sci. Rep. 2024;14:9267. doi: 10.1038/s41598-024-59481-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werner, H.-J. ; Knowles, P. J. ; Celani, P. ; Györffy, W. ; Hesselmann, A. ; Kats, D. ; Knizia, G. ; Köhn, A. ; Korona, T. ; Kreplin, D. ; Lindh, R. ; Ma, Q. ; Manby, F. R. ; Mitrushenkov, A. ; Rauhut, G. ; Schütz, M. ; Shamasundar, K. R. ; Adler, T. B. ; Amos, R. D. ; Bennie, S. J. ; Bernhardsson, A. ; Berning, A. ; Black, J. A. ; Bygrave, P. J. ; Cimiraglia, R. ; Cooper, D. L. ; Coughtrie, D. ; Deegan, M. J. O. ; Dobbyn, A. J. ; Doll, K. ; Dornbach, M. ; Eckert, F. ; Erfort, S. ; Goll, E. ; Hampel, C. ; Hetzer, G. ; Hill, J. G. ; Hodges, M. ; Hrenar, T. ; Jansen, G. ; Köppl, C. ; Kollmar, C. ; Lee, S. J. R. ; Liu, Y. ; Lloyd, A. W. ; Mata, R. A. ; May, A. J. ; Mussard, B. ; McNicholas, S. J. ; Meyer, W. ; Miller, T. F., III ; Mura, M. E. ; Nicklass, A. ; O’Neill, D. P. ; Palmieri, P. ; Peng, D. ; Peterson, K. A. ; Pflüger, K. ; Pitzer, R. ; Polyak, I. ; Reiher, M. ; Richardson, J. O. ; Robinson, J. B. ; Schröder, B. ; Schwilk, M. ; Shiozaki, T. ; Sibaev, M. ; Stoll, H. ; Stone, A. J. ; Tarroni, R. ; Thorsteinsson, T. ; Toulouse, J. ; Wang, M. ; Welborn, M. ; Ziegler, B. . MOLPRO: A Package of Ab Initio Programs, 2025. https://www.molpro.net/.
- Hansen A., Knowles P. J., Werner H.-J.. Accurate calculation of noncovalent interactions using PNO-LCCSD (T)-F12 in Molpro. J. Phys. Chem. A. 2025;129:4812–4833. doi: 10.1021/acs.jpca.5c02316. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




