Linear-Scaling Implementation of Multilevel Hartree–Fock Theory

Linda Goletto; Eirik F Kjønstad; Sarai D Folkestad; Ida-Marie Høyvik; Henrik Koch

doi:10.1021/acs.jctc.1c00299

. 2021 Nov 8;17(12):7416–7427. doi: 10.1021/acs.jctc.1c00299

Linear-Scaling Implementation of Multilevel Hartree–Fock Theory

Linda Goletto ^†, Eirik F Kjønstad ^†, Sarai D Folkestad ^†, Ida-Marie Høyvik ^†, Henrik Koch ^†,^‡,^*

PMCID: PMC8675138 PMID: 34747179

Abstract

We introduce a new algorithm for the construction of the two-electron contributions to the Fock matrix in multilevel Hartree–Fock (MLHF) theory. In MLHF, the density of an active molecular region is optimized, while the density of an inactive region is fixed. The MLHF equations are solved in a reduced molecular orbital (MO) basis localized to the active region. The locality of the MOs can be exploited to reduce the computational cost of the Fock matrix: the cost related to the inactive density becomes linear scaling, while the iterative cost related to the active density is independent of the system size. We demonstrate the performance of this new algorithm on a variety of systems, including amino acid chains, water clusters, and solvated systems.

Introduction

The most expensive step in a Hartree–Fock (HF) calculation is typically the construction of the two-electron contributions to the Fock matrix. While the formal scaling is Inline graphic , where N is a measure of the system size, it reduces asymptotically to ; only integrals are non-zero in the limit of large N. Furthermore, for sparse density matrices, the number of numerically significant exchange terms is reduced to , even if identifying these terms strictly implies a steeper scaling.¹ Much effort has been devoted to lower the quadratic scaling of the Coulomb term in the Fock matrix. For sufficiently large N, the Coulomb contributions can also be calculated in Inline graphic time.²

One strategy to achieve an Inline graphic Coulomb matrix is to introduce hierarchies of fine and coarse grains for close and remote interactions, respectively. With the Barnes–Hut method,³ the scaling was lowered to , while the continuous fast multipole method (CFMM) of White et al.² was the first scheme to reach linear scaling. Many alternative tree-like algorithms have since been developed, with the main goal of reducing the prefactor.^4,5 For the exchange term, the focus has been on efficiently identifying the numerically significant exchange integrals. The widely adopted LinK algorithm of Ochsenfeld et al.¹ presorts the contributing integrals while also incorporating permutational symmetry. Other strategies to further reduce the prefactor have been suggested.^6,7

An important reduction in the time required by the computation of the two-electron integrals has also been obtained through the density fitting (DF)—or resolution-of-identity (RI)—approximation.⁸ Applied on the Coulomb term first,^9,10 and later on the exchange component,¹¹ this approach approximates the four-center electron repulsion integrals by two- and three-center expressions. The method itself does not scale linearly with respect to the system size, but it has been combined with CFMM¹² and localized orbitals¹³ to yield an asymptotic Inline graphic scaling. As an alternative to RI, Cholesky decomposition can be used in the integral approximation.^14,15

Graphical processing units (GPUs) have also proven to be an important asset in the speed-up of the two-electron integral computation;¹⁶ the introduction of double precision support has allowed for mixed precision approaches that balance accuracy and GPU performance.¹⁷

Once the Fock matrix has been constructed, a self-consistent field (SCF) algorithm often performs an Inline graphic diagonalization step to obtain the next guess for the molecular orbital (MO) coefficients. However, due to the sparsity of the atomic orbital (AO) density matrix, this step can be replaced by an density optimization.¹⁸⁻²¹ A purification procedure, such as McWeeny’s purification,^22,23 is used to enforce hermiticity, N-representability, and idempotency. A detailed review of linear-scaling SCF methods can be found in Ref (24).

Another strategy to achieve linear-scaling HF is to use fragmentation methods that divide the full space into boxes or monomers.²⁵ After the definition of the fragments, an SCF procedure is typically performed on each of them. The interaction between fragments can be accounted for in several ways, such as through overlapping buffer regions around the fragments.^26,27 When the property of interest is localized in a known region of the system, multiscale and multilevel methods can be used. The rationale behind these techniques is that one can—without loss of accuracy in the targeted property—restrict the most expensive quantum mechanical treatment to an active region of the system. The environment is treated either as a continuum,²⁸⁻³⁰ at a molecular mechanics level,³¹⁻³³ or by using a less expensive quantum mechanical model.³⁴⁻⁴¹

The multilevel Hartree–Fock (MLHF) method was introduced by Sæther et al.⁴² This approach bears some resemblance to the local SCF method^43,44 and is closely related to the QM/ELMO method recently proposed by Macetti and Genoni.⁴⁵

In MLHF, the total density is written as a sum of an active and an inactive density matrix, where only the active density is optimized. Interactions with the environment are included through a constant contribution to the Fock matrix. The MLHF method is designed for systems where the active region is small with respect to the full system size, such as solvated systems or proteins with a well-defined active site. It offers a reliable reference wave function for reduced space coupled cluster calculations of intensive properties, where the correlation treatment is restricted to a set of active MOs.⁴⁶⁻⁴⁸

Due to the active–inactive partitioning, the MLHF equations can be solved in the space of the localized active MOs. The cost of diagonalization is therefore independent of the system size. Furthermore, the locality of the MOs can be used to reduce the cost of the AO Fock matrix; several terms do not contribute to the active MO matrix and can be neglected.^42,49 This fact has, however, only been partially exploited in previous implementations.^42,46,50

In this article, we present an efficient MLHF Fock matrix algorithm that fully exploits the local nature of the active MOs. The environment density contributions can be calculated at a cost that scales as Inline graphic , while the iterative cost, consisting of active density contributions, is independent of the system size. Our MLHF implementation is based on a conventional direct HF implementation. We emphasize that any improvement in HF algorithms—such as RI or CFMM—can be incorporated into an implementation of the MLHF method.

MLHF Theory

In MLHF,⁴² the total density matrix is partitioned into an active and an environment (or inactive) density, D^a and D^e

The active, environment, and total density matrices are required to separately fulfill the hermiticity, trace, and idempotency conditions. The environment density is determined and fixed at the beginning of the calculation, whereas the active density is obtained by minimizing the HF energy.

Using eq 1, with terms given in the AO basis, we can express the HF energy for a closed-shell system as

where

Here, h_nuc is the nuclear repulsion energy, h is the one-electron Hamiltonian integral matrix, and

is the two-electron contribution to the Fock matrix. The two-electron Hamiltonian integrals are denoted as g_αβγδ, where α, β, γ, and δ are AO indices.

The environment density, D^e, enters the energy minimization through the Fock matrix

By projecting the Fock matrix onto the localized MO basis, we obtain a set of MO Roothaan–Hall equations that are solved iteratively to optimize D^a. Convergence acceleration can be achieved through, for example, direct inversion of the iterative subspace.^49,51,52 The h and G(D^e) terms are computed once at the beginning of the calculation and transformed to the current MO basis in every iteration.^42,49 Therefore, one only needs to accurately represent the two-electron contributions in the active MO basis. In this basis, G(D^x) is given by

Here, p and q refer to MO indices, and C contains the active MO coefficients.

The active and inactive orbital spaces can be obtained from an idempotent starting guess for the total density. A common starting guess is a superposition of atomic densities⁵³ (SAD), D^SAD. However, D^SAD is not idempotent. To fulfill idempotency, D^SAD can be used to build a Fock matrix which is then diagonalized.⁴² Due to the sparsity of the SAD guess, which is block-diagonal, this is an Inline graphic Fock matrix construction with a small prefactor.⁵³ Alternatively, it is possible to use a more accurate starting guess, such as a superposition of molecular densities (SMD),⁵⁴ with methods like McWeeny’s purification.^22,23 The small prefactor of matrix multiplications can make this Inline graphic procedure advantageous compared to the construction and diagonalization of a Fock matrix.

To determine the initial active occupied orbitals, we perform a restricted partial Cholesky decomposition of the initial idempotent density^55,56

where the index p is restricted to the active occupied MOs. The decomposition is restricted in the sense that pivoting elements are required to correspond to AOs on a set of active atoms.

For the active virtual space, we use projected atomic orbitals (PAOs).^57,58 The PAOs are generated by projecting out the occupied components (both active and inactive) from the subset of AOs centered on the active atoms, {α̅}

Since the obtained PAOs are linearly dependent, an orthonormalization procedure, for example Löwdin orthonormalization,⁵⁹ is required to form non-redundant and orthogonal PAOs.

Linear-Scaling Algorithm for the Fock Matrix

The MLHF Fock matrix has two-electron contributions arising from both the active and the environment density, that is, G(D^e) and G(D^a). The G(D^e) matrix is calculated at the beginning of the calculation and subsequently transformed to the initial active MO basis. In the SCF procedure, G(D^e) is updated to the current MO basis in each iteration through an MO-to-MO basis transformation. In contrast, G(D^a) must be recalculated in every iteration.

The two-electron contributions, and especially G(D^e), have been found to dominate the computational cost in most MLHF calculations.^42,46 However, in previous implementations of MLHF, these terms were not constructed using sufficiently optimized Fock matrix algorithms. In the original algorithm, which was implemented in a local version of LSDALTON,⁶⁰ the locality of the active MOs was only exploited to truncate the AO basis: the AOs that did not contribute to any of the active MOs were discarded at the beginning of the calculation. This screening algorithm, since it only considers contributions to the MOs, does not exploit all the information available when constructing specific Fock matrix elements. While the algorithm reduces the asymptotic scaling, it was found to be ineffective, except for very large systems.⁴²

The implementation in e^T 1.0,⁴⁶ on the other hand, relied on a specialized Fock matrix algorithm which made use of the MO coefficients to skip negligible contributions to G(D^a). However, while this reduced the iterative cost, it did not strictly change the scaling of the underlying Fock construction algorithm. It also did not apply screening to the construction of G(D^e),⁴⁶ thus making the non-iterative cost higher than necessary.

The scaling of G(D^e) and G(D^a) can be reduced to Inline graphic and by fully exploiting the local nature of the active MOs. This reduced scaling is readily understood by considering the restriction of the AO indices to active and inactive sets, as implied by the G(D^x) expression in eq 6. Here, we define the set of active AOs as the AOs that contribute to the active MOs, that is, the AOs that correspond to significant elements in the active MO coefficients. Note that these active AOs are not only centered on the active atoms but can also belong to atoms in the inactive region that are close to the active atoms. Similarly, we define the set of inactive AOs as those that contribute to the environment density. The sets of active and inactive AOs overlap.

Since the coefficients C_αp and C_βq in eq 6 refer to the active set of MOs, only active α and β (in the sense defined above) will contribute to G(D^x). In the case of G(D^a), the γ and δ indices in eq 6 are also active due to the D_γδ^a factor. All the AO indices (α, β, γ, and δ) are thus active, and so the cost of G(D^a) will be Inline graphic .

For G(D^e), the Coulomb and exchange terms must be considered separately. In the Coulomb contribution

the γ and δ indices are inactive, but they are also located on atoms separated by a small distance; otherwise g_αβγδ would be zero. The number of surviving pairs γδ, and consequently the cost of Inline graphic , therefore scales as . On the other hand, the exchange contribution

can be calculated as Inline graphic because δ and γ are close to the active indices α and β, respectively; otherwise g_αδγβ would be zero. The localization of the AO indices in the various two-electron terms is depicted in Figure 1.

Localization of the AO indices in the Coulomb and exchange contributions. In the Coulomb contribution to G(D^e), the α and β indices are active, in the sense that they contribute to the active MOs, whereas γ and δ are inactive. For G(D^a), and the exchange contribution to G(D^e), all AO indices are active.

The G(D^e) term can be computed once in the beginning of the MLHF calculation at an Inline graphic cost. The iterative cost of MLHF is dominated by the construction of G(D^a). The scaling is reduced by at least one order compared to conventional HF, where the Coulomb and exchange terms have a quadratic and linear-scaling cost, respectively.

The index restrictions required to efficiently calculate these terms can be determined in a prescreening procedure. In our implementation, lists of significant shell pairs are prepared prior to entering the construction loop for the two-electron contribution to the Fock matrix. These lists are shell-based, instead of AO-based, because the integrals are computed in shell batches by Libint 2.⁶¹ Prescreening allows us to avoid looping over negligible terms when calculating the two-electron contributions, thereby ensuring the correct scaling.

The screening algorithm is designed to calculate contributions to the MO Fock matrix to a given precision. The algorithm is based on the observation that an element of the AO matrix can be neglected when all contributions to the corresponding MO matrix are below some specified threshold

Here, C_α = max_p|C_αp|, and Inline graphic and are the Coulomb and exchange thresholds, respectively. The magnitude of the integrals is estimated using the Cauchy–Schwarz inequality

For compatibility with the integral program,⁶¹ these conditions are implemented for shells rather than individual AOs. When expressed in terms of AO shells {s_i}, and with Cauchy–Schwarz estimates for the integrals, the conditions in eqs 11 and 12 become

where we have defined the shell-based quantities

In the following, we will also make use of the quantities

The active MOs determine which G(D^x) contributions are negligible. When the screening is applied to G(D^a), we always use the current active MOs. On the other hand, when it is applied to G(D^e), we use the initial active MOs. As a result, the introduced error in G(D^e) is proportional to, and not bounded by, the threshold. In practice, it is sufficient to use the same thresholds without a significant loss of accuracy.

The screening conditions in eqs 14 and 15 assume information about the four shells s₁, s₂, s₃, and s₄, which is only available in the inner-most loop of a Fock matrix construction. An efficient implementation, however, must exploit the information available at any given level of the nested loop. This is accomplished using a set of looser screening conditions, derived from eqs 14 and 15, where all information available at a given level is used to screen out negligible terms.

The procedures used to calculate the Coulomb and exchange terms are given in algorithms 1 and 2. In both algorithms, the first step is to determine the set of shell pairs s₁s₂ that correspond to non-negligible two-electron integrals. The significant shell pair list

is prepared at the beginning of the MLHF calculation. Here, τ is an integral cutoff threshold, while Inline graphic and g^1/2 are defined in eqs 16 and 20, respectively. In the outermost loop, over the s₁s₂ in , we can use screening conditions derived from eqs 14 and 15 for the given s₁ and s₂ (see line 3 of algorithms 1 and 2). Note that these conditions also take into account permutational symmetry. A shortened list of significant shell pairs s₁s₂ ( Inline graphic ) is thus constructed, in addition to a list of the significant s₁ () and a list of significant s₂ for each s₁ (). The dimensions of , , and all scale linearly with the size of the system for and are constant for and G(D^a). This prescreening step is detailed in lines 2–9 of algorithms 1 and 2.

The elements of Inline graphic are ordered in different ways for and . To allow for an early exit in the algorithm, the ordering follows the magnitude of the products. In the case, the desired scaling is already achieved, and is sorted in the ascending order. The list is ordered according to ascending s₂ to efficiently exploit permutational symmetries (see lines 23 and 19 in algorithms 1 and 2).

In the construction loop for Inline graphic and , we first loop over and . At this point, we can formulate screening criteria, from eqs 14 and 15, for the given s₁, s₂, and s₃. These criteria are used to either exit the s₃ loop or to cycle to the next s₃; see lines 14 and 17–20 in algorithm 1 and lines 15–16 in algorithm 2. When the inner-most s₄ loop is reached, all the shells are known. Therefore, the Coulomb and exchange conditions in eqs 14 and 15 can be used, though some minor modifications are required to account for permutational symmetry; see lines 24–25 in algorithm 1 and lines 20–21 in algorithm 2.

It is also possible to compute Inline graphic and in the same construction loop. In this case, we use the structure in algorithm 1, but the exchange conditions given in algorithm 2 are added in the corresponding loops.

Our discussion so far has focused on the scaling of the G(D^x) construction loops. In general, the prescreening steps scale more steeply. In both algorithms 1 and 2, the prescreening loop scales linearly with the system size. In the case of Inline graphic , the reordering scales as , while it is independent of the system size for and G(D^a). Furthermore, some of the quantities in eqs 16–23 have a cost that scales quadratically, albeit with small prefactors. However, for the systems we are targeting (10³ to 10⁵ AOs), their cost is negligible when compared to the cost of constructing the Fock matrix.

An overview of the computational scaling of terms related to G(D^x) is given in Table 1; in particular, the table shows the effects of the C-screening. Furthermore, it presents the scaling of the prescreening lists g^1/2 and D, as well as terms related to the construction of the SAD Fock matrix.

Table 1. Computational Scaling of Terms in the MLHF Implementation, with and without Screening with Respect to the MOs (C-Screening).

computational scaling

step

no C-screening

C-screening

iterative

reordering ( Inline graphic

)

iterative

reordering ( Inline graphic

, G(D^a))

iterative

non-iterative

iterative

non-iterative

g^1/2 list

non-iterative

D lists

iterative

Open in a new tab

There are additional steps which may scale more steeply than the terms in Table 1. At the beginning of the MLHF calculation, linear dependence is eliminated from the AO basis by Inline graphic Cholesky decomposition (or, alternatively, by diagonalization) of the overlap matrix. The one-electron Hamiltonian integrals are also computed at this stage; this step has a small prefactor and can be made linear with the same multipole strategies that have been developed for the Coulomb matrix in HF theory.²⁴ These non-iterative steps are the same as in standard HF. The MLHF procedure also includes a non-iterative step to determine the initial active orbitals, a procedure which is Inline graphic scaling.

In addition to the cost of G(D^a), and the related prescreening steps, the iterative cost of MLHF includes the cost of adding the elements [G(D^a)]_αβ to the AO Fock matrix, as well as the subsequent AO-to-MO transformation. These steps are Inline graphic scaling processes. The Roothaan–Hall optimization is performed in the MO basis and therefore does not entail any steps that scale with the size of the system. The initial Roothaan–Hall diagonalization of the SAD Fock matrix, however, is performed in the AO basis and is therefore an Inline graphic step. However, for the systems we are targeting, the computational cost is invariably dominated by the construction of G(D^e) and G(D^SAD).

OpenMP parallelization is applied to the outer index s₁s₂ of the main construction loops in algorithms 1 and 2. Each thread can either have its own copy of the Fock matrix or add calculated contributions to a shared copy. With a copy for each thread, one avoids the overhead resulting from threads having to wait for access to memory locations. The memory penalty of keeping a copy for each thread may become a bottleneck for sufficiently large systems. One approach to remove this memory bottleneck is to have a number of threads share a copy of the Fock matrix.⁶² An alternative is to compress the Fock matrix,⁶³ so that every thread can hold a copy.

In the MLHF approach, the selection of the significant elements for the compressed Fock matrix can be performed using the same screening conditions applied in algorithms 1 and 2. This results in an asymptotically non-scaling memory requirement for the copies of the Fock matrix in MLHF. In HF, on the other hand, the memory requirement is asymptotically linear with respect to the system size when the density matrix is sparse. In this paper, compression is adopted when the memory requirement becomes a limiting factor.

Results and Discussion

Algorithms 1 and 2 have been implemented in a development version of the e^T program.⁴⁶ We use a Cholesky decomposition to obtain the occupied orbital space and PAOs to obtain the virtual active MOs. A threshold of 10^–1 is used for the Cholesky decomposition. In all calculations, we apply a gradient threshold of 10^–6, giving default values for Inline graphic and equal to 10^–12 and 10^–10, respectively. The different thresholds are all expressed in atomic units.

Unless otherwise stated, the initial idempotent density guess is obtained from SAD through a diagonalization of the corresponding Fock matrix.

All geometries can be found in ref (64), and we use UCSF Chimera⁶⁵ to visualize them.

Scaling Properties

The scaling properties of the implementation are demonstrated on two sets of model systems: linear chains of amino acids, constructed by repeating the unit shown in Figure 2, and water clusters of increasing radius, the smallest of which is shown in Figure 3.

Shortest amino acid chain used in our calculations. The active atoms (those of the alanine at the N-terminal of the chain) are highlighted in blue.

Smallest cluster of water molecules used in our calculations. The active water molecule is highlighted in blue.

For the amino acid chain, we define the alanine at the N-terminal as active and use both the cc-pVDZ and aug-cc-pVDZ basis sets. The timings for the Coulomb and exchange contributions to G(D^SAD), G(D^e), and G(D^a) are given in Tables 2 and 3 and depicted in Figure 4. The tables highlight the improvement in the scaling due to the C-screening. Without the C-screening, the active density reduces the scaling by a factor of N, but the information in the active MO coefficients is not exploited. This results in Inline graphic scaling linearly with the size of the system, while is independent of the system size. For G(D^e), since the density is not localized to the active shells, the scaling is the same as in a general Fock matrix construction, that is, for non-C-screened algorithm, the Coulomb term scales quadratically and the exchange term scales linearly. The results in Tables 2 and 3 and Figure 4 show that the C-screening implementation reduces the costs for all two-electron contributions to the Fock matrix and reduces the scaling for Inline graphic and G(D^e).

Table 2. Wall Time for MLHF/cc-pVDZ Calculations on the Linear Amino Acid Chains^a.

no C-screening

C-screening

G(D^SAD)

G(D^e)

G(D^a)

G(D^SAD)

G(D^e)

G(D^a)

#AOs

1254

2484

226

3714

508

119

4944

115

887

175

107

114

120

6174

179

1448

208

134

180

152

7404

273

2040

249

166

261

181

8634

352

2797

300

193

348

220

9864

470

3657

336

220

460

253

Open in a new tab

The timings are expressed in seconds without and with C-screening. All calculations were performed on two Intel Xeon-Gold 6138 processors with 20 cores each with 160 GB memory available.

Table 3. Wall Time for MLHF/aug-cc-pVDZ Calculations on the Linear Amino Acid Chains^a.

no C-screening

C-screening

G(D^SAD)

G(D^e)

G(D^a)

G(D^SAD)

G(D^e)

G(D^a)

#AOs

2112

28s

10s

29s

10s

4183

22s

23s

6254

37s

102

36s

8325

49s

179

48s

10396

282

12467

411

109

14538

562

133

101

16609

754

150

112

117

Open in a new tab

The timings are expressed in minutes when not stated otherwise, without and with C-screening. All calculations were performed on two Intel Xeon-Gold 6138 processors with 20 cores each with 160 GB memory available.

Wall time for MLHF calculations on the linear amino acid chains, with C-screening. All calculations were performed on two Intel Xeon-Gold 6138 processors with 20 cores each with 160 GB memory available.

As mentioned before, the C-screening, like all screening methods based on the overlap of orbitals, performs better with non-diffuse basis sets. However, these results show that the Inline graphic scaling can be reached with both basis sets. The wall time for the prescreening steps and for some relevant non-iterative procedures in the calculations is reported in the Supporting Information.

The calculations on the amino acid chains illustrate the behavior of the algorithm for a one-dimensional system. Since many systems of interest are three-dimensional, we also consider the scaling properties on water clusters where the central water molecule is active. Several combinations of basis sets have been selected; in the following, the notation x/y (e.g., aug-cc-pVDZ/STO-3G) is used to denote that the active water molecule is treated with the basis x and the environment with the basis y.

Wall time for aug-cc-pVDZ/STO-3G calculations is shown in the first row of Figure 5. When the environment is treated with a minimal basis, the calculations rapidly exhibit the correct scaling, even if diffuse basis functions are used on the active atoms. This may be of some practical importance since the active atoms must have diffuse functions for correlated methods to predict intensive properties with quantitative accuracy. Furthermore, an adequate frozen environment density may not require a high-quality basis set.

Wall time for MLHF calculations on clusters of water molecules of increasing radius, with C-screening. All calculations were performed on two Intel Xeon-Gold 6138 processors with 20 cores each. The calculations with the environment treated with STO-3G were given 160 GB memory; the calculations with a cc-pVDZ environment were performed with 360 GB memory available.

In the last two rows of Figure 5, we report the wall time with the aug-cc-pVDZ/cc-pVDZ and aug-cc-pVTZ/cc-pVDZ basis set combinations. The computational cost of the Inline graphic and terms is approximately constant with respect to the cluster size. On the other hand, the term has a scaling in-between and , and the term scales as . The observed scaling is thus different from the asymptotic scaling of these terms. Due to the larger number of AOs per atom, these are calculations on smaller water clusters than those with the STO-3G environment. Hence, these calculations show that one must extend the environment further to reach the asymptotic scaling. Despite this, the time to construct G(D^e) still becomes smaller than the time required to construct G(D^SAD) when the system exceeds 15 000 AOs. The non-iterative cost is therefore dominated by the G(D^SAD) in the largest systems. Tables with the wall time are given in the Supporting Information.

Comparison to HF

The MLHF method has already been shown to be significantly cheaper than standard HF.^42,46 The C-screening detailed in algorithms 1 and 2 reduces the cost and scaling of MLHF even further.

We illustrate these savings by performing MLHF and HF calculations on the system shown in Figure 3, treated with the aug-cc-pVTZ/cc-pVDZ basis set combination. The wall time for the G(D^x) terms (t^x), which completely dominate the corresponding Fock matrix constructions, is given in Table 4. The total wall time for the full calculations, t^tot, is also reported. Compared to the MLHF implementation without C-screening, the total wall time t^tot is reduced by approximately a factor of 3. With respect to standard HF, t^tot is reduced by approximately a factor of 5. In particular, the C-screening reduces t^e by a factor of 2.5 and t^a by a factor of 4 for G(D^a). The timings for G(D^SAD) are reported for reference, but are, as expected, the same in the three calculations.

Table 4. Wall Time, Expressed in minutes, for MLHF and HF Calculations on a Water Cluster with a 10 Å Radius^a.

method	t^SAD (min)	t^e (min)	t^a (min)	t^tot (min)
MLHF C-screening	1	4	2	12
MLHF no C-screening	1	10	8	40
HF	1		10	57

Open in a new tab

The times to construct G(D^SAD), G(D^e), and G(D^a) of the first iteration are denoted as t^SAD, t^e, and t^a. t^tot is the total wall time of the full calculation. The aug-cc-pVTZ/cc-pVDZ combination of basis sets is used, and there are 3236 AOs. The calculations were performed on two Intel Xeon Gold 6152 processors, with 44 threads and 1.4 TB memory available.

It should be emphasized that the computational savings compared to non-screened MLHF and standard HF depend on the basis set. In particular, the addition of diffuse functions to the basis set has a significant impact on the screening. Although the screening becomes effective at a sufficient distance from the active region, this distance may be quite large. For large basis sets with many diffuse functions, other strategies—for example, RI or Cholesky decomposition—could be incorporated into an MLHF implementation.

Validating the Screening Algorithm with CC2 Excitation Energies

Our implementation applies C-screening on both active and inactive electron repulsion terms. In this section, we demonstrate that the results are insensitive to the use of the C-screened MLHF wave function as a reference in post-HF calculations of intensive properties.

We present CC2 excitation energies of different moieties in aqueous solution, obtained with and without C-screening. The systems—SO₂, 4-aminophthalimide, and para-nitroaniline in water—are depicted in Figure 6. In all cases, the solute is chosen as active and treated with aug-cc-pVDZ, while the surrounding water molecules are treated with cc-pVDZ. Table 5 shows that the C-screening does not affect the computed excitation energies.

Three solvated moieties—SO₂, 4-aminophthalimide, and *para*-nitroaniline—treated at the CC2-in-MLHF/HF level. The solute is active in MLHF.

Table 5. CC2-in-MLHF/HF Excitation Energies, Obtained Using aug-cc-pVDZ on the Active Atoms and cc-pVDZ on the Inactive Atoms.

	C-screening (eV)	no C-screening (eV)
SO₂ + water	3.236	3.236
4-aminophthalimide + water	3.845	3.845
para-nitroaniline + water	4.036	4.036

Open in a new tab

Density Purification and Memory Compression for Large Systems

For large systems, the memory required to keep a copy of the AO Fock matrix for each OpenMP thread can become impractical. Additionally, the G(D^SAD) construction can become the bottleneck since it scales as Inline graphic with a significant prefactor. To avoid the G(D^SAD) step and the diagonalization of the corresponding Fock matrix, we make use of McWeeny’s purification^22,23 on an SMD starting guess.⁵⁴ The memory usage for G(D^e) and G(D^a) is reduced by applying compression⁶³ to the copies of the Fock matrix.

We use these strategies on erythromycin-in-water systems, treated with aug-cc-pVTZ/cc-pVDZ. The smallest system, with 42 119 AOs, is depicted in Figure 7. In Table 6, we report timings for the SMD guess t^SMD, the purification t^pur, the memory compression t^com, and the Inline graphic , , and G(D^a) terms, along with the required memory of a single copy of the compressed matrices. Note that the calculations were carried out on two different machines (A and B), so that the timings cannot be directly compared. The compression scheme entails a computational penalty; however, it makes it possible to reach systems with more than 10⁵ AOs.

Smallest erythromycin-in-water system used in our calculations. The active erythromycin molecule is highlighted in blue.

Table 6. Wall Time and Memory Requirements for MLHF/aug-cc-pVTZ/cc-pVDZ Calculations on a Series of Erythromycin-in-Water Systems^a.

G(D^e)

G(D^a)

#AOs

t^SMD

t^pur

t^com

42 119

623

676

62 111

619

672

82 103

644

697

102 119

644

697

Open in a new tab

The timings are expressed in hours, while the memory is given in megabytes. The two calculations with 42 119 and 62 111 AOs were performed on machine A, with two Intel Xeon Gold 6138 processors and 40 threads, while the two calculations with 82 103 and 102 119 AOs were performed on machine B, with two Intel Xeon-Gold 6130 processors and 64 threads.

The cost of memory compression for the exchange term is non-negligible. However, this compression step does not scale with the system size. The cost is mainly due to the lack of OpenMP parallelization. The calculations are still dominated by the Coulomb term. Timings for the Coulomb compression step are not reported as it requires less than a minute in all calculations. This compression scales as Inline graphic for , and as for , so its cost will always be negligible compared to other terms.

Due to the need to hold in memory some N_AO² matrices, the memory requirement of the full calculation scales quadratically; in the largest system, a peak memory usage of 518 GB was reached. The memory usage for the compressed Fock matrices is small and scales as Inline graphic with the system size.

From Table 6, we see that the cost of the SMD construction is significant. It is dominated by the HF calculation on erythromycin. While solvated systems are trivially separated into subsystems, large covalently bound systems require a fragmentation procedure. This would also reduce the cost of SMD for erythromycin-in-water.

Summary and Concluding Remarks

We have introduced a new algorithm for the two-electron contributions to the Fock matrix in the MLHF method. This algorithm exploits the locality of the active MOs to efficiently screen contributions to the active MO Fock matrix. We achieve Inline graphic scaling for the construction of and scaling in the and G(D^a) terms. Although the MLHF implementation includes steps that scale more steeply, the Fock matrix construction dominates the iterative and overall costs of calculations on systems with up to 10⁴ to 10⁵ AOs.

To demonstrate the scaling of the implementation, we have presented a number of calculations on one- and three-dimensional systems of increasing size. The efficiency of the implementation was also tested on a water cluster, which provides an illustration of the savings relative to non-screened MLHF and HF. Our algorithm involves additional screening based on the MOs with respect to previous algorithms. We have therefore tested its accuracy by performing excited-state CC2 calculations.

Since the memory required to hold a copy of the AO Fock matrix for every OpenMP thread increases as Inline graphic , the memory usage can become the limiting factor for large systems. At the same time, in these systems the SAD Fock matrix construction dominates the computational cost. We have therefore combined the two-electron integrals screening with memory compression of the Fock matrix⁶³ and McWeeny’s purification^22,23 of an SMD starting guess,⁵⁴ in order to reach larger system sizes. Calculations on erythromycin-in-water systems with up to 100 000 basis functions have been performed.

In the limit of large N, the cost to construct G(D^e) becomes effectively independent of the system size. This is because of the long-range decay of the Coulomb interactions, which is used in HF theory to reduce the asymptotic Coulomb matrix scaling from Inline graphic to .²⁴ For the Coulomb contribution of G(D^e), the scaling similarly reduces to . This is not to say that all costs are independent of the system size: as in other Fock construction algorithms, there may be preparation steps that scale more steeply. Possible further improvements could include an adaptation of the well-established CFMM method,² as well as a combination of the MLHF approach with DF⁸ or Cholesky decomposition.^14,15

Acknowledgments

We thank Rolf H. Myhre for useful discussions and Anders Hutcheson for contributions to e^T. We acknowledge computing resources through UNINETT Sigma2—the National Infrastructure for High Performance Computing and Data Storage in Norway, through project number NN2962k. We acknowledge funding from the Marie Skłodowska-Curie European Training Network “COSINE—COmputational Spectroscopy In Natural sciences and Engineering,”grant agreement no. 765739 and the Research Council of Norway through FRINATEK projects 263110 and 275506.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.1c00299.

Wall time comparison for the prescreening procedures of MLHF/cc-pVDZ calculations and MLHF/aug-cc-pVDZ calculations; wall time for non-iterative procedures of MLHF/cc-pVDZ calculations and MLHF/aug-cc-pVDZ calculations on the linear amino acid chains; wall time comparison for the MLHF/aug-cc-pVDZ/STO-3G calculations; and wall time comparison for the MLHF/aug-cc-pVDZ/cc-pVDZ calculations on water clusters of increasing radius (PDF)

The authors declare no competing financial interest.

Supplementary Material

ct1c00299_si_001.pdf^{(191.4KB, pdf)}

References

Ochsenfeld C.; White C. A.; Head-Gordon M. Linear and sublinear scaling formation of Hartree–Fock-type exchange matrices. J. Chem. Phys. 1998, 109, 1663–1669. 10.1063/1.476741. [DOI] [Google Scholar]
White C. A.; Johnson B. G.; Gill P. M. W.; Head-Gordon M. The continuous fast multipole method. Chem. Phys. Lett. 1994, 230, 8–16. 10.1016/0009-2614(94)01128-1. [DOI] [Google Scholar]
Barnes J.; Hut P. A hierarchical O (N log N) force-calculation algorithm. Nature 1986, 324, 446–449. 10.1038/324446a0. [DOI] [Google Scholar]
Strain M. C.; Scuseria G. E.; Frisch M. J. Achieving linear scaling for the electronic quantum Coulomb problem. Science 1996, 271, 51–53. 10.1126/science.271.5245.51. [DOI] [Google Scholar]
Challacombe M.; Schwegler E.; Almlöf J. Fast assembly of the Coulomb matrix: A quantum chemical tree code. J. Chem. Phys. 1996, 104, 4685–4698. 10.1063/1.471163. [DOI] [Google Scholar]
Neese F.; Wennmohs F.; Hansen A.; Becker U. Efficient, approximate and parallel Hartree–Fock and hybrid DFT calculations. A ‘chain-of-spheres’ algorithm for the Hartree–Fock exchange. Chem. Phys. 2009, 356, 98–109. 10.1016/j.chemphys.2008.10.036. [DOI] [Google Scholar]
Guidon M.; Hutter J.; VandeVondele J. Auxiliary density matrix methods for Hartree–Fock exchange calculations. J. Chem. Theory Comput. 2010, 6, 2348–2364. 10.1021/ct1002225. [DOI] [PubMed] [Google Scholar]
Whitten J. L. Coulombic potential energy integrals and approximations. J. Chem. Phys. 1973, 58, 4496–4501. 10.1063/1.1679012. [DOI] [Google Scholar]
Vahtras O.; Almlöf J.; Feyereisen M. W. Integral approximations for LCAO-SCF calculations. Chem. Phys. Lett. 1993, 213, 514–518. 10.1016/0009-2614(93)89151-7. [DOI] [Google Scholar]
Eichkorn K.; Treutler O.; Öhm H.; Häser M.; Ahlrichs R. Auxiliary basis sets to approximate Coulomb potentials. Chem. Phys. Lett. 1995, 240, 283–290. 10.1016/0009-2614(95)00621-a. [DOI] [Google Scholar]
Polly R.; Werner H.-J.; Manby F. R.; Knowles P. J. Fast Hartree–Fock theory using local density fitting approximations. Mol. Phys. 2004, 102, 2311–2321. 10.1080/0026897042000274801. [DOI] [Google Scholar]
Sodt A.; Subotnik J. E.; Head-Gordon M. Linear scaling density fitting. J. Chem. Phys. 2006, 125, 194109. 10.1063/1.2370949. [DOI] [PubMed] [Google Scholar]
Köppl C.; Werner H.-J. Parallel and low-order scaling implementation of Hartree–Fock exchange using local density fitting. J. Chem. Theory Comput. 2016, 12, 3122–3134. 10.1021/acs.jctc.6b00251. [DOI] [PubMed] [Google Scholar]
Boman L.; Koch H.; Sánchez de Merás A. Method specific Cholesky decomposition: Coulomb and exchange energies. J. Chem. Phys. 2008, 129, 134107. 10.1063/1.2988315. [DOI] [PubMed] [Google Scholar]
Aquilante F.; Gagliardi L.; Pedersen T. B.; Lindh R. Atomic Cholesky decompositions: A route to unbiased auxiliary basis sets for density fitting approximation with tunable accuracy and efficiency. J. Chem. Phys. 2009, 130, 154107. 10.1063/1.3116784. [DOI] [PubMed] [Google Scholar]
Ufimtsev I. S.; Martínez T. J. Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation. J. Chem. Theory Comput. 2008, 4, 222–231. 10.1021/ct700268q. [DOI] [PubMed] [Google Scholar]
Luehr N.; Ufimtsev I. S.; Martínez T. J. Dynamic precision for electron repulsion integral evaluation on graphical processing units (GPUs). J. Chem. Theory Comput. 2011, 7, 949–954. 10.1021/ct100701w. [DOI] [PubMed] [Google Scholar]
Helgaker T.; Jørgensen P.; Olsen J.. Molecular Electronic-Structure Theory; John Wiley & Sons, 2014. [Google Scholar]
Li X.-P.; Nunes R. W.; Vanderbilt D. Density-matrix electronic-structure method with linear system-size scaling. Phys. Rev. B: Condens. Matter Mater. Phys. 1993, 47, 10891–10894. 10.1103/physrevb.47.10891. [DOI] [PubMed] [Google Scholar]
Millam J. M.; Scuseria G. E. Linear scaling conjugate gradient density matrix search as an alternative to diagonalization for first principles electronic structure calculations. J. Chem. Phys. 1997, 106, 5569–5577. 10.1063/1.473579. [DOI] [Google Scholar]
Challacombe M. A simplified density matrix minimization for linear scaling self-consistent field theory. J. Chem. Phys. 1999, 110, 2332–2342. 10.1063/1.477969. [DOI] [Google Scholar]
McWeeny R. Hartree–Fock theory with nonorthogonal basis functions. Phys. Rev. 1959, 114, 1528. 10.1103/physrev.114.1528. [DOI] [Google Scholar]
McWeeny R. Some recent advances in density matrix theory. Rev. Mod. Phys. 1960, 32, 335. 10.1103/revmodphys.32.335. [DOI] [Google Scholar]
Kussmann J.; Beer M.; Ochsenfeld C. Linear-scaling self-consistent field methods for large molecules. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2013, 3, 614–636. 10.1002/wcms.1138. [DOI] [Google Scholar]
Gordon M. S.; Fedorov D. G.; Pruitt S. R.; Slipchenko L. V. Fragmentation methods: A route to accurate calculations on large systems. Chem. Rev. 2012, 112, 632–672. 10.1021/cr200093j. [DOI] [PubMed] [Google Scholar]
Yang W.; Lee T. S. A density-matrix divide-and-conquer approach for electronic structure calculations of large molecules. J. Chem. Phys. 1995, 103, 5674–5678. 10.1063/1.470549. [DOI] [Google Scholar]
He X.; Merz K. M. Jr. Divide and conquer Hartree–Fock calculations on proteins. J. Chem. Theory Comput. 2010, 6, 405–411. 10.1021/ct9006635. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miertuš S.; Scrocco E.; Tomasi J. Electrostatic interaction of a solute with a continuum. A direct utilization of ab initio molecular potentials for the prevision of solvent effects. Chem. Phys. 1981, 55, 117–129. 10.1016/0301-0104(81)85090-2. [DOI] [Google Scholar]
Tomasi J.; Mennucci B.; Cammi R. Quantum mechanical continuum solvation models. Chem. Rev. 2005, 105, 2999–3094. 10.1021/cr9904009. [DOI] [PubMed] [Google Scholar]
Mennucci B. Polarizable continuum model. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2012, 2, 386–404. 10.1002/wcms.1086. [DOI] [Google Scholar]
Warshel A.; Levitt M. Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J. Mol. Biol. 1976, 103, 227–249. 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
Senn H. M.; Thiel W. QM/MM methods for biomolecular systems. Angew. Chem., Int. Ed. 2009, 48, 1198–1229. 10.1002/anie.200802019. [DOI] [PubMed] [Google Scholar]
Lin H.; Truhlar D. G. QM/MM: what have we learned, where are we, and where do we go from here?. Theor. Chem. Acc. 2007, 117, 185–199. 10.1007/s00214-006-0143-z. [DOI] [Google Scholar]
Wesolowski T. A.; Shedge S.; Zhou X. Frozen-density embedding strategy for multilevel simulations of electronic structure. Chem. Rev. 2015, 115, 5891–5928. 10.1021/cr500502v. [DOI] [PubMed] [Google Scholar]
Svensson M.; Humbel S.; Froese R. D. J.; Matsubara T.; Sieber S.; Morokuma K. ONIOM: a multilayered integrated MO+ MM method for geometry optimizations and single point energy predictions. A test for Diels- Alder reactions and Pt (P (t-Bu) 3) 2+ H2 oxidative addition. J. Phys. Chem. 1996, 100, 19357–19363. 10.1021/jp962071j. [DOI] [Google Scholar]
Hratchian H. P.; Parandekar P. V.; Raghavachari K.; Frisch M. J.; Vreven T. QM: QM electronic embedding using Mulliken atomic charges: Energies and analytic gradients in an ONIOM framework. J. Chem. Phys. 2008, 128, 034107. 10.1063/1.2814164. [DOI] [PubMed] [Google Scholar]
Govind N.; Wang Y. A.; Carter E. A. Electronic-structure calculations by first-principles density-based embedding of explicitly correlated systems. J. Chem. Phys. 1999, 110, 7677–7688. 10.1063/1.478679. [DOI] [Google Scholar]
Li W.; Piecuch P. Multilevel Extension of the Cluster-in-Molecule Local Correlation Methodology: Merging Coupled-Cluster and Møller- Plesset Perturbation Theories. J. Phys. Chem. A 2010, 114, 6721–6727. 10.1021/jp1038738. [DOI] [PubMed] [Google Scholar]
Myhre R. H.; Sánchez de Merás A. M. J.; Koch H. Multi-level coupled cluster theory. J. Chem. Phys. 2014, 141, 224105. 10.1063/1.4903195. [DOI] [PubMed] [Google Scholar]
Höfener S.; Visscher L. Calculation of electronic excitations using wave-function in wave-function frozen-density embedding. J. Chem. Phys. 2012, 137, 204120. 10.1063/1.4767981. [DOI] [PubMed] [Google Scholar]
Mata R. A.; Werner H.-J.; Schütz M. Correlation regions within a localized molecular orbital approach. J. Chem. Phys. 2008, 128, 144106. 10.1063/1.2884725. [DOI] [PubMed] [Google Scholar]
Sæther S.; Kjærgaard T.; Koch H.; Høyvik I.-M. Density-Based Multilevel Hartree–Fock Model. J. Chem. Theor. Comput. 2017, 13, 5282–5290. 10.1021/acs.jctc.7b00689. [DOI] [PubMed] [Google Scholar]
Assfeld X.; Rivail J.-L. Quantum chemical computations on parts of large molecules: the ab initio local self consistent field method. Chem. Phys. Lett. 1996, 263, 100–106. 10.1016/s0009-2614(96)01165-7. [DOI] [Google Scholar]
Ferré N.; Assfeld X.; Rivail J.-L. Specific force field parameters determination for the hybrid ab initio QM/MM LSCF method. J. Comput. Chem. 2002, 23, 610–624. 10.1002/jcc.10058. [DOI] [PubMed] [Google Scholar]
Macetti G.; Genoni A. Quantum mechanics/extremely localized molecular orbital method: A fully quantum mechanical embedding approach for macromolecules. J. Phys. Chem. A 2019, 123, 9420–9428. 10.1021/acs.jpca.9b08882. [DOI] [PubMed] [Google Scholar]
Folkestad S. D.; et al. eT 1.0: An open source electronic structure program with emphasis on coupled cluster and multilevel methods. J. Chem. Phys. 2020, 152, 184103. 10.1063/5.0004713. [DOI] [PubMed] [Google Scholar]
Folkestad S. D.; Kjønstad E. F.; Goletto L.; Koch H. Multilevel CC2 and CCSD in reduced orbital spaces: electronic excitations in large molecular systems. J. Chem. Theory Comput. 2021, 17, 714–726. 10.1021/acs.jctc.0c00590. [DOI] [PMC free article] [PubMed] [Google Scholar]
Goletto L.; Giovannini T.; Folkestad S. D.; Koch H. Combining multilevel Hartree–Fock and multilevel coupled cluster approaches with molecular mechanics: a study of electronic excitations in solutions. Phys. Chem. Chem. Phys. 2021, 23, 4413–4425. 10.1039/d0cp06359b. [DOI] [PubMed] [Google Scholar]
Høyvik I.-M. Convergence acceleration for the multilevel Hartree–Fock model. Mol. Phys. 2020, 118, 1626929. 10.1080/00268976.2019.1626929. [DOI] [Google Scholar]
Marrazzini G.; Giovannini T.; Scavino M.; Egidi F.; Cappelli C.; Koch H. Multilevel density functional theory. J. Chem. Theory Comput. 2021, 17, 791–803. 10.1021/acs.jctc.0c00940. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pulay P. Convergence acceleration of iterative sequences. The case of SCF iteration. Chem. Phys. Lett. 1980, 73, 393–398. 10.1016/0009-2614(80)80396-4. [DOI] [Google Scholar]
Pulay P. Improved SCF convergence acceleration. J. Comput. Chem. 1982, 3, 556–560. 10.1002/jcc.540030413. [DOI] [Google Scholar]
Van Lenthe J. H.; Zwaans R.; Van Dam H. J. J.; Guest M. F. Starting SCF calculations by superposition of atomic densities. J. Comput. Chem. 2006, 27, 926–932. 10.1002/jcc.20393. [DOI] [PubMed] [Google Scholar]
Neugebauer J.; Louwerse M. J.; Baerends E. J.; Wesolowski T. A. The merits of the frozen-density embedding scheme to model solvatochromic shifts. J. Chem. Phys. 2005, 122, 094115. 10.1063/1.1858411. [DOI] [PubMed] [Google Scholar]
Sánchez de Merás A. M. J.; Koch H.; Cuesta I. G.; Boman L. Cholesky decomposition-based definition of atomic subsystems in electronic structure calculations. J. Chem. Phys. 2010, 132, 204105. 10.1063/1.3431622. [DOI] [PubMed] [Google Scholar]
Aquilante F.; Bondo Pedersen T.; Sánchez de Merás A.; Koch H. Fast noniterative orbital localization for large molecules. J. Chem. Phys. 2006, 125, 174101. 10.1063/1.2360264. [DOI] [PubMed] [Google Scholar]
Pulay P. Localizability of dynamic electron correlation. Chem. Phys. Lett. 1983, 100, 151–154. 10.1016/0009-2614(83)80703-9. [DOI] [Google Scholar]
Saebo S.; Pulay P. Local treatment of electron correlation. Annu. Rev. Phys. Chem. 1993, 44, 213–236. 10.1146/annurev.pc.44.100193.001241. [DOI] [Google Scholar]
Löwdin P.-O.Advances in Quantum Chemistry; Elsevier, 1970; Vol. 5, pp 185–199. [Google Scholar]
Aidas K.; Angeli C.; Bak K. L.; Bakken V.; Bast R.; Boman L.; Christiansen O.; Cimiraglia R.; Coriani S.; Dahle P.; et al. The Dalton quantum chemistry program system. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2014, 4, 269–284. 10.1002/wcms.1172. [DOI] [PMC free article] [PubMed] [Google Scholar]
Valeev E. F.Libint: A Library for the Evaluation of Molecular Integrals of Many-Body Operators over Gaussian Functions, 2017.
Huang H.; Sherrill C. D.; Chow E. Techniques for high-performance construction of Fock matrices. J. Chem. Phys. 2020, 152, 024122. 10.1063/1.5129452. [DOI] [PubMed] [Google Scholar]
Rudberg E.; Rubensson E. H.; Sałek P. Hartree–Fock calculations with linearly scaling memory usage. J. Chem. Phys. 2008, 128, 184106. 10.1063/1.2918357. [DOI] [PubMed] [Google Scholar]
Goletto L.; Kjønstad E. F.; Folkestad S. D.; Høyvik I.-M.; Koch H.. Geometries for “Linear-scaling implementation of multilevel Hartree–Fock theory”, 2021, https://doi.org/10.5281/zenodo.4639506. [DOI] [PMC free article] [PubMed]
Pettersen E. F.; Goddard T. D.; Huang C. C.; Couch G. S.; Greenblatt D. M.; Meng E. C.; Ferrin T. E. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 2004, 25, 1605–1612. 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ct1c00299_si_001.pdf^{(191.4KB, pdf)}

[ref1] Ochsenfeld C.; White C. A.; Head-Gordon M. Linear and sublinear scaling formation of Hartree–Fock-type exchange matrices. J. Chem. Phys. 1998, 109, 1663–1669. 10.1063/1.476741. [DOI] [Google Scholar]

[ref2] White C. A.; Johnson B. G.; Gill P. M. W.; Head-Gordon M. The continuous fast multipole method. Chem. Phys. Lett. 1994, 230, 8–16. 10.1016/0009-2614(94)01128-1. [DOI] [Google Scholar]

[ref3] Barnes J.; Hut P. A hierarchical O (N log N) force-calculation algorithm. Nature 1986, 324, 446–449. 10.1038/324446a0. [DOI] [Google Scholar]

[ref4] Strain M. C.; Scuseria G. E.; Frisch M. J. Achieving linear scaling for the electronic quantum Coulomb problem. Science 1996, 271, 51–53. 10.1126/science.271.5245.51. [DOI] [Google Scholar]

[ref5] Challacombe M.; Schwegler E.; Almlöf J. Fast assembly of the Coulomb matrix: A quantum chemical tree code. J. Chem. Phys. 1996, 104, 4685–4698. 10.1063/1.471163. [DOI] [Google Scholar]

[ref6] Neese F.; Wennmohs F.; Hansen A.; Becker U. Efficient, approximate and parallel Hartree–Fock and hybrid DFT calculations. A ‘chain-of-spheres’ algorithm for the Hartree–Fock exchange. Chem. Phys. 2009, 356, 98–109. 10.1016/j.chemphys.2008.10.036. [DOI] [Google Scholar]

[ref7] Guidon M.; Hutter J.; VandeVondele J. Auxiliary density matrix methods for Hartree–Fock exchange calculations. J. Chem. Theory Comput. 2010, 6, 2348–2364. 10.1021/ct1002225. [DOI] [PubMed] [Google Scholar]

[ref8] Whitten J. L. Coulombic potential energy integrals and approximations. J. Chem. Phys. 1973, 58, 4496–4501. 10.1063/1.1679012. [DOI] [Google Scholar]

[ref9] Vahtras O.; Almlöf J.; Feyereisen M. W. Integral approximations for LCAO-SCF calculations. Chem. Phys. Lett. 1993, 213, 514–518. 10.1016/0009-2614(93)89151-7. [DOI] [Google Scholar]

[ref10] Eichkorn K.; Treutler O.; Öhm H.; Häser M.; Ahlrichs R. Auxiliary basis sets to approximate Coulomb potentials. Chem. Phys. Lett. 1995, 240, 283–290. 10.1016/0009-2614(95)00621-a. [DOI] [Google Scholar]

[ref11] Polly R.; Werner H.-J.; Manby F. R.; Knowles P. J. Fast Hartree–Fock theory using local density fitting approximations. Mol. Phys. 2004, 102, 2311–2321. 10.1080/0026897042000274801. [DOI] [Google Scholar]

[ref12] Sodt A.; Subotnik J. E.; Head-Gordon M. Linear scaling density fitting. J. Chem. Phys. 2006, 125, 194109. 10.1063/1.2370949. [DOI] [PubMed] [Google Scholar]

[ref13] Köppl C.; Werner H.-J. Parallel and low-order scaling implementation of Hartree–Fock exchange using local density fitting. J. Chem. Theory Comput. 2016, 12, 3122–3134. 10.1021/acs.jctc.6b00251. [DOI] [PubMed] [Google Scholar]

[ref14] Boman L.; Koch H.; Sánchez de Merás A. Method specific Cholesky decomposition: Coulomb and exchange energies. J. Chem. Phys. 2008, 129, 134107. 10.1063/1.2988315. [DOI] [PubMed] [Google Scholar]

[ref15] Aquilante F.; Gagliardi L.; Pedersen T. B.; Lindh R. Atomic Cholesky decompositions: A route to unbiased auxiliary basis sets for density fitting approximation with tunable accuracy and efficiency. J. Chem. Phys. 2009, 130, 154107. 10.1063/1.3116784. [DOI] [PubMed] [Google Scholar]

[ref16] Ufimtsev I. S.; Martínez T. J. Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation. J. Chem. Theory Comput. 2008, 4, 222–231. 10.1021/ct700268q. [DOI] [PubMed] [Google Scholar]

[ref17] Luehr N.; Ufimtsev I. S.; Martínez T. J. Dynamic precision for electron repulsion integral evaluation on graphical processing units (GPUs). J. Chem. Theory Comput. 2011, 7, 949–954. 10.1021/ct100701w. [DOI] [PubMed] [Google Scholar]

[ref18] Helgaker T.; Jørgensen P.; Olsen J.. Molecular Electronic-Structure Theory; John Wiley & Sons, 2014. [Google Scholar]

[ref19] Li X.-P.; Nunes R. W.; Vanderbilt D. Density-matrix electronic-structure method with linear system-size scaling. Phys. Rev. B: Condens. Matter Mater. Phys. 1993, 47, 10891–10894. 10.1103/physrevb.47.10891. [DOI] [PubMed] [Google Scholar]

[ref20] Millam J. M.; Scuseria G. E. Linear scaling conjugate gradient density matrix search as an alternative to diagonalization for first principles electronic structure calculations. J. Chem. Phys. 1997, 106, 5569–5577. 10.1063/1.473579. [DOI] [Google Scholar]

[ref21] Challacombe M. A simplified density matrix minimization for linear scaling self-consistent field theory. J. Chem. Phys. 1999, 110, 2332–2342. 10.1063/1.477969. [DOI] [Google Scholar]

[ref22] McWeeny R. Hartree–Fock theory with nonorthogonal basis functions. Phys. Rev. 1959, 114, 1528. 10.1103/physrev.114.1528. [DOI] [Google Scholar]

[ref23] McWeeny R. Some recent advances in density matrix theory. Rev. Mod. Phys. 1960, 32, 335. 10.1103/revmodphys.32.335. [DOI] [Google Scholar]

[ref24] Kussmann J.; Beer M.; Ochsenfeld C. Linear-scaling self-consistent field methods for large molecules. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2013, 3, 614–636. 10.1002/wcms.1138. [DOI] [Google Scholar]

[ref25] Gordon M. S.; Fedorov D. G.; Pruitt S. R.; Slipchenko L. V. Fragmentation methods: A route to accurate calculations on large systems. Chem. Rev. 2012, 112, 632–672. 10.1021/cr200093j. [DOI] [PubMed] [Google Scholar]

[ref26] Yang W.; Lee T. S. A density-matrix divide-and-conquer approach for electronic structure calculations of large molecules. J. Chem. Phys. 1995, 103, 5674–5678. 10.1063/1.470549. [DOI] [Google Scholar]

[ref27] He X.; Merz K. M. Jr. Divide and conquer Hartree–Fock calculations on proteins. J. Chem. Theory Comput. 2010, 6, 405–411. 10.1021/ct9006635. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref28] Miertuš S.; Scrocco E.; Tomasi J. Electrostatic interaction of a solute with a continuum. A direct utilization of ab initio molecular potentials for the prevision of solvent effects. Chem. Phys. 1981, 55, 117–129. 10.1016/0301-0104(81)85090-2. [DOI] [Google Scholar]

[ref29] Tomasi J.; Mennucci B.; Cammi R. Quantum mechanical continuum solvation models. Chem. Rev. 2005, 105, 2999–3094. 10.1021/cr9904009. [DOI] [PubMed] [Google Scholar]

[ref30] Mennucci B. Polarizable continuum model. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2012, 2, 386–404. 10.1002/wcms.1086. [DOI] [Google Scholar]

[ref31] Warshel A.; Levitt M. Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J. Mol. Biol. 1976, 103, 227–249. 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]

[ref32] Senn H. M.; Thiel W. QM/MM methods for biomolecular systems. Angew. Chem., Int. Ed. 2009, 48, 1198–1229. 10.1002/anie.200802019. [DOI] [PubMed] [Google Scholar]

[ref33] Lin H.; Truhlar D. G. QM/MM: what have we learned, where are we, and where do we go from here?. Theor. Chem. Acc. 2007, 117, 185–199. 10.1007/s00214-006-0143-z. [DOI] [Google Scholar]

[ref34] Wesolowski T. A.; Shedge S.; Zhou X. Frozen-density embedding strategy for multilevel simulations of electronic structure. Chem. Rev. 2015, 115, 5891–5928. 10.1021/cr500502v. [DOI] [PubMed] [Google Scholar]

[ref35] Svensson M.; Humbel S.; Froese R. D. J.; Matsubara T.; Sieber S.; Morokuma K. ONIOM: a multilayered integrated MO+ MM method for geometry optimizations and single point energy predictions. A test for Diels- Alder reactions and Pt (P (t-Bu) 3) 2+ H2 oxidative addition. J. Phys. Chem. 1996, 100, 19357–19363. 10.1021/jp962071j. [DOI] [Google Scholar]

[ref36] Hratchian H. P.; Parandekar P. V.; Raghavachari K.; Frisch M. J.; Vreven T. QM: QM electronic embedding using Mulliken atomic charges: Energies and analytic gradients in an ONIOM framework. J. Chem. Phys. 2008, 128, 034107. 10.1063/1.2814164. [DOI] [PubMed] [Google Scholar]

[ref37] Govind N.; Wang Y. A.; Carter E. A. Electronic-structure calculations by first-principles density-based embedding of explicitly correlated systems. J. Chem. Phys. 1999, 110, 7677–7688. 10.1063/1.478679. [DOI] [Google Scholar]

[ref38] Li W.; Piecuch P. Multilevel Extension of the Cluster-in-Molecule Local Correlation Methodology: Merging Coupled-Cluster and Møller- Plesset Perturbation Theories. J. Phys. Chem. A 2010, 114, 6721–6727. 10.1021/jp1038738. [DOI] [PubMed] [Google Scholar]

[ref39] Myhre R. H.; Sánchez de Merás A. M. J.; Koch H. Multi-level coupled cluster theory. J. Chem. Phys. 2014, 141, 224105. 10.1063/1.4903195. [DOI] [PubMed] [Google Scholar]

[ref40] Höfener S.; Visscher L. Calculation of electronic excitations using wave-function in wave-function frozen-density embedding. J. Chem. Phys. 2012, 137, 204120. 10.1063/1.4767981. [DOI] [PubMed] [Google Scholar]

[ref41] Mata R. A.; Werner H.-J.; Schütz M. Correlation regions within a localized molecular orbital approach. J. Chem. Phys. 2008, 128, 144106. 10.1063/1.2884725. [DOI] [PubMed] [Google Scholar]

[ref42] Sæther S.; Kjærgaard T.; Koch H.; Høyvik I.-M. Density-Based Multilevel Hartree–Fock Model. J. Chem. Theor. Comput. 2017, 13, 5282–5290. 10.1021/acs.jctc.7b00689. [DOI] [PubMed] [Google Scholar]

[ref43] Assfeld X.; Rivail J.-L. Quantum chemical computations on parts of large molecules: the ab initio local self consistent field method. Chem. Phys. Lett. 1996, 263, 100–106. 10.1016/s0009-2614(96)01165-7. [DOI] [Google Scholar]

[ref44] Ferré N.; Assfeld X.; Rivail J.-L. Specific force field parameters determination for the hybrid ab initio QM/MM LSCF method. J. Comput. Chem. 2002, 23, 610–624. 10.1002/jcc.10058. [DOI] [PubMed] [Google Scholar]

[ref45] Macetti G.; Genoni A. Quantum mechanics/extremely localized molecular orbital method: A fully quantum mechanical embedding approach for macromolecules. J. Phys. Chem. A 2019, 123, 9420–9428. 10.1021/acs.jpca.9b08882. [DOI] [PubMed] [Google Scholar]

[ref46] Folkestad S. D.; et al. eT 1.0: An open source electronic structure program with emphasis on coupled cluster and multilevel methods. J. Chem. Phys. 2020, 152, 184103. 10.1063/5.0004713. [DOI] [PubMed] [Google Scholar]

[ref47] Folkestad S. D.; Kjønstad E. F.; Goletto L.; Koch H. Multilevel CC2 and CCSD in reduced orbital spaces: electronic excitations in large molecular systems. J. Chem. Theory Comput. 2021, 17, 714–726. 10.1021/acs.jctc.0c00590. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref48] Goletto L.; Giovannini T.; Folkestad S. D.; Koch H. Combining multilevel Hartree–Fock and multilevel coupled cluster approaches with molecular mechanics: a study of electronic excitations in solutions. Phys. Chem. Chem. Phys. 2021, 23, 4413–4425. 10.1039/d0cp06359b. [DOI] [PubMed] [Google Scholar]

[ref49] Høyvik I.-M. Convergence acceleration for the multilevel Hartree–Fock model. Mol. Phys. 2020, 118, 1626929. 10.1080/00268976.2019.1626929. [DOI] [Google Scholar]

[ref50] Marrazzini G.; Giovannini T.; Scavino M.; Egidi F.; Cappelli C.; Koch H. Multilevel density functional theory. J. Chem. Theory Comput. 2021, 17, 791–803. 10.1021/acs.jctc.0c00940. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref51] Pulay P. Convergence acceleration of iterative sequences. The case of SCF iteration. Chem. Phys. Lett. 1980, 73, 393–398. 10.1016/0009-2614(80)80396-4. [DOI] [Google Scholar]

[ref52] Pulay P. Improved SCF convergence acceleration. J. Comput. Chem. 1982, 3, 556–560. 10.1002/jcc.540030413. [DOI] [Google Scholar]

[ref53] Van Lenthe J. H.; Zwaans R.; Van Dam H. J. J.; Guest M. F. Starting SCF calculations by superposition of atomic densities. J. Comput. Chem. 2006, 27, 926–932. 10.1002/jcc.20393. [DOI] [PubMed] [Google Scholar]

[ref54] Neugebauer J.; Louwerse M. J.; Baerends E. J.; Wesolowski T. A. The merits of the frozen-density embedding scheme to model solvatochromic shifts. J. Chem. Phys. 2005, 122, 094115. 10.1063/1.1858411. [DOI] [PubMed] [Google Scholar]

[ref55] Sánchez de Merás A. M. J.; Koch H.; Cuesta I. G.; Boman L. Cholesky decomposition-based definition of atomic subsystems in electronic structure calculations. J. Chem. Phys. 2010, 132, 204105. 10.1063/1.3431622. [DOI] [PubMed] [Google Scholar]

[ref56] Aquilante F.; Bondo Pedersen T.; Sánchez de Merás A.; Koch H. Fast noniterative orbital localization for large molecules. J. Chem. Phys. 2006, 125, 174101. 10.1063/1.2360264. [DOI] [PubMed] [Google Scholar]

[ref57] Pulay P. Localizability of dynamic electron correlation. Chem. Phys. Lett. 1983, 100, 151–154. 10.1016/0009-2614(83)80703-9. [DOI] [Google Scholar]

[ref58] Saebo S.; Pulay P. Local treatment of electron correlation. Annu. Rev. Phys. Chem. 1993, 44, 213–236. 10.1146/annurev.pc.44.100193.001241. [DOI] [Google Scholar]

[ref59] Löwdin P.-O.Advances in Quantum Chemistry; Elsevier, 1970; Vol. 5, pp 185–199. [Google Scholar]

[ref60] Aidas K.; Angeli C.; Bak K. L.; Bakken V.; Bast R.; Boman L.; Christiansen O.; Cimiraglia R.; Coriani S.; Dahle P.; et al. The Dalton quantum chemistry program system. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2014, 4, 269–284. 10.1002/wcms.1172. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref61] Valeev E. F.Libint: A Library for the Evaluation of Molecular Integrals of Many-Body Operators over Gaussian Functions, 2017.

[ref62] Huang H.; Sherrill C. D.; Chow E. Techniques for high-performance construction of Fock matrices. J. Chem. Phys. 2020, 152, 024122. 10.1063/1.5129452. [DOI] [PubMed] [Google Scholar]

[ref63] Rudberg E.; Rubensson E. H.; Sałek P. Hartree–Fock calculations with linearly scaling memory usage. J. Chem. Phys. 2008, 128, 184106. 10.1063/1.2918357. [DOI] [PubMed] [Google Scholar]

[ref64] Goletto L.; Kjønstad E. F.; Folkestad S. D.; Høyvik I.-M.; Koch H.. Geometries for “Linear-scaling implementation of multilevel Hartree–Fock theory”, 2021, https://doi.org/10.5281/zenodo.4639506. [DOI] [PMC free article] [PubMed]

[ref65] Pettersen E. F.; Goddard T. D.; Huang C. C.; Couch G. S.; Greenblatt D. M.; Meng E. C.; Ferrin T. E. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 2004, 25, 1605–1612. 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]

PERMALINK

Linear-Scaling Implementation of Multilevel Hartree–Fock Theory

Linda Goletto

Eirik F Kjønstad

Sarai D Folkestad

Ida-Marie Høyvik

Henrik Koch

Abstract

Introduction

MLHF Theory

Linear-Scaling Algorithm for the Fock Matrix

Figure 1.

Table 1. Computational Scaling of Terms in the MLHF Implementation, with and without Screening with Respect to the MOs (C-Screening).

Results and Discussion

Scaling Properties

Figure 2.

Figure 3.

Table 2. Wall Time for MLHF/cc-pVDZ Calculations on the Linear Amino Acid Chainsa.

Table 3. Wall Time for MLHF/aug-cc-pVDZ Calculations on the Linear Amino Acid Chainsa.

Figure 4.

Figure 5.

Comparison to HF

Table 4. Wall Time, Expressed in minutes, for MLHF and HF Calculations on a Water Cluster with a 10 Å Radiusa.

Validating the Screening Algorithm with CC2 Excitation Energies

Figure 6.

Table 5. CC2-in-MLHF/HF Excitation Energies, Obtained Using aug-cc-pVDZ on the Active Atoms and cc-pVDZ on the Inactive Atoms.

Density Purification and Memory Compression for Large Systems

Figure 7.

Table 6. Wall Time and Memory Requirements for MLHF/aug-cc-pVTZ/cc-pVDZ Calculations on a Series of Erythromycin-in-Water Systemsa.

Summary and Concluding Remarks

Acknowledgments

Supporting Information Available

Supplementary Material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 2. Wall Time for MLHF/cc-pVDZ Calculations on the Linear Amino Acid Chains^a.

Table 3. Wall Time for MLHF/aug-cc-pVDZ Calculations on the Linear Amino Acid Chains^a.

Table 4. Wall Time, Expressed in minutes, for MLHF and HF Calculations on a Water Cluster with a 10 Å Radius^a.

Table 6. Wall Time and Memory Requirements for MLHF/aug-cc-pVTZ/cc-pVDZ Calculations on a Series of Erythromycin-in-Water Systems^a.