Super-resolution techniques to simulate electronic spectra of large molecular systems

Matthias Kick; Ezra Alexander; Anton Beiersdorfer; Troy Van Voorhis

doi:10.1038/s41467-024-52368-5

. 2024 Sep 12;15:8001. doi: 10.1038/s41467-024-52368-5

Super-resolution techniques to simulate electronic spectra of large molecular systems

Matthias Kick ^1,^✉, Ezra Alexander ¹, Anton Beiersdorfer ², Troy Van Voorhis ¹

PMCID: PMC11393058 PMID: 39266582

Abstract

An accurate treatment of electronic spectra in large systems with a technique such as time-dependent density functional theory is computationally challenging. Due to the Nyquist sampling theorem, direct real-time simulations must be prohibitively long to achieve suitably sharp resolution in frequency space. Super-resolution techniques such as compressed sensing and MUSIC assume only a small number of excitations contribute to the spectrum, which fails in large molecular systems where the number of excitations is typically very large. We present an approach that combines exact short-time dynamics with approximate frequency space methods to capture large narrow features embedded in a dense manifold of smaller nearby peaks. We show that our approach can accurately capture narrow features and a broad quasi-continuum of states simultaneously, even when the features overlap in frequency. Our approach is able to reduce the required simulation time to achieve reasonable accuracy by a factor of 20-40 with respect to standard Fourier analysis and shows promise for accurately predicting the whole spectrum of large molecules and materials.

Subject terms: Theoretical chemistry, Computational methods, Photochemistry, Materials for optics

Calculating electronic spectra of large systems is computationally challenging. Here, the authors combine exact short-time dynamics with approximate frequency space methods to capture narrow features embedded in a dense manifold of smaller peaks.

Introduction

Electronic excitations in molecules and materials are important for understanding various kinds of phenomena such as photo-excitation in solar cells, optical excitations in OLEDS and quantum dots^1–11. Theoretically, electronic excitations can be obtained by analysing the frequency components of the time-dependent dipole moment obtained from real-time propagation¹². Among various other methods such GW/BSE¹³, EOM^14–16 or ADC¹⁷, time-dependent density functional theory (RT-TDDFT)¹² is the most promising method to calculate the whole spectrum of large systems due to its superior scaling with respect to system size compared to other methods. Because of the computational complexity of real-time simulations for large molecules and materials, one is typically restricted to fairly short time dynamics (e.g. tens of fs). Due to the Nyquist sampling theorem, discrete Fourier analysis of the short-time dynamics fails to capture the narrow features that are critical fingerprints of molecular spectra. Meanwhile, standard super-resolution methods - such as compressed sensing (CS)^18–20, MUSIC²¹ and orthogonal matching pursuit^22,23 - typically fail for large molecular systems because they require the number of narrow features to be small, whereas the spectra of large molecules tends to be quite densely populated. Similarly, linear response approaches like the Casida²⁴ or Sternheimer equation²⁵ typically require one-at-a-time identification of roots and likewise fail when the number of desired roots is very large. In this paper we show how exact short-time dynamics can be combined with approximate frequency space results to accurately capture narrow features and a quasi-continuum of states in large molecular systems.

Our approach (BYND—Broad Yet Narrow Description) is illustrated in Fig. 1, for the case of a molecular chromophore adsorbed on a surface of a semiconductor nanocrystal. In this case, a super-resolution method only captures a small number of peaks in the overall spectrum, while discrete Fourier Transform (FT) of the short-time signal recovers only a broad quasi-continuum. In our approach, one first obtains an approximate spectrum - in this case using small matrix approximation (SMA)²⁶ – that has the right number of peaks in roughly the right locations. Next, the most important narrow features in the spectrum are optimized to match the short-time dynamics. Finally, linear prediction is used to match the intensities of the approximate and optimized spectral features - exactly recovering the short-time signal and yielding a spectrum that is substantially more accurate than CS or discrete FT alone can provide.

BYND successfully finds electronic excitations for large molecular systems where CS and other algorithms fail due to the presence of a quasi-continuum. For our largest test systems, we see standard mean errors between 0.01 and 0.14 eV in narrow feature position with respect to reference long-time RT-TDDFT. Considering the typical error of TDDFT with respect to experiment is around 0.25 eV²⁷, our method yields high quality results consistent with standard theoretical practice, useful in interpreting experimental results. Further, we see a reduction in the required computational time between 20- and 40-fold compared to standard FT due to the smaller number of time steps required by BYND. Thus, BYND enables the simulation of large molecular systems which otherwise would be computationally prohibitive even on modern computer hardware.

This article is structured as follows. We first briefly introduce the theory behind frequency-resolved approximations and exact short-time dynamics. We then move forward to a step-by-step explanation of the working equations of our method. We discuss the performance of BYND on a challenging set of large systems and conclude by discussing future directions for the method.

Results and discussion

Linear prediction

Modeling entire spectra from time-dependent signals can, in principle, be achieved by linear prediction^28–30. The basic idea is that one can predict spectral features from linear combinations of past output values. One simply determines all relevant model parameters directly from the short-time signal³¹. There are techniques which achieve this in time or frequency domain and in principle, if the number of samples is sufficient and if the distance between time steps is adequate, linear prediction is able to model the spectrum with good accuracy^32,33. However, for the system sizes we are aiming for, sampling enough time steps is computationally prohibitive. Further, if we want to model a spectrum, using linear prediction only, one usually needs an idea of how many frequencies there are and where they are located^32,33. Even if we would have this information available, the number of frequencies usually exceeds the number of data points by a large amount resulting in an under-determined system which makes it nearly impossible to extract meaningful spectra (see Fig. 2). We discuss these problems in more detail later on in this article 2.4 where we also provide examples.

Fig. 2 — a The short-time signal is accurately reproduced in each case, however, narrow features are completely absent in the resulting spectrum. We used a time signal with 1000 time steps and small matrix approximation (SMA) frequencies. To determine the model parameters (amplitudes) for the SMA frequencies we make use of equation (2). b We show an artificial generated spectrum where the first black exact result has no SMA frequency at the corresponding energy. In this case, linear prediction is not able to capture this feature. Intensity is abbreviated with Int.

SMA

The input required for BYND is an approximate excited state spectrum which shows the right number of peaks at approximately the right energies. To this end, we approximate the pseudo eigenvalue problem of the Casida equations²⁴ by employing the SMA. In this approximation, the electronic excitation energies (in frequency space) are given by a simple analytical expression, allowing one to obtain a large number of excitation energies without directly solving the extremely costly pseudo eigenvalue problem. We have implemented the SMA within the FHIaims infrastructure. This implementation allows the rapid evaluation of several thousand exited states easily in systems containing more than 1000 atoms (to be discussed elsewhere).

RT-TDDFT

Our goal is to improve the frequency information of the SMA by combining it with exact short-time dynamics from a real-time TDDFT (RT-TDDFT) simulation. In RT-TDDFT, the time-dependent Kohn-Sham states are explicitly propagated in time under the influence of an electric field (E_λ), which usually has the form of a sharp δ-pulse^34,35. The effect of the electric field pulse is the excitation of all possible electronic excitation modes. Thus, the oscillation of the time-dependent dipole moment from the real-time propagation can be directly linked to the excitation energies of the system. It should be emphasized that BYND can be used with any real-time propagation method. However, due to its superior scaling with respect to system size, real-time TDDFT is the clear choice over other electronic structure methods as we attempt to push toward larger systems. In fact, real-time TDDFT is already widely employed to capture electron dynamics in intermediately-sized molecular and solid-state systems^36–42.

Throughout the text, λ and μ will indicate the direction of the electric field and observed time-dependent dipole moment respectively. For a full optical excitation spectrum one needs to perform three propagations with different orientations of E_λ (x, y and z).

Combining SMA with RT-TDDFT

In this section we will introduce how our method is able to capture both narrow features and the quasi-continuum by combining approximate frequency results from the SMA with short-time RT-TDDFT data. In order to illustrate each step of our approach, the excitation spectrum of Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃) will serve as a prototype (Fig. 1). Within this system, bulk, surface and molecular states can easily mix and the excited states of the system blur into a quasi-continuum, requiring the evaluation of a large number of excited states in a small energy window. Specifically, in our example one needs to evaluate roughly 47,000 excited states in order to calculate the spectrum up to an excitation energy of 10 eV. With 3482 total electrons, this system is highly challenging for standard TDDFT and is thus an ideal test of our approach (Fig. 3). The CdSe nanocrystal has been extended into a test set of signals representative of a broad range of common large systems through the addition of aromatic molecules, which add narrow features, and through increasing the size of the nanocrystal, which enhances the continuum region. While these systems are an excellent test bed to study convergence and accuracy of signals with challenging wave forms, additional test systems such as dye-sensitized solar cells, surfaces slabs, molecular aggregates and nano tubes will also be used here to further illustrate the broad applicability of BYND.

Fig. 3 — a Dipole spectrum obtained from the small matrix approximation (SMA) with the corresponding time-dependent dipole signal (Dip.). b Selected narrow features from the SMA calculation. c Narrow feature position after optimization. The error with respect to exact short-time dynamics is notably reduced. d The full time-dependent signal is now reproduced with high accuracy and all features are correctly reproduced in the spectrum. The SMA signal in (a) and (b) was scaled to match the maximum amplitude of the time-dependent density functional theory (RT-TDDFT) reference within the given time window. This is only done for a better comparison of the signal wave forms. For the sake of simplicity and without loss of generality, we only show the dipole moment in x-direction after an electric field pulse in the same direction. Int. is the abbreviation for intensity.

In the following, we will use 1000 time steps of RT-TDDFT data of Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃) as reference. As we show in Fig. 1a, this signal length is far too short for techniques like Fourier analysis or CS to give any meaningful results. In fact, a standard Fourier transform requires the simulation of 20,000 time steps in order to yield the desired resolution.

Attempts to model the spectral features with the frequencies from SMA in a linear prediction fashion utterly fail due to the number of excitations far exceeding the number of available data points. As a consequence meaningful model parameters (amplitudes) are not extractable even by applying regularization techniques. In other cases the model frequencies from SMA might be in the wrong place and linear prediction alone even with sufficient data points is unable to extract all information. We illustrate these two cases in Fig. 2. In (a), the short-time signal is accurately reproduced, however, the parameters are under-determind and there are many ways to reproduce the signal. It is not possible to select a meaningful spectrum. The bright states are completely absent. In (b), there exists no proper solution at all as there is no SMA frequency available at the position of the first bright feature. All these considerations ultimately lead to the necessity of optimizing the SMA frequencies and to restrict their number.

Narrow feature selection

Our method is based on the realization that the spectrum of large systems can be separated in a sparse part and continuum part. It is important to realize that the SMA is accurate enough to give us an estimate of how many narrow features should be present, where the narrow features are located (up to 0.5 eV accuracy), how many continuum states are present and in which frequency range the continuum is. Therefore, our decisive step is to use the SMA as an initial guess (Fig. 3a). We select the initial set of narrow features by selecting each frequency for which the SMA transition dipole moment is above a certain threshold. The threshold needs to be chosen according to ensure that only bright excitations are included. In our example we use a threshold of 1.5 a.u. for the intensity (Fig. 3b).

Narrow feature optimization

The task of finding a set of optimal frequencies ω_k translates to finding a signal f^sparse which minimizes the error with respect to the short-time dynamics dipole target signal y. For this purpose, f^sparse at a certain time step t_i, can be defined as

f_{λ μ}^{sparse} (A_{k}^{λ μ}, ω_{k}, t_{i}) = - \sum_{k} A_{k}^{λ μ} \sin (ω_{k} t_{i}),

where we make use of the fact that all excitation modes start with an in-phase oscillation right after a sharp δ-pulse⁴³. Note, for cases where the electric field is parallel with the dipole operator, we are able to employ a non-negative constraint on the amplitudes.

The first step of our algorithm is to determine the amplitudes $A_{k}^{λ μ}$ of our target frequencies ω_k. For this purpose we make use of ridge regression⁴⁴, also known as Tikhonov regularization⁴⁵,

\min \frac{1}{n} \sum_{i} ∣ ∣ y_{i}^{λ μ} - f_{λ μ}^{sparse} (A_{k}^{λ μ}, ω_{k}, t_{i}) ∣ ∣_{2}^{2} + α_{sparse} ∣ ∣ f^{sparse} ∣ ∣_{2}^{2} .

Here, α_sparse is the regularization coefficient (for discussion on how to choose α_sparse, see Supplementary information section 4). Note that, in contrast to methods like CS, we do not need to enforce sparsity here as our SMA initial guess provides us with a good approximation of how many narrow features should be present. In principle, one could use CS or MUSIC for sparse feature extraction, however, we find that direct optimization of SMA provides more accurate results (see Supplementary Figs. 8–12).

Finding the optimal frequencies ω_k is a non-linear optimization problem⁴⁶ and can be solved efficiently by performing a line-search around the initial guess for these frequencies. Our algorithm aims to minimize the following objective function,

L (A_{k}^{λ μ}, ω_{k}) = \sum_{λ μ} \sum_{i} ∣ ∣ y_{i}^{λ μ} - f_{λ μ}^{sparse} (A_{k}^{λ μ}, ω_{k}, t_{i}) ∣ ∣_{2}^{2} + β \sum_{i} A_{k}^{λ μ} ∣ ∣ \sin (ω_{k} t_{i}) - \sin (ω_{k}^{init} t_{i}) ∣ ∣_{2}^{2},

where the first term measures the error with respect to the target signal. The last term acts as a penalty on frequencies which are too far away from their initial guess $ω_{k}^{init}$ with β determining the strength of the penalty. Our procedure is realized as a greedy-algorithm⁴⁷, which means our algorithm starts with the frequencies which have the highest amplitude and performs a line-search with a frequency search space ± Δω around the initial frequency. If a minimum is found the algorithm updates the old value with the newly found optimum frequency value and performs an additional amplitude adjustment step. It then moves forward to the next frequency. When all frequencies have been updated we start again by finding optimum amplitudes for the new set of frequencies. Both steps, amplitude adjustment and line-search are repeated until frequencies and amplitudes are converged. The entire procedure is described in Box 1. For more details the reader is referred to the discussion in section 1 of our Supplementary information.

As one can see from Fig. 3c, our procedure is able to successfully recover the narrow features in Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃). It should be emphasized that there are, in principle, infinitely many sets of frequencies which minimize the objective function L. By starting with a somewhat-accurate initial guess for the number of frequencies, we dramatically reduce the number of possible solutions. Only through this initial guess are we able to locate the correct position of the narrow features within the quasi-continuum of excitations. For the sake of simplicity in this demonstration of our approach, we have set β to zero in all our test scenarios. Another possible simplification is to start only with signal components where λ = μ. We observe that this can improve convergence behaviour by introducing more constraints on the feature space, only allowing non-negative amplitudes.

Box 1: Line-search.

1: initial guess for ω_k from SMA

2: While not converged do

3: $A_{k}^{λ μ} \leftarrow \min \frac{1}{n} \sum_{i} ∣ ∣ y_{i}^{λ μ} - f_{λ μ}^{sparse} (A_{k}^{λ μ}, ω_{k}, t_{i}) ∣ ∣_{2}^{2} + α_{sparse} ∣ ∣ f^{sparse} ∣ ∣_{2}^{2}$

4: if iteration = 1 then

5: Δω ← Δω_init

6: randomly modify $A_{k}^{λ μ}$

7: else

8: Δω ← Δω_def

9: for $ω_{i} \in \{ω_{1}, . . ., ω_{k}\}$ do

10: for $ω \in \{ω_{i} - Δ ω, . . ., ω_{i}, . . ., ω_{i} + Δ ω\}$ do

11: $f_{λ μ} \leftarrow - \sum_{k \neq i} A_{k}^{λ μ} \sin (ω_{k} t_{i}) + A_{i}^{λ μ} \sin (ω t_{i})$

12: Compute $L (A_{k}^{λ μ}, A_{i}^{λ μ}, ω_{k}, ω), k \neq i$

13: $ω_{i} \leftarrow \min (L)$

14: $A_{k}^{λ μ} \leftarrow \min \frac{1}{n} \sum_{i} ∣ ∣ y_{i}^{λ μ} - f_{λ μ}^{sparse} (A_{k}^{λ μ}, ω_{k}, t_{i}) ∣ ∣_{2}^{2} + α_{sparse} ∣ ∣ f^{sparse} ∣ ∣_{2}^{2}$

Relaxation of quasi continuum

After optimization of the narrow features, we calculate the residual between the target signal and f^sparse,

y_{λ μ}^{cont} (t) = y_{λ μ} (t) - f_{λ μ}^{sparse} (t) .

By subtracting f^sparse from our target, y^cont contains only information about the continuum region of the spectrum. We now make use of the fact that the SMA contains also information about the spectral density of the continuum region and perform an additional regression in order to obtain the correct amplitudes for the continuum,

\min \frac{1}{n} \sum_{i} ∣ ∣ y_{i}^{cont, λ μ} - f_{λ μ}^{cont} (A_{k}^{λ μ}, ω_{k}, t_{i}) ∣ ∣_{2}^{2} + α_{cont} ∣ ∣ f^{cont} ∣ ∣_{2}^{2} .

Note that, in this linear prediction, the index k indicates the frequencies obtained from the SMA. As the target signal does not contain any narrow feature components, we set the regularization coefficient α_cont to the default value of 100. Our final reconstructed dipole signal is then given by

f_{λ μ} (t) = f_{λ μ}^{cont} (t) + f_{λ μ}^{sparse} (t) .

As we show in Fig. 3d, our algorithm is able to accurately reproduce the exact short-time dynamics signal. We obtain amplitudes and frequencies of the bright states as well as correct amplitudes for the continuum region. We would like to highlight that our algorithm is completely independent of the underlying electronic structure code and can be realized in a Python implementation which easily runs on standard local desktop and laptop computers.

Convergence and accuracy

Figure 4 shows the convergence of the calculated absorption spectrum with respect to the number of time steps of the target electronic dipole signals for three different systems Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃), Cd₃₈Se₃₈-ZnPc-DPA-32(NH₂CH₃) and Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc). These systems demonstrate how our method performs with different types of spectra and signals. For Cd₃₈Se₃₈-ZnPc-DPA-32(NH₂CH₃) we expect the emergence of additional narrow features due to the presence of the DPA molecule on top of ZnPc (Fig. 4). On the contrary, the Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc) system has two ZnPc molecules and a significantly larger nanocrystal size. This larger nanocrystal leads to more blurring of the bright, localized excitations into the continuum. In addition, the two ZnPc molecules mimic a higher surface coverage and are on top bound to two different facets of the nanocrystal. Overall this system consists of 7572 electrons and is thus roughly two times larger than the other two nanocrystals and thus can be regarded as a highly challenging test case for our method.

Fig. 4 — We use Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃) (left), Cd₃₈Se₃₈-ZnPc-DPA-32(NH₂CH₃) (middle) and Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc) (right) as a prototypical examples. We show the result for the absorption spectrum by varying the length of the short-time dynamics dipole signals between 500 and 5000 time steps. The reference RT-TDDFT absorption spectrum was simulated with in total 20,000 time steps. For details regarding the input frequencies, the reader is referred to Supplementary Tables 2–5. Intensity was abbreviated with Int.

To further support our visual analysis, we calculate the Pearson correlation coefficient (ρ) between various approximate methods and the 20,000 time step reference spectrum in the range of 1 to 12 eV. This allows us to quantify similarities between two spectra regarding overall shape and intensity across a broad frequency range. For this purpose, we utilize our two largest nanocrystal systems, Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc) and Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc)-DPA (see Supplementary Fig. 20), and average over the obtained correlation coefficients. Due to their high spectral density and spectral narrow feature characteristics, these systems pose significant challenges for any super-resolution technique. Figure 5 shows the results for BYND compared to other super-resolution approaches.

Fig. 5 — The Pearson correlation coefficient (ρ) is displayed as $\log (1 - ρ)$ . The reference spectrum is the Fourier transform from a 20,000 time step RT-TDDFT simulation. We compare the performance of BYND with compressed sensing (CS), Fourier-Padé (Pade), and Fourier transformation of a short-time signal (short). The correlation coefficients have been calculated for a spectral window from 1 to 12 eV. For technical reasons ( $\log (1 - ρ) \to - \infty$ ), we have omitted the final point of the Fourier transform of the short-time signal.

While the Pearson correlation is useful to quantify spectral similarity, we would like to emphasize that in many cases visual inspection reveals that BYND spectra are more similar to the converged spectra than the Pearson coefficient suggests (see methods section for more details). Thus, Fig. 5 serves as something of an upper bound on the error of BYND compared to other methods.

We have additionally demonstrated the accuracy of BYND for purely molecular systems (see Supplementary Fig. 15).

Observations, trends and limitations

As one can see in Fig. 4, we begin to obtain relatively accurate spectra compared to RT-TDDFT for our simplest system starting at only 500 time steps. Generally, all narrow features are reproduced for all test systems. Unsurprisingly, more challenging systems, namely Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc), require more data points to achieve high accuracy results. However, we note that even for this system the spectrum is well reproduced using only 1500 time steps.

In cases where the splitting between the bright features is small, BYND requires more short-time data to fully resolve these details. For example, for the bright states at an excitation energy of around 3.5 eV in the Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃) system, our algorithm predicts one single highly bright feature instead of the two exact, less bright excitations. With such a small number of data points our algorithm is not able to distinguish between these two frequencies and more time steps are needed to resolve them. We first observe the emergence of the second bright excitation upon including 3000 time steps, which is still roughly seven times shorter than the signal required for standard Fourier analysis. This is a general trend, and we obtain detailed resolved narrow features for all test systems when using 3000 time steps.

Similar observations can be made regarding the relative intensities. The narrow features can be clearly distinguished from the quasi-continuum background for all lengths of the short-time signal, but finding the correct relative intensities requires more time steps. The correct relative amplitudes of the narrow features are reproduced using 3000 time steps for all considered systems. The improvement coincides with a significant better assignment of continuum amplitudes. Continuum amplitudes are obtained from the residual signal $y_{λ μ}^{cont}$ which is described in eq. (4). As $f_{λ μ}^{sparse}$ becomes more and more accurate $y_{λ μ}^{cont}$ will be as well. On the other hand, an overestimation of amplitudes for bright excitations thus naturally leads to underestimation of the continuum region.

Going from 500 to 5000 time steps shows a clear convergence behaviour in accuracy. At 5000 time steps, we achieve already almost excellent agreement with the exact result which is still four times less data points compared to the full RT-TDDFT run. However, while significantly mitigated, errors in amplitude are still evident. The fact that BYND shows a clear convergence is supported by Fig. 5 where we compare the similarity with the long-time reference. Convergence towards the exact results underlines that BYND is not an approximate method. Provided with enough data points, BYND will yield the exact time dynamics. Thus, it demonstrates that BYND also fulfills the Thomas-Reiche-Kuhn sum rule^48–50 for the oscillator strength. Fourier-Padé approximation and CS do not show a systematic convergence behaviour. Both typically work best for a few well-separated narrow features which does not hold true anymore for the densely populate spectra of our systems^18,51. Contrary, BYND shows a significant better correlation with the reference spectrum for any number of time steps from 1500 onwards.

BYND is capable of describing the excited state spectra not only of nanocrystals but also for a broad variety of other systems. The field of dye-sensitized solar cells⁵², solar batteries⁵³, energy transfer⁵⁴ or catalysis⁵⁵, as well as chemical sensing⁵⁶ are just a few examples where BYND can be applied. To highlight this aspect, Fig. 6 displays spectra of a molecular aggregate, a nanotube, and two surface systems which are all accurately reproduced. Even when broad and narrow features coincide, as evident in Fig. 6a, d, BYND yields reliable results. Furthermore, the broad features in Fig. 6b/c at around 7 eV emerges from the continuum amplitude fitting and was not part of the narrow feature optimization. We conclude that if the dominant narrow features are correctly reproduced, additional broad features can be captured by the continuum fitting procedure. Our observations indicate that, separating the signal into sparse and continuum components is robust enough even for challenging spectral patterns. Thus this approach should not be purely restricted to electronic structure applications only. As long as the signal is separable into continuum and sparse parts by any kind of initial guess, BYND should be able to yield the correct spectral information.

Fig. 6 — a Cis-[Ru(4,4'-COOH-2,2'-bpy)₂(NCS)₂] on an anatase (101) cluster as an example for a dye-sensitized solar cell^65,66. b Molecular ZnPc j-aggregate. c ZnPc film on a Si (111) surface. d Zinc-porphyrin molecules on a carbon nanotube⁵⁴. Time steps: a 2500, b 1500, c 2000 and d 3000. Time steps have been chosen to provide a good trade-off between accuracy and minimizing the amount of data. Due to the large system size, we only show system (c) with a reference spectrum of 10,000 time steps; otherwise, we use 20,000 time steps. Intensity is abbreviated with Int.

In the case of electronic excitation spectra, BYND’s performance will generally depend on the quality of the SMA input frequencies used to identify the sparse contribution. In addition to errors in these frequencies themselves, there is also the possibility that the SMA generates too few or too many frequencies. We demonstrate in Supplementary Figs. 2 and 3 that the case where too many narrow feature frequencies are selected is usually not of concern. When too few frequencies are selected, BYND can possibly miss narrow features. The same holds true if ω_init is too small and input frequencies are too far away from their target. We find that measuring the quality of the fit between the target and BYND signals provides a useful tool for identifying both cases (see discussion in Supplementary information section 2). Another limitation of BYND arises from the TDDFT dipole signal’s lack of information about “dark" excitations; thus, BYND in the context of electronic excitations is limited to excitation energies with non-vanishing oscillator strengths.

Overall, our analysis shows that BYND is able to correctly predict the full excitation spectrum of large systems. Narrow features embedded in the continuum are clearly evident; bright molecular features, CT features, and contributions from the continuum are all well reproduced. This is achieved while significantly reducing the required computational workload. The range of 500 to 1500 time steps corresponds to a speed up by a factor of 13 to 40 compared to high resolution results. For reference, this cost reduction brings the calculation of a full TDDFT spectrum for a large system close to the computational cost of a standard ground state geometry optimization. Excellent agreement is then achieved by including more data points.

System size considerations

In order to give a perspective on which system sizes are possible with BYND, we display in Fig. 7 the scaling of BYND’s computational cost with system size. The filled data points represent full simulations of f-cororene, Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃), and Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc). Unfilled markers represent additional nanocrystal systems (see Supplementary Fig. 21) where we have used the RT-TDDFT wall-time estimates provided by FHIaims to predict their computational cost. At a given walltime, BYND enables the simulation of significantly larger systems than a standard RT-TDDFT run, even when using an input signal of 3000 time steps. To quantify this further, for Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc) – one of our largest nanocrystals – we achieve very good narrow feature accuracy with a signal length of only 1500 time steps. For this number of time steps, BYND needs only 5138 CPUh, compared to the 58,359 CPUh required for the full long-time dynamics run – a reduction in required computational time by factor of 11.3.

In this work we showed how approximate frequency space results can be combined with short-time dynamics simulations in order to accurately capture narrow features and a quasi-continuum of states for large systems. Due to the ability of BYND to use only short-time dynamics, we are able to significantly reduce the computational time which is needed for the underlying electronic structure simulations. For one of our highly challenging systems we observe a reduction by a factor of 11. The reduction of computational time is due to two key components of our approach. First, we use the SMA as an estimate for how many narrow features can be expected and which frequencies they have. Second, we use this information to further optimize their position and amplitudes by minimizing the error with respect to the short-time dynamics signal which on its own would have insignificant resolution to capture the spectrum. Thus, our approach allows researchers to understand the electronic properties of large systems which were previously computationally inaccessible. In contrast to methods such as filter diagonalization⁵⁷, which only shows promising results if the spectrum is not too dense⁵¹, BYND is explicitly designed to work with high spectral densities. Further, we would like to emphasize that BYND is not an approximation: if enough data is provided, the results will always converge towards the full-time dynamics of the chosen electronic structure method. This is in contrast to other methods such as simplified TDDFT⁵⁸, simplified GW/sBSE⁵⁹, or TD-INDO/S⁶⁰ which employ approximations to the electron interaction integrals in order to achieve computational speedup. To increase the data available without computing more time steps, future work on BYND is aimed towards including quadrupole or higher multipole moments. More data points will then enable the use of even shorter time dynamics. Improvement in accuracy could be achieved by using the Casida equations to explicitly describe just the first few excited states, which can be then fixed in our non-linear optimization to yield an even more efficient localization of the remaining narrow features. Furthermore, we see potential in improving our line-search routine by employing advanced machine-learning techniques, which may allow us to further increase the range of the spectrum covered by the sparse signal. Further one can use SMA results from semi-local exchange-correlation kernels to approximate hybrid TDDFT or GW results, thus saving additional simulation time.

In conclusion, we have combined frequency domain results with exact short-time dynamics in order to create a super-resolution technique (BYND) which allows for the ab initio description of the entire excitation spectrum for systems which are beyond the system size boundaries of current electronic structure methods.

Methods

TDDFT simulations

The time-dependent Kohn-Sham states are explicitly propagated in time under the influence of an electric field, $E_{λ} (t) = V_{λ} δ (t)$ ^34,35. Once the time-dependent dipole moment ( $μ_{ν} (t)$ ) is obtained from the simulation, one can use the polarizability tensor in frequency space,^12,61

α_{λ ν} (ω) = \frac{1}{V_{λ}} \int_{0}^{\infty} d t e^{- i ω t} [μ_{ν} (t) - μ_{ν} (t_{0})],

to calculate the final excitation spectrum⁶¹

S (ω) = \frac{2 ω}{3 π} Tr \{ℑ [α (ω)]\} .

All TDDFT calculations for our nanocrystal systems have been carried out using the FHIaims⁶² program. Exchange-correlation interactions have been treated using the PBE⁶³ functional. Light tier1 settings have been used for the integration grid and basis set. All RT-TDDFT⁶¹ calculations have been performed with a time step of 0.2 a.u. and an electric field strength of 0.01 a.u. Total simulation time was 4000 a.u.

Quantifying similarities between two spectra

In order to quantify the similarity between two spectra, we make use of the Pearson correlation coefficient,

ρ = \frac{\sum_{i} (x_{i} - \bar{x}) (y_{i} - ȳ)}{\sqrt{\sum_{i} {(x_{i} - \bar{x})}^{2}} \sqrt{\sum_{i} {(y_{i} - ȳ)}^{2}}} .

Here, both spectra (x and y) are represented as vectors with their corresponding mean values $\bar{x}$ and $ȳ$ . We can interpret Eq. (9) as the overlap of the variation of the spectrum from its average; thus, the overlap of a completely smooth distribution will be exactly zero. We find that the Pearson correlation is therefore more sensitive when comparing spectra obtained from very short-time signals, as opposed to, for example, the spectral angle mapper which only accounts for the overlap. One limitation of quantifying spectral overlaps is that spectral shifts and differences in peak positions may not adequately captured. Eq. (9) compares the two spectra bin by bin, which means that if two very narrow features are just shifted slightly, their overlap would be zero. However, a visual inspection would clearly indicate a large similarity in this situation. In order to mitigate this effect, we convolute the obtained spectrum with a Gaussian function, which effectively spreads out the spectral peaks and increases their width. Note that this Gaussian broadening is not applied to our 20,000 RT-TDDFT reference. For each method (see Fig. 5) we apply a range of broadening factors for each number of time steps and calculate the correlation coefficient between the broadened spectrum and the reference spectrum. We then pick the broadening that gives the optimal Pearson coefficient.

In general, numerous peaks as well as broad features make quantifying the difference between very complex spectra quite challenging. As a consequence, quantifying the difference between two spectra as a whole may lack sensitivity. Nevertheless, this analysis is an excellent support for Fig. 4 where we also provide visual representations of the obtained spectra to allow for more comprehensive understanding.

SMA

In this approximation, the electronic excitation energies in frequency space are simply given by

ω_{i} = \sqrt{{(ϵ_{a} - ϵ_{i})}^{2} + 4 (ϵ_{a} - ϵ_{i}) ⟨i a ∣ f_{H x c} ∣ i a⟩},

with ϵ_a and ϵ_i being the eigenstate energies of the a-th virtual and i-th occupied Kohn-Sham state while f_Hxc denotes the Hartree and exchange-correlation kernel. Thus, the SMA gives us a simple analytical expression for obtaining a large number of excitation energies without the necessity of solving the Casida equations directly. Strictly speaking, the SMA is only exact if the single-particle excitations show vanishing overlap, and thus in realistic systems the SMA is error prone and can only serve as a first approximate step. However, it remains a significant improvement over, for example, just using the ground state spectrum as it contains a great deal of information about the relative location of the bright states.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

supplementary_information^{(4MB, pdf)}

Peer Review File^{(199.9KB, pdf)}

41467_2024_52368_MOESM3_ESM.pdf^{(32.8KB, pdf)}

Description of Additional Supplementary Files

Supplementary Data 1^{(103.3KB, zip)}

Reporting Summary^{(247.3KB, pdf)}

Source data

Source Data^{(1.3MB, zip)}

Acknowledgements

M.K. acknowledges support from the German Research Foundation (DFG, KI 2558/1-1, 505191319). Further, M.K. would like to thank Prof. Harald Oberhofer and Cristina Grosu for their valuable input and support. T.V.V. acknowledges support from the National Science Foundation (Award no. CHE-2154938). This work used expanse at the San Diego Supercomputing Center through allocation CHE200006 from the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program, which is supported by the National Science Foundation grants (Award no. OAC-2138259, OAC-2138286, OAC-2138307, OAC-2137603, and OAC-2138296).

Author contributions

M.K. performed all electronic structure calculations, designed the algorithm and performed all necessary code implementations. M.K. also wrote the manuscript. E.A. helped editing the manuscript and contributed fruitfully in various discussions. A.B. carried out test calculations on small molecular systems. T.V.V. edited the manuscript and helped with algorithm design.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Data availability

SMA initial guess frequenies, xyz-structure files and TDDFT input files are provided in the Supplementary Information/Supplementary Data 1 file. Source data for Figs. 1–7 are provided with this paper in form of a Source Data file. Source data are provided with this paper.

Code availability

The code and a tutorial on how to use it can be obtained from our GitHub repository (BYND)⁶⁴.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-024-52368-5.

References

1.Ren, Y. Refined standards for simulating UV-VIS absorption spectra of acceptors in organic solar cells by TD-DFT. J. Photochem. Photobiol. A Chem.407, 113087 (2021). [Google Scholar]
2.Goldzak, T., McIsaac, A. R. & Van Voorhis, T. Colloidal CdSe nanocrystals are inherently defective. Nat. Commun.12, 890 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Ali, A. et al. TD-DFT benchmark for UV-visible spectra of fused-ring electron acceptors using global and range-separated hybrids. Phys. Chem. Chem. Phys.22, 7864–7874 (2020). [DOI] [PubMed] [Google Scholar]
4.Neef, A. et al. Orbital-resolved observation of singlet fission. Nature616, 275–279 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Slowik, I. et al. Novel organic light-emitting diode design for future lasing applications. Org. Electron.48, 132–137 (2017). [Google Scholar]
6.Kinoshita, T. et al. Spectral splitting photovoltaics using perovskite and wideband dye-sensitized solar cells. Nat. Commun.6, 8834 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Gasparini, N. et al. Adjusting the energy of interfacial states in organic photovoltaics for maximum efficiency. Nat. Commun.12, 1772 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Coppola, C. et al. DFT and TDDFT investigation of four triphenylamine/phenothiazine-based molecules as potential novel organic hole transport materials for perovskite solar cells. Mater. Chem. Phys.278, 125603 (2022). [Google Scholar]
9.Lyakurwa, M. & Numbury, S. B. DFT and TD-DFT study of optical and electronic properties of new donor-acceptor-donor monomers for polymer solar cells. Oxf. Open Mater. Sci.3, itad003 (2023). [Google Scholar]
10.Zaier, R., Hajaji, S., Kozaki, M. & Ayachi, S. DFT and TD-DFT studies on the electronic and optical properties of linear π-conjugated cyclopentadithiophene (cpdt) dimer for efficient blue oled. Opt. Mater.91, 108–114 (2019). [Google Scholar]
11.Moradpour, B. & Omidyan, R. DFT/TD-DFT study of electronic and phosphorescent properties in cycloplatinated complexes: implications for oleds. RSC Adv.12, 34217–34225 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Jornet-Somoza, J. & Lebedeva, I. Real-time propagation TDDFT and density analysis for exciton coupling calculations in large systems. J. Chem. Theory Comput.15, 3743–3754 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Perfetto, E., Pavlyukh, Y. & Stefanucci, G. Real-time gw: toward an ab initio description of the ultrafast carrier and exciton dynamics in two-dimensional materials. Phys. Rev. Lett.128, 016801 (2022). [DOI] [PubMed] [Google Scholar]
14.Vila, F. D., Rehr, J. J., Kas, J. J., Kowalski, K. & Peng, B. Real-time coupled-cluster approach for the cumulant green’s function. J. Chem. Theory Comput.16, 6983–6992 (2020). [DOI] [PubMed] [Google Scholar]
15.Rehr, J. J. et al. Equation of motion coupled-cluster cumulant approach for intrinsic losses in x-ray spectra. J. Chem. Phys.152, 174113 (2020). [DOI] [PubMed] [Google Scholar]
16.Vila, F. D. et al. Real-time equation-of-motion CC cumulant and CC Green’s function simulations of photoemission spectra of water and water dimer. J. Chem. Phys.157, 044101 (2022). [DOI] [PubMed] [Google Scholar]
17.Ruberti, M., Decleva, P. & Averbukh, V. Multi-channel dynamics in high harmonic generation of aligned CO2: ab initio analysis with time-dependent b-spline algebraic diagrammatic construction. Phys. Chem. Chem. Phys.20, 8311–8325 (2018). [DOI] [PubMed] [Google Scholar]
18.Candès, E. J., Romberg, J. K. & Tao, T. Stable signal recovery from incomplete and inaccurate measurements. Commun. pure appl. math.59, 1207–1223 (2006). [Google Scholar]
19.Sejdic, E., Orovic, I. & Stankovic, S. Compressive sensing meets time-frequency: an overview of recent advances in time-frequency processing of sparse signals. Digit. Signal Process.77, 22–35 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Orović, I., Papić, V., Ioana, C., Li, X. & Stanković, S. Compressive sensing in signal processing: algorithms and transform domain formulations. Math. Probl. Eng.2016, 7616393 (2016). [Google Scholar]
21.Schmidt, R. Multiple emitter location and signal parameter estimation. IEEE Trans. Antenn. Propag.34, 276–280 (1986). [Google Scholar]
22.Wang, J., Kwon, S. & Shim, B. Generalized orthogonal matching pursuit. IEEE Trans. Signal Process.60, 6202–6216 (2012). [Google Scholar]
23.Mallat, S. & Zhang, Z. Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process.41, 3397–3415 (1993). [Google Scholar]
24.Casida, M. & Huix-Rotllant, M. Progress in time-dependent density-functional theory. Annu. Rev. Phys. Chem.63, 287–323 (2012). [DOI] [PubMed] [Google Scholar]
25.Sternheimer, R. On nuclear quadrupole moments. Phys. Rev.84, 244–253 (1951). [Google Scholar]
26.Vasiliev, I., Öğüt, S. & Chelikowsky, J. R. Ab initio excitation spectra and collective electronic response in atoms and clusters. Phys. Rev. Lett.82, 1919–1922 (1999). [Google Scholar]
27.Jacquemin, D., Wathelet, V., Perpète, E. A. & Adamo, C. Extensive TD-DFT benchmark: singlet-excited states of organic molecules. J. Chem. Theory Comput.5, 2420–2435 (2009). [DOI] [PubMed] [Google Scholar]
28.Led, J. J. & Gesmar, H. Application of the linear prediction method to NMR spectroscopy. Chem. Rev.91, 1413–1426 (1991). [Google Scholar]
29.Koehl, P. Linear prediction spectral analysis of NMR data. Prog. Nucl. Magn. Reson. Spectrosc.34, 257–299 (1999). [Google Scholar]
30.Swagel, E., Paul, J., Bristow, A. D. & Wahlstrand, J. K. Analysis of complex multidimensional optical spectra by linear prediction. Opt. Express29, 37525–37533 (2021). [DOI] [PubMed] [Google Scholar]
31.Li, R., Li, H. & Shi, W. Human activity recognition based on LPA. Multimed. Tools Appl.79, 31069–31086 (2020). [Google Scholar]
32.Makhoul, J. Spectral linear prediction: properties and applications. IEEE T. Acoust. Speech23, 283–296 (1975). [Google Scholar]
33.Makhoul, J. Linear prediction: a tutorial review. Proc. IEEE63, 561–580 (1975). [Google Scholar]
34.Tussupbayev, S., Govind, N., Lopata, K. & Cramer, C. J. Comparison of real-time and linear-response time-dependent density functional theories for molecular chromophores ranging from sparse to high densities of states. J. Chem. Theory Comput.11, 1102–1109 (2015). [DOI] [PubMed] [Google Scholar]
35.Pela, R. R. & Draxl, C. All-electron full-potential implementation of real-time TDDFT in exciting. Electron. Struct.3, 037001 (2021). [Google Scholar]
36.Falke, S. M. et al. Coherent ultrafast charge transfer in an organic photovoltaic blend. Science344, 1001–1005 (2014). [DOI] [PubMed] [Google Scholar]
37.Wachter, G. et al. Ab initio simulation of electrical currents induced by ultrafast laser excitation of dielectric materials. Phys. Rev. Lett.113, 087401 (2014). [DOI] [PubMed] [Google Scholar]
38.Meng, S. & Kaxiras, E. Electron and hole dynamics in dye-sensitized solar cells: influencing factors and systematic trends. Nano Lett.10, 1238–1247 (2010). [DOI] [PubMed] [Google Scholar]
39.Lian, C., Guan, M., Hu, S., Zhang, J. & Meng, S. Photoexcitation in solids: first-principles quantum simulations by real-time TDDFT. Adv. Theory Simul.1, 1800055 (2018). [Google Scholar]
40.Provorse, M. R. & Isborn, C. M. Electron dynamics with real-time time-dependent density functional theory. Int. J. Quantum Chem.116, 739–749 (2016). [Google Scholar]
41.Lopata, K. & Govind, N. Modeling fast electron dynamics with real-time time-dependent density functional theory: application to small molecules and chromophores. J. Chem. Theory Comput.7, 1344–1355 (2011). [DOI] [PubMed] [Google Scholar]
42.Shepard, C., Zhou, R., Yost, D. C., Yao, Y. & Kanai, Y. Simulating electronic excitation and dynamics with real-time propagation approach to TDDFT within plane-wave pseudopotential formulation. J. Chem. Phys.155, 100901 (2021). [DOI] [PubMed] [Google Scholar]
43.Schelter, I. & Kümmel, S. Accurate evaluation of real-time density functional theory providing access to challenging electron dynamics. J. Chem. Theory Comput.14, 1910–1927 (2018). [DOI] [PubMed] [Google Scholar]
44.Hoerl, A. E. & Kennard, R. W. Ridge regression: biased estimation for nonorthogonal problems. Technometrics12, 55–67 (1970). [Google Scholar]
45.Tikhonov, A. N. Solution of incorrectly formulated problems and the regularization method. Soviet Math. Dokl.4, 1035–1038 (1963). [Google Scholar]
46.Lange, H., Brunton, S. L. & Kutz, J. N. From fourier to koopman: spectral methods for long-term time series prediction. J. Mach. Learn Res.22, 1881–1918 (2021). [Google Scholar]
47.Curtis, S. The classification of greedy algorithms. Sci. Comput. Program.49, 125–157 (2003). [Google Scholar]
48.Thomas, W. Über die zahl der dispersionselektronen, die einem stationären zustande zugeordnet sind. (vorläufige mitteilung). Naturwissenschaften13, 627–627 (1925). [Google Scholar]
49.Reiche, F. & Thomas, W. Über die zahl der dispersionselektronen, die einem stationären zustand zugeordnet sind. Zeit. f. Phys.34, 510–525 (1925). [Google Scholar]
50.Kuhn, W. Über die gesamtstärke der von einem zustande ausgehenden absorptionslinien. Zeit. f. Phys.33, 408–412 (1925). [Google Scholar]
51.Bruner, A., LaMaster, D. & Lopata, K. Accelerated broadband spectra using transition dipole decomposition and padé approximants. J. Chem. Theory Comput.12, 3741–3750 (2016). [DOI] [PubMed] [Google Scholar]
52.Hagfeldt, A., Boschloo, G., Sun, L., Kloo, L. & Pettersson, H. Dye-sensitized solar cells. Chem. Rev.110, 6595–6663 (2010). [DOI] [PubMed] [Google Scholar]
53.Gouder, A. & Lotsch, B. V. Integrated solar batteries: design and device concepts. ACS Energy Lett.8, 3343–3355 (2023). [Google Scholar]
54.Arellano, L. M. et al. Charge stabilizing tris(triphenylamine)-zinc porphyrin-carbon nanotube hybrids: synthesis, characterization and excited state charge transfer studies. Nanoscale9, 7551–7558 (2017). [DOI] [PubMed] [Google Scholar]
55.Lin, C.-H. et al. Density-functional theory studies on photocatalysis and photoelectrocatalysis: challenges and opportunities. Sol. RRL8, 2300948 (2024). [Google Scholar]
56.Swager, T. M. & Mirica, K. A. Introduction: chemical sensors. Chem. Rev.119, 1–2 (2019). [DOI] [PubMed] [Google Scholar]
57.Wall, M. R. & Neuhauser, D. Extraction, through filter diagonalization, of general quantum eigenvalues or classical normal mode frequencies from a small number of residues or a short time segment of a signal. I. Theory and application to a quantum dynamics model. J. Chem. Phys.102, 8011–8022 (1995). [Google Scholar]
58.Bannwarth, C. & Grimme, S. A simplified time-dependent density functional theory approach for electronic ultraviolet and circular dichroism spectra of very large molecules. Comput. Theor. Chem.1040-1041, 45–53 (2014). [Google Scholar]
59.Cho, Y., Bintrim, S. J. & Berkelbach, T. C. Simplified gw/bse approach for charged and neutral excitation energies of large molecules and nanomaterials. J. Chem. Theory Comput.18, 3438–3446 (2022). [DOI] [PubMed] [Google Scholar]
60.Ghosh, S., Andersen, A., Gagliardi, L., Cramer, C. J. & Govind, N. Modeling optical spectra of large organic systems using real-time propagation of semiempirical effective hamiltonians. J. Chem. Theory Comput.13, 4410–4420 (2017). [DOI] [PubMed] [Google Scholar]
61.Hekele, J., Yao, Y., Kanai, Y., Blum, V. & Kratzer, P. All-electron real-time and imaginary-time time-dependent density functional theory within a numeric atom-centered basis function framework. J. Chem. Phys.155, 154801 (2021). [DOI] [PubMed] [Google Scholar]
62.Blum, V. et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun.180, 2175–2196 (2009). [Google Scholar]
63.Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett.77, 3865–3868 (1996). [DOI] [PubMed] [Google Scholar]
64.Kick, M. & Van Voorhis, T. Super-resolution techniques to simulate electronic spectra of large molecular systems. https://github.com/mk8819/bynd (2024). [DOI] [PMC free article] [PubMed]
65.Klein, M., Pankiewicz, R., Zalas, M. & Stampor, W. Magnetic field effects in dye-sensitized solar cells controlled by different cell architecture. Sci. Rep.6, 30077 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Lundqvist, M. J., Nilsing, M., Persson, P. & Lunell, S. DFT study of bare and dye-sensitized TiO2 clusters and nanocrystals. Int. J. Quantum Chem.106, 3214–3234 (2006). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplementary_information^{(4MB, pdf)}

Peer Review File^{(199.9KB, pdf)}

41467_2024_52368_MOESM3_ESM.pdf^{(32.8KB, pdf)}

Description of Additional Supplementary Files

Supplementary Data 1^{(103.3KB, zip)}

Reporting Summary^{(247.3KB, pdf)}

Source Data^{(1.3MB, zip)}

Data Availability Statement

The code and a tutorial on how to use it can be obtained from our GitHub repository (BYND)⁶⁴.

[CR1] 1.Ren, Y. Refined standards for simulating UV-VIS absorption spectra of acceptors in organic solar cells by TD-DFT. J. Photochem. Photobiol. A Chem.407, 113087 (2021). [Google Scholar]

[CR2] 2.Goldzak, T., McIsaac, A. R. & Van Voorhis, T. Colloidal CdSe nanocrystals are inherently defective. Nat. Commun.12, 890 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Ali, A. et al. TD-DFT benchmark for UV-visible spectra of fused-ring electron acceptors using global and range-separated hybrids. Phys. Chem. Chem. Phys.22, 7864–7874 (2020). [DOI] [PubMed] [Google Scholar]

[CR4] 4.Neef, A. et al. Orbital-resolved observation of singlet fission. Nature616, 275–279 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Slowik, I. et al. Novel organic light-emitting diode design for future lasing applications. Org. Electron.48, 132–137 (2017). [Google Scholar]

[CR6] 6.Kinoshita, T. et al. Spectral splitting photovoltaics using perovskite and wideband dye-sensitized solar cells. Nat. Commun.6, 8834 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Gasparini, N. et al. Adjusting the energy of interfacial states in organic photovoltaics for maximum efficiency. Nat. Commun.12, 1772 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Coppola, C. et al. DFT and TDDFT investigation of four triphenylamine/phenothiazine-based molecules as potential novel organic hole transport materials for perovskite solar cells. Mater. Chem. Phys.278, 125603 (2022). [Google Scholar]

[CR9] 9.Lyakurwa, M. & Numbury, S. B. DFT and TD-DFT study of optical and electronic properties of new donor-acceptor-donor monomers for polymer solar cells. Oxf. Open Mater. Sci.3, itad003 (2023). [Google Scholar]

[CR10] 10.Zaier, R., Hajaji, S., Kozaki, M. & Ayachi, S. DFT and TD-DFT studies on the electronic and optical properties of linear π-conjugated cyclopentadithiophene (cpdt) dimer for efficient blue oled. Opt. Mater.91, 108–114 (2019). [Google Scholar]

[CR11] 11.Moradpour, B. & Omidyan, R. DFT/TD-DFT study of electronic and phosphorescent properties in cycloplatinated complexes: implications for oleds. RSC Adv.12, 34217–34225 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Jornet-Somoza, J. & Lebedeva, I. Real-time propagation TDDFT and density analysis for exciton coupling calculations in large systems. J. Chem. Theory Comput.15, 3743–3754 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Perfetto, E., Pavlyukh, Y. & Stefanucci, G. Real-time gw: toward an ab initio description of the ultrafast carrier and exciton dynamics in two-dimensional materials. Phys. Rev. Lett.128, 016801 (2022). [DOI] [PubMed] [Google Scholar]

[CR14] 14.Vila, F. D., Rehr, J. J., Kas, J. J., Kowalski, K. & Peng, B. Real-time coupled-cluster approach for the cumulant green’s function. J. Chem. Theory Comput.16, 6983–6992 (2020). [DOI] [PubMed] [Google Scholar]

[CR15] 15.Rehr, J. J. et al. Equation of motion coupled-cluster cumulant approach for intrinsic losses in x-ray spectra. J. Chem. Phys.152, 174113 (2020). [DOI] [PubMed] [Google Scholar]

[CR16] 16.Vila, F. D. et al. Real-time equation-of-motion CC cumulant and CC Green’s function simulations of photoemission spectra of water and water dimer. J. Chem. Phys.157, 044101 (2022). [DOI] [PubMed] [Google Scholar]

[CR17] 17.Ruberti, M., Decleva, P. & Averbukh, V. Multi-channel dynamics in high harmonic generation of aligned CO2: ab initio analysis with time-dependent b-spline algebraic diagrammatic construction. Phys. Chem. Chem. Phys.20, 8311–8325 (2018). [DOI] [PubMed] [Google Scholar]

[CR18] 18.Candès, E. J., Romberg, J. K. & Tao, T. Stable signal recovery from incomplete and inaccurate measurements. Commun. pure appl. math.59, 1207–1223 (2006). [Google Scholar]

[CR19] 19.Sejdic, E., Orovic, I. & Stankovic, S. Compressive sensing meets time-frequency: an overview of recent advances in time-frequency processing of sparse signals. Digit. Signal Process.77, 22–35 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Orović, I., Papić, V., Ioana, C., Li, X. & Stanković, S. Compressive sensing in signal processing: algorithms and transform domain formulations. Math. Probl. Eng.2016, 7616393 (2016). [Google Scholar]

[CR21] 21.Schmidt, R. Multiple emitter location and signal parameter estimation. IEEE Trans. Antenn. Propag.34, 276–280 (1986). [Google Scholar]

[CR22] 22.Wang, J., Kwon, S. & Shim, B. Generalized orthogonal matching pursuit. IEEE Trans. Signal Process.60, 6202–6216 (2012). [Google Scholar]

[CR23] 23.Mallat, S. & Zhang, Z. Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process.41, 3397–3415 (1993). [Google Scholar]

[CR24] 24.Casida, M. & Huix-Rotllant, M. Progress in time-dependent density-functional theory. Annu. Rev. Phys. Chem.63, 287–323 (2012). [DOI] [PubMed] [Google Scholar]

[CR25] 25.Sternheimer, R. On nuclear quadrupole moments. Phys. Rev.84, 244–253 (1951). [Google Scholar]

[CR26] 26.Vasiliev, I., Öğüt, S. & Chelikowsky, J. R. Ab initio excitation spectra and collective electronic response in atoms and clusters. Phys. Rev. Lett.82, 1919–1922 (1999). [Google Scholar]

[CR27] 27.Jacquemin, D., Wathelet, V., Perpète, E. A. & Adamo, C. Extensive TD-DFT benchmark: singlet-excited states of organic molecules. J. Chem. Theory Comput.5, 2420–2435 (2009). [DOI] [PubMed] [Google Scholar]

[CR28] 28.Led, J. J. & Gesmar, H. Application of the linear prediction method to NMR spectroscopy. Chem. Rev.91, 1413–1426 (1991). [Google Scholar]

[CR29] 29.Koehl, P. Linear prediction spectral analysis of NMR data. Prog. Nucl. Magn. Reson. Spectrosc.34, 257–299 (1999). [Google Scholar]

[CR30] 30.Swagel, E., Paul, J., Bristow, A. D. & Wahlstrand, J. K. Analysis of complex multidimensional optical spectra by linear prediction. Opt. Express29, 37525–37533 (2021). [DOI] [PubMed] [Google Scholar]

[CR31] 31.Li, R., Li, H. & Shi, W. Human activity recognition based on LPA. Multimed. Tools Appl.79, 31069–31086 (2020). [Google Scholar]

[CR32] 32.Makhoul, J. Spectral linear prediction: properties and applications. IEEE T. Acoust. Speech23, 283–296 (1975). [Google Scholar]

[CR33] 33.Makhoul, J. Linear prediction: a tutorial review. Proc. IEEE63, 561–580 (1975). [Google Scholar]

[CR34] 34.Tussupbayev, S., Govind, N., Lopata, K. & Cramer, C. J. Comparison of real-time and linear-response time-dependent density functional theories for molecular chromophores ranging from sparse to high densities of states. J. Chem. Theory Comput.11, 1102–1109 (2015). [DOI] [PubMed] [Google Scholar]

[CR35] 35.Pela, R. R. & Draxl, C. All-electron full-potential implementation of real-time TDDFT in exciting. Electron. Struct.3, 037001 (2021). [Google Scholar]

[CR36] 36.Falke, S. M. et al. Coherent ultrafast charge transfer in an organic photovoltaic blend. Science344, 1001–1005 (2014). [DOI] [PubMed] [Google Scholar]

[CR37] 37.Wachter, G. et al. Ab initio simulation of electrical currents induced by ultrafast laser excitation of dielectric materials. Phys. Rev. Lett.113, 087401 (2014). [DOI] [PubMed] [Google Scholar]

[CR38] 38.Meng, S. & Kaxiras, E. Electron and hole dynamics in dye-sensitized solar cells: influencing factors and systematic trends. Nano Lett.10, 1238–1247 (2010). [DOI] [PubMed] [Google Scholar]

[CR39] 39.Lian, C., Guan, M., Hu, S., Zhang, J. & Meng, S. Photoexcitation in solids: first-principles quantum simulations by real-time TDDFT. Adv. Theory Simul.1, 1800055 (2018). [Google Scholar]

[CR40] 40.Provorse, M. R. & Isborn, C. M. Electron dynamics with real-time time-dependent density functional theory. Int. J. Quantum Chem.116, 739–749 (2016). [Google Scholar]

[CR41] 41.Lopata, K. & Govind, N. Modeling fast electron dynamics with real-time time-dependent density functional theory: application to small molecules and chromophores. J. Chem. Theory Comput.7, 1344–1355 (2011). [DOI] [PubMed] [Google Scholar]

[CR42] 42.Shepard, C., Zhou, R., Yost, D. C., Yao, Y. & Kanai, Y. Simulating electronic excitation and dynamics with real-time propagation approach to TDDFT within plane-wave pseudopotential formulation. J. Chem. Phys.155, 100901 (2021). [DOI] [PubMed] [Google Scholar]

[CR43] 43.Schelter, I. & Kümmel, S. Accurate evaluation of real-time density functional theory providing access to challenging electron dynamics. J. Chem. Theory Comput.14, 1910–1927 (2018). [DOI] [PubMed] [Google Scholar]

[CR44] 44.Hoerl, A. E. & Kennard, R. W. Ridge regression: biased estimation for nonorthogonal problems. Technometrics12, 55–67 (1970). [Google Scholar]

[CR45] 45.Tikhonov, A. N. Solution of incorrectly formulated problems and the regularization method. Soviet Math. Dokl.4, 1035–1038 (1963). [Google Scholar]

[CR46] 46.Lange, H., Brunton, S. L. & Kutz, J. N. From fourier to koopman: spectral methods for long-term time series prediction. J. Mach. Learn Res.22, 1881–1918 (2021). [Google Scholar]

[CR47] 47.Curtis, S. The classification of greedy algorithms. Sci. Comput. Program.49, 125–157 (2003). [Google Scholar]

[CR48] 48.Thomas, W. Über die zahl der dispersionselektronen, die einem stationären zustande zugeordnet sind. (vorläufige mitteilung). Naturwissenschaften13, 627–627 (1925). [Google Scholar]

[CR49] 49.Reiche, F. & Thomas, W. Über die zahl der dispersionselektronen, die einem stationären zustand zugeordnet sind. Zeit. f. Phys.34, 510–525 (1925). [Google Scholar]

[CR50] 50.Kuhn, W. Über die gesamtstärke der von einem zustande ausgehenden absorptionslinien. Zeit. f. Phys.33, 408–412 (1925). [Google Scholar]

[CR51] 51.Bruner, A., LaMaster, D. & Lopata, K. Accelerated broadband spectra using transition dipole decomposition and padé approximants. J. Chem. Theory Comput.12, 3741–3750 (2016). [DOI] [PubMed] [Google Scholar]

[CR52] 52.Hagfeldt, A., Boschloo, G., Sun, L., Kloo, L. & Pettersson, H. Dye-sensitized solar cells. Chem. Rev.110, 6595–6663 (2010). [DOI] [PubMed] [Google Scholar]

[CR53] 53.Gouder, A. & Lotsch, B. V. Integrated solar batteries: design and device concepts. ACS Energy Lett.8, 3343–3355 (2023). [Google Scholar]

[CR54] 54.Arellano, L. M. et al. Charge stabilizing tris(triphenylamine)-zinc porphyrin-carbon nanotube hybrids: synthesis, characterization and excited state charge transfer studies. Nanoscale9, 7551–7558 (2017). [DOI] [PubMed] [Google Scholar]

[CR55] 55.Lin, C.-H. et al. Density-functional theory studies on photocatalysis and photoelectrocatalysis: challenges and opportunities. Sol. RRL8, 2300948 (2024). [Google Scholar]

[CR56] 56.Swager, T. M. & Mirica, K. A. Introduction: chemical sensors. Chem. Rev.119, 1–2 (2019). [DOI] [PubMed] [Google Scholar]

[CR57] 57.Wall, M. R. & Neuhauser, D. Extraction, through filter diagonalization, of general quantum eigenvalues or classical normal mode frequencies from a small number of residues or a short time segment of a signal. I. Theory and application to a quantum dynamics model. J. Chem. Phys.102, 8011–8022 (1995). [Google Scholar]

[CR58] 58.Bannwarth, C. & Grimme, S. A simplified time-dependent density functional theory approach for electronic ultraviolet and circular dichroism spectra of very large molecules. Comput. Theor. Chem.1040-1041, 45–53 (2014). [Google Scholar]

[CR59] 59.Cho, Y., Bintrim, S. J. & Berkelbach, T. C. Simplified gw/bse approach for charged and neutral excitation energies of large molecules and nanomaterials. J. Chem. Theory Comput.18, 3438–3446 (2022). [DOI] [PubMed] [Google Scholar]

[CR60] 60.Ghosh, S., Andersen, A., Gagliardi, L., Cramer, C. J. & Govind, N. Modeling optical spectra of large organic systems using real-time propagation of semiempirical effective hamiltonians. J. Chem. Theory Comput.13, 4410–4420 (2017). [DOI] [PubMed] [Google Scholar]

[CR61] 61.Hekele, J., Yao, Y., Kanai, Y., Blum, V. & Kratzer, P. All-electron real-time and imaginary-time time-dependent density functional theory within a numeric atom-centered basis function framework. J. Chem. Phys.155, 154801 (2021). [DOI] [PubMed] [Google Scholar]

[CR62] 62.Blum, V. et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun.180, 2175–2196 (2009). [Google Scholar]

[CR63] 63.Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett.77, 3865–3868 (1996). [DOI] [PubMed] [Google Scholar]

[CR64] 64.Kick, M. & Van Voorhis, T. Super-resolution techniques to simulate electronic spectra of large molecular systems. https://github.com/mk8819/bynd (2024). [DOI] [PMC free article] [PubMed]

[CR65] 65.Klein, M., Pankiewicz, R., Zalas, M. & Stampor, W. Magnetic field effects in dye-sensitized solar cells controlled by different cell architecture. Sci. Rep.6, 30077 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] 66.Lundqvist, M. J., Nilsing, M., Persson, P. & Lunell, S. DFT study of bare and dye-sensitized TiO2 clusters and nanocrystals. Int. J. Quantum Chem.106, 3214–3234 (2006). [DOI] [PubMed] [Google Scholar]

PERMALINK

Super-resolution techniques to simulate electronic spectra of large molecular systems

Matthias Kick

Ezra Alexander

Anton Beiersdorfer

Troy Van Voorhis

Abstract

Introduction

Fig. 1. Working principle of BYND.

Results and discussion

Linear prediction

Fig. 2. Linear prediction of the excitation dipole spectra of Cd38Se38-ZnPc-32(NH2CH3).

SMA

RT-TDDFT

Combining SMA with RT-TDDFT

Fig. 3. Fourier transform of the time-dependent dipole moment of Cd38Se38-ZnPc-32(NH2CH3).

Narrow feature selection

Narrow feature optimization

Box 1: Line-search.

Relaxation of quasi continuum

Convergence and accuracy

Fig. 4. Convergence behaviour with respect to the number of data points.

Fig. 5. Averaged Pearson correlation coefficient of Cd33Se33/Zn93S93-2(ZnPc) and Cd33Se33/Zn93S93-2(ZnPc)-DPA.

Observations, trends and limitations

Fig. 6. Additional systems.

System size considerations

Fig. 7. Possible system sizes.

Methods

TDDFT simulations

Quantifying similarities between two spectra

SMA

Reporting summary

Supplementary information

Source data

Acknowledgements

Author contributions

Peer review

Peer review information

Data availability

Code availability

Competing interests

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig. 2. Linear prediction of the excitation dipole spectra of Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃).

Fig. 3. Fourier transform of the time-dependent dipole moment of Cd₃₈Se₃₈-ZnPc-32(NH₂CH₃).

Fig. 5. Averaged Pearson correlation coefficient of Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc) and Cd₃₃Se₃₃/Zn₉₃S₉₃-2(ZnPc)-DPA.