Skip to main content
Structural Dynamics logoLink to Structural Dynamics
. 2023 Jun 27;10(3):034103. doi: 10.1063/4.0000189

Projection to extract the perpendicular component (PEPC) method for extracting kinetics from time-resolved data

H Ki 1,2,1,2, J Gu 1,2,1,2, Y Cha 1,2,1,2, K W Lee 1,2,1,2, H Ihee 1,2,1,2,a)
PMCID: PMC10306411  PMID: 37388296

Abstract

Time-resolved x-ray liquidography (TRXL) is a potent method for investigating the structural dynamics of chemical and biological reactions in the liquid phase. It has enabled the extraction of detailed structural aspects of various dynamic processes, the molecular structures of intermediates, and kinetics of reactions across a wide range of systems, from small molecules to proteins and nanoparticles. Proper data analysis is key to extracting the information of the kinetics and structural dynamics of the studied system encrypted in the TRXL data. In typical TRXL data, the signals from solute scattering, solvent scattering, and solute–solvent cross scattering are mixed in the q-space, and the solute kinetics and solvent hydrodynamics are mixed in the time domain, thus complicating the data analysis. Various methods developed so far generally require prior knowledge of the molecular structures of candidate species involved in the reaction. Because such information is often unavailable, a typical data analysis often involves tedious trial and error. To remedy this situation, we have developed a method named projection to extract the perpendicular component (PEPC), capable of removing the contribution of solvent kinetics from TRXL data. The resulting data then contain only the solute kinetics, and, thus, the solute kinetics can be easily determined. Once the solute kinetics is determined, the subsequent data analysis to extract the structural information can be performed with drastically improved convenience. The application of the PEPC method is demonstrated with TRXL data from the photochemistry of two molecular systems: [Au(CN)2]3 in water and CHI3 in cyclohexane.

I. INTRODUCTION

Time-resolved x-ray liquidography (TRXL), also known as time-resolved x-ray solution scattering, has been established as a useful method for studying the structural dynamics and kinetics of chemical and biological reactions in the liquid solution phase.1–5 Molecular structures of reaction intermediates and their kinetics have been investigated with TRXL at third generation synchrotrons for a variety of molecules spanning from diatomic molecules to proteins and nanoparticles.6–28 Thanks to the recent development of x-ray free-electron lasers (XFELs), the time resolution of TRXL has reached the sub-picosecond regime29–38 and has even allowed for tracking time-dependent positions of nuclear wavepackets.34,36,39 Although the development of x-ray sources and facilities has greatly improved the time resolution and signal-to-noise ratio (SNR) of TRXL data, it is the development of appropriate data analysis methods that has enabled the expansion of the depth and breadth of information extracted from TRXL data.

The key step in the analysis of TRXL data is to determine the kinetics and structures of reaction intermediates, and many useful analysis methodologies have been developed.40–48 In typical TRXL data, the signals from solute scattering, solvent scattering, and solute–solvent cross scattering are mixed in the q-space,43 and the solute kinetics and solvent hydrodynamics (or solvent kinetics) are mixed in the time domain. For this reason, the data analysis of TRXL data is not trivial. Various methods that have been developed so far generally require prior knowledge of the molecular structures of candidate species involved in the reaction.7,10,13,43,49 Since such information is often unavailable; a typical data analysis often involves tedious trial and error.

In this work, we report a method to circumvent this situation. In this method named projection to extract the perpendicular component (PEPC), we take advantage of the fact that the scattering profiles of the solvent term are well-known. We show that by subtracting a scaled portion of the known solvent term from the TRXL signals, the resulting signals, i.e., the PEPC-treated signals, exhibit no contribution from the solvent kinetics, although they retain the intact solute kinetics. The absence of solvent kinetics in the PEPC-treated signals allows the straightforward determination of the solute kinetics without any prior knowledge of the molecular structures of the participating solute species.

II. RESULTS AND DISCUSSION

A. TRXL data

Usually, experimental difference scattering curves, ΔS(q, t)exp, obtained by subtracting the scattering curve at a negative time delay (tref) from the scattering curve at a positive time delay (t) of interest, as given in Eq. (1), are subject to data analysis,

ΔS(q,t)exp=S(q,t)expS(q,tref)exp. (1)

To extract the kinetics and structural dynamics information from ΔS(q, t)exp, the theoretical ΔS(q, t) and ΔS(q, t)theory need to be calculated and compared with ΔS(q, t)exp. For the purpose of analysis, we categorize the total scattering signal into the following three terms based on whether the atom belongs to the solute or solvent molecules. Accordingly, ΔS(q, t)theory comprises three components: (i) solute-only term [ΔS(q, t)solute], (ii) solute–solvent cross term [ΔS(q, t)cage], and (iii) solvent-only term [ΔS(q, t)solvent], as given in the following equation:43

ΔS(q,t)theory=ΔS(q,t)solute+ΔS(q,t)cage+ΔS(q,t)solvent. (2)

The solute-only term contains information about the change in the molecular structure of solute molecules, whose concentrations change according to the solute kinetics. The cage term contains information about the change in the cage structure according to the solute kinetics. The solvent-only term contains the changes in the arrangement of solvent molecules in the bulk solvent, mainly due to the changes in temperature and density of the solvent, which are caused because the photon energy absorbed by the solute molecules is transferred to the solvent molecules. The solute-only term can be calculated using the Debye equation. For this calculation, the molecular structure of the solute molecules needs to be known. Often, the structure from density functional theory (DFT) calculations is used as the starting structure, and the structural parameters can be refined through a fitting process that employs those structural parameters as fitting parameters. The cage term can be calculated using pair distribution functions, g(r), from molecular dynamics (MD) simulations. The sum of the solute-only term and the solute–solvent cross term can be categorized into the solute-related term, as follows:

ΔS(q,t)solrel=ΔS(q,t)solute+ΔS(q,t)cage, (3)

then, ΔS(q, t)theory can be written as the sum of ΔS(q, t)sol-rel and ΔS(q, t)solvent

ΔS(q,t)theory=ΔS(q,t)solrel+ΔS(q,t)solvent. (4)

The ΔS(q, t)sol-rel term can be expressed as the sum of the contributions of relevant solute species, i.e., the sum of difference scattering curves of different solute species. More specifically, it can be written as follows:

ΔS(q,t)solrel=1/R×k(fk(t)×SADSk(q)), (5)

where R is the ratio of the number of solvent molecules with respect to that of solute molecules, fk(t) is the molar fraction of the kth solute species among the total solute molecules at time t, and SADSk(q) is the solute-related term per unit concentration of the kth solute species. SADS stands for the species-associated difference scattering curve. As shown in Eq. (5), in a typical TRXL analysis, the solute-related term is multiplied by the factor of 1/R so that the term is scaled to indicate the amplitude of the signal per a solvent molecule, not per a solute molecule. In principle, Eq. (5) is applicable to general structural changes, even including the case of continuous structural changes. For such a continuous structural change, the structure at each time point would have its own SADSk(q) with its corresponding fk(t) be a delta function. SADSk(q) can be expressed as follows:

SADSk(q)=ΔSk(q)solute+ΔSk(q)cage, (6)
SADSk(q)=Sk(q)soluteS0(q)solute+Sk(q)cageS0(q)cage, (7)

where ΔSk(q)solute and ΔSk(q)cage are the difference solute-only term and the difference solute–solvent cross term of the kth solute species, respectively; Sk(q)solute and Sk(q)cage are the solute-only term and the solute–solvent cross term of the kth solute species, respectively; and S0(q)solute and S0(q)cage are the solute-only term and the solute–solvent cross term of reactants, respectively.

Actually, as expressed in Eqs. (3) and (6), the solvent cages surrounding the solute molecules, as well as the solute molecules, contribute to ΔS(q, t)sol-rel, thus to its component, SADSk(q). Here, for the simplicity of explanation, the term “solute species” is used as a term that includes both a solute species and its surrounding cage structure. We note that the kinetics of surrounding cage structure and solute molecules can be different, particularly at early time delays when non-equilibrium cage structures can be observed. For such a case, in Eq. (5), a large number of SADSk(q)'s are required to describe the continuous structural changes accompanying the non-equilibrium dynamics. Also the corresponding fk(t)'s would show more complex behavior on the t-axis to describe the kinetics of both solute molecules and surrounding cage structure, which are independent of each other.

The solvent-only term, ΔS(q, t)solvent, given in Eq. (8), consists of two basis differentials (∂S/∂T)ρ and (∂S/∂ρ)T,43,49 which can be obtained in a separate solvent heating experiment either by exciting the dye molecules in a dye solution50 or by directly exciting the vibrational overtone of the solvent molecules in a neat solvent using near-infrared excitation,51

ΔS(q,t)solvent=ΔT(t)×(S/T)ρ+Δρ(t)×(S/ρ)T. (8)

In this equation, (∂S/∂T)ρ is the change of the solvent scattering intensity in response to a temperature change at a constant density, (∂S/∂ρ)T is the change of the solvent scattering intensity in response to a density change at a constant temperature, and ΔT(t) and Δρ(t) are the time-dependent changes in the temperature and density of the solvent at the time delay t, respectively.

A combination of Eqs. (4), (5), and (8) gives the following relationship:43,49

ΔS(q,t)theory=1/R×k(fk(t)×SADSk(q))+ΔT(t)×(S/T)ρ+Δρ(t)×(S/ρ)T. (9)

The goal of data analysis is to determine (i) the kinetic framework of the reaction and related rate constants, which are manifested in fk(t), and (ii) the molecular structures of reaction intermediates, which are manifested in SADSk(q) that generate ΔS(q, t)theory in the best agreement with ΔS(q, t)exp.

B. Previous data analysis methods

In general, two types of information are desired to be extracted: (i) kinetics and (ii) molecular structures. The kinetic information, which includes the number of participating species and their associated rate constants, can also be obtained from time-resolved optical or vibrational spectroscopies. Once the kinetic information is obtained, the next step is to extract the structural information hidden in the TRXL data. The conventional analysis protocols for TRXL data can be classified into three methods: (i) linear combination fitting (LCF) with iterative least squares minimization (or iterative weighted least squares minimization), (ii) non-orthogonal decomposition (NOD), which is LCF with the direct method, and (iii) global fitting analysis (GFA).

LCF52 is one of the most straightforward approaches. In LCF, the difference curve at each time delay is fitted separately with a theoretical curve calculated using Eq. (9). Although fk(t)s are linked to ΔT(t) and Δρ(t) via the energy conservation and hydrodynamics equation,43 their relationships are ignored. Instead, fk(t), ΔT(t), and Δρ(t) are treated as independent fitting parameters. In the fitting, the chi-square (χ2) value representing the discrepancy between the experimental and calculated data is minimized via optimizing the fitting parameters, which are simply the proportionality coefficients of the contributing terms. Plotting the obtained proportionality coefficients of the solute terms, which correspond to fk(t)s, as a function of time displays the kinetic profiles of the solute molecules, and plotting those of the solvent heating terms, which correspond to ΔT(t) and Δρ(t), as a function of time delays displays how the temperature and density of the solvent changes with time, i.e., the hydrodynamic response of the solvent. This method requires prior knowledge on the identities of the solute molecules and their molecular structures because it is required to calculate SADSk(q)'s based on the identities and structures for the fitting. If the solute species are not known, all plausible candidate species should be tested.49

In the LCF method, the least squares refinement is performed for the difference curve at each time delay. Because the difference curve is a linear combination of various contributing terms, one can use linear algebra to directly determine the proportionality coefficients (direct method).53,54 In this way, the iterative least squares refinement can be bypassed. To distinguish this method from the LCF with the iterative minimization, we name the LCF with the direct method as “NOD.”47 In both LCF and NOD, the structural parameters of the candidate solute species can be varied, and in this way, the structure can also be refined. In terms of time and accuracy, NOD is advantageous over LCF with the least squares minimization, which always has a risk of being trapped in local minima.

In both LCF and NOD, the difference curve at each time delay is treated separately, with no constraints between the difference curves at different time delays. In reality, there are at least two constraints. First of all, the concentrations, fk(t), are not independent of one another. Instead, they should obey the kinetic rate equations of the reactions. Second, the total energy of the system should be conserved. The sum of the electronic and vibrational energy of various solute species, the energy released through radiative emissions, and the thermal energy dissipated to the bulk solvent should be conserved throughout the progress of the photoreaction via energy conservation. In GFA,9,10,20,30,43 these constraints can be taken advantage of by fitting the difference curves at all time delays simultaneously. Naturally, the fitting parameters are not the proportionality coefficients, fk(t), ΔT(t), and Δρ(t), at individual time delays, but the rate constants and some parameters required to calculate the change in the temperature and density of the solvent (such as the relative energy values of each chemical species participating in the reaction, thermodynamic parameters of the solvent, and the quantum yield of the reaction). In the fitting, the sum of the χ2 values at all time delays is minimized. Consequently, the total number of fitting parameters is greatly reduced. Nevertheless, this method requires prior knowledge of the solute species because SADSk(q)s are needed for the fitting, as in LCF and NOD. Moreover, a kinetic model needs to be assumed, and many such candidate kinetic models need to be tested via trial and error. The kinetic profiles from preliminary LCF or NOD analyses can provide a starting point for a plausible kinetic model.

C. Drawback of previous data analysis methods

In all three methods discussed in Sec. II B, the theoretical scattering curves of solute molecules, which are used to calculate SADSk(q)s, need to be known or at least be assumed. In some cases, the first guess of the plausible reaction pathways and the DFT structures of the corresponding reaction intermediates work properly. Nevertheless, as the unknowns and the complexity of the studied molecular system increase, a more systematic approach is required. The simplest and most straightforward way is to test all plausible scenarios of kinetic frameworks and structures. Such an approach often works suitably, but a more efficient approach is desirable.

At this point, it should be noted that the data analysis protocol for TRXL data from proteins is quite different from that for small molecules. For the protein data, the kinetics can be first determined using a method to extract the kinetic framework, even without knowing the structures of the protein intermediates, by treating the TRXL data as if they are time-resolved spectroscopic data.15,16,27,38,48 Then, the resulting species-associated difference scattering curves (SADSs) consistent with the determined kinetics are subject to the subsequent structural analysis. The kinetics of proteins can be more complicated than those of small molecules. For example, the K30D mutant of homodimeric hemoglobin (HbI) exhibits one of the most complicated kinetic frameworks studied so far with TRXL.48 Nevertheless, the kinetics can be determined owing to various methods developed for this purpose, such as singular value decomposition (SVD) with reduced time ranges (SVD-RTR)48 and SVD-aided pseudo-principal-component analysis (SAPPA).45 The reason why the kinetics can be readily determined without any prior knowledge or guess of the molecular structures of the solute species is as follows. In the case of the protein TRXL data, the solvent term can be easily subtracted from the experimental data to yield the solute-related term because the solvent term appears in a high-q region where the solute-related term does not show significant contribution in the high-q region.15,16,48 A ramification from this consideration of the protein TRXL data is that even for the small-molecule data, if the solvent term can be removed, the kinetics can be readily determined from the resulting solute-related data. Toward this goal, we reported the SVD-aided non-orthogonal decomposition (SANOD) method,47 where the components from singular value decomposition (SVD)55–57 are used as the terms to be used in NOD instead of the calculated solute terms, circumventing the requirement to have prior knowledge of the solute species and their structures. The fact that the solvent term is known greatly facilitates SANOD. Nevertheless, to use the SANOD method, all necessary components need to be assembled, making its application not straightforward.

D. A new method: Projection to extract the perpendicular component (PEPC)

In the discussion so far, it has been shown that the solute kinetic model can be easily determined if the solvent term can be removed from the TRXL data. Therefore, one straightforward approach would be to remove the solvent term. However, it is generally not possible to completely remove the contribution of the solvent term from the TRXL data even if the shape of the solvent term is known. To overcome this issue, our new strategy is to remove the solvent “kinetics” from the TRXL data instead of trying to remove the solvent term itself from the TRXL data. The schematic diagram of this new strategy is illustrated in Fig. 1(a). As long as the solvent kinetics can be removed without altering the solute kinetics, the solute kinetics can be readily determined from the resulting data as demonstrated in the case of the K30D HbI,48 as explained in Sec. II C. In the following, we first prove that such a task not only is possible but also can be performed with a simple mathematical operation based on vector calculation.

FIG. 1.

FIG. 1.

(a) Concept of the PEPC (projection to extract the perpendicular component) method. TRXL data obtained from an experiment [ΔS(q, t)exp, middle] are generally composed of multiple components. As a simple example, here, we assumed that ΔS(q, t)exp comprises a combination of the solvent-derived (∂S/∂T)ρ term (left, red) and the species-associated difference scattering curve (SADS, left, blue) arising from a single transient solute species. Each component contributes to ΔS(q, t)exp with their own kinetics, making the analysis of ΔS(q, t)exp complicated. The PEPC method facilitates the kinetic analysis of TRXL data by extracting the solute kinetics from ΔS(q, t)exp. This is achieved by removing the kinetic contribution of (∂S/∂T)ρ using prior knowledge on the shape of (∂S/∂T)ρ in q-space. (b) The signal obtained from the TRXL experiment and its components are conceptually considered as vectors and expressed by arrows. A signal obtained from the TRXL experiment [ΔS(q, t)exp, magenta] is composed of two different components: the solvent term [ΔS(q, t)solvent, red] and the solute-related term [solute term plus solute–solvent cage term, ΔS(q, t)sol-rel, blue]. In general, the shape of the solvent term in q-space (the direction of the vector) is well-known, and, therefore, the unit vector in this direction [(∂S/∂T)ρ/|(∂S/∂T)ρ|, black] is also well-known. The PEPC method removes the signal components that are parallel to the solvent term [ΔS(q, t)exp ∥ solvent, gray] from the total signal, leaving only the component perpendicular to the solvent term [ΔS(q, t)exp ⊥ solvent, green], ΔS(q, t)PEPC. (c) Illustration demonstrating that when the PEPC method is applied to a series of difference scattering curves measured at multiple time delays, the kinetics of the solvent term, that is, the increase or decrease in the amplitude of the solvent term with time, no longer contributes to the series of PEPC-treated difference scattering curves. Whereas the original TRXL data contain both the solute kinetics and solvent kinetics, the PEPC-treated curves contain only the solute kinetics, with no contribution from the solvent kinetics.

To explain this concept, let us first consider TRXL data at a given time delay as a vector. For example, if the number of q-points in ΔS(q, t) is nq, then ΔS(q, t) at a single time point (t) can be considered as a vector in a space of nq dimensions. Since ΔS(q, t) = ΔS(q, t)sol-rel + ΔS(q, t)solvent, both ΔS(q, t)sol-rel and ΔS(q, t)solvent can be also considered as vectors in the same space of nq dimensions. For the sake of simplicity, we can first consider vectors in a space of two dimensions for convenient visualization (Fig. 1). In addition, let us consider a simplistic case where ΔS(q, t)solvent has only the (∂S/∂T)ρ term, instead of both (∂S/∂T)ρ and (∂S/∂ρ)T terms, and ΔS(q, t)sol-rel contains the kinetics of only a single solute species,

ΔS(q,t)solvent=ΔT(t)×(S/T)ρ, (10)
ΔS(q,t)solrel=1/R×f1(t)×SADS1(q), (11)

where ΔT(t) is the solvent temperature change, (∂S/∂T)ρ is the solvent scattering change per unit temperature change, f1(t) is the molar fraction of the solute species at time t, and SADS1(q) is the solute-related term per unit concentration of the first solute species. In this simplistic case, the directions of ΔS(q, t)sol-rel and ΔS(q, t)solvent vectors do not change with time, whereas their magnitudes can change with time, making it much easier to explain the concept of our approach.

Here, a two-dimensional vector ΔS(q, t)exp is the vector sum of a two-dimensional vector ΔS(q, t)sol-rel and a two-dimensional vector ΔS(q, t)solvent, as shown in Fig. 1(b). In fact, determining ΔS(q, t)sol-rel and ΔS(q, t)solvent vectors is our goal, but the issue is that it is impossible to determine ΔS(q, t)sol-rel and ΔS(q, t)solvent vectors without prior knowledge. Here, one critical piece of prior knowledge is that we know the direction of the ΔS(q, t)solvent vector, which is determined by the shape of the solvent term [(∂S/∂T)ρ]. In other words, the solvent unit vector [(∂S/∂T)ρ/|(∂S/∂T)ρ|], in the direction of the ΔS(q, t)solvent vector, is known from (∂S/∂T)ρ. Therefore, we can decompose the original TRXL data [ΔS(q, t)exp] into components parallel [ΔS(q, t)exp ∥ solvent] and perpendicular [ΔS(q, t)exp ⊥ solvent] to the solvent unit vector using vector operations. For instance, the magnitude of the component ΔS(q, t)exp ∥ solvent can be obtained by taking the dot product of ΔS(q, t)exp and the solvent unit vector. Multiplying this magnitude to the solvent unit vector gives ΔS(q, t)exp ∥ solvent, which can be expressed as follows:

ΔS(q,t)expsolvent=(ΔS(q,t)exp·(S/T)ρ/|(S/T)ρ|)×(S/T)ρ/|(S/T)ρ|=proj(S/T)ρ/|(S/T)ρ|ΔS(q,t)exp=proj(S/T)ρΔS(q,t)exp, (12)

where the symbol · denotes the dot product, and projAB denotes the projection of a vector B to a vector A. This component parallel to the solvent unit vector [ΔS(q, t)exp ∥ solvent], which is the projection of the original data [ΔS(q, t)exp] to the solvent unit vector [(∂S/∂T)ρ/|(∂S/∂T)ρ|], is then subtracted from the original data [ΔS(q, t)exp] to obtain the component perpendicular to the solvent unit vector [ΔS(q, t)exp ⊥ solvent]. For this reason, we named this method to extract the perpendicular component as the “projection to extract the perpendicular component (PEPC).” The resulting PEPC-treated data, ΔS(q, t)PEPC, can be expressed as follows:

ΔS(q,t)PEPC=ΔS(q,t)expsolvent=ΔS(q,t)expΔS(q,t)expsolvent=ΔS(q,t)expproj(S/T)ρΔS(q,t)exp. (13)

Let us now consider a series of ΔS(q, t)exp vectors at multiple time delays. For typical TRXL experimental data, the magnitudes of both ΔS(q, t)sol-rel and ΔS(q, t)solvent change with time. Figure 1(c) shows that, even for such a case, the solvent kinetics is completely removed in ΔS(q, t)PEPC, and the solute kinetics, represented by the relative magnitudes of the original ΔS(q, t)sol-rel vectors, is retained in the ΔS(q, t)PEPC vectors.

So far, for heuristic purposes, the vectors were represented in two-dimensional space, and we considered simplistic cases where only the magnitudes of the ΔS(q, t)solvent and ΔS(q, t)sol-rel vectors change with time, whereas their directions do not because we intentionally supposed cases where each vector has only a single component. In real ΔS(q, t)exp vectors, the nq dimensions need to be considered instead of the two dimensions. Moreover, because the ΔS(q, t)solvent vectors have the (∂S/∂ρ)T component as well as the (∂S/∂T)ρ component, both the direction and magnitude of the ΔS(q, t)solvent vectors change with time. Furthermore, because the ΔS(q, t)sol-rel vectors can contain contributions from multiple solute species, their directions as well as their magnitudes will change with time. In such a general case, ΔS(q, t)exp can be expressed as ΔS(q, t)theory given in Eq. (9), as follows:

ΔS(q,t)exp=1/R×k(fk(t)×SADSk(q))+ΔT(t)×(S/T)ρ+Δρ(t)×(S/ρ)T. (14)

Of course, at this stage, ΔT(t), Δρ(t), fk(t)'s, and SADSk(q)'s are not known. Still, conceptually, SADSk(q) can be decomposed into the components parallel and perpendicular to (∂S/∂T)ρ and (∂S/∂ρ)T, as follows:

SADSk(q)=SADSk,(q)+SADSk,(q)=dk,T×(S/T)ρ+dk,ρ×(S/ρ)T+SADSk,(q). (15)

Here, SADSk, ∥(q) represents the projection of SADSk(q) onto a plane containing two vectors (∂S/∂T)ρ and (∂S/∂ρ)T, and the coefficients dk,T and dk, ρ correspond to the contributions of (∂S/∂T)ρ and (∂S/∂ρ)T, respectively, to SADSk, ∥(q). SADSk, ⊥(q) denotes the component of SADSk(q) perpendicular to both (∂S/∂T)ρ and (∂S/∂ρ)T. A detailed description of the procedure and the mathematical background for decomposing SADSk(q) into SADSk, ∥(q) and SADSk, ⊥(q) as well as the process for determining the coefficients dk, T and dk, ρ are provided in section “Mathematical background for the decomposition of SADSk into SADSk, and SADSk,” of the supplementary material. Substituting Eq. (15) into Eq. (14) gives the following equation:

ΔS(q,t)exp=1/R×k(fk(t)×SADSk,(q))+ΔT(t)+1/R×k(fk(t)×dk,T)×(S/T)ρ+Δρ(t)+1/R×k(fk(t)×dk,ρ)×(S/ρ)T. (16)

Via PEPC, the vector component parallel to (∂S/∂T)ρ and (∂S/∂ρ)T, which is ΔT(t)+1/R×k(fk(t)×dk,T)×(S/T)ρ+Δρ(t)+1/R×k(fk(t)×dk,ρ)×(S/ρ)T, is subtracted from the ΔS(q, t)exp vector, yielding the vector component perpendicular to (∂S/∂T)ρ and (∂S/∂ρ)T, as follows:

ΔS(q,t)PEPC=1/R×k(fk(t)×SADSk,(q))=1/R×k(fk(t)×(SADSk(q)dk,T×(S/T)ρdk,ρ×(S/ρ)T)). (17)

In ΔS(q, t)PEPC, the amounts of the subtracted solvent term are ΔT(t)+1/R×k(fk(t)×dk,T) and Δρ(t)+1/R×k(fk(t)×dk,ρ) for (∂S/∂T)ρ and (∂S/∂ρ)T, respectively. These amounts deviate from the correct ones, ΔT(t) and Δρ(t), by 1/R×k(fk(t)×dk,T) and 1/R×k(fk(t)×dk,ρ), respectively. In other words, the solvent term is subtracted more than necessary, making the shapes of ΔS(q, t)PEPC in the q-space altered from the true vector, 1/R×k(fk(t)×SADSk(q)). Nevertheless, as can be seen in Eq. (17), the terms contributing to the solvent kinetics, such as ΔT(t) and Δρ(t), are completely removed, and only the terms contributing to the solute kinetics, fk(t), are left in ΔS(q, t)PEPC. Therefore, a kinetic analysis such as the one applied to the TRXL data for HbI K30D48 can be used to extract kinetics from ΔS(q, t)PEPC without any prior knowledge of the solute species and their molecular structures.

E. Extracting the solute kinetics from ΔS(q, t)PEPC

Once the ΔS(q, t)PEPC data are obtained via PEPC, the subsequent kinetic analysis is straightforward. Here, we provide a brief description. The first step is to use SVD on ΔS(q, t)PEPC to decompose ΔS(q, t)PEPC into the time-independent components in the q-space (left singular vectors, LSVs) and their temporal profiles in the time axis (right singular vectors, RSVs). If the experimental data have nq q-points at each time delay and nt time delays, the SVD of these data gives a total of nt LSVs and a total of nt RSVs. However, in general, most of these LSVs and RSVs express only the noise of the experimental data, and the number of components expressing a meaningful shape on the experimental data is limited to only a few. For this reason, we first use the low-rank approximation to determine the number of meaningful components and discard the remaining noise from the experimental data prior to the kinetic analysis. The relative contributions of the decomposed vectors are represented by their singular values. Based on these singular values and the autocorrelation values, the number of significant components is determined.

The significant RSVs that survive the low-rank approximation are then fitted simultaneously with a sum of exponential functions sharing common time constants. The number of kinetic components required to satisfactorily fit RSVs and the number of significant singular vectors from the low-rank approximation play important roles as constraints in the subsequent kinetic analysis to determine the kinetic frameworks. A good example can be found in recent work on the K30D mutant of HbI.

F. Application of PEPC to real data

To demonstrate the effectiveness of the PEPC method, we applied the technique to actual experimental data obtained from the photoreaction of the gold trimer complex (GTC), [Au(CN)2]3 in water.30 The results of GFA of the data have been reported.30 The photoreaction pathway revealed from the GFA, including the molecular structure of the reactants and intermediates, as well as the time constants of the reaction pathways, is shown in Fig. 2(a). We employed the PEPC technique to process the original ΔS(q, t)exp, which led to the generation of PEPC-treated ΔS(q, t)PEPC data as depicted in Fig. 2(b). For PEPC, the signal component which is parallel to (∂S/∂T)ρ of water is removed from ΔS(q, t)exp. Normally, the solvent term consists of two signal components (∂S/∂T)ρ and (∂S/∂ρ)T, and, thus, the signal component which is parallel to either of the two signal components should be removed in the PEPC method. Nevertheless, here, only one of the two components (∂S/∂ρ)T is considered and removed in the PEPC method because (∂S/∂T)ρ and (∂S/∂ρ)T for water have similar shapes.30 The shape of (∂S/∂T)ρ is obtained from a separate experiment using a dye solution (FeCl3 dissolved in water). It should be noted that, in Fig. 2 and in all subsequent figures, the difference scattering curves, ΔS(q, t), are displayed in the form of qΔS(q, t), i.e., ΔS(q, t) multiplied by the magnitude of the momentum-transfer vector, q, to better visualize the small signals at large q values. SVD analysis was performed to investigate the solute-only kinetics in ΔS(q, t)PEPC. Examination of the singular values, autocorrelation values, LSVs, and RSVs indicates that the first three components make a substantial contribution to ΔS(q, t)PEPC (Fig. S1). The three major RSVs obtained from the SVD analysis of ΔS(q, t)PEPC are shown in Fig. 2(c). A satisfactory fit of the RSVs of ΔS(q, t)PEPC gives three time constants of 1.7 ± 0.1 ps, 1.0 ± 0.1 ns, and 114 ± 3 ns. The time constants are in excellent agreement with those identified from the previous study using GFA (1.6 ± 0.1 ps, 3.0 ± 0.5 ns, and 100 ± 20 ns)30 except for some discrepancy in the second time constant. The three components and three time constants immediately suggest a kinetic model with three species in sequential transitions. Hence, we used this kinetic model and applied kinetics-constrained analysis (KCA) to obtain the SADSs of the three species. We note that when KCA is applied to extract SADSs, a specific kinetic model is applied, and SADSs compatible with the applied kinetic model are extracted. The extracted SADSs are not equal to LSVs. Instead, each SADS must be a linear combination of LSVs. The resulting SADSs (SADSPEPC) are shown in Fig. 2(d) (black solid lines). When compared to the SADSs for the three intermediates identified from the previous GFA [SADSreal, Fig. 2(d), red solid lines],30 at first glance, the shape of each SADSPEPC is noticeably different from its corresponding SADSreal. Nevertheless, the determined kinetic framework and the time constants are consistent with the reported ones obtained from the GFA. The excellent agreement shows that the solute kinetics can be easily obtained from ΔS(q, t)PEPC. One might notice that the contour plot of the PEPC-treated data in Fig. 2(b) [and Figs. 3(a) and 4(c) to be presented later] exhibits some horizontal stripes. We note that these stripes are not generated by the PEPC procedure but rather are present in the original data, ΔS(q, t)exp, except for the case of Fig. 3(a). The stripes are less visible in ΔS(q, t)exp due to its higher overall signal amplitude compared to that of ΔS(q, t)PEPC. In the case of Fig. 3(a), the horizontal stripes were significantly enhanced with the PEPC procedure. However, the increase in the amplitude of the artifact features is not inherent to the PEPC method itself but is a result of the poor signal-to-noise ratio of the signal components used in PEPC, particularly ΔS(q, t = 100 ps)exp. A more in-depth discussion of this issue is presented in the “Horizontal stripes in the contour plot of the PEPC-treated data” section of the supplementary material.

FIG. 2.

FIG. 2.

(a) Photoreaction mechanism of [Au(CN)2]3 (gold trimer complex, GTC) in water, adapted from previous work.30 The molecular structures of the reactants and intermediates involved in the photoreaction are shown, along with the time constants and their errors. (b) ΔS(q, t)exp from a TRXL experiment on [Au(CN)2]3 in water,30 and ΔS(q, t)PEPC obtained by applying the PEPC method to ΔS(q, t)exp. The kinetic contribution of (∂S/∂T)ρ of water was eliminated using PEPC. (c) The three right singular vectors (RSVs) obtained from the SVD analysis of ΔS(q, t)PEPC. The dots represent the RSVs from experimental data. The three RSVs were globally fitted using a sum of three exponential functions convoluted with a Gaussian instrumental response function (IRF), yielding time constants of 1.7 ps, 1.0 ns, and 114 ns. The lines represent the theoretical fitting curves. (d) The three SADSs obtained from the kinetic analysis of ΔS(q, t)PEPC (SADSPEPC, black solid lines) are compared with those from the global fitting analysis (GFA) of ΔS(q, t)exp (SADSreal, red solid lines). (e) Three corrected SADSs obtained by correcting the amount of (∂S/∂T)ρ excessively removed in SADSPEPC's (SADSPEPC, corr, black dash lines) are compared with those from the GFA of ΔS(q, t)exp (SADSreal, red solid lines). Each SADSPEPC, corr is obtained by optimizing the weight, α, of (∂S/∂T)ρ for each SADSPEPC so that to minimize the discrepancy between the sum of SADSPEPC and α × (∂S/∂T)ρ and its corresponding SADSreal. (f) Time-dependent changes of solvent temperature obtained by fitting ΔS(q, t)exp for each t with the sum of the contributions of three SADSPEPC, corr and ΔT(t) × (∂S/∂T)ρ, where ΔT(t) is the fitting parameter for each t.

FIG. 3.

FIG. 3.

(a) ΔS(q, t)exp from a TRXL experiment on [Au(CN)2]3 in water,30 and ΔS(q, t)PEPC obtained by applying the PEPC method to ΔS(q, t)exp. The contour plot shown for ΔS(q, t)exp is the same as that shown in Fig. 2(b). Here, the kinetic contributions of two components, (∂S/∂T)ρ of water and ΔS(q, t = 100 ps)exp, were eliminated using PEPC, leaving only the signal component perpendicular to the two components. The PEPC method is applied to two subsets of ΔS(q, t)exp, each corresponding to t < 100 ps [ΔS(q, t)XFEL] and t ≥ 100 ps [ΔS(q, t)synch], yielding ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC, respectively. (b) and (c) The first RSVs obtained from the SVD analysis of (b) ΔS(q, t)XFEL, PEPC and (c) ΔS(q, t)synch, PEPC. For (b), the first RSV was fitted using an exponential decay function with zero amplitude at t < 0, convoluted with a Gaussian IRF, yielding a time constant for the decay of 1.54 ps. For (c), the first RSV was fitted by the sum of an exponential rise function and an exponential decay function with time constants of 1.14 and 81 ns, respectively.

FIG. 4.

FIG. 4.

(a) Reaction pathways of CHI3 photolysis in methanol and cyclohexane, adapted from the literature.9,20 (b) Potential signal components for CHI3 photolysis in cyclohexane, predicted based on the null hypothesis that the reaction pathway for cyclohexane is the same as that assigned for methanol. (c) ΔS(q, t)exp from a TRXL experiment on CHI3 in cyclohexane,20 and ΔS(q, t)PEPC obtained by applying the PEPC method to ΔS(q, t)exp. The contributions of the four signal components shown in (b) are removed from ΔS(q, t)exp via PEPC. The remaining signal in ΔS(q, t)PEPC indicates that the null hypothesis is not true, and the reaction pathways are different for cyclohexane and methanol. (d) The first RSV obtained from the SVD analysis of ΔS(q, t)PEPC (dots). The kinetics of the first RSV agrees well with the time-dependent concentration profile of iso-CHI2–I retrieved using GFA (solid line), indicating that the remaining signal in ΔS(q, t)PEPC is due to the contribution of the reaction pathway yielding iso-CHI2–I. (e) The first LSV obtained from the SVD analysis of ΔS(q, t)PEPC (black solid line). The LSV is compared with the SADS corresponding to the formation of iso-CHI2–I [SADS(iso-CHI2–I), red solid line]. Although the first LSV did not correspond well with SADS(iso-CHI2–I), the LSVcorr (black dashed line), which was adjusted for the contributions of the signal components excessively removed during the PEPC procedure, showed excellent agreement with SADS(iso-CHI2–I). The LSVcorr is described as the sum of five terms, LSV, α × (∂S/∂T)ρ, β × (∂S/∂ρ)T, γ × SADS(CHI3 → CHI2 + I), and δ × SADS(CHI3 → CHI2 + ½ I2) where α, β, γ, and δ are the fitting parameters. (f) and (g) The time traces of ΔT(f) and Δρ(g), retrieved from the correct kinetic framework that includes the contribution of iso-CHI2–I (dots). Once the kinetics of the reaction is determined using PEPC and the solute-related term for each relevant chemical species is determined through a subsequent structural analysis, the reconstruction of the hydrodynamic response of the solvent is straightforward. ΔT and Δρ for each time delay can be retrieved as described in the text. The resulting ΔT and Δρ match those determined using GFA (solid lines).

G. Structural analysis using the SADSs from ΔS(q, t)PEPC

When the kinetic model is determined in this way, SADSs corresponding to each species suitable for the model can be obtained. By applying structural analysis to these SADSs, information on the structure of each species can be obtained. It should be noted that the PEPC process preserves the solute kinetics but alters the shapes of SADSs in q-space, as mentioned in Sec. II D. Therefore, the alteration of SADSs needs to be taken into account in the structural analysis based on the SADSs obtained from ΔS(q, t)PEPC, denoted as SADSk(q)PEPC. The alteration of the shapes of SADSk(q)PEPC is caused because the solvent term is excessively removed in the PEPC process, as given in Eq. (17). Here, SADSk(q)PEPC and SADSk(q)real correspond to SADSk, ⊥(q) and SADSk(q) in Eq. (17), respectively.

This relation is expressed in the following equation:

SADSk(q)PEPC=SADSk,(q)=SADSk(q)dk,T×(S/T)ρdk,ρ×(S/ρ)T=SADSk(q)realdk,T×(S/T)ρdk,ρ×(S/ρ)T. (18)

This equation shows that in SADSk(q)PEPC, the (∂S/∂T)ρ term is further removed by dk, T × (∂S/∂T)ρ, and the (∂S/∂ρ)T term is further removed by dk, ρ × (∂S/∂ρ)T. Considering this, it is necessary to conduct structural analysis using the amount of solvent term excessively removed in the PEPC process as a fitting parameter. The corrected SADSs can be expressed as follows:

SADSk(q)corr=SADSk(q)PEPC+gk×(S/T)ρ+hk×(S/ρ)T. (19)

Here, gk and hk are the coefficients for (∂S/∂T)ρ and (∂S/∂ρ)T, respectively, and ideally should be the same as dk, T and dk, ρ, respectively. Then, in the fitting process, the χ2 value representing the discrepancy between SADSk(q)corr and SADSk(q) is minimized using gk and hk as well as the structural parameters of intermediates and reactants which are used to calculate Sk(q)solute, S0(q)solute, Sk(q)cage, and S0(q)cage in Eq. (7), as fitting parameters. By correcting the contribution of the solvent term that is excessively removed in the PEPC process, specifically gk × (∂S/∂T)ρ and hk × (∂S/∂ρ)T, the shape of the difference scattering curve for each species obtained from the GFA technique30 can be reproduced [Fig. 2(f)]. Here, the contribution of hk × (∂S/∂ρ)T is ignored for this specific example with water as a solvent. It is noteworthy that, despite adjusting gk and hk to minimize the discrepancy between SADSk(q)corr and SADSk(q), the satisfactory fitting of SADSk(q)corr to SADSk(q) is only achieved for the correct structure of the species, thus enabling the identification of the species and optimization of the detailed molecular structure. For instance, Fig. S5 highlights that SADS1(q)corr is only consistent with SADS calculated for the structure corresponding to the T1′ state and exhibits a noticeable deviation from those calculated for the structures corresponding to the T1 state or the tetramer (Fig. S5, left panels). Similarly, SADS2(q)corr and SADS3(q)corr match only to SADS calculated for the structures corresponding to the T1 state and the tetramer, respectively (Fig. S5, middle and right panels). These findings unequivocally demonstrate that SADSk(q)corr accurately matches SADS calculated only for the correct molecular structure of the species. In other words, despite the altered shape of SADSk(q)PEPC in q-space, it remains feasible to extract the relevant structural information corresponding to each species by analyzing SADSk(q)PEPC.

With this, both the kinetic analysis and the structural analysis for the solute term are completed. Here, we note that consideration of gk and hk in the structural fitting using PEPC-treated data does not increase the complexity of the structural fitting. In other words, gk and hk are not independent fitting parameters in the structural fitting but are functions expressed by independent fitting parameters. Accordingly, the structural analysis of a SADSk(q)PEPC has the same level of complexity (the number of independent fitting parameters) as the structural analysis of a correct SADS, a SADSk(q), that is not altered by the PEPC process. The method for determining the coefficients gk and hk is described in section “Determination of coefficients gk and hk during structural analysis” of the supplementary material.

The last remaining step is to reconstruct the information about the hydrodynamic response of the solvent (solvent kinetics). The analysis of solvent kinetics is straightforward once the solute-related term, ΔS(q, t)sol-rel, is determined from the kinetic analysis and structural analysis. At this stage, fk(t) for the solute-related term in Eq. (5) is obtained through kinetic analysis. The subsequent structural analysis refines the molecular structures and determines SADSk(q). The solvent-only term, ΔS(q, t)solvent = ΔT(t) × (∂S/∂T)ρ + Δρ(t) × (∂S/∂ρ)T, can be determined by subtracting ΔS(q, t)sol-rel, i.e., 1/R × Σk[fk(t) × SADSk(q)], from ΔS(q, t)exp, as shown in Eq. (14). ΔT(t) and Δρ(t) can be retrieved from the ΔS(q, t)solvent by simply performing NOD of ΔS(q, t)solvent as a linear combination of two terms (∂S/∂T)ρ and (∂S/∂ρ)T. In the case of the [Au(CN)2]3 in water shown earlier, the profile of ΔT(t) reconstructed through the corresponding process is shown in Fig. 2(f). Note that the two terms (∂S/∂T)ρ and (∂S/∂ρ)T for the solvent response have similar shapes in q-space, especially in the case of water, making it difficult to distinguish the contributions of the two terms in ΔS(q, t)solvent. Accordingly, here, the hydrodynamics response is described with only the (∂S/∂T)ρ term, as in Eq. (10), instead of the linear combination of the two terms, and the resulting ΔT(t) is indicated. In general, except for water, the two terms (∂S/∂T)ρ and (∂S/∂ρ)T for the solvent response have distinct shapes in q-space, and, therefore, the contribution of each term in the solvent response can be easily determined. An example of such a general case, CHI3 in cyclohexane, is given in Figs. 4(f) and 4(g).

H. PEPC for simultaneous removal of the contributions of multiple signal components

As demonstrated in Sec. II F, PEPC is a powerful tool for eliminating the contribution of a single component, such as (∂S/∂T)ρ. In fact, PEPC can also be used to simultaneously remove multiple signal components, making it a versatile method with applications extending beyond solvent subtraction. For example, PEPC can be employed to eliminate the contributions of known, trivial reaction intermediates, allowing for the isolation of potential contributions from unknown intermediates that may be hidden in experimental signals. Additionally, PEPC can be applied to eliminate the kinetic contributions of an intermediate dominating at a specific time delay, enabling the extraction of contributions from other intermediates in the reaction.

Prior to discussing applications of PEPC, we provide an overview of its theoretical principles. We describe how PEPC can eliminate the contributions of multiple signal components, which we refer to as “trivial components” for brevity, and provide a mathematical representation of the resulting PEPC-treated signal. The vector space spanned by these trivial components is referred to as the “trivial space (TS).” Assuming the presence of m trivial components, denoted as {trv1, trv2, …, trvm}, PEPC eliminates signal components parallel to these m trivial components and retains only the component perpendicular to all m components. The relation between ΔS(q, t)exp and ΔS(q, t)PEPC can be expressed as follows:

ΔS(q,t)exp=i=1mwi×trvi+ΔS(q,t)PEPC. (20)

Here, wi represents the weight of ith trivial component, trvi, removed via PEPC. As illustrated in Fig. 1(c), the kinetics of a trivial component does not contribute in the direction perpendicular to that component. Therefore, by finding ΔS(q, t)PEPC that is perpendicular to all m trivial components, the kinetics of the signal components other than the m trivial components can be extracted. This can be achieved by satisfying the following equation for all i:

ΔS(q,t)PEPC·trvi=0. (21)

The main concern is whether a set {w1, w2, …, wn} for Eq. (20) that satisfies Eq. (21) for all i can be readily calculated.

In fact, the set {w1, w2, …, wn} that satisfies Eq. (21) for all i can be readily obtained by calculating the mathematical projection. The projection of a vector v onto a subspace U, spanned by a set of vectors {u1, u2, …, un}, denoted by projUv, provides the vector component that lies within the subspace, while the residual of the projection, i.e., v − projUv, gives the component perpendicular to all vectors, u1, u2, …, and un.58 Thus, the first term on the right-hand side of Eq. (20) can be obtained by calculating the projection, projTSΔS(q, t)exp, and the second term, ΔS(q, t)PEPC, that satisfies Eq. (21) can be obtained by calculating the residual of the projection, ΔS(q, t)exp − projTSΔS(q, t)exp. Mathematically, this can be expressed as follows:

ΔS(q,t)PEPC=ΔS(q,t)expi=1mwi×trvi=ΔS(q,t)expprojTSΔS(q,t)exp. (22)

Equation (22) shows how PEPC can eliminate the contributions of multiple trivial components, retaining only the component perpendicular to the trivial components by removing signal components parallel to the trivial components. Mathematically, this PEPC treatment can be expressed using a projection operator. However, the projection formula may not be very intuitive in practice. To address this, we propose a practical and intuitive method based on a mathematical property of the projection operator: the projection, projUv, is given by the linear combination of {u1, u2, …, un} that minimizes the magnitude of the residual, v − projUv.58 By utilizing this property, we can obtain the projection, projTSΔS(q, t)exp, by finding the linear combination of {trv1, trv2, …, trvm} that minimizes the magnitude of the residual of the projection, ΔS(q, t)PEPC. Based on this, the projection, projTSΔS(q, t)exp, can be obtained by finding the least squares solution of the following system of linear equations:59–61

ΔS(q,t)exp=i=1mxi×trvi. (23)

Here, xi represents the weight of ith trivial component, trvi. Note that the least squares solution, which we denote as {wLS, 1, wLS, 2, …, wLS, m}, can be obtained unambiguously based on well-established numerical algorithms.62 Once the least squares solution is determined, ΔS(q, t)PEPC can be derived as follows:

projTSΔS(q,t)exp=i=1mwLS,i×trvi, (24)
ΔS(q,t)PEPC=ΔS(q,t)expprojTSΔS(q,t)exp=ΔS(q,t)expi=1mwLS,i×trvi. (25)

This approach based on least squares fitting provides a straightforward and practical method for numerically calculating the PEPC-treated signal, ΔS(q, t)PEPC, for multiple trivial components, {trv1, trv2, …, trvm}.

The kinetic analysis of ΔS(q, t)PEPC resulting from the PEPC treatment with multiple trivial components can be performed in the same way as for ΔS(q, t)PEPC treated with a single trivial component, namely, (∂S/∂T)ρ. Once the kinetic framework is established from the kinetic analysis, the SADS corresponding to the kinetic framework can be obtained using KCA. To extract molecular structural information, the resulting SADSk(q)PEPC can be analyzed similarly, with corrections similar to those in Eq. (19) but with the equation modified as follows:

SADSk(q)corr=SADSk(q)PEPC+i=1mαi,k×trvi. (26)

Here, αi, k is the weight of the correction for the ith trivial component, trvi. By employing Eq. (26), the molecular structure of each species can be optimized by minimizing the deviation between the experimentally obtained SADSk(q)corr, which has been corrected for the distortion from PEPC treatment, and the calculated SADSk(q)real derived from the molecular structure.

I. Application of PEPC to determine the kinetic framework

To showcase the versatility of PEPC, particularly in handling multiple trivial components, we utilized it in analyzing two distinct datasets: one for the photoreaction of [Au(CN)2]3 in water30 and the other for the photoreaction of CHI3 in cyclohexane.20 For [Au(CN)2]3 in water, the TRXL data consist of two sub-datasets measured at two different facilities: a synchrotron and an XFEL.30 The data corresponding to t ≥ 100 ps [ΔS(q, t)synch] and t < 100 ps [ΔS(q, t)XFEL] were obtained from the experiments at the synchrotron and XFEL, respectively. Typically, a synchrotron experiment using the same sample is performed before the XFEL experiment. In this case, the preliminary information, ΔS(q, t)synch, can be used when analyzing ΔS(q, t)XFEL to identify any contributions from short-lived intermediates that were not identified in ΔS(q, t)synch. Moreover, when analyzing ΔS(q, t)synch, PEPC can be used to identify any contributions from long-lived intermediates other than those present at the earliest time delay, t = 100 ps, by removing the contribution of the intermediate present at t = 100 ps from the entire ΔS(q, t)synch. Here, in contrast to the PEPC process illustrated in Fig. 2(b), the contribution of ΔS(q, t = 100 ps)exp is removed, in addition to the contribution of solvent heating signal (∂S/∂T)ρ. As a result, only the signal component perpendicular to these two components remains after removing the signal components parallel to (∂S/∂T)ρ and ΔS(q, t = 100 ps)exp. The resulting ΔS(q, t)PEPC from ΔS(q, t)XFEL and ΔS(q, t)synch is shown in Fig. 3(a) as ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC, respectively. The observation of a remaining signal in ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC suggests the presence of reaction intermediates other than those exists at t = 100 ps, which was previously assigned as an intermediate in the T1 state based on GFA.30,34 SVD analysis revealed that each of ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC exhibits only one significant LSV (Figs. S6 and S7), indicating the presence of an additional early intermediate before the intermediate in the T1 state within the t < 100 ps range and the other late intermediate in the t > 100 ps range. The significant RSVs are shown in Figs. 3(b) and 3(c). Kinetic analysis of the RSV of ΔS(q, t)XFEL, PEPC reveals that the RSV rises instantaneously within the instrument response function (IRF, 480 fs) of the experiment and decays with a single time constant of 1.54 ± 0.07 ps. Obviously, this observation indicates that the early intermediate (an intermediate in the T1′ state)34 forms within the IRF and transforms to the intermediate in the T1 state with a 1.54 ps time constant. For ΔS(q, t)synch, PEPC, the RSV rises with a 1.14 ± 0.07 ns time constant and decays with an 81 ± 4 ns time constant, indicating that the T1 intermediate transforms to the other late intermediate (a tetramer)30 with 1.14 ns, and the late intermediate decays to ground state with 81 ns.

From the description of the kinetic analysis earlier, one would notice that the kinetic analysis of two sub-datasets, ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC, is even more straightforward than that of ΔS(q, t)PEPC demonstrated in Sec. II F. This is due to two reasons. First, the data are divided into two subsets and analyzed separately, thereby reducing the numbers of signal components and time constants in each sub-dataset. Specifically, ΔS(q, t)XFEL, PEPC has one signal component and one time constant, and ΔS(q, t)synch, PEPC has one signal component and two time constants, compared to the three signal components and three time constants in the entire ΔS(q, t)PEPC shown in Fig. 2(b). However, the advantage of dividing the data into subsets for kinetic analysis will not be further discussed in this study, as it is not relevant to the benefit of using PEPC. Second, and more importantly, PEPC eliminates the contribution of an additional trivial component, i.e., ΔS(q, t = 100 ps)exp, reducing the number of signal components, i.e., the number of relevant chemical species contributing to the signal. Thus, applying PEPC reduces the number of solute signal components in ΔS(q, t)XFEL and ΔS(q, t)synch from two [T1′ and T1 in ΔS(q, t)XFEL and T1 and tetramer in ΔS(q, t)synch] to one [T1′ only in ΔS(q, t)XFEL and tetramer only in ΔS(q, t)synch]. Consequently, the kinetics of the remaining solute species, T1′ and tetramer, is more clearly exposed in the resulting significant RSVs of ΔS(q, t)XFEL, PEPC and ΔS(q, t)synch, PEPC, respectively, facilitating the kinetic analysis.

The photoreaction of CHI3 presents a fundamental question concerning whether the reaction pathway depends on the solvent, and if so, how. Therefore, it is crucial to determine whether the reaction pathways differ in different solvents. Previous studies examined the reaction pathway in methanol using TRXL [Fig. 4(a), pathways wrapped in a blue dashed line], identifying that only the dissociation channel yielding the CHI2 radical and I radical or the iodine molecule is active.9 Afterward, the same reaction was examined in a different solvent, cyclohexane, using TRXL.20 In the study, the reaction pathway in cyclohexane was identified by using GFA, confirming that the pathway is different in cyclohexane [Fig. 4(a), pathways wrapped in a red dashed line].

If one were to analyze the TRXL data for cyclohexane, again, an essential question to address is whether the reaction pathway differs from that observed in methanol. To address this question using GFA, one would need to verify whether the fitting using the kinetic framework determined for methanol works well with the TRXL data for cyclohexane, which requires optimizing several parameters such as rate constants and excitation ratio. However, such a verification task can be tedious and time-consuming. In this study, we demonstrate that PEPC offers a more straightforward approach for confirming any differences in reaction pathways between solvents.

The key is that, by applying PEPC, the contributions of the signal components corresponding to the reaction pathways identified for methanol can be removed from the TRXL data for cyclohexane, allowing for an assessment of whether the reaction pathway differs in the two solvents. Under the null hypothesis that the reaction pathway for cyclohexane is the same as that assigned for methanol, it can be predicted that the two solute-related terms corresponding to the formation of CHI2 and the iodine radical [Fig. 4(b), blue solid line] and the subsequent transformation of the iodine radical to the iodine molecule [Fig. 4(b), green solid line] as well as the two solvent terms of cyclohexane [Fig. 4(b), black and red solid lines for (∂S/∂T)ρ and (∂S/∂ρ)T of cyclohexane, respectively] would contribute to the TRXL signal for the photolysis of CHI3 in cyclohexane. PEPC enables removal of the contributions of these four signal components from ΔS(q, t)exp, and the resulting ΔS(q, t)PEPC can be used to test the null hypothesis. If the null hypothesis is true, then there would be no meaningful signal in the ΔS(q, t)PEPC. However, if the reaction pathway is different and a signal component not identified for methanol exists, then a significant amplitude of the signal would remain in the ΔS(q, t)PEPC, indicating the presence of a distinct signal component that was not be completely subtracted during the PEPC process.

The contour plots of the measured ΔS(q, t)exp for cyclohexane and the ΔS(q, t)PEPC are depicted in Fig. 4(c). The presence of a remaining signal in ΔS(q, t)PEPC suggests that the null hypothesis is invalid, and the reaction pathways differ for cyclohexane and methanol. Further investigation, involving kinetic and structural analysis of the residual signal in ΔS(q, t)PEPC, reveals that the residual signal originates from the pathway that produces an isomer of CHI3, iso-CHI2–I,20 a pathway that is inactive in methanol9 [Fig. 4(a)]. SVD analysis of ΔS(q, t)PEPC results in a single major component (Fig. S8). Regarding kinetics, the major RSV obtained from the SVD analysis [Fig. 4(d), dots] matches with the time-dependent concentration profile of iso-CHI2–I, obtained using GFA20 [Fig. 4(d), solid line], indicating that the residual signal in ΔS(q, t)PEPC is due to iso-CHI2–I. Furthermore, regarding structural analysis, the LSVcorr [Fig. 4(e), black dashed line] calculated based on Eq. (26), i.e., LSVcorr = LSV + α × (∂S/∂T)ρ + β × (∂S/∂ρ)T + γ × SADS(CHI3→CHI2 + I) + δ × SADS(CHI3→CHI2 + ½ I2), where α, β, γ, and δ are the fitting parameters, agrees excellently with the SADS calculated for iso-CHI2–I20 [SADS(iso-CHI2–I), Fig. 4(e), red solid line], thus confirming that the residual signal in ΔS(q, t)PEPC originates from iso-CHI2–I.

The demonstration of PEPC to the TRXL data for the photoreaction of CHI3 shows how PEPC can be utilized to determine the kinetic framework of the reaction and to analyze the structural dynamics of the reaction. The significant residual signal observed in ΔS(q, t)PEPC confirms the difference in the reaction pathways for cyclohexane and methanol. Further analysis of ΔS(q, t)PEPC reveals (1) a reaction intermediate, iso-CHI2–I, that is involved in the reaction pathways in cyclohexane, along with (2) the kinetics of iso-CHI2–I, especially, when and how iso-CHI2–I forms and decays, and (3) the molecular structure of iso-CHI2–I. The determined kinetics and molecular structures can be utilized for the analysis of the hydrodynamics of the solvent, cyclohexane, as explained in Sec. II G. The resulting ΔT(t) and Δρ(t) are shown in Figs. 4(f) and 4(g), respectively.

Although we have presented and demonstrated the PEPC method as a powerful tool for analyzing the kinetic behavior of TRXL signals, it is worth noting that a conventional method, referred to as the “nodal-point method (NP method),” can also provide similar benefits. However, as discussed in the “Comparison of the PEPC method and the nodal-point method (NP method)” section of the supplementary material, our comparison shows that the PEPC method outperforms the NP method in several crucial aspects.

J. Potential of the PEPC method for other types of data

In this study, we demonstrated the ability of the PEPC method to remove the solvent kinetics from the TRXL data by utilizing the prior knowledge of the shape of the solvent term. In principle, the application of PEPC is not limited to TRXL data and can be extended to other types of data where a certain component is known. By using the PEPC method, the kinetics of the known component can be removed regardless of the type of the data. Furthermore, the PEPC method can be applied to datasets other than time-resolved data. In time-resolved data, spectra change with time. For other types of data, the spectra change with experimental variables other than time, such as temperature or pressure. In such cases, if the spectrum of a species is known and its magnitude changes with the experimental variable, the PEPC method can be used to subtract its variable-dependent changes from the original data. For example, for a set of spectra measured as a function of temperature, the temperature-dependent changes in the spectrum of a species can be removed using the PEPC method. Overall, the PEPC method has a great potential as a versatile and powerful tool for the analysis of various types of data, extending beyond the scope of TRXL data analysis.

III. CONCLUSION

In this work, we introduced PEPC, a new analytic method for the analysis of time-resolved data, and demonstrated its application through various examples. By removing the kinetic contributions of known signal components, such as solvent heating or well-known intermediates or products, the PEPC technique greatly facilitates the kinetic analysis of the treated signal. As a representative example, we showed that the PEPC technique can be used to easily extract kinetic information of solute-related terms from TRXL data by removing the kinetic contributions of solvent terms. Although the solvent term being excessively removed in this process alters the shape of the solute-related term in q-space, structural analysis can still be performed on the resulting PEPC-treated data to obtain structural information. As the PEPC process is fully arithmetic, it has a fast computation time, allowing for real-time monitoring of the status of the data being obtained during the experiment. Overall, the PEPC technique has the potential to serve as a valuable tool for both post-experiment data analysis and real-time monitoring of time-resolved data, with the ability to extend beyond the scope of TRXL data analysis. Further research is required to explore and demonstrate the full range of applications of this powerful and versatile method in various fields of science.

SUPPLEMENTARY MATERIAL

See the supplementary material for detailed discussions on various topics, including the mathematical background for the decomposition of SADSk into SADSk, and SADSk, , the appearance of horizontal stripes in the contour plot of the PEPC-treated data, the determination of coefficients gk and hk during structural analysis, and a comparison of the PEPC method and the nodal-point method.

ACKNOWLEDGMENTS

This work was supported by the Institute for Basic Science (IBS-R033).

AUTHOR DECLARATIONS

Conflict of Interest

The authors have no conflicts to disclose.

Author Contributions

Hosung Ki: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (lead); Methodology (lead); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Jain Gu: Investigation (supporting); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Yongjun Cha: Investigation (supporting); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Kyung Won Lee: Investigation (supporting); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Hyotcherl Ihee: Conceptualization (equal); Funding acquisition (lead); Supervision (lead); Validation (equal); Writing – original draft (equal); Writing – review & editing (equal).

DATA AVAILABILITY

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  • 1. Ihee H., “ Visualizing solution-phase reaction dynamics with time-resolved x-ray liquidography,” Acc. Chem. Res. 42, 356–366 (2009). 10.1021/ar800168v [DOI] [PubMed] [Google Scholar]
  • 2. Ihee H., Wulff M., Kim J., and Adachi S., “ Ultrafast x-ray scattering: Structural dynamics from diatomic to protein molecules,” Int. Rev. Phys. Chem. 29, 453–520 (2010). 10.1080/0144235X.2010.498938 [DOI] [Google Scholar]
  • 3. Kim K. H., Kim J., Lee J. H., and Ihee H., “ Topical review: Molecular reaction and solvation visualized by time-resolved x-ray solution scattering: Structure, dynamics, and their solvent dependence,” Struct. Dyn. 1, 011301 (2014). 10.1063/1.4865234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Kim J., Kim K. H., Oang K. Y., Lee J. H., Hong K., Cho H., Huse N., Schoenlein R. W., Kim T. K., and Ihee H., “ Tracking reaction dynamics in solution by pump–probe x-ray absorption spectroscopy and x-ray liquidography (solution scattering),” Chem. Commun. 52, 3734–3749 (2016). 10.1039/C5CC08949B [DOI] [PubMed] [Google Scholar]
  • 5. Ki H., Oang K. Y., Kim J., and Ihee H., “ Ultrafast x-ray crystallography and liquidography,” Annu. Rev. Phys. Chem. 68, 473–497 (2017). 10.1146/annurev-physchem-052516-050851 [DOI] [PubMed] [Google Scholar]
  • 6. Neutze R., Wouts R., Techert S., Davidsson J., Kocsis M., Kirrander A., Schotte F., and Wulff M., “ Visualizing photochemical dynamics in solution through picosecond x-ray scattering,” Phys. Rev. Lett. 87, 195508 (2001). 10.1103/PhysRevLett.87.195508 [DOI] [PubMed] [Google Scholar]
  • 7. Davidsson J., Poulsen J., Cammarata M., Georgiou P., Wouts R., Katona G., Jacobson F., Plech A., Wulff M., Nyman G., and Neutze R., “ Structural determination of a transient isomer of CH2I2 by picosecond x-ray diffraction,” Phys. Rev. Lett. 94, 245503 (2005). 10.1103/PhysRevLett.94.245503 [DOI] [Google Scholar]
  • 8. Plech A., Kotaidis V., Lorenc M., and Wulff M., “ Thermal dynamics in laser excited metal nanoparticles,” Chem. Phys. Lett. 401, 565–569 (2005). 10.1016/j.cplett.2004.11.072 [DOI] [Google Scholar]
  • 9. Lee J. H., Kim J., Cammarata M., Kong Q., Kim K. H., Choi J., Kim T. K., Wulff M., and Ihee H., “ Transient x-ray diffraction reveals global and major reaction pathways for the photolysis of iodoform in solution,” Angew. Chem., Int. Ed. 47, 1047–1050 (2008). 10.1002/anie.200704150 [DOI] [PubMed] [Google Scholar]
  • 10. Lee J. H., Kim T. K., Kim J., Kong Q., Cammarata M., Lorenc M., Wulff M., and Ihee H., “ Capturing transient structures in the elimination reaction of haloalkane in solution by transient x-ray diffraction,” J. Am. Chem. Soc. 130, 5834–5835 (2008). 10.1021/ja710267u [DOI] [PubMed] [Google Scholar]
  • 11. Andersson M., Malmerberg E., Westenhoff S., Katona G., Cammarata M., Wohri A. B., Johansson L. C., Ewald F., Eklund M., Wulff M., Davidsson J., and Neutze R., “ Structural dynamics of light-driven proton pumps,” Structure 17, 1265–1275 (2009). 10.1016/j.str.2009.07.007 [DOI] [PubMed] [Google Scholar]
  • 12. Christensen M., Haldrup K., Bechgaard K., Feidenhans'l R., Kong Q. Y., Cammarata M., Lo Russo M., Wulff M., Harrit N., and Nielsen M. M., “ Time-resolved x-ray scattering of an electronically excited state in solution. Structure of the 3A2u state of tetrakis-μ-pyrophosphitodiplatinate(II),” J. Am. Chem. Soc. 131, 502–508 (2009). 10.1021/ja804485d [DOI] [PubMed] [Google Scholar]
  • 13. Kong Q., Lee J. H., Kim K. H., Kim J., Wulff M., Ihee H., and Koch M. H. J., “ Ultrafast x-ray solution scattering reveals different reaction pathways in the photolysis of triruthenium dodecacarbonyl (Ru3(CO)12) after ultraviolet and visible excitation,” J. Am. Chem. Soc. 132, 2600–2607 (2010). 10.1021/ja9097548 [DOI] [PubMed] [Google Scholar]
  • 14. Ramachandran P. L., Lovett J. E., Carl P. J., Cammarata M., Lee J. H., Jung Y. O., Ihee H., Timmel C. R., and van Thor J. J., “ The short-lived signaling state of the photoactive yellow protein photoreceptor revealed by combined structural probes,” J. Am. Chem. Soc. 133, 9395–9404 (2011). 10.1021/ja200617t [DOI] [PubMed] [Google Scholar]
  • 15. Kim K. H., Muniyappan S., Oang K. Y., Kim J. G., Nozawa S., Sato T., Koshihara S. Y., Henning R., Kosheleva I., Ki H., Kim Y., Kim T. W., Kim J., Adachi S., and Ihee H., “ Direct observation of cooperative protein structural dynamics of homodimeric hemoglobin from 100 ps to 10 ms with pump-probe x-ray solution scattering,” J. Am. Chem. Soc. 134, 7001–7008 (2012). 10.1021/ja210856v [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Kim T. W., Lee J. H., Choi J., Kim K. H., van Wilderen L. J., Guerin L., Kim Y., Jung Y. O., Yang C., Kim J., Wulff M., van Thor J. J., and Ihee H., “ Protein structural dynamics of photoactive yellow protein in solution revealed by pump-probe x-ray solution scattering,” J. Am. Chem. Soc. 134, 3145–3153 (2012). 10.1021/ja210435n [DOI] [PubMed] [Google Scholar]
  • 17. Kim K. H., Lee J. H., Kim J., Nozawa S., Sato T., Tomita A., Ichiyanagi K., Ki H., Kim J., Adachi S., and Ihee H., “ Solvent-dependent molecular structure of ionic species directly measured by ultrafast x-ray solution scattering,” Phys. Rev. Lett. 110, 165505 (2013). 10.1103/PhysRevLett.110.165505 [DOI] [PubMed] [Google Scholar]
  • 18. Lee J. H., Wulff M., Bratos S., Petersen J., Guerin L., Leicknam J.-C., Cammarata M., Kong Q., Kim J., Møller K. B., and Ihee H., “ Filming the birth of molecules and accompanying solvent rearrangement,” J. Am. Chem. Soc. 135, 3255–3261 (2013). 10.1021/ja312513w [DOI] [PubMed] [Google Scholar]
  • 19. Takala H., Bjorling A., Berntsson O., Lehtivuori H., Niebling S., Hoernke M., Kosheleva I., Henning R., Menzel A., Ihalainen J. A., and Westenhoff S., “ Signal amplification and transduction in phytochrome photosensors,” Nature 509, 245–248 (2014). 10.1038/nature13310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Ahn C. W., Ki H., Kim J., Kim J., Park S., Lee Y., Kim K. H., Kong Q., Moon J., Pedersen M. N., Wulff M., and Ihee H., “ Direct observation of a transiently formed isomer during iodoform photolysis in solution by time-resolved x-ray liquidography,” J. Phys. Chem. Lett. 9, 647–653 (2018). 10.1021/acs.jpclett.7b03125 [DOI] [PubMed] [Google Scholar]
  • 21. Leshchev D., Harlang T. C. B., Fredin L. A., Khakhulin D., Liu Y., Biasin E., Laursen M. G., Newby G. E., Haldrup K., Nielsen M. M., Wärnmark K., Sundström V., Persson P., Kjaer K. S., and Wulff M., “ Tracking the picosecond deactivation dynamics of a photoexcited iron carbene complex by time-resolved x-ray scattering,” Chem. Sci. 9, 405–414 (2018). 10.1039/C7SC02815F [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Khakhulin D., Lawson Daku L. M., Leshchev D., Newby G. E., Jarenmark M., Bressler C., Wulff M., and Canton S. E., “ Visualizing the coordination-spheres of photoexcited transition metal complexes with ultrafast hard x-rays,” Phys. Chem. Chem. Phys. 21, 9277–9284 (2019). 10.1039/C9CP01263J [DOI] [PubMed] [Google Scholar]
  • 23. Kong Q., Khakhulin D., Shkrob I. A., Lee J. H., Zhang X., Kim J., Kim K. H., Jo J., Kim J., Kang J., Pham V.-T., Jennings G., Kurtz C., Spence R., Chen L. X., Wulff M., and Ihee H., “ Solvent-dependent complex reaction pathways of bromoform revealed by time-resolved x-ray solution scattering and x-ray transient absorption spectroscopy,” Struct. Dyn. 6, 064902 (2019). 10.1063/1.5132968 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Leshchev D., Khakhulin D., Newby G., Ki H., Ihee H., and Wulff M., “ Sub-nanosecond secondary geminate recombination in mercury halides HgX2 (X = I, Br) investigated by time-resolved x-ray scattering,” J. Chem. Phys. 151, 054310 (2019). 10.1063/1.5096422 [DOI] [Google Scholar]
  • 25. Ki H., Park S., Eom S., Gu J., Kim S., Kim C., Ahn C. W., Choi M., Ahn S., Ahn D. S., Choi J., Baik M. H., and Ihee H., “ Gold nanoparticle formation via x-ray radiolysis investigated with time-resolved x-ray liquidography,” Int. J. Mol. Sci. 21, 7125 (2020). 10.3390/ijms21197125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Kim T. W., Lee S. J., Jo J., Kim J. G., Ki H., Kim C. W., Cho K. H., Choi J., Lee J. H., Wulff M., Rhee Y. M., and Ihee H., “ Protein folding from heterogeneous unfolded state revealed by time-resolved x-ray solution scattering,” Proc. Natl. Acad. Sci. U. S. A. 117, 14996–15005 (2020). 10.1073/pnas.1913442117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Lee S. J., Kim Y., Kim T. W., Yang C., Thamilselvan K., Jeong H., Hyun J., and Ihee H., “ Reversible molecular motional switch based on circular photoactive protein oligomers exhibits unexpected photo-induced contraction,” Cell Rep. Phys. Sci. 2, 100512 (2021). 10.1016/j.xcrp.2021.100512 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Jeong H., Ki H., Kim J. G., Kim J., Lee Y., and Ihee H., “ Sensitivity of time-resolved diffraction data to changes in internuclear distances and atomic positions,” Bull. Korean Chem. Soc. 43, 376–390 (2022). 10.1002/bkcs.12494 [DOI] [Google Scholar]
  • 29. Arnlund D., Johansson L. C., Wickstrand C., Barty A., Williams G. J., Malmerberg E., Davidsson J., Milathianaki D., DePonte D. P., Shoeman R. L., Wang D. J., James D., Katona G., Westenhoff S., White T. A., Aquila A., Bari S., Berntsen P., Bogan M., van Driel T. B., Doak R. B., Kjaer K. S., Frank M., Fromme R., Grotjohann I., Henning R., Hunter M. S., Kirian R. A., Kosheleva I., Kupitz C., Liang M. N., Martin A. V., Nielsen M. M., Messerschmidt M., Seibert M. M., Sjohamn J., Stellato F., Weierstall U., Zatsepin N. A., Spence J. C. H., Fromme P., Schlichting I., Boutet S., Groenhof G., Chapman H. N., and Neutze R., “ Visualizing a protein quake with time-resolved x-ray scattering at a free-electron laser,” Nat. Methods 11, 923–926 (2014). 10.1038/nmeth.3067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Kim K. H., Kim J. G., Nozawa S., Sato T., Oang K. Y., Kim T. W., Ki H., Jo J., Park S., Song C., Sato T., Ogawa K., Togashi T., Tono K., Yabashi M., Ishikawa T., Kim J., Ryoo R., Kim J., Ihee H., and Adachi S., “ Direct observation of bond formation in solution with femtosecond x-ray scattering,” Nature 518, 385–389 (2015). 10.1038/nature14163 [DOI] [PubMed] [Google Scholar]
  • 31. Levantino M., Schiro G., Lemke H. T., Cottone G., Glownia J. M., Zhu D. L., Chollet M., Ihee H., Cupane A., and Cammarata M., “ Ultrafast myoglobin structural dynamics observed with an x-ray free-electron laser,” Nat. Commun. 6, 6772 (2015). 10.1038/ncomms7772 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Biasin E., van Driel T. B., Kjaer K. S., Dohn A. O., Christensen M., Harlang T., Chabera P., Liu Y., Uhlig J., Papai M., Nemeth Z., Hartsock R., Liang W., Zhang J., Alonso-Mori R., Chollet M., Glownia J. M., Nelson S., Sokaras D., Assefa T. A., Britz A., Galler A., Gawelda W., Bressler C., Gaffney K. J., Lemke H. T., Moller K. B., Nielsen M. M., Sundstrom V., Vanko G., Warnmark K., Canton S. E., and Haldrup K., “ Femtosecond x-ray scattering study of ultrafast photoinduced structural dynamics in solvated [Co(terpy)2]2+,” Phys. Rev. Lett. 117, 013002 (2016). 10.1103/PhysRevLett.117.013002 [DOI] [PubMed] [Google Scholar]
  • 33. Kim K. H., Kim J. G., Oang K. Y., Kim T. W., Ki H., Jo J., Kim J., Sato T., Nozawa S., Adachi S., and Ihee H., “ Femtosecond x-ray solution scattering reveals that bond formation mechanism of a gold trimer complex is independent of excitation wavelength,” Struct. Dyn. 3, 043209 (2016). 10.1063/1.4948516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Kim J. G., Nozawa S., Kim H., Choi E. H., Sato T., Kim T. W., Kim K. H., Ki H., Kim J., Choi M., Lee Y., Heo J., Oang K. Y., Ichiyanagi K., Fukaya R., Lee J. H., Park J., Eom I., Chun S. H., Kim S., Kim M., Katayama T., Togashi T., Owada S., Yabashi M., Lee S. J., Lee S., Ahn C. W., Ahn D.-S., Moon J., Choi S., Kim J., Joo T., Kim J., Adachi S., and Ihee H., “ Mapping the emergence of molecular vibrations mediating bond formation,” Nature 582, 520–524 (2020). 10.1038/s41586-020-2417-3 [DOI] [PubMed] [Google Scholar]
  • 35. Biasin E., Fox Z. W., Andersen A., Ledbetter K., Kjaer K. S., Alonso-Mori R., Carlstad J. M., Chollet M., Gaynor J. D., Glownia J. M., Hong K., Kroll T., Lee J. H., Liekhus-Schmaltz C., Reinhard M., Sokaras D., Zhang Y., Doumy G., March A. M., Southworth S. H., Mukamel S., Gaffney K. J., Schoenlein R. W., Govind N., Cordones A. A., and Khalil M., “ Direct observation of coherent femtosecond solvent reorganization coupled to intramolecular electron transfer,” Nat. Chem. 13, 343–349 (2021). 10.1038/s41557-020-00629-3 [DOI] [PubMed] [Google Scholar]
  • 36. Choi E. H., Kim J. G., Kim J., Ki H., Lee Y., Lee S., Yoon K., Kim J., Kim J., and Ihee H., “ Filming ultrafast roaming-mediated isomerization of bismuth triiodide in solution,” Nat. Commun. 12, 4732 (2021). 10.1038/s41467-021-25070-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Ki H., Choi S., Kim J., Choi E. H., Lee S., Lee Y., Yoon K., Ahn C. W., Ahn D.-S., Lee J. H., Park J., Eom I., Kim M., Chun S. H., Kim J., Ihee H., and Kim J., “ Optical Kerr effect of liquid acetonitrile probed by femtosecond time-resolved x-ray liquidography,” J. Am. Chem. Soc. 143, 14261–14273 (2021). 10.1021/jacs.1c06088 [DOI] [PubMed] [Google Scholar]
  • 38. Lee Y., Kim J. G., Lee S. J., Muniyappan S., Kim T. W., Ki H., Kim H., Jo J., Yun S. R., Lee H., Lee K. W., Kim S. O., Cammarata M., and Ihee H., “ Ultrafast coherent motion and helix rearrangement of homodimeric hemoglobin visualized with femtosecond x-ray solution scattering,” Nat. Commun. 12, 3677 (2021). 10.1038/s41467-021-23947-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Kim J. G., Choi E. H., Lee Y., and Ihee H., “ Femtosecond x-ray liquidography visualizes wavepacket trajectories in multidimensional nuclear coordinates for a bimolecular reaction,” Acc. Chem. Res. 54, 1685–1698 (2021). 10.1021/acs.accounts.0c00812 [DOI] [PubMed] [Google Scholar]
  • 40. Svergun D. I., “ Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing,” Biophys. J. 76, 2879–2886 (1999). 10.1016/S0006-3495(99)77443-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Svergun D., “ Advanced solution scattering data analysis methods and their applications,” J. Appl. Crystallogr. 33, 530–534 (2000). 10.1107/S0021889800001370 [DOI] [Google Scholar]
  • 42. Svergun D. I., Petoukhov M. V., and Koch M. H. J., “ Determination of domain structure of proteins from x-ray solution scattering,” Biophys. J. 80, 2946–2953 (2001). 10.1016/S0006-3495(01)76260-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Ihee H., Lorenc M., Kim T. K., Kong Q. Y., Cammarata M., Lee J. H., Bratos S., and Wulff M., “ Ultrafast x-ray diffraction of transient molecular structures in solution,” Science 309, 1223–1227 (2005). 10.1126/science.1114782 [DOI] [PubMed] [Google Scholar]
  • 44. Panjkovich A. and Svergun D. I., “ Deciphering conformational transitions of proteins by small angle x-ray scattering and normal mode analysis,” Phys. Chem. Chem. Phys. 18, 5707–5719 (2016). 10.1039/C5CP04540A [DOI] [PubMed] [Google Scholar]
  • 45. Oang K. Y., Yang C., Muniyappan S., Kim J., and Ihee H., “ SVD-aided pseudo principal-component analysis: A new method to speed up and improve determination of the optimum kinetic model from time-resolved data,” Struct. Dyn. 4, 044013 (2017). 10.1063/1.4979854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Franke D., Jeffries C. M., and Svergun D. I., “ Machine learning methods for x-ray scattering data analysis from biomacromolecular solutions,” Biophys. J. 114, 2485–2492 (2018). 10.1016/j.bpj.2018.04.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Ki H., Lee Y., Choi E. H., Lee S., and Ihee H., “ SVD-aided non-orthogonal decomposition (SANOD) method to exploit prior knowledge of spectral components in the analysis of time-resolved data,” Struct. Dyn. 6, 024303 (2019). 10.1063/1.5085864 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Choi M., Kim J. G., Muniyappan S., Kim H., Kim T. W., Lee Y., Lee S. J., Kim S. O., and Ihee H., “ Effect of the abolition of intersubunit salt bridges on allosteric protein structural dynamics,” Chem. Sci. 12, 8207–8217 (2021). 10.1039/D1SC01207J [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Kong Q. Y., Lee J. H., Plech A., Wulff M., Ihee H., and Koch M. H. J., “ Ultrafast x-ray solution scattering reveals an unknown reaction intermediate in the photolysis of [Ru-3(CO)12],” Angew. Chem., Int. Ed. 47, 5550–5553 (2008). 10.1002/anie.200801153 [DOI] [PubMed] [Google Scholar]
  • 50. Kjaer K. S., v Driel T. B., Kehres J., Haldrup K., Khakhulin D., Bechgaard K., Cammarata M., Wulff M., Sørensen T. J., and Nielsen M. M., “ Introducing a standard method for experimental determination of the solvent response in laser pump, x-ray probe time-resolved wide-angle x-ray scattering experiments on system in solution,” Phys. Chem. Chem. Phys. 15, 15003–15016 (2013). 10.1039/C3CP50751C [DOI] [PubMed] [Google Scholar]
  • 51. Cammarata M., Lorenc M., Kim T. K., Lee J. H., Kong Q. Y., Pontecorvo E., Lo Russo M., Schiro G., Cupane A., Wulff M., and Ihee H., “ Impulsive solvent heating probed by picosecond x-ray diffraction,” J. Chem. Phys. 124, 124504 (2006). 10.1063/1.2176617 [DOI] [PubMed] [Google Scholar]
  • 52. Bierlaire M., Toint P. L., and Tuyttens D., “ On iterative algorithms for linear least squares problems with bound constraints,” Linear Algebra Appl. 143, 111–143 (1991). 10.1016/0024-3795(91)90009-L [DOI] [Google Scholar]
  • 53. Björck Å., “ Solving linear least squares problems by Gram-Schmidt orthogonalization,” BIT Numer. Math. 7, 1–21 (1967). 10.1007/BF01934122 [DOI] [Google Scholar]
  • 54. Golub G. H., Manneback P., and Toint P. L., “ A comparison between some direct and iterative methods for certain large scale geodetic least squares problems,” SIAM J. Sci. Stat. Comput. 7, 799–816 (1986). 10.1137/0907053 [DOI] [Google Scholar]
  • 55. Golub G. H. and Reinsch C., “ Singular value decomposition and least squares solutions,” Numer. Math. 14, 403–420 (1970). 10.1007/BF02163027 [DOI] [Google Scholar]
  • 56. Schmidt M., Rajagopal S., Ren Z., and Moffat K., “ Application of singular value decomposition to the analysis of time-resolved macromolecular x-ray data,” Biophys. J. 84, 2112–2129 (2003). 10.1016/S0006-3495(03)75018-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Jung Y. O., Lee J. H., Kim J., Schmidt M., Moffat K., Srajer V., and Ihee H., “ Volume-conserving trans-cis isomerization pathways in photoactive yellow protein visualized by picosecond x-ray crystallography,” Nat. Chem. 5, 212–220 (2013). 10.1038/nchem.1565 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Strang G., Introduction to Linear Algebra, 5th ed. ( Cambridge University Press, 2021), pp. 206–213. [Google Scholar]
  • 59. Meyer C. D., Matrix Analysis and Applied Linear Algebra ( SIAM, 2000), p. 439. [Google Scholar]
  • 60. Salkind N. J., Encyclopedia of Measurement and Statistics ( SAGE Publications, 2006), pp. 530–536. [Google Scholar]
  • 61. Hogben L., Handbook of Linear Algebra ( CRC Press, 2007), p. 86. [Google Scholar]
  • 62. Golub G., “ Numerical methods for solving linear least squares problems,” Numer. Math. 7, 206–216 (1965). 10.1007/BF01436075 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

See the supplementary material for detailed discussions on various topics, including the mathematical background for the decomposition of SADSk into SADSk, and SADSk, , the appearance of horizontal stripes in the contour plot of the PEPC-treated data, the determination of coefficients gk and hk during structural analysis, and a comparison of the PEPC method and the nodal-point method.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.


Articles from Structural Dynamics are provided here courtesy of American Institute of Physics

RESOURCES