Skip to main content
Nature Communications logoLink to Nature Communications
. 2025 Nov 26;16:10522. doi: 10.1038/s41467-025-65514-4

Surface structure of water from soft X-ray second harmonic generation

David J Hoffman 1,, Shane W Devlin 2,3,4, Douglas Garratt 1, Sasawat Jamnuch 5,6, Jacob A Spies 2,7, Bailey R Nebgen 2,7, Daniel Schacher 4, Alexandria Do 8, Franky Bernal 2,9, Erika J Riffe 2,9, Kristjan Kunnus 1, Christina Y Hampton 1, Joseph Duris 1, David Cesar 1, Nicholas Sudar 1, Georgi L Dakovski 1, Walter S Drisdell 9, Keith V Lawler 4, Agostino Marinelli 1, Michael W Zuerch 2,7, Richard J Saykally 2,9, Craig P Schwartz 4,, Tod A Pascal 8,, Jake D Koralek 1,
PMCID: PMC12658203  PMID: 41298383

Abstract

The microscopic structure of water’s surface is crucial to many natural and industrial processes, but studying its hydrogen bond (H-bond) network directly remains challenging due to the required interfacial sensitivity of experimental techniques. By leveraging advances in flat liquid sheet microjets and terawatt-scale attosecond soft X-ray pulses from the LCLS X-ray free electron laser, we employed soft X-ray second harmonic generation (SXSHG) spectroscopy to examine the liquid water/vapor interface. SXSHG combines the elemental selectivity of X-ray spectroscopies with the surface selectivity of SHG and gives access to the electronic structure of interfacial species. Here, we show the SXSHG spectrum differs from bulk water’s X-ray absorption, with its peak shifted several eV, indicating a vastly different electronic environment at the interface as compared to the bulk. First-principles electronic structure calculations show the signal is highly sensitive to H-bond interactions, such as water molecules accepting a single H-bond, which are surface abundant.

Subject terms: Chemical physics, Surface spectroscopy, Density functional theory, Nonlinear optics


Water-vapor interfaces have been studied with many techniques, yet open questions persist about their electronic and molecular structure. Here, the authors demonstrate the application of soft x-ray second harmonic generation to study the water surface by leveraging attosecond pulses at the LCLS and a flat liquid sheet microjet, providing insights on the H-bond structure.

Introduction

Water and its interfaces are ubiquitous in physical, chemical, and biological processes14. Water’s tetrahedral hydrogen bond (H-bond) network leads to the unique macroscopic properties of its liquid phase, such as its anomalously high boiling point and density5,6, however the H-bond network is necessarily different at the surface due to the lack of H-bonding partners. Due to its fundamental importance, the surface of water has been extensively studied by a wide array of experimental techniques, ranging from measurements of macroscopic properties such as surface tension7 to probes with finite penetration depth, such as photoemission spectroscopy8,9, particle scattering10, and various spectral decomposition and reflectivity-based techniques1113. The development of nonlinear surface-selective spectroscopies, such as UV-visible second harmonic generation (SHG) and vibrational sum frequency generation (SFG) have provided some structural information about water interfaces. However, the interpretation of the resulting complex spectra is challenging and remains the subject of much debate1419. Therefore, despite the pervasiveness of aqueous interfaces and the many tools developed to study them, there are still substantial open questions about their electronic and molecular structure and how they connect to important macroscopic phenomena.

Soft X-ray second harmonic generation (SXSHG) applies the surface sensitivity of SHG to element-specific X-ray transitions, and can probe electronic valence states not easily accessible with UV-visible spectroscopies2026. As SXSHG almost-exclusively relies on the intense X-ray pulses produced by X-ray free electron lasers (XFELs), it is a relatively new technique that is still quickly developing. The foundational work on graphite slabs over a range of thicknesses demonstrated the possibility of the measurement and the surface-selectivity of the measurement24. A later study on a boron-Parylene N junction showed the sensitivity to buried interfaces as well as the ability to extract intermolecular distances with Angstrom precision21. Work on LiNbO3 has also demonstrated the polarization-dependence of the signal as well as element-dependent signatures from the Li K-edge and Nb N-edge22.

The initial states in soft X-ray spectroscopies like SXSHG comprise tightly bound core electrons, which cause the corresponding transitions to be almost entirely dominated by atom-dependent binding energies that are separated by 10–100 s of eV. In the case of water, soft X-ray absorption spectroscopy (XAS) of the oxygen K-edge is well-established as a sensitive probe of its hydrogen bonding network, and has been the subject of several reviews27,28. This sensitivity arises primarily because of the strong transitions from the core oxygen 1 s orbitals and the lowest-lying unoccupied valence orbitals, which have σ* character and are localized on the hydrogen atoms. The water XAS spectrum is then particularly sensitive to the H-bond donation character of the probed water molecules, and suggests that the SXSHG spectrum can provide similar information for the interfacial water molecules.

SXSHG is a nonlinear process, where two soft X-ray photons are absorbed and a new photon with twice the incident energy is emitted (shown schematically in Fig. 1, inset). The resulting SHG signal has a characteristic quadratic intensity dependence with the incident beam’s intensity:

ISHG2ω=χeff22ω2I0ω2 1

Fig. 1. Schematic depiction of the SXSHG experiment.

Fig. 1

Attosecond soft X-ray pulses are produced by the XFEL. The pulses intrinsically contain some photons in the second harmonic region. The pulse energy is measured non-invasively by a nitrogen gas attenuator (GMD). The pulse is focused by a pair of Kirkpatrick-Baez mirrors and impinges a water sheet jet at a 70° angle of incidence. The fundamental and harmonic from the XFEL pulse are partially absorbed by the bulk water and SHG is generated at the surfaces. The pulse passes through an aluminum filter to preferentially attenuate the much brighter fundamental. The fundamental and harmonic are spectrally resolved on the detector. Inset: Schematic energy level diagram of the SXSHG process.

Critically, χeff(2), the effective second-order nonlinear susceptibility at a given photon energy, is symmetry forbidden in centrosymmetric media within the dipole approximation29. Practically, in the case of liquid water, this means that the only observed SHG signals should arise from interfaces, which inherently break inversion symmetry. Previous experimental and theoretical work on solid samples has demonstrated that 98% of the SXSHG signal is generated from the three atomic layers nearest to the interface24, and similar probe depths are expected in the case of liquid water.

Despite its purported advantages, several factors make SXSHG experiments on water particularly challenging. First, nonlinear X-ray processes require extreme levels of photon intensity and coherence, which are currently only available from XFELs. In the present work, X-ray pulses were produced using the X-ray Laser-Enhanced Attosecond Pulses (XLEAP) technique, which enables the XFEL to produce sub-femtosecond pulses with greater peak intensities than are possible with the standard spontaneously amplified stimulated emission processes commonly used at XFELs30. These pulses were further amplified to TW-scale peak power using superradiant amplification31. Second, the strong attenuation of soft X-rays in both air and water requires SXSHG measurements to be performed in a vacuum environment. Here, we employed a flat liquid sheet microjet, which provides a constantly-refreshing liquid target with optically flat liquid interfaces open to the vacuum environment3236. The submicron sheet is thin enough to detect both the fundamental and harmonic signals in transmission geometry, which greatly simplifies the experimental design by allowing for alignment of the spectrometer without the liquid target, minimizing the impacts of possible jet fluctuations, and enabling for the simultaneous measurement of the XAS and SXSHG from the same sample.

With these advances, we measure and report the SXSHG spectrum of the water/vapor interface across the oxygen K-edge (~540 eV). While the measured signal was dominated by background photons from the XFEL source and competing intensity-dependent transient absorption (TA) effects in the water sample, covariance analysis of the harmonic (~1080 eV) and fundamental (~540 eV) regions of the spectra measured at each set photon energy revealed the SXSHG signal and permitted subtraction of the background as a function of intensity-dependence. The SXSHG signal was only observed above the oxygen K-edge, consistent with resonant enhancement expected from the sample. The peak of the full SXSHG spectrum appeared several eV higher than the peak of the bulk water XAS, differentiating the SXSHG signal from the bulk measurement. These results were supported with first-principles electronic structure calculations, which also captured the measured blue-shift in the SXSHG spectrum compared to the XAS. The simulated SXSHG response of the interfacial water molecules was found to be particularly sensitive to the number of acceptor H-bonds, making this technique a direct probe of the broken H-bond structure of interfacial water analogous to, but distinct from, XAS. Using these calculations, it was determined that the major contributors to the full SXSHG spectrum were water molecules accepting a single H-bond that are significantly more populous at the water surface than in the bulk and agree with state-of-the-art simulations of the liquid water surface.

Results

Soft X-ray second harmonic generation spectroscopy

The experimental configuration is shown in a schematic diagram of the SXSHG experiment is shown in Fig. 1 and described in the “Materials and Methods”. Briefly, attosecond X-ray pulses with pulse energies of 100 s of μJ were tuned across the oxygen K-edge region from ~520–560 eV in 5 eV steps and were focused using a pair of Kirkpatrick-Baez mirrors to a ~10 μm FWHM spot on the liquid sheet target. The pulse energy was measured non-invasively shot-by-shot using a gas monitor detector (GMD), which we will use as a proxy for intensity throughout this work. For each photon energy, the fundamental and harmonic were measured simultaneously in the transmission geometry on a shot-by-shot basis. The pulses were spectrally resolved to monitor both the harmonic and fundamental spectral regions simultaneously. The XFEL produced a substantial background in the harmonic region that was about four orders of magnitude weaker than the fundamental. Spectral datasets for each XFEL configuration were then taken with (jet-in) and without (jet-out) the liquid sheet in the beam to enable background subtraction of the XFEL harmonic, which additionally enables XAS measurements of the liquid jet across the fundamental region.

The intrinsic shot-to-shot fluctuations of the XFEL pulse energy were used to generate the intensity dependence for identifying nonlinear signals. The pulse energy varies stochastically from <100 to >500 μJ with moderate spectral changes due to shot-by-shot variation in the electron beam31. Jet-in and jet-out data were binned by pulse energy and the jet-out data was subtracted from the jet-in data after accounting for absorption through the water sample. An example of intensity-dependent difference signals in the harmonic region are shown in Fig. 2B for a dataset taken at 550 eV (above the oxygen K-edge). The XFEL harmonic and squared fundamental spectra are also shown as blue and red dashed curves, respectively. The squared fundamental spectrum indicates where an SHG signal should appear if |χ(2)|2 is flat across the spectral region. The complicated behavior of the intensity-dependent spectra was assigned to three distinct processes as illustrated in Fig. 2A, C. Linear absorption and an unexpected TA process that is linear in intensity (Supplementary Fig. 2) both reduce the counts across the entire spectral region, while the SHG increases the counts only in the region that roughly corresponds to the squared fundamental profile. We associate the observed TA features with intrapulse ionization of the water sample, because characteristic water ionization spectral features were also observed in the fundamental region oxygen K-edge XAS37. The induced harmonic absorption could then be potentially explained by an increased effective electron density per the Drude model38.

Fig. 2. Intensity-dependent processes in the harmonic region.

Fig. 2

A The harmonic produced by the XFEL is attenuated by intrinsic linear absorption through the water jet and a transient absorption (TA) induced by the fundamental. The SHG process produces additional counts in the harmonic region. B Difference signals for 550 eV at selected binned fundamental pulse energies (μJ). Colored shaded areas represent standard errors for the signal between three datasets. The blue and red dashed curves are the XFEL harmonic and squared fundamental spectra respectively. C Depiction of the intensity-dependent difference signals using the features in (A), reproducing (B). D Experimental covariance difference map for 550 eV showing intensity-dependent features. E Modeled best-fit covariance map with the intensity-dependent features from (A) indicated. The correlated positive SHG feature and negative TA background are apparent. Color bar magnitude is arbitrary but shared.

The decomposition of the difference signal in Fig. 2B into TA and SHG components is supported through the use of well-established covariance mapping techniques39. The experimental covariance map for the 550 eV dataset is shown in Fig. 2D and a modeled map is shown in Fig. 2E generated from the jet-out data and the intensity-dependent features illustrated in Fig. 2A. These covariance maps are 2D contour plots of the covariance of the detector pixels, which indicate how the intensities measured at different photon energies dispersed on the detector vary with respect to each other after background subtraction of intrinsic correlations from the light source. Blue features correspond to a negative correlation (i.e., more fundamental counts correlate to fewer harmonic counts), while red features correspond to a positive correlation (more fundamental counts correlate to more harmonic counts). The x- and y-axes correspond to the fundamental and harmonic regions, respectively, and the white curves along each axis are the corresponding (jet-out) average spectra. The proposed TA signal then results in a broad uncorrelated blue feature, as the TA signal must be nonresonant across the harmonic region and the absorption across the fundamental is similarly flat above the oxygen K-edge, although the measured behavior of the TA depends on the fundamental and harmonic pulse profiles. By contrast, the SHG process will produce a signal that is correlated along the diagonal line corresponding to the harmonic frequency being twice the fundamental frequency.

The best-fit parameters for the TA signals as well as the |χ(2)|2 spectral response were determined by a simultaneous fitting to the experimental covariance map and the isolated binned SHG signals (see “Materials and Methods” for a detailed description). The covariance map (Fig. 2E) produced from this relatively straightforward model accurately reproduces the major qualitative features of the experimental data, although it misses some of the fine structure in both signals. Figure 3A shows the resulting intensity-dependent isolated SHG signals from the same best fit model. The isolated signal was positive and increasing with fundamental intensity, and was in the expected spectral location (red dashed curve). The integrated intensity-dependent signal was then plotted against the integrated fundamental (Fig. 3B). The best fit curve (Eq. 1) gives the value of |χ(2)|2 and demonstrates the quadratic intensity dependence of the isolated SHG signal.

Fig. 3. Intensity and spectral dependence of the isolated SXSHG signal.

Fig. 3

A Background-subtracted intensity-dependent SHG signals from the data shown in Fig. 2B. B Intensity of the integrated isolated SHG signal vs. the fundamental. Quadratic fit shown. Error bars represent standard error between three measurements. C Full spectral dependence of the nonlinear response |χ(2)|2 vs. the mean XFEL photon energy. SHG signal only appears well above the O K-edge from simultaneously measured XAS (blue curve). Y-error bars from statistical error and estimated systematic error. X-error bars derived from the FWHM of the XFEL fundamental spectra.

The full SXSHG spectrum was then generated by tuning the XFEL photon energy across the oxygen K-edge in 5 eV steps. The ten collected datasets were all processed in the same manner as described above, wherein the |χ(2)|2 response was determined by the simultaneous fitting of the covariance maps and the binned isolated SHG signals. Best fit |χ(2)|2 parameters are shown for each dataset in Fig. 3C, plotted against the simultaneously measured XAS of bulk water (blue curve), illustrating the onset of measured |χ(2)|2 signal. The y-error bars in Fig. 3C capture the statistical error as well as estimated systematic error in modeling the TA background process, detailed in “Materials and Methods”. A clear edge is apparent in the |χ(2)|2 spectrum within several eV of the oxygen K-edge measured by XAS. Covariance maps and difference signals below the K-edge only show evidence of TA features (in both the harmonic and fundamental spectral regions). Surprisingly, the onset of the |χ(2)|2 signal does not occur at the peak of the XAS spectrum (~540 eV) but is blue-shifted by ~5 eV. The full dataset then suggests a |χ(2)|2 spectral response for the water surface that is distinct from the bulk water XAS.

First principles simulations

Insights into the origin of the observed SXSHG signal were obtained using first principles simulations of the SHG response from perturbation theory within density functional theory (DFT). The methods used are analogous to those in prior SXSHG studies2024,40 (as detailed in the Materials and Methods), and are briefly outlined here. First, we obtained ensemble averaged H-bond structures from classical molecular dynamics (MD) simulations using the many-body polarizable water model (MB-pol)41, which has been rigorously validated to simultaneously capture both the bulk42 and interfacial43 physics of water at ambient conditions. We then determined the water mass-density distribution as a function of distance from the instantaneous water/vapor interface (Supplementary Fig. 6B)44, and quantified the H-bond characteristics of interfacial molecules by calculating the potential of mean force (Supplementary Fig. 6C)45. The populations of different major species as distinguished by number of H-bond acceptors and donors (each ranging from zero to two) are shown in Fig. 4A and illustrated in Fig. 4B. SXSHG spectra were calculated for each unique H-bond configuration (Fig. 4C, minor configurations can be found in Supplementary Fig. 7D), sampled from an MD simulation of a smaller simulation cell for computational feasibility. The full simulated signal is generated on a per-atom basis, allowing for the contributions from particular molecules to be isolated.

Fig. 4. First-principles simulations of SXSHG spectra.

Fig. 4

A Population analysis of water molecules in the bulk (purple) and first interfacial layer (green), grouped by number of acceptor (A) and donor (D) H-bonds from MD calculations. B Schematic depictions of the major water H-bond configurations. C Simulated SXSHG spectra for different H-bond configurations that contribute significantly to the total spectrum. Intensities are all to scale, vertically offset for clarity (dashed lines indicate respective zeros). D TD-DFT calculations of the excited states of an isolated water molecule. Valence states are shown in red and highly excited SXSHG states are shown in gray, plotted at half energy. The SXSHG manifolds (orbital isosurfaces inset) are blue shifted relative to the valence states. E The resulting simulated spectrum (purple) is compared to experiment (black) from Fig. 3C. The population in (A) are used to weight the spectra of the various H-bonding configurations in (C).

The simulated SXSHG spectra were found to be particularly sensitive to the number of acceptor H-bonds. This stands in contrast to linear XAS measurements, which have been shown to be sensitive to the degree of donor H-bonds due to the low-lying water valence orbitals being localized on the hydrogen atoms, but not particularly sensitive to the number of acceptor H-bonds28. In particular, interfacial single acceptor/no donor species (A1D0, blue trace in Fig. 4C) presents several SXSHG features at ~532 eV, 540 eV and a particularly intense main peak at 544 eV. Conversely, the calculated spectrum of the A2D2 species (red trace in Fig. 4C), the dominant constituent of bulk water, is largely featureless over the calculated energy range. The spectra of each H-bond donor/acceptor species were then weighted based on the interfacial population thermodynamics at 277 K to match the expected experimental temperature due to evaporative cooling in vacuum46 to obtain a computational SXSHG spectrum. Figure 4E shows that the simulated spectrum is in excellent agreement with the experimental measurements. Additional simulations also show the SXSHG signal results from broken symmetry at the interface, as the corresponding SXSHG response of a bulk water system was found to be 50x less intense even with the limited simulation cell size (Supplementary Fig. 7A). These calculations inherently include bulk-active quadrupole contributions as well (see “Materials and Methods”). Combined with previous experimental and theoretical work on other samples showing thickness independence24 and polarization-dependence22 consistent with dipole selection rules, these simulations support our claim of surface selectivity.

Discussion

The experimentally observed blue shift of ~5 eV in the surface SXSHG spectrum relative to the bulk XAS was also captured by our first-principles SXSHG simulations (Supplementary Fig. 7C), but these simulations do not provide an orbital-level description. To better understand this effect, we performed high-level quantum chemistry calculations of the natural transition orbitals resulting from the overlap of the 1 s core electron with the various SXSHG excited states (corresponding to excitonic virtual states and the process depicted by the blue arrow in the schematic SHG energy level diagram in Fig. 1). These calculations, performed in the single molecule limit, revealed a manifold of core-excited states at twice 539 eV (sx1), which are weakly dipole allowed, and another manifold of states at twice 543 eV (sx2) with much stronger oscillator strengths. Simulations of AxDy species and their first solvation shell waters revealed similar manifolds for all configurations, with larger oscillator strength correlating with smaller exciton size47 within this energy range. These manifolds and corresponding molecular orbitals for an isolated molecule are shown in Fig. 4D (gray trace) relative to the major water molecule valence transitions (red trace). Note that the gas phase XAS appears at lower energy than the liquid phase XAS27,28. The half-energy sx1 and sx2 states are also several eV blue-shifted relative to the valence transitions, which is suggestive of an orbital overlap explanation for this effect. However, these calculations do not consider the transition between the valence and sx1/sx2 manifolds. The sx2 manifold of states was found to have intramolecular charge transfer character, as determined by the transition charge metric48, with large oscillator strengths due to strong dipole induced-dipole couplings. This manifold of states for both the single molecule and clusters is also particularly sensitive to molecular asymmetry (Supplementary Fig. 8), which is known to occur in liquid water in the presence of broken H-bonding49,50. These states may provide a partial explanation for the sensitivity to acceptor H-bonds as well, as the 4a1 state localized on the hydrogen atoms have poor spectral overlap with the sx1/sx2 manifolds, and potentially better overlap with higher energy orbitals with more oxygen character.

Our MD simulations revealed an interface that is decorated with broken H-bond species. The single-acceptor (A1Dx) species were both greatly enhanced at the surface relative to the bulk and produce the largest signal per molecule. The single-acceptor-single-donor (A1D1, green trace in Fig. 4C) species make up ~20% of the surface molecules with three times their bulk prevalence, and it is the primary contributor to the 544 eV main peak and a post-edge peak in the SXSHG spectrum at 550 eV. The single acceptor, no donor (A1D0, blue trace in Fig. 4C) species represents ~3% of the surface water molecules (four times their bulk prevalence) but has an outsized influence on the main SXSHG peak at 544 eV and the smaller peak at 536.5 eV.

Notably, the SXSHG spectrum of the tetrahedral double-acceptor-double-donor species (A2D2, red trace in Fig. 4C) has only a weak response at 550 eV due to the high degree of symmetry, and while this species is the majority component of the bulk, about half as many (~35%) of the surface waters adopt this configuration. In general, the weak response from the more symmetric species necessitates information from other more established techniques to fully describe the liquid surface, such as vibrational SFG. The weak SXSHG activity of the A2D2 species provides an interesting contrast to the vibrational SFG data, as the OH stretch of individual species provide a strong response in the SFG which is minimized in the final spectrum due to phase cancellations due to the opposing orientations of molecules near the surface19,5153. Here, the per-species response is reduced, possibly due to the valence molecular orbitals gaining more s-character when tetrahedrally bonded, as has been seen in simulations of the water XAS spectrum54. This may reduce either the one or two-photon cross-section in the symmetric A2D2 species.

Thus, when considering the calculated spectra and population analysis, we conclude that the main peak in the full SXSHG spectrum encodes the signature of broken, single acceptor H-bonding. This result corroborates the sensitivity of using oxygen 1 s electronic transitions to probe hydrogen bonding with high selectivity, and shows that SXSHG can complement optical and infrared nonlinear methods in characterizing the water surface to enable a full understanding of the liquid surface.

In summary, we demonstrated SXSHG as a probe of the water/vapor interface by combining intense attosecond XFEL pulses with flat liquid sheet microjets. The experimental spectrum was interpreted by complementary first-principles electronic structure calculations of water molecules in distinct H-bond environments, and weighting the simulated spectra by the prevalence of each H-bond configuration from extensive MD simulations. Overall, we find that the major contributors to the full SXSHG spectrum are water molecules accepting a single H-bond, which are enhanced at the interface in agreement with state-of-the-art MD simulations. This sensitivity to H-bond acceptors was an unexpected result, as the bulk XAS is known to be more sensitive to H-bond donors.

While in this current work, the conclusions that we are able to draw are limited due to the low signal-to-noise of the SXSHG feature above the XFEL background, there are a number of strategies that could be pursued to improve the measurement. Selective absorptive and spatial filters can be employed to preferentially remove the XFEL harmonic before reaching the sample target, or a reflective geometry could be employed to minimize the harmonic background by utilizing Brewster’s angle. Target thickness, angle-of-incidence, and X-ray polarization dependence could also be examined in future work to experimentally demonstrate surface selectivity.

Most significantly, the commissioning of new high repetition rate XFELs such as LCLS-II will enable the collection of SXSHG data at tens of kHz instead of the 120 Hz used in this study. We expect this to help evaluate and validate the various water models/potentials used in the literature for simulating the vapor/water interface. In addition to future measurements on neat water, it will be possible to study a variety of scientifically and industrially interesting systems, including the interfacial response of water due to the presence of small molecules and ions, the electrical double layer, and the solvation environment of biomolecules. Using the orders-of-magnitude statistical improvements provided by next generation XFELs, high quality SXSHG data is expected to provide a rigorous test of theoretical models and further refine our knowledge of the water surface and more complex interfaces and environments.

Methods

Experimental setup

The experiments were carried out at the chemRIXS end station of the Linac Coherent Light Source (LCLS) XFEL at SLAC National Accelerator Laboratory. The attosecond pulses were optimized for peak intensity, with several hundred μJ of energy per pulse, bandwidths of ~8 eV, and nominal durations of several hundred attoseconds31. Jet-in and jet-out datasets were taken with ~100,000 shots each, corresponding to 15 min of acquisition.

The water target was a liquid sheet that was produced using a microfluidic chip with a converging channel geometry (Micronit Micro 1, flow rate 2.2 mL/min). The liquid sheet is free flowing at high enough velocity (~10 m/s) to ensure a fresh target for each X-ray pulse at 120 Hz. Based on the white-light thin film interference pattern produced by the sheet55 and the observed XAS, the sheet thickness in the measured region is ~500 nm. To produce SHG from a liquid surface, the electric field of the incident light must have a component normal to the surface29. The sheet was rotated to a 70° angle of incidence (relative to sheet normal) with the X-ray pulses having p-polarization, giving an effective optical path length of 1.5 μm. Measurements were made in-vacuum with a chamber pressure of ~1 mTorr.

The transmitted pulses were first attenuated using 4.2 μm of aluminum filters which preferentially absorbed the fundamental. They were spectrally resolved using a variable line spacing grating spectrometer in first order (resolving power ~1300) and measured with a CCD detector (Andor Newton_SO) that was able to monitor both the harmonic and fundamental spectral regions simultaneously for each XFEL shot at 120 Hz (Supplementary Fig. 1). There is a broad scattered light background which scales linearly with the pulse energy, so an estimated scalar background value on a shot-by-shot basis was determined from the average of the high energy side of the detector which was then subtracted from the entire spectrum.

The scattered light background can be seen to also have some oscillatory features which scale with pulse energy. This background is shown in Supplementary Fig. 1C. While the counts from the background (typically < 5 per pixel) are too small by nearly an order of magnitude to produce the main observed intensity-dependent features in the harmonic region, they likely provide some distortion to the edges of the XFEL harmonic. In addition to the XFEL fundamental and second harmonic, there are smaller peaks around 675 and 875 eV which are smaller scattered fundamental signals. The stronger peak appearing at 675 eV was used to correct for detector saturation in the main fundamental peak on very strong shots. Finally, the second-order harmonic peak (which overlaps the fundamental peak) was estimated from the first-order harmonic peak and subtracted from the fundamental on a shot-by-shot basis, which improves the calculated XAS and covariance maps in the strongly absorbing spectral regions. The attenuation of the harmonic peak through a series of aluminum filters was found to match literature values56, so there is no major fundamental component to the observed harmonic peak.

Histograms of pulse energies as measured by the GMD for six consecutive runs are shown in Supplementary Fig. 1D, showing the machine behavior is consistent over these timescales. Average spectra of the fundamental and harmonic for each GMD bin are shown in Supplementary Fig. 1E and 1F, respectively. There are moderate changes to the spectrum with pulse energy, especially in the lowest quartile, but the higher energy pulses are fairly consistent. The full analysis described below uses these intensity-dependent spectra in the calculation of the TA and SHG contributions to the signal.

Intensity-dependent signals analysis

Because of the poor correlation between the XFEL fundamental and harmonic intensities, the only filtering that was done was the removal of dropped XFEL shots from the dataset. For the fundamental intensity-dependent analysis, the data set was binned according to the GMD pulse energy measurement, which is upstream of the interaction point and is equivalent between the jet-in and jet-out datasets57. The binned jet-in and jet-out datasets could then be used to analyze intensity-dependent features in both spectral regions.

The full, average spectrum will be notated as I0(Ω) and I(Ω) for the jet-out and jet-in harmonic spectra. The harmonic spectra as binned on the integrated fundamental spectra, s, will be given as I0(Ω|s) and I(Ω|s). Similarly, the full fundamental spectra will be given as I0(ω) and I(ω) and the spectra binned on the integrated harmonic spectra, S, will be given as I0(ω|S) and I(ω|S).

For calculating the difference signals shown in Fig. 2B of the main text, the binned jet-out data was first scaled by the average transmission through the jet at each photon energy:

TΩ=I(Ω)/I0Ω 2

Where T represents the transmission at harmonic frequency Ω. T(Ω)I0(Ω|s) was subtracted from each corresponding jet-in dataset I(Ω|s) to yield the difference signal.

From the model of the harmonic signals presented in Fig. 2A, the observed signal at a given photon energy in the harmonic region depends on the magnitude of the TA signal and the magnitude of the SHG signal:

IΩs=T0Ω+MΩs(I0Ωs+ISHGΩs) 3

Where T0 is the linear transmission of the harmonic through the sheet in the absence of any TA effect and M is the magnitude of the TA effect (experimentally found to vary across the harmonic region), and ISHG is the SHG signal at the given frequency and fundamental intensity.

From the observed jet-in and jet-out dataset, the measured SHG signal for a given bin is:

ISHGΩs=IΩsT0Ω+MΩsI0Ωs 4

This form follows from the fact that the observed SHG is generated almost entirely from the sheet surface facing the beam and is therefore attenuated in the same way as XFEL harmonic. As the fundamental is strongly attenuated by the sheet (OD > 1 at all energies we find SHG signal), there is a greater than 100-fold reduction in the SHG signal generated at the back interface. Therefore any SHG produced by the back interface has a negligible contribution on the measured signals. The isolated SHG signal was then integrated across the spectral region for determination of the effective χ(2) response.

Initial guesses for the TA parameters were generated by using Eq. 3 and assuming ISHG is negligible. The transmission for each GMD bin could then be fit to a line to yield an estimate for T0 and M which were used for the full fitting with the covariance maps. Examples of these linear fits are shown in Supplementary Fig. 2A at the spectral regions shown in Supplementary Fig. 2B. This technique would underestimate the magnitude of the TA response in the presence of an SHG signal, which necessitates the more complex covariance map fitting. Intensity-dependent features are also apparent in the fundamental region. While these are also most likely caused by radiolysis from the fundamental pulse, it was found to be useful for the covariance analysis to correlate the magnitude of the TA features in the fundamental with the integrated XFEL harmonic:

IωS=t0(ω)+m(ω)SI0ωS 5

The values of t0 and m could then be estimated from a linear regression of the data binned to the integrated XFEL harmonic, S, as discussed in the previous section.

Covariance mapping

The covariance mapping39 visualizes the covariance matrix as a contour plot. For our purposes, we principally care about the correlations between the harmonic and fundamental regions of the detector. Each element of the covariance matrix is the covariance between the measured counts on two pixels. For the jet-out data, the covariance between a pixel in the fundamental region and a pixel in the harmonic region is given by:

covI0(ω),I0(Ω)=E[I0(ω)I0(Ω)]E[I0(ω)]E[I0(Ω)] 6

Where E[…] indicates the expectation value. This measurement captures the intrinsic correlations between the fundamental and XFEL harmonic. For the jet-in data, we can use Eqs. 3 and 5 to get:

covIω,IΩ=covt0ω+mωSI0ω,T0Ω+MΩs(I0Ω+ISHG) 7
cov(I(ω),I(Ω))=t0ωT0ΩcovI0ω,I0Ω+t0ωT0ΩcovI0ω,ISHG+T0ΩmωcovSI0ω,I0Ω+ISHG+t0ωMΩcovI0ω,s(I0Ω+ISHG)+mωMΩcovSI0ω,s(I0Ω+ISHG) 8

Equation 8 follows from 7 by the bilinearity of covariance. Similar calculations can be done to decompose the correlations between two pixels in the fundamental region or two pixels in the harmonic region. From Eq. 8, we can see that the covariance has contributions from, in order: the intrinsic correlation from the fundamental and XFEL harmonic, the correlation between the fundamental and the SHG signal, the XFEL harmonic and fundamental TA, the fundamental and the harmonic TA, as well as the covariance between the various TA signals and the SHG (comparably small). The dominant contributions from the intrinsic correlations can then be subtracted using the jet-out covariance data multiplied by the linear transmission of the harmonic and fundamental at that energy, which yields only the intensity-dependent contributions. Neglecting higher-order cross terms in m, M, and ISHG, the difference maps, D, are then:

DIω,IΩ=covIω,IΩt0ωT0ΩcovI0ω,I0Ω 9
DIω,IΩt0ωT0ΩcovI0ω,ISHG+T0ΩmωcovSI0ω,I0Ω+t0ωMΩcovI0ω,sI0Ω 10

These covariance difference maps with the intrinsic correlations suppressed are the maps which are shown in the main text and analyzed below. Raw jet-in and jet-out covariance maps for this spectral region can be seen in Supplementary Fig. 3 for the 550 eV dataset highlighted in the main text.

Experimentally, the jet-out covariance matrix can then be multiplied element-wise with the outer product of the jet transmission to get the full covariance difference matrix. The same estimated transmission matrix was used for the experimental data and the modeled data (described below) to minimize the errors in fitting the data.

Modeling the SHG response

While a well-behaved transform-limited pulse in a medium with a uniform χ(2) response will result in the SHG signal being the convolution of the fundamental spectrum, a poorly behaved pulse results in the SHG response resembling the square of the fundamental profile. These two limiting cases present differently in the covariance maps and binned data, so the response has to be modeled. An empirical definition of ISHG was employed:

ISHG2ω=χeff2ω2I0ωΔI0ω+Δg(Δ)dΔ 11

Where the effective nonlinear susceptibility is only dependent on the output photon frequency and the function g(Δ) is chosen to interpolate between the two limiting cases of interest for our SHG signal.

For computational efficiency, a boxcar function was used for g:

g(Δ)=1,ΔΔm0,Δ>Δm 12

Where Δm is the range of frequencies which can interact in a χ(2) process. The best fit choice for Δm was determined to be about ~1.5 eV (7 pixels on the detector) and was used for the covariance model fitting.

Empirically it was found that the nonlinear response had a spectral profile which cut off on the red side of the spectrum (e.g., Fig. 3A). As the spectral response was consistent across all of the potential SHG signal datasets examined, it is likely more reflective of the XFEL source than the nonlinear properties of the water sample. To incorporate the observed spectral drop-off, the nonlinear spectral response was then modeled phenomenologically as a logistic function:

χeffΩ2=χ021+expb(ΩΩ0) 13

Where χ02 is the magnitude of the nonlinear response, b is the logistic growth rate (curve steepness) and Ω0 is the midpoint of the function.

Data fitting

The TA signal parameters and the derived SHG parameters were determined with a simultaneous fitting of the experimental covariance maps and the binned intensity-dependent features. Broadly, parameters controlling the frequency-dependent values of the linear transmission (t0, T0) and TA features (m, M) were input and used to calculate the binned SHG signal based on Eq. 4. The binned SHG response was compared to the calculated SHG response using inputted values for the SHG spectral response (Δm, Ω0, b) to determine the magnitude of the response, χ02. A simulated jet-in dataset that was generated from the jet-out dataset and parameters for the various intensity-dependent processes indicated in Eq. 10, which was then used to calculate a model covariance map. The goodness of fit was determined by a composite parameter using the residual of the experimental covariance map to the model map and the residual of the binned SHG signal to the calculated binned SHG response.

As was discussed above and illustrated in Supplementary Fig. 2, estimates for t0, m, T0, and M can be obtained from linear fits to the binned data. The spectral response to the fundamental TA m(ω) was assumed to be broadly spectrally accurate, and only a global scale factor and offset was used for fitting. As the harmonic TA signal is the most uncertain and has the greatest impact on the derived SHG signal, M(Ω) was modeled as a five-point cubic spline equally spaced in Ω.

The new estimates for t0, m, T0, and M were then used to obtain the binned isolated SHG signal per Eq. 4. The binned SHG signal was used to determine the magnitude of χ02 by integrating the overall signal in each bin and taking a linear fit between the experimental (Eq. 4) and calculated (Eq. 11) integrated SHG signals with a y-intercept of zero. From here, the transmission and TA for both regions and the full nonlinear spectral response are used to calculate the modeled jet-in data from which the modeled covariance map and the intensity-dependent difference spectra (isolated SHG signals) can be obtained.

The goodness of fit was determined by simultaneously minimizing the squared residual of the experimental and modeled covariance difference maps and the squared residual of the experimental and modeled intensity-dependent isolated SHG signals. The overall composite function that was minimized was:

F=1Rcov2+1Rbin2+log(rbin2) 14

Where Rcov2 and Rbin2 are weighted coefficients of determination for the covariance map fit and binned SHG signal fits respectively and rbin2 is the squared residual of the binned fits.

The fit results were still found to suffer from systematic errors, particularly in the range of bins considered in the determination of T0 and χ02 and the integration bounds considered for determining χ02. To attempt to quantify this additional uncertainty, for each dataset 13 different permutations of the bin and integration bounds choices were considered. These systematic errors were found to be comparable or larger to the raw statistical errors in many cases and are included in the error bars of the linear transmission, TA fits, and fits of the nonlinear spectral response, including the nonlinear susceptibilities reported in Fig. 3.

The experimental and resulting best fit covariance map, the binned isolated SHG signal, and the quadratic fit for each dataset are presented in Supplementary Fig. 4. Error bars in binned signals are the standard deviation between the three experimental runs. The R-squared values used for fitting in Eq. 14 are shown in Supplementary Fig. 5. The covariance map fits were fair to good with R-squared values above 0.5. The binned spectra fit and intensity-dependent quadratic fits show similarly fair-to-good fits above the K-edge, but SHG fits were disfavored relative to the no-signal null case for all of the datasets below the K-edge.

The intensity-dependent datasets containing candidate SHG signals were also fit to a power law of the form:

y=Axp 15

Which yielded fits of similar quality to the quadratic fits (Supplementary Fig. 5B, red circles). The best fit power law exponents, p, are shown in Supplementary Fig. 5C. The resulting best fit power laws were scattered between 1.5 and 2.5, consistent with quadratic behavior and similar to what is seen in conventional lab based optical SHG measurements.

Molecular dynamics simulations

Here, we employed the recently developed MB-pol(2023) potential41 and the LAMMPS58 MD simulation engine for all our simulations. To date, the MB-pol many-body water potential of Paesani and coworkers4143,5961 is widely considered to be amongst the most accurate, transferable and predictive, able to simultaneously reproduce the bulk and interfacial structure and thermodynamics of the liquid, vapor and solid phases. To start, we generated a unit-cell consisting of 432 water molecules, in a cubic box of length 18.77 Å, packed to minimize the interaction energy by applying the continuous configurational Boltzmann biased Monte Carlo method62,63. We then equilibrated the bulk liquid using our standard procedure64: after an initial conjugate gradient energy minimization to a root mean square (RMS) force of 10−5 kcal/(mol/Å), we heated the system from 0 K to the desired temperature (298 K and 277 K) over 100 ps in the constant volume, constant temperature (canonical or NVT) ensemble with a Nose-Hoover chain of 3 thermostats with a relaxation time of 0.25 ps. The system was propagated forward in time using a velocity-Verlet algorithm with an integration time step of 0.5 fs. van der Waals and short range electrostatic interactions were calculated explicitly, with a real space cutoff of 0.9 nm, while the long-range electrostatics were calculated in reciprocal space using the particle−mesh Ewald (PME) method, as implemented in the helPME library65,66, with a convergence tolerance of 10−5 kcal/mol. The many-body energies, forces and stresses were calculated in LAMMPS by means of the fix mbx and pair mbx functionality which allows for seamless interfacing with the MBX software67.

After initial NVT equilibration, we resolved any stresses in the system by means of 1 ns of constant pressure, constant temperature (NPT) simulations using the Andersen barostat (pressure relaxation constant of 1 ps). The equations of motion used are those of Shinoda et al.68, which combine the hydrostatic equations of Martyna et al.69 with the strain energy proposed by Parrinello and Rahman70. The time integration schemes closely follow the time-reversible measure-preserving Verlet integrators derived by Tuckerman et al.71 During the last 500 ps of the 1 ns NPT simulation, we calculated the average cell lengths and linearly adjusted the final NPT simulation cell to the averages, over a further 100 ps of dynamics. Finally, we performed production dynamics for an additional 2 ns of NVT dynamics. As a figure of merit, the calculated density of water at ambient conditions (298 K, 1 atm) was 0.997 g/cm3.

The final snapshot of our bulk simulations was used as an initial configuration for our vacuum/water simulations. Here, we inflated the simulation cell in the z-direction by 10 nm and centered the liquid slab within the cell, thus allowing 5 nm of vacuum at each surface. The top and bottom of the simulation box were bounded by a purely repulsive wall, using the fix wall/harmonic functionality in LAMMPS, for stable simulations in the rare event that molecules evaporated from the surface and approached the unit cell z-boundaries. After initial heating, we performed a 2D-NVT simulation for 4 ns. In all cases, the slab was found to be stable over the entire MD simulation. All interface simulation parameters were the same as the bulk simulations, except for the application of the 2D PME method and the 2D slab corrections of Yeh and Berkowitz72 with a further 2.0 z factor, in order to eliminate spurious interactions between the two surfaces.

To validate our vapor/water simulations, we calculated the interfacial surface tension (IFT), using the rigorous statistical mechanical formulism first proposed by Tolman73, and later developed more fully by Kirkwood and Buff (KB)74:

γ=12pz12paz+pbzdz 16

where p is the component of the stress tensor perpendicular to the surface (z-axis in our coordinate system), while p||a and p||b are the parallel (x- and y-axis) components. The general idea behind this KB method is that in the bulk liquid, the parallel and perpendicular stress components are equal and cancel, while at or near the interface, 2p > p||. Thus Eq. 16 allows the IFT to be obtained from MD calculations by a simple integration of the components of the pressure. The calculated IFT using this approach was 69.5 ± 1.3 mJ/m2 at 298 K and 72.6 ± 2.5 mJ/m2 at 277 K. While the calculated IFT is slightly underpredicted compared to experiments (72.0 and 75.1 mJ/m2 respectively), the calculated slope of the IFT with decreasing temperature of 1.045 mJ/m2/K is in excellent agreement with the value calculated from the experimental steam tables75 of 1.042 mJ/m2/K, suggesting that the surface entropy, and by extension the interfacial structure, of MB-pol(2023) is representative of the experimental reality.

Mass density distribution and interface definition

We employ a rigorous definition of the water interface based on the instantaneous interface definition advanced by Willard and Chandler44. Here the interface is defined by a time-dependent density field ρ(r,t), constructed from Gaussian functions located at the water molecules’ center of mass ri(t):

ρr,t=iN/32πξ2321erritξ2 17

where ξ = 0.24 nm (~ the diameter of a water molecule) is the Gaussian width, representing a system-dependent coarse-graining length.

From Eq. 17 we can define the instantaneous interface at a given time t as the 2D manifold s(t) = r, for which the density field is constant. In the current work, we take this constant to be half the bulk density, i.e., the Gibbs dividing surface. Once defined, we apply a smoothing function, by means of a cubic interpolation scheme, to ensure a continuous instantaneous interface definition. Then the position of each water molecule was taken as the closest contact point to each of the two instantaneous interfaces, discretized along the z-axis in 0.1 Å bins. An example of the resulting profiles is given in Supplementary Fig. 6B. There we find that while the planar interface shows the usual hyperbolic tangent dependence with distance, there is instead significant oscillations in the density according to the instantaneous interface definition. Moreover, we are able to unambiguously define the 1st interface layer, as the region extending 0.67 nm into the liquid from the point of vanishing density.

Hydrogen bond definition and analysis

We employed a parameter free, radial-angle joint distribution function g(R,β) (shown schematically in Supplementary Fig. 6C), which is the normalized probability of observing an acceptor-donor pair in a volume element between R and R + dR, and an acceptor-donor-hydrogen angle between β and dβ:

gR,β=Pr(R,β)/QQ=NN1V2πsinβdβR2dR 18

where Q is the expected probability if the system was noninteracting76, N is the number of oxygen donor/acceptor atoms, and V is the volume. We then calculate the potential of mean force (PMF) W(R,β) as:

WR,β=kTlngR,β 19

Notably, for consideration of the vapor/water interface, we determined the <Vinterface> and <Ninterface> from the mass-density profile of the instantaneous interface in the previous section.

We defined the H-bonded state as an equipotential region of the 2D PMF surface that passes through the saddle point, and encircles the minimum, while all other combinations are considered to be broken H-bonds, or in the extreme case, free OH-bonds (i.e., water molecules not participating in any H-bonds). Practically, this amounts to first constructing g(R,β) and numerically determining B(R,β,Wcut): the bounding region encircling a minima with energies within the cutoff Wcut, numerically. Specifically, Wcut was determined self-consistently as the saddle point where the overlap between two or more minima B(R, β, Wcut) is exactly 1. In the current case of MB-pol(2023), we determined that for the bulk liquid at 298 K and 1 atm, Wcut = 1.22 kT, which we show graphically as the shaded region in Supplementary Fig. 6C. The H-bonding statistics were then obtained by reanalyzing the MD trajectory and selecting only the water H-bonds within B(R,β,Wcut = 1.22 kT).

For theoretical estimates of the average number of H-bonds per water molecule <nHB > , we decomposed the H-bond configurations for the bulk and interface layer into the 9 Acceptor(A)/Donor(D) configurations AxDy, where x, y ∈ [0, 1, 2]. As a figure of merit, we calculated an average number of H-bonds of <nHB ≥ 3.55 ± 0.12 for our bulk simulations at 298 K, which reduces to <nHB ≥ 2.92 ± 0.45 in the 1st interfacial layer, defined based on our instantaneous interface calculations described above. The percentage of the various H-bonding configurations in the bulk and at the interface at 298 K and 277 K are given in Supplementary Table 1.

X-ray absorption spectra simulation

Oxygen K-edge spectra were calculated by exciting each oxygen atom in each snapshot individually using constrained-occupancy DFT calculations employing the PBE GGA functional77. Plane–wave pseudopotential calculations using ultrasoft pseudopotentials78 were performed using the PWSCF code within the Quantum-ESPRESSO package79. We used a kinetic energy cut-off for electronic wave functions of 25 Ry and a density cut-off of 200 Ry. The core-excited Kohn–Sham eigenspectrum was generated using the XCH approach80. Based on a numerically converged self-consistent charge density, we generated the unoccupied states for our XAS calculations non-self-consistently, sufficiently sampling the first Brillouin zone with a 2 × 2 × 2 uniform k-point grid, employing an efficient implementation of the Shirley interpolation scheme81 generalized to handle ultrasoft pseudopotentials82. Matrix elements were evaluated within the PAW frozen-core approximation83. Core-excited ultrasoft pseudopotentials and corresponding atomic orbitals were generated with the Vanderbilt code78. Each computed transition was convolved with a 0.2 eV Gaussian function to produce continuous spectra.

The use of pseudopotentials in our calculations means that we lose the absolute reference state from which we can base our linear excitation energies, as would be present in an all-electron calculation. Thus, each spectra needed to be properly calibrated for unambiguous comparisons, especially in systems with varying number of molecules, total charge, or different cell sizes. As in our previous work84, we here apply two independent calibration schemes. First, we apply to the formation energy formulism of Prendergast and coworkers to self-consistently align our spectra to a common reference85. Previous work has shown that this approach is accurate in predicting the main peak position of various lithium compounds to within 0.1 eV of the experiments86. We then applied an absolute energy shift, by matching the simulated Oxygen K-edge white-line of O2 to the experiments, which amounted to a shift of +535.5 eV applied to all our simulated spectra.

Nonlinear SXSHG simulations

To demonstrate the second harmonic susceptibility of the water interface, first principles simulations of the SHG were performed. X-ray simulations become prohibitively expensive as the system size increases, which limits the absolute size of the model systems. Here, we used simulation cells in a slab geometry with dimensions 10 × 10 × 30 Å and the total number of atoms ranging from 90–110 depending on the surface configuration. 20 Å of vacuum was included to represent the water/vacuum interface and to avoid periodic image interactions. These smaller unit cells were then propagated forward in time using our approach outlined above, with harmonic restraints applied to the internal structure and center of mass position of the particular H-bond configuration (i.e., the AxDy species and its first solvation shell). This allowed for the equilibration of the water structure around the AxDy configuration, which we found lead to rapidly convergent electronic-structure calculations. Next, we performed excited state calculations using velocity-gauge real-time time-dependent density functional theory (RT-TDDFT) as implemented in a modified version of the SIESTA package40, with a double-ζ with polarization atomic natural orbital basis set used to model electronic structures. One of the advantages of performing RT-TDDFT in the velocity gauge is that the method provides a full multipole expansion via the applied external vector potential and the resultant time-dependent current density. In this sense, all contributions, dipole, quadrupole, etc., are included, albeit at a nontrivial computational cost, meaning the method is also sensitive to bulk-active contributions to the signal.

Soft X-ray transitions were obtained by including the oxygen 1 s orbital such that the solutions of Kohn-Sham DFT contained core electrons. The system was sampled at the gamma point, with a plane wave energy cutoff of 400 Ry. As SIESTA uses the real-space atomic natural orbital basis rather than plane waves, this cutoff corresponds to the size of the real-space grid that the relevant objects are represented on in analogy to how dense such a grid would have to be for a corresponding plane wave basis.

The second-order susceptibility of the water surface was obtained by propagating an intense laser pulse through the system and calculating the resulting current. A perturbative pulse of 521 eV was used with the timestep of 0.0055 a.u., corresponding to ~0.2661 attoseconds. The resulting non-equilibrium electron density was propagated under the full pulse duration of 1 fs to obtain the second harmonic response of the system. The SHG response of the water surface was extracted following the semi-slab approach established in our previous works2024,87.

To extract the magnitude of the second harmonic response, we performed SHG calculations with pulse intensities from 1.0 × 1012 to 7.5 × 1017 W/cm2. The current density of the system due to perturbation can be described as a Taylor series expansion of field strength:

j=χ(0)+χ(1)Eω+χ(2)Eω2+ 20

The resulting response at 2ω was obtained from numerical integration and fit to a quadratic in the intensity. It has been shown that the total χ(2) response can be expressed as an outer sum of the initially occupied states88, which are here the individual O 1 s orbitals. This allows the SXSHG response to be calculated for each water molecule, which allows the contributions to the full SXSHG spectrum from different H-bond configurations to be determined.

We found that the X-ray SHG is dominated by the topmost water molecules at the interface, with a signal that is more than an order of magnitude larger than the corresponding simulated spectra in the bulk (3D geometry), even with the limitation of the small number of water molecules included in the simulation. A representative calculated current, across the fundamental and harmonic for the 3D bulk and 2D slab geometries are shown in Supplementary Fig. 7A. A plot of the 2ω response as a function of field strength is demonstrated in Supplementary Fig. 7B. A comparison of the full simulated SXSHG spectrum to the simulated XAS is shown in Fig. Supplementary Fig. 7C, showing the experimental shift is replicated. The simulated SXSHG spectra for each major H-bond species was shown in Fig. 4C. The remaining minor species (surface prevalence of <3% in Supplementary Table 1) are shown in Supplementary Fig. 7D.

Quantum chemistry calculations

To try and develop an understanding of the possible states being accessed with second harmonic excitation, first principles spin-unrestricted linear response TD-DFT calculations were performed, using the B3LYP functional89 within the Q-Chem 6.2 electronic structure package90. These simulations used a reduced excitation space91 of just the oxygen 1 s electrons. Basis set polarization at the 5-ζ level within the Dunning correlation consistent basis sets92 (i.e., cc-pV5Z) was required to have sufficient density of states at and above twice the O K-edge to determine the possible second harmonic excited states. Initial TD-DFT simulations were performed on only a single water molecule to calculate all possible TD-DFT excited states. Within this methodology, the lowest lying O 1 s excited state with a non-negligible oscillator strength (i.e., the K-edge) was found to be around 518.9 eV. Thus, the excited states considered as potentially accessible at the second harmonic began at ~1040 eV and are listed in Supplementary Table 2. To visualize these states, their first pair of natural transition orbitals were determined, along with a corresponding cube file. Although the calculations are spin-unrestricted, the spin-up and spin-down natural transition orbitals were identical, and only the spin-up versions are presented here.

Supplementary information

Acknowledgements

D.J.H. acknowledges helpful discussions with L. Young, T.J.A. Wolf, and J.P. Cryan. T.A.P. acknowledges fruitful discussions from V.A.P. and L.J.P. Use of the Linac Coherent Light Source (LCLS), SLAC National Accelerator Laboratory, is supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02-76SF00515. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the US DOE, under contract no. DE-AC02-05CH11231, using NERSC award BES-ERCAP0033755. This work also used the Expanse supercomputer at the San Diego Supercomputing Center through allocations DMR190106 and PHY210131 from the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program, which is supported by NSF Grant No. 2138259, No. 2138286, No. 2138307, No. 2137603, and No. 2138296. D.J.H. and J.D.K. were supported by the Department of Energy, Laboratory Directed Research and Development program at SLAC National Accelerator Laboratory, under contract DE-AC02-76SF00515. M.W.Z. acknowledges funding by the Department of Energy (DE-SC0024123). M.W.Z., T.A.P., J.D.K., and R.J.S. acknowledges funding from the UC Office of the President within the Multicampus Research Programs and Initiatives (Grant No. M21PL3263). W.S.D. was supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division under Contract DE-AC02-05CH11231, FWP number FP00014920. S.W.D., D.S., K.V.L., and C.P.S. were supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division under Contract DE-AC02-05CH11231. F.B., S.W.D., and R.J.S. were supported by the Department of Energy, Office of Basic Energy Sciences, through the Chemical Sciences Division at the Lawrence Berkeley National Laboratory under contract #CH403503. S.J. and A.D. were supported by the US DOE BES grant no. DE-SC0023503. S.J. acknowledges the financial assistance award 70NANB23H111 from U.S. Department of Commerce, National Institute of Standards and Technology. B.R.N. acknowledges funding from the National Science Foundation Graduate Research Fellowship Program (Grant No. DGE 1752814). J.A.S. acknowledges support from the Arnold O. Beckman Postdoctoral Fellowship Program. E.J.R. acknowledges support from CALSOLV/RESOLV-Bochum and the UC Office of the President within the Multicampus Research Programs and Initiatives. T.A.P. acknowledges support from the Sloan Research Foundation.

Author contributions

C.P.S., W.S.D., R.J.S., M.W.Z., and J.D.K. originated the project concept. D.J.H., S.W.D., D.G., J.A.S., B.R.N., F.B., E.J.R., K.K., C.Y.H., G.L.D., C.P.S., M.W.Z., and J.D.K. executed the experiment and collected data. J.D., D.C., N.S., and A.M. coordinated attosecond pulse delivery. D.J.H. and S.W.D. provided sample characterization. D.J.H., S.W.D., D.G., and J.D.K. analyzed experimental data. S.J., D.S., A.D., K.V.L., and T.A.P. provided theoretical modeling. D.J.H., K.V.L., T.A.P., C.P.S., M.W.Z., and J.D.K. wrote the manuscript with contributions from all authors.

Peer review

Peer review information

Nature Communications thanks Lixue Shi and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Data availability

The experimental data as shown in the main text and Supplementary figures, the TDDFT input and data files, and molecular dynamics simulation input and data files are available on Zenodo93.

Code availability

Custom code used for TDDFT and molecular dynamics simulations can be found in the open-access repository on Zenodo93.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

David J. Hoffman, Email: djhoff@slac.stanford.edu

Craig P. Schwartz, Email: craig.schwartz@unlv.edu

Tod A. Pascal, Email: tpascal@ucsd.edu

Jake D. Koralek, Email: koralek@slac.stanford.edu

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-025-65514-4.

References

  • 1.Björneholm, O. et al. Water at Interfaces. Chem. Rev.116, 7698–7726 (2016). [DOI] [PubMed] [Google Scholar]
  • 2.Zhong, J. et al. Atmospheric spectroscopy and photochemistry at environmental water interfaces. Annu. Rev. Phys. Chem.70, 45–69 (2019). [DOI] [PubMed] [Google Scholar]
  • 3.Chanda, A. & Fokin, V. V. Organic synthesis “on water”. Chem. Rev.109, 725–748 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Laage, D., Elsaesser, T. & Hynes, J. T. Water dynamics in the hydration shells of biomolecules. Chem. Rev.117, 10694–10725 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Smith, J. D. et al. Unified description of temperature-dependent hydrogen-bond rearrangements in liquid water. Proc. Natl. Acad. Sci. USA102, 14171–14174 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ball, P. Water as an active constituent in cell biology. Chem. Rev.108, 74–108 (2008). [DOI] [PubMed] [Google Scholar]
  • 7.Navascues, G. Liquid surfaces: theory of surface tension. Rep. Prog. Phys.42, 1131–1186 (1979). [Google Scholar]
  • 8.Winter, B. & Faubel, M. Photoemission from liquid aqueous solutions. Chem. Rev.106, 1176–1211 (2006). [DOI] [PubMed] [Google Scholar]
  • 9.Wilson, K. R. et al. Investigation of volatile liquid surfaces by synchrotron X-ray spectroscopy of liquid microjets. Rev. Sci. Instrum.75, 725–736 (2004). [Google Scholar]
  • 10.Faubel, M., Schlemmer, S. & Toennies, J. P. A molecular beam study of the evaporation of water from a liquid jet. Z. Phys. D Mol. Clust.10, 269–277 (1988). [Google Scholar]
  • 11.Braslau, A. et al. Surface roughness of water measured by X-ray reflectivity. Phys. Rev. Lett.54, 114–117 (1985). [DOI] [PubMed] [Google Scholar]
  • 12.Thrane, L., Jacobsen, R. H., Uhd Jepsen, P. & Keiding, S. R. THz reflection spectroscopy of liquid water. Chem. Phys. Lett.240, 330–333 (1995). [Google Scholar]
  • 13.Shi, L. et al. Water structure and electric fields at the interface of oil droplets. Nature640, 87–93 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ramos, T. N. & Champagne, B. Investigation of the second harmonic generation at the water–vacuum interface by using multi-scale modeling methods. ChemistryOpen12, e202200045 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Nihonyanagi, S., Yamaguchi, S. & Tahara, T. Ultrafast dynamics at water interfaces studied by vibrational sum frequency generation spectroscopy. Chem. Rev.117, 10665–10693 (2017). [DOI] [PubMed] [Google Scholar]
  • 16.Dalstein, L., Chiang, K.-Y. & Wen, Y.-C. Direct quantification of water surface charge by phase-sensitive second harmonic spectroscopy. J. Phys. Chem. Lett.10, 5200–5205 (2019). [DOI] [PubMed] [Google Scholar]
  • 17.Geiger, F. M. Second harmonic generation, sum frequency generation, and χ (3): dissecting environmental interfaces with a nonlinear optical Swiss Army knife. Annu. Rev. Phys. Chem.60, 61–83 (2009). [DOI] [PubMed] [Google Scholar]
  • 18.Matsuzaki, K., Nihonyanagi, S., Yamaguchi, S., Nagata, T. & Tahara, T. Quadrupolar mechanism for vibrational sum frequency generation at air/liquid interfaces: theory and experiment. J. Chem. Phys.151, 064701 (2019). [Google Scholar]
  • 19.Litman, Y., Chiang, K.-Y., Seki, T., Nagata, Y. & Bonn, M. Surface stratification determines the interfacial water structure of simple electrolyte solutions. Nat. Chem.16, 644–650 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Berger, E. et al. Extreme ultraviolet second harmonic generation spectroscopy in a polar metal. Nano Lett.21, 6095–6101 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Schwartz, C. P. et al. Angstrom-resolved interfacial structure in buried organic-inorganic junctions. Phys. Rev. Lett.127, 096801 (2021). [DOI] [PubMed] [Google Scholar]
  • 22.Uzundal, C. B. et al. Polarization-resolved extreme-ultraviolet second-harmonic generation from LiNbO3. Phys. Rev. Lett.127, 237402 (2021). [DOI] [PubMed] [Google Scholar]
  • 23.Woodahl, C. et al. Probing lithium mobility at a solid electrolyte surface. Nat. Mat.22, 848–852 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lam, R. K. et al. Soft X-ray second harmonic generation as an interfacial probe. Phys. Rev. Lett.120, 23901 (2018). [DOI] [PubMed] [Google Scholar]
  • 25.Yamamoto, S. et al. Element selectivity in second-harmonic generation of GaFeO3 by a soft-X-ray free-electron laser. Phys. Rev. Lett.120, 223902 (2018). [DOI] [PubMed] [Google Scholar]
  • 26.Zong, A., Nebgen, B. R., Lin, S.-C., Spies, J. A. & Zuerch, M. Emerging ultrafast techniques for studying quantum materials. Nat. Rev. Mater.8, 224–240 (2023). [Google Scholar]
  • 27.Smith, J. W. & Saykally, R. J. Soft X-ray absorption spectroscopy of liquids and solutions. Chem. Rev.117, 13909–13934 (2017). [DOI] [PubMed] [Google Scholar]
  • 28.Fransson, T. et al. X-ray and electron spectroscopy of water. Chem. Rev.116, 7551–7569 (2016). [DOI] [PubMed] [Google Scholar]
  • 29.Mizrahi, V. & Sipe, J. E. Phenomenological treatment of surface second-harmonic generation. J. Opt. Soc. Am. B5, 660 (1988). [Google Scholar]
  • 30.Duris, J. et al. Tunable isolated attosecond X-ray pulses with gigawatt peak power from a free-electron laser. Nat. Photonics14, 30–36 (2020). [Google Scholar]
  • 31.Franz, P. et al. Terawatt-scale attosecond X-ray pulses from a cascaded superradiant free-electron laser. Nat. Photonics18, 698–703 (2024). [Google Scholar]
  • 32.Ekimova, M., Quevedo, W., Faubel, M., Wernet, P. & Nibbering, E. T. J. A liquid flatjet system for solution phase soft-x-ray spectroscopy. Struct. Dyn.2, 054301 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Galinis, G. et al. Micrometer-thickness liquid sheet jets flowing in vacuum. Rev. Sci. Instrum.88, 083117 (2017). [DOI] [PubMed] [Google Scholar]
  • 34.Koralek, J. D. et al. Generation and characterization of ultrathin free-flowing liquid sheets. Nat. Comm.9, 1353 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Crissman, C. J. et al. Sub-micron thick liquid sheets produced by isotropically etched glass nozzles. Lab Chip22, 1365–1373 (2022). [DOI] [PubMed] [Google Scholar]
  • 36.Hoffman, D. J. et al. Microfluidic liquid sheets as large-area targets for high repetition XFELs. Front. Mol. Biosci.9, 1–15 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Loh, Z.-H. et al. Observation of the fastest chemical processes in the radiolysis of water. Science367, 179–182 (2020). [DOI] [PubMed] [Google Scholar]
  • 38.Spiller, E. Soft X-ray Optics (SPIE Optical Engineering Press, 1994).
  • 39.Frasinski, L. J. Covariance mapping techniques. J. Phys. B Mol. Opt. Phys.49, 152004 (2016). [Google Scholar]
  • 40.Pemmaraju, C. D. et al. Velocity-gauge real-time TDDFT within a numerical atomic orbital basis set. Comput. Phys. Commun.226, 30–38 (2018). [Google Scholar]
  • 41.Zhu, X., Riera, M., Bull-Vulpe, E. F. & Paesani, F. MB-pol(2023): sub-chemical accuracy for water simulations from the gas to the liquid phase. J. Chem. Theory Comput.19, 3551–3566 (2023). [DOI] [PubMed] [Google Scholar]
  • 42.Bore, S. L. & Paesani, F. Realistic phase diagram of water from “first principles” data-driven quantum simulations. Nat. Comm.14, 3349 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Reddy, S. K. et al. On the accuracy of the MB-pol many-body potential for water: Interaction energies, vibrational frequencies, and classical thermodynamic and dynamical properties from clusters to liquid water and ice. J. Chem. Phys.145, 194504 (2016). [DOI] [PubMed] [Google Scholar]
  • 44.Willard, A. P. & Chandler, D. Instantaneous Liquid Interfaces. J. Phys. Chem. B114, 1954–1958 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kumar, R., Schmidt, J. R. & Skinner, J. L. Hydrogen bonding definitions and dynamics in liquid water. J. Chem. Phys.126, 204107 (2007). [DOI] [PubMed] [Google Scholar]
  • 46.Buttersack, T. et al. Imaging temperature and thickness of thin planar liquid water jets in vacuum. Struct. Dyn.10, 034901 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Bäppler, S. A., Plasser, F., Wormit, M. & Dreuw, A. Exciton analysis of many-body wave functions: Bridging the gap between the quasiparticle and molecular orbital pictures. Phys. Rev. A90, 052521 (2014). [Google Scholar]
  • 48.do Monte, S. A. et al. Quantification of the ionic character of multiconfigurational wave functions: the Q at diagnostic. J. Phys. Chem. A127, 9842–9852 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kühne, T. D. & Khaliullin, R. Z. Electronic signature of the instantaneous asymmetry in the first coordination shell of liquid water. Nat. Commun.4, 1450 (2013). [DOI] [PubMed] [Google Scholar]
  • 50.Elgabarty, H. & Kühne, T. D. Tumbling with a limp: local asymmetry in water’s hydrogen bond network and its consequences. Phys. Chem. Chem. Phys.22, 10397–10411 (2020). [DOI] [PubMed] [Google Scholar]
  • 51.Ni, Y. & Skinner, J. L. Communication: Vibrational sum-frequency spectrum of the air-water interface, revisited. J. Chem. Phys.145, 031103 (2016). [DOI] [PubMed] [Google Scholar]
  • 52.Medders, G. R. & Paesani, F. Dissecting the molecular structure of the air/water interface from quantum simulations of the sum-frequency generation spectrum. J. Am. Chem. Soc.138, 3912–3919 (2016). [DOI] [PubMed] [Google Scholar]
  • 53.Nihonyanagi, S. et al. Accurate determination of complex χ(2) spectrum of the air/water interface. J. Chem. Phys.143, 124707 (2015). [DOI] [PubMed] [Google Scholar]
  • 54.Cavalleri, M., Ogasawara, H., Pettersson, L. G. M. & Nilsson, A. The interpretation of X-ray absorption spectra of water and ice. Chem. Phys. Lett.364, 363–370 (2002). [Google Scholar]
  • 55.Hoffman, D. J. et al. Liquid heterostructures: generation of liquid–liquid interfaces in free-flowing liquid sheets. Langmuir38, 12822–12832 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Henke, B. L., Gullikson, E. M. & Davis, J. C. X-ray interactions: photoabsorption, scattering, transmission, and reflection at E = 50-30,000 eV, Z = 1-92. Data Nucl. Data Tables54, 181–342 (1993). [Google Scholar]
  • 57.Tiedtke, K. et al. Gas detectors for X-ray lasers. J. Appl. Phys.103, 094511 (2008). [Google Scholar]
  • 58.Thompson, A. P. et al. LAMMPS-a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun.271, 108171 (2022). [Google Scholar]
  • 59.Muniz, M. C. et al. Vapor–liquid equilibrium of water with the MB-pol many-body potential. J. Chem. Phys.154, 211103 (2021). [DOI] [PubMed] [Google Scholar]
  • 60.Gartner III, T. E. et al. Anomalies and local structure of liquid water from boiling to the supercooled regime as predicted by the many-body MB-pol model. J. Phys. Chem. Lett.13, 3652–3658 (2022). [DOI] [PubMed] [Google Scholar]
  • 61.Reddy, S. K., Moberg, D. R., Straight, S. C. & Paesani, F. Temperature-dependent vibrational spectra and structure of liquid water from classical and quantum simulations with the MB-pol potential energy function. J. Chem. Phys.147, 244504 (2017). [DOI] [PubMed] [Google Scholar]
  • 62.Maiti, P. K., Çaǧın, T., Wang, G. & Goddard, W. A. Structure of PAMAM dendrimers: generations 1 through 11. Macromolecules37, 6236–6254 (2004). [Google Scholar]
  • 63.Cagin, T., Wang, G., Martin, R., Breen, N. & Goddard III, W. A. Molecular modelling of dendrimers for nanoscale applications. Nanotechnology11, 77 (2000). [Google Scholar]
  • 64.Pascal, T. A., Lin, S.-T. & Goddard III, W. A. Thermodynamics of liquids: standard molar entropies and heat capacities of common solvents from 2PT molecular dynamics. Phys. Chem. Chem. Phys.13, 169–181 (2011). [DOI] [PubMed] [Google Scholar]
  • 65.Simmonett, A. C. & Brooks, B. R. Analytical Hessians for Ewald and particle mesh Ewald electrostatics. J. Chem. Phys.154, 104101 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Simmonett, A. C. & Brooks, B. R. A compression strategy for particle mesh Ewald theory. J. Chem. Phys.154, 054112 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.The Paesani Research Group. MBX: a many-body energy and force calculator for data-driven many-body simulations. Accessed 1 June 2024. https://paesanigroup.ucsd.edu/software/mbx.html. [DOI] [PMC free article] [PubMed]
  • 68.Shinoda, W., Shiga, M. & Mikami, M. Rapid estimation of elastic constants by molecular dynamics simulation under constant stress. Phys. Rev. B69, 134103 (2004).
  • 69.Martyna, G. J., Tobias, D. J. & Klein, M. L. Constant pressure molecular dynamics algorithms. J. Chem. Phys.101, 4177–4189 (1994). [Google Scholar]
  • 70.Parrinello, M. & Rahman, A. Polymorphic transitions in single crystals: a new molecular dynamics method. J. Appl. Phys.52, 7182–7190 (1981). [Google Scholar]
  • 71.Tuckerman, M. E., Alejandre, J., López-Rendón, R., Jochim, A. L. & Martyna, G. J. A Liouville-operator derived measure-preserving integrator for molecular dynamics simulations in the isothermal–isobaric ensemble. J. Phys. A Math. Gen.39, 5629 (2006). [Google Scholar]
  • 72.Yeh, I. C. & Berkowitz, M. L. Ewald summation for systems with slab geometry. J. Chem. Phys.111, 3155–3162 (1999). [Google Scholar]
  • 73.Tolman, R. C. Consideration of the Gibbs theory of surface tension. J. Chem. Phys.16, 758–774 (1948). [Google Scholar]
  • 74.Kirkwood, J. G. & Buff, F. P. The statistical mechanical theory of surface tension. J. Chem. Phys.17, 338–343 (1949). [Google Scholar]
  • 75.Vargaftik, N., Volkov, B. & Voljak, L. International tables of the surface tension of water. J. Phys. Chem. Ref. Data12, 817–820 (1983). [Google Scholar]
  • 76.Allen, M. P. & Tildesley, D. J. Computer Simulation of Liquids (Oxford University Press, 2017).
  • 77.Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett.77, 3865–3868 (1996). [DOI] [PubMed] [Google Scholar]
  • 78.Vanderbilt, D. Soft self-consistent pseudopotentials in a generalized eigenvalue formalism. Phys. Rev. B41, 7892–7895 (1990). [DOI] [PubMed] [Google Scholar]
  • 79.Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. J. Phys.: Condens. Matter21, 395502 (2009). [DOI] [PubMed] [Google Scholar]
  • 80.Prendergast, D. & Galli, G. X-ray absorption spectra of water from first principles calculations. Phys. Rev. Lett.96, 215502 (2006). [DOI] [PubMed] [Google Scholar]
  • 81.Shirley, E. L. Optimal basis sets for detailed Brillouin-zone integrations. Phys. Rev. B54, 16464 (1996). [DOI] [PubMed] [Google Scholar]
  • 82.Prendergast, D. & Louie, S. G. Bloch-state-based interpolation: An efficient generalization of the Shirley approach to interpolating electronic structure. Phys. Rev. B80, 235126 (2009).
  • 83.Taillefumier, M., Cabaret, D., Flank, A.-M. & Mauri, F. X-ray absorption near-edge structure calculations with the pseudopotentials: application to the K edge in diamond and α-quartz. Phys. Rev. B66, 195107 (2002). [Google Scholar]
  • 84.Kumar, K. et al. Active states during the reduction of CO2 by a MoS2 electrocatalyst. J. Phys. Chem. Lett.14, 3222–3229 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Jiang, P. et al. Experimental and theoretical investigation of the electronic structure of Cu2O and CuO thin films on Cu (110) using X-ray photoelectron and absorption spectroscopy. J. Chem. Phys.138, 024704 (2013). [DOI] [PubMed] [Google Scholar]
  • 86.Pascal, T. A. et al. Liquid sulfur impregnation of microporous carbon accelerated by nanoscale interfacial effects. Nano Lett.17, 2517–2523 (2017). [DOI] [PubMed] [Google Scholar]
  • 87.Helk, T. et al. Table-top extreme ultraviolet second harmonic generation. Sci. Adv.7, 6–10 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Sharma, S. & Ambrosch-Draxl, C. Second-harmonic optical response from first principles. Phys. Scr.2004, 128 (2004). [Google Scholar]
  • 89.Becke, A. D. Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys.98, 5648–5652 (1993). [Google Scholar]
  • 90.Epifanovsky, E. et al. Software for the frontiers of quantum chemistry: an overview of developments in the Q-Chem 5 package. J. Chem. Phys.155, 084801 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Besley, N. A. Calculation of the electronic spectra of molecules in solution and on surfaces. Chem. Phys. Lett.390, 124–129 (2004). [Google Scholar]
  • 92.Dunning, T. H. Jr Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen. J. Chem. Phys.90, 1007–1023 (1989). [Google Scholar]
  • 93.Hoffman, D. J. et al. Surface structure of water from soft X-ray second harmonic generation. Zenodo, 10.5281/zenodo.17230045 (2025). [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The experimental data as shown in the main text and Supplementary figures, the TDDFT input and data files, and molecular dynamics simulation input and data files are available on Zenodo93.

Custom code used for TDDFT and molecular dynamics simulations can be found in the open-access repository on Zenodo93.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES