Abstract
Formation of a transient [myoglobin (Mb), cytochrome b5 (cyt b5)] complex is required for the reductive repair of inactive ferri-Mb to its functional ferro-Mb state. The [Mb, cyt b5] complex exhibits dynamic docking (DD), with its cyt b5 partner in rapid exchange at multiple sites on the Mb surface. A triple mutant (Mb(3M)) was designed as part of efforts to shift the electron-transfer process to the simple docking (SD) regime, in which reactive binding occurs at a restricted, reactive region on the Mb surface that dominates the docked ensemble.1 An electrostatically-guided Brownian dynamics (BD) docking protocol was used to generate an initial ensemble of reactive configurations of the complex between unrelaxed partners. This ensemble samples a broad and diverse array of heme-heme distances and orientations. These configurations seeded all-atom constrained molecular dynamics simulations (MD) to generate relaxed complexes for the calculation of electron tunneling matrix elements (TDA) through tunneling-pathway analysis. This procedure for generating an ensemble of relaxed complexes combines the ability of BD calculations to sample the large variety of available conformations and interprotein distances, with the ability of MD to generate the atomic level information, especially regarding the structure of water molecules at the protein-protein interface, that defines electron-tunneling pathways. We used the calculated TDA values to compute ET rates for the [Mb(wt), cyt b5] complex and for the complex with a mutant that has a binding free energy strengthened by three D/E → K charge-reversal mutations, [Mb(3M), cyt b5]. The calculated rate constants are in agreement with the measured values, and the mutant complex ensemble has many more geometries with higher TDA values than does the wild-type Mb complex. Interestingly, water plays a double role in this electron-transfer system, lowering the tunneling barrier as well as inducing protein interface remodeling that screens the repulsion between the negatively-charged proprionates of the two hemes.
1 Introduction
Factors that control unimolecular electron transfer (ET) rates within proteins have been explored both theoretically2,3 and experimentally,4 with remarkable agreement emerging.5 Intermolecular ET reactions between transiently docked partners, although common in nature, are far more challenging to study and to understand because interprotein ET involves an ensemble of docked configurations with differing reactivities, affinities and extent of interfacial hydration than unimolecular reactions. 6,7,1,8,9 Two limiting energy landscapes (Fig 1) have been found for interprotein ET: i) ‘dynamic docking’ (DD), where reactive configurations are few in number and geometrically distinct from the majority of bound but non-reactive configurations; ii) ‘simple docking’ (SD), where dominant binding configurations are also strongly ET-active.1,10
It has been posited that the formation of a transient complex between myoglobin (Mb) and cytochrome b5 (cyt b5) in myocytes produces the reductive repair of the inactive ferri-Mb to its functional ferro-Mb state.11,12,13 Mb(wt) has a low formal negative charge (q = −0.25e at pH 7, 20°C), and its association with cyt b5, also with a large net negative charge (q = −5.7e at pH 7, 20°C), is primarily stabilized by weak, electrostatic interactions between complementary charged residues (Ka ~ 103 M−1 at pH 7), and the complex is paradigmatic of dynamic docking. Recently, a systematic approach for re-designing the protein-protein interface was developed in which optimizing the placement of charges on the surfaces of either (or both) protein partners shifted the energy landscape of the complex from DD toward SD.1,14 Application of the Brownian dynamics (BD) computational docking procedure to the [Mb, cyt b5] pair identified three residues on the front face of Mb (residues D44, D60 and E85) as the most promising mutation sites for stabilizing the complex electrostatically and for thus increasing its ET reactivity (Fig 2). Measurements (ms-timescale) of photoinitiated ET within the cyt b5 complex of the Zn-deuteroporphyrin IX (ZnD)-substituted Mb(D44K/D60K/E85K) triple mutant (ZnMb(3M)), designed in this way, indicate that these mutations do indeed convert the complex from the DD regime toward the SD regime. Even more remarkably, the ZnMb(+8) derivative prepared by substituting a di-ester ZnD-heme into the Mb(3M) protein, showed ultrafast inter-protein ET quenching and charge recombination in transient absorption experiments; the median quenching and recombination rate constants, kf = 2.1×109 s−1 and kb = 4.3×1010 s−1, respectively, approach some of those found within the photosynthetic reaction center. 14
What are the factors that contribute to the increase in the ET rate constants for the Mb(3M) mutant relative to ZnMb(wt)? BD simulations suggested that one significant contributor is the localization of binding sites for cyt b5 on the front face of Mb, near the heme edge. Beratan and coworkers addressed hydration and water structuring at protein-protein interfaces as another possible influence on interprotein ET in studies of ET self-exchange between ferri- and ferro-cyt b5.15 Because the structure of the [cyt b5, cyt b5] dimer is not known, Lin et al. examined an array of reactant starting configurations, and computed DA couplings (TDA) for MD sampled geometries.
We describe here a combination of Brownian dynamics coarse-grained docking structure sampling followed by the finer-grained MD sampling approach for protein-protein interfaces of Lin et al.15 to compute electronic couplings. An electrostatically-guided Brownian dynamics (BD) docking protocol was used to seed the all-atom constrained MD, and to generate an ensemble of equilibrated complexes that sample a broad and diverse array of heme-heme distances and orientations. This protocol generates the atomic level information, especially the position of water molecules at the protein-protein interface, necessary for the calculation of inter-protein electron tunneling matrix elements (TDA) using tunneling pathway analysis. We use these structures as a starting point to take a close look at the composition of the coupling pathways, and to examine the effects of hydration on the mechanism of ET between Mb and cyt b5. The approach has been applied to the [Mb(WT), cyt b5] complex and to the complex with a mutant that has binding strengthened by three D/E → K charge-reversal mutations, [Mb(3M), cyt b5].
2 Computational Details
2.1 Brownian Dynamics
Brownian dynamics (BD) analysis of protein-protein docking used the Linux version of the MacroDox program.16,17,18 Coordinates for ferri-Mb (1YMB.pdb)19 and ferri-cyt b5 (1CYO.pdb)20 were used as input for the BD simulations; the charge on the iron ion of Mb was changed to 2+, corresponding to the charge of Zn2+ in ZnMb. A structural model for the Mb(D44K, D60K, E85K) triple mutant, Mb(3M), was generated using the coordinates of ferric-Mb(horse) and the Mutate module in MacroDox. As in previous BD analyses of the electrostatically-driven docking of Fe2+Mb with Fe3+b5, simulations were performed with parameters that match the experimental ones (293K, pH 7 and μ = 18 mM).1, 14, 21
For each pair of proteins, trajectories are computed with the cyt b5 center of mass (COM) initially positioned at a random location on a sphere (70Å radius with the COM of the Mb targetat its origin). Each trajectory terminates when the distance between the COM of cyt b5 and the COM of Mb (dCOM) exceeds 200Å. A trajectory is classified as a ‘hit’ if the prescribed reaction criterion distance is achieved at any point along the trajectory. The coordinates computed for a hit correspond to the configuration along its trajectory where the reaction criterion distance is smallest. ‘Docking profiles’ for the hits resulting from 104 trajectories were created by plotting the cyt b5 center-of-mass in the Mb coordinate reference frame.
For each protein-protein pair, docking profiles were generated for two complementary reaction criteria: i) center-of-mass (COM) binding criterion (Fig 2, right), where a successful hit is tallied if the distance between the COM of the partners comes within the sum of the protein radii (~39Å); ii) meso-carbon reactive criterion (Fig 2, left), where a hit occurs if a trajectory leads to a short distance (< 25Å) between any pair of heme meso-C atoms, one from each protein. The hits from the COM criterion describe an array of probable ‘binding’ geometries, while the hits from the meso-C criterion describe an array of ET ‘reactive’ geometries.
Three strategies were used to select representative geometries from the ensemble of BD hits as input for MD computations. First, we used the analysis module within MacroDox to compute the potential energy of stabilization (V), reaction distance (drxn) and Fe-Fe separation (dFeFe) for all of the hits found in each BD simulation.22 We then sorted the hits from each simulation according to each of these descriptors and chose the top five geometries from each sorting criterion. (Note that for the COM simulations, drxn corresponds to the distance between the COM of the partners, while for the meso-C analysis drxn is the shortest distance between any pair of heme meso-C atoms of the partners.) Second, for each BD simulation, we chose four geometries from among those with dFeFe nearest the maxima in the dFeFe histograms. Finally, for the [Mb(wt), cyt b5] pair, we selected the four configurations from the COM simulation with the shortest distance (< 4.0Å) between one O-atom of a Mb heme carboxylate and one O-atom of a cyt b5 heme carboxylate. The total number of initially selected geometries was 34 for Mb(3M) (19 from the COM simulation and 15 from the meso-C simulation) and 33 for Mb(wt) (23 from the COM simulation and 10 from the meso-C simulation). As shown previously, 1,14 configurations from the Mb(3M) simulations are more stable than configurations from the Mb(wt) simulations (Fig S1). This approach produces a diverse set of starting geometries, (Fig 3, Table S1) with dFeFe values that vary from 14.6Å to 32.6Å. Configurations with much larger metal-metal distances can be found, but these are present in low numbers, have low affinity, and are expected to contribute little to the overall ET reactivity.
2.2 Molecular Dynamics
For each initial geometry (obtained from a BD simulation), the histidine positions and protonation states at pH 6 were modeled using WhatIF.23 The heme carboxylates were held in the de-protonated state, as would be expected in a water environment. Hydrogen atoms were added to the complexes using the HBUILD function of CHARMM.24 Water molecules were added to the protein cavities using Dowser,25 and a water box with a minimal shell thickness of 15Å around the interprotein complex was constructed using the SOLVATE function in VMD (μ = 0.05 M NaCl). 26
As the relative atomic positions within each protein are fixed during the BD steps, classical MD simulations were performed to relax the orientations of the side-chains within the putative protein-protein interface of each BD pose. The sampled BD geometries represent the full diversity of geometries contained in the full array (see section 3.1). Keeping the protein backbone frozen during an MD simulations allows sampling around each of these BD-derived geometries. Full sampling of the interprotein potential surface, specifically unfreezing of the backbone and running all-atoms MD, is beyond the scope of this manuscript. All MD simulations were performed using CHARMM, version 35b2.24 In all MD steps, the protein backbone coordinates were fixed while the coordinates of the solvent and protein side-chains were equilibrated using the CHARMM27 force field27 and explicit TIP3P water.28 First, the system was subjected to three equilibration steps: i) hydrogen atom minimization, ii) optimization of the positions of the water and ions with a constant pressure and temperature (CPT) MD simulation (time step 1fs for 150ps), and iii) optimization of the side-chain positions using a frozen backbone (NVT) MD simulation with a box size determined from the CPT run. The 200ps NVT runs were performed at 300K using periodic boundary conditions and 1fs time steps. Finally, 200ps MD production runs were performed with an NVT ensemble (300K, periodic boundary conditions, 1fs time step, and particle mesh Ewald (PME) full electrostatics. Snapshots from the production runs were collected every 2ps, and the last 51 snapshots were used for the tunnelling pathway analysis.
2.3 Tunnelling pathway analysis
For the snapshots from each initial BD geometry, the electronic coupling (TDA) was evaluated with the Pathway model29 with the VMD Pathways plug-in.30 In the Pathway model:
Eq. 1 |
Here ε(C) is the covalent bond decay factor, and ε(TS) is the through-space decay factor. A is the pre-factor, with units of eV. Throughout this manuscript A=1eV. In this manuscript, there is no explicit term for hydrogen-bonds, which are described as a combination of covalent and through-space steps. The donor was defined as the C, N and central metal atoms in the Mb heme, while the acceptor was defined as the Fe atom in the cyt b5 heme.
3. Results and Discussion
To study ET between Mb and cyt b5, where the structure of the ensemble of reacting complexes is not known, we begin by using BD to generate an ensemble of possible reactive geometries. As the electrostatic potentials of the partners are utilized to guide the BD docking, the resulting ensemble includes only the stabilized docking geometries that also satisfy the ET reaction criterion implemented in the BD analysis (Fig 2). The ET reaction criteria is either the center-of-mass (COM) distance criterion, where a hit is tallied if the distance between the COM of the partners comes within the sum of their radii or the meso-carbon distance criterion, where a hit is tallied if the distance between any heme meso-C atom on one protein and of its partner is < 25Å. It is our hypothesis that this electrostatic filtering of the accessible conformational space generates an adequate distribution of geometries that can be used to understand the measured ET reactivity differences between the DD-type [Mb(wt), cyt b5] and SD-type [Mb(3M), cyt b5] complexes.
3.1 BD geometries
The docking profiles from the meso-C (left) and COM (right) BD simulations are shown for [Mb(wt), cyt b5] and [Mb(3M), cyt b5] in Fig 2, along with the corresponding histograms describing the donor-acceptor distance distributions (dFeFe) for the hits. The meso-C criterion for a BD hit requires that, during the BD trajectory, cyt b5 passes through a configuration of the [Mb, cyt b5] pair where efficient ET is possible (based on initial distance analysis). Nonetheless, when cyt b5 docks with Mb(wt), the hits are distributed roughly uniformly over almost the entire Mb surface, spanning a wide range of Fe-Fe distances (e.g., 15.9Å < dFeFe < 31.6Å), a pattern that typifies the DD interaction landscape. The simulation for the [Mb(3M), cyt b5] complex has a similar range of Fe-Fe distances (Table S1), but Fig 2 shows that the majority of hits now cluster near the exposed Mb heme edge. The profile for the mutant suggests that this complex approaches the SD energy landscape, although there are still numerous ET-active configurations.
To describe and compare structural variations within the DD-type Mb(wt) and SD-type Mb(3M) ensembles, we have computed dFeFe histograms for each of the docking profiles. The dFeFe histograms for Mb(wt) and Mb(3M) in Fig 2 are dramatically different. First, the majority of the Mb(3M) hits have very short Fe-Fe distances (dFeFe < 20Å), while the majority of the Mb(wt) hits have much longer Fe-Fe distances (20Å < dFeFe < 35Å). Second, the histogram for the Mb(3M) simulation is quite symmetric, with dmax = 15.3Å and w1/2 = 0.2Å. In contrast, the Mb(wt) distribution is bimodal. Moreover, the distance maxima for both the major (dmax = 21.0Å) and minor (dmax = 27.5Å) peaks are significantly greater than the dmax = 15.3Å for the Mb(3M) ensemble, and the widths at half maxima for both peaks are roughly 10-fold larger (w1/2 = 2.0Å, 2.3Å) than the w1/2 value for the Mb(3M) ensemble.
The COM criteria for a BD ‘hit’ is intended to complement the meso-C criteria by identifying those BD trajectories in which cyt b5 has passed through a bound configuration of the [Mb, cyt b5] pair, without selection for efficient ET. Correspondingly, the spread in dFeFe is larger for the simulations using the COM criterion (Fig 2, right) than for the simulations using the meso-C criterion (Fig 2, left). Nonetheless, the differences between the Mb(wt) and Mb(3M) docking profiles are similar to those observed with the meso-C criterion: the hits cluster near the Mb heme edge, and the majority of Mb(3M) ensemble geometries have a dFeFe that is small in comparison to the majority of the geometries in the Mb(wt) ensemble. Details of the distributions of structures within the two ensembles are, of course, different for the meso-C and COM criteria. In particular, the fits of the dFeFe histograms for both complexes are bimodal when the meso-C criterion is used. The major peak for the Mb(3M) ensemble (dmax 21Å; w1/2 = 1.0 Å) is found at a shorter Fe-Fe distance than the broad major peak for the Mb(wt) ensemble (dmax 25.6Å; w1/2 = 3.2 Å); both the Mb(wt) and Mb(3M) ensembles have a weak, broad, minor peak (dmax ~ 30 Å; w1/2 = 8 Å).
A plot of dCOM as a function of dFeFe for all of the configurations found with the meso-C and COM BD simulations for both the [Mb(wt), cyt b5] and [Mb(3M), cyt b5] complexes (Fig 3, upper) produces a comprehensive, coarse-grained view of the [Mb, cyt b5] complex. This combined ensemble of docked structures is diverse, with a broad range of donor-acceptor distances (dFeFe) found even for the Mb(3M) variant, as noted above. Most importantly, there is extensive overlap of poses found in the BD among the four simulations, indicating that there are numerous geometries that are common to both the DD-type complex with Mb(wt) and the better defined complex with Mb(3M). This observation corroborates our assumption of using BD simulations to describes the DD binding and mutation-induced changes in binding and reactivity, showing that the surface mutations enhance the probability of obtaining reactive conformations, rather than creating new ones.
The structures of the ET partner proteins are fixed during a BD simulation and the protein interfaces in the BD docked configurations are neither relaxed nor hydrated. As a starting point for assessing the influence of hydration and side-chain relaxation on the ET coupling pathways (see below), we selected configurations from each coarse-grained BD ensemble as input for constrained MD simulations (Colored symbols in the upper panel of Fig 3). As described in the Computational Methods section, choosing starting structures for MD from the meso-C and COM BD ensemble, we first sorted the hits in each component ensemble in four ways: based on (i) dFeFe; (ii) reaction distance (dCOM or meso-C distance, depending on the reaction criterion); (iii) binding potential energy; and (iv) deviation of dFeFe from the maximum in the dFeFe histogram. We then selected the top structures from each sorted array. In addition, we chose the configurations from the Mb(wt) COM simulation that had the shortest propionate oxygen to propionate oxygen separation across the interface. As can be seen in the upper panel of Fig 3, the 66 starting geometries selected in this way are indeed representative of the structural diversity within the full coarse-grained ensemble.
The creation of the coarse-grained ensemble provided an opportunity to visualize the variation and diversity among configurations of the complex as a function of the donor-acceptor distance. We partitioned the chosen starting geometries into shells of radius dFeFe = 15, 21, and 32Å (and width of ~1A), and superimposed the coordinates of cyt b5 for each configuration in that shell onto the Mb structure (Fig 3, lower). In the shell at dFeFe ~15Å (Fig 3B), which represents the most reactive geometries (the primary focus of this study), the cyt b5 molecules are clustered around the heme edge. Nonetheless, these configurations exhibit a wide range of cyt b5 orientations: the dihedral angle between the partner heme normals spans the full 360° range. Most importantly, this shell includes one or more conformations from each of the four contributing BD simulations, supporting the assumption that the differences in observed rate constants for the Mb(wt) and Mb(3M) complexes result from the different probabilities of attaining such reactive configurations. The shell with dFeFe = 21Å (Fig 3C) also includes configurations from all four BD simulations and, as expected, contains a wide range of cyt b5 orientations. However, this figure shows that the three surface mutations redirect the cyt b5 docking location, so that the cyt b5 molecules cluster over different regions of the Mb surface in wt vs. 3M. The cyt b5 molecules cluster near the solvent-exposed heme edge of Mb(3M), while the cyt b5 molecules cluster nearer the CD corner of Mb(wt). The dFeFe = 32Å shell (Fig 3D) contains only configurations from the Mb(wt) simulations, and the docked cyt b5 molecules non-specifically on the Mb surface.
3.2 Calculated Pathway couplings
The members of the ensemble of starting configurations derived from the Mb(wt) and Mb(3M) BD simulations were hydrated as described in the Materials and Methods Section, and constrained (frozen backbone) MD computations were performed to examine the influence of side-chain and solvent mobility on the ET pathways and their couplings (Eq. 1). For each initial BD pose, 51 snapshots were collected from the constrained MD simulations. The strongest coupling pathway linking Fe in cyt b5 and the C, N, and central metal atoms of the Mb heme porphyrin core was computed for each snapshot.
For each starting BD geometry, the natural log of the mean-squared Pathway coupling constant for the strongest-coupling pathway, averaged over all 50 snapshots, is plotted as a function of the donor-acceptor distance in Fig 4. Overall, <TDA2> varies approximately exponentially with dFeFe. The exponents obtained for the Mb(wt) data (β = 1.28Å−, R2=0.964) and the Mb(3M) data (β = 1.36Å−, R2=0.955) are similar, giving an overall value of β = 1.3 Å−1, (R2 = 0.963) is Interestingly, these values are all midway between those reported for water-mediated tunneling (β~1.5-1.6 Å−1 from the Gray31,32 and Cave33 groups) and protein-mediated tunneling (β~1.0-1.4 Å−1). 31,32
Water molecules lower the tunneling barrier (compared to empty space) and enable protein conformational changes: in all cases, <TDA2> for the relaxed, hydrated snapshots obtained from MD is larger than the corresponding value calculated for the un-relaxed, dehydrated starting geometry (0TDA2) (Table S2). Indeed, the un-relaxed 0TDA2 drop approximately exponentially with distance, (Fig 4, gray line; Fig S2) with an exponent (β0 ~ 1.45 Å−1, R2 = 0.954), a value that is not only larger than β for the hydrated/relaxed configurations, but also approaches the range of values for water-mediated tunneling. This highlights the importance of including the motion of water molecules and interfacial side-chain residues when computing electron tunneling interactions in transient complexes. The MD simulations that begin with BD-derived starting geometries change the interactions via the protein-protein.
The structural heterogeneity of the BD ensemble, illustrated in Figs 2 and 3, lead to a large variation in <TDA2> for the corresponding MD trajectories. For example, even within the set of configurations from the d =15Å shell, <TDA2> varies by ~20-fold for the various BD starting geometries. In contrast, for the MD trajectory associated with a single BD starting geometry, the calculated standard deviation σ(<TDA2>) for the MD snapshots along the trajectory is quite small (σ ~ 0.1 <TDA2>), indicating that all MD snapshots from a single BD pose have similar ET pathways.
3.3 Comparisons to measured ET rates
For the [Mb(wt), cyt b5] and [Mb(3M), cyt b5] complexes, rate constants measured by transient absorption spectroscopy were reported previously for the T→I ET reaction in Scheme 1 where T is the triplet [3ZnDMb, Fe3+b5] excited state, I is the charge separated [ZnD+Mb, Fe2+b5] intermediate state, S is the singlet [1ZnDMb, Fe3+b5] excited state and G is the [ZnDMb, Fe3+b5] ground state: 3kET = 7.8×103 s−1 and 1.0×106 s−1, respectively.1 In addition, the rate for the S→I ET in Scheme 1 was measured with the [ZnD-dmeMb(3M), cyt b5] pair, (1kET = 2.1×109 s−1) 14 showing that the charge separated intermediate can originate from either the singlet or the triplet state, depending on whether the distance between the donor and acceptor is short or long; configurations with small dZnFe react primarily via the singlet ET photocycle, while singlet states produced in configurations with longer dZnFe, undergo rapid intersystem crossing to their triplet state (1kD = 2.5×108 s−1) 14 and then react via the triplet ET route. The I→G charge recombination ET rate constant is also expected to depend on the starting geometry. More specifically, configurations with much shorter dZnFe should undergo more rapid charge recombination than configurations with longer dZnFe. The geometries of the reactive configuration, the intersystem crossing rate, and the relative lifetimes of the singlet and triplet excited states dictate the partitioning between the two routes. It is our hypothesis that the differences in the measured rates for the SD-type Mb(3M) and DD-type Mb(wt) complexes can be attributed to differences in the distributions of reacting geometries with short and long dZnFe.
To test this idea and the conclusion that the experimental rates are dominated by ET rather than Forster energy transfer, we computed ET rate constants for this complex using the high-temperature, non-adiabatic ET rate equation (Eq 2).34
Eq 2 |
Here, <TDA2> is the mean-squared donor-acceptor electronic coupling, λ is the reorganization energy, ΔG is the reaction free energy, kB is Boltzmann’s constant, ħ is Planck’s constant divided by 2π, and T is the temperature.
Using Eq 2 to compute ET rates for a transient protein-protein complex that can sample many reactive configurations is challenging, as <TDA2>, λ and ΔG are geometry-dependent. Since the donor and acceptor are identical in the Mb(wt) and Mb(3M) structures, the driving forces (ΔG) for the S→I and T→I charge separation reactions can be approximated as being independent of the protein-protein pose: ΔG(S→I) = −1.13eV and ΔG(T→I) = −0.9eV (see the SI for ref 14) By sampling a large number of starting geometries, we have evaluated the distance dependence of the mean-squared electronic coupling for the ensemble. The distance-dependence of the reorganization energy (λ) was evaluated using the Marcus two sphere model,35
Eq 3 |
where Δe = 1 is the charge transferred from donor to acceptor, r is the donor-acceptor distance, εop = 2 is the optical dielectric constant. The static dielectric constant (εs)35 and the effective radii of the donor and acceptor hemes (a1 and a2) are correlated and in practice are set semi-empirically. Choosing, for example, εs =10 and a1 = a2 = 3.9Å, gives λ=1.10eV for r = dFeFe = 15.3Å, the peak in the dFeFe histogram for the Mb(3M) simulation, in agreement with reorganization energies reported for interprotein heme-heme ET. 36,37, 38 Although more sophisticated approaches for computing λ have been developed,39 applying these methods to such a large set of geometries is not practical. Within the range of distances, dFeFe, found in the BD simulations, λ ≈ ΔG(S→I) ≈ ΔG(T→I), and thus both the S→I and T→I charge separation reactions are nearly activationless (Table 1). As a result, the ET rate constant is merely proportional to <TDA2>. Fig 4 shows semi-log plots of kET (right axis) and <TDA2> (left axis) as a function of dFeFe for the starting geometries from the four BD simulations. For simplicity, we use A=1eV as the pre-exponential electronic factor for all geometries.
Table 1.
Mb | dFe-Fe (Å) |
<TDA2 (eV2a) |
λ (eV) |
ΔG (eV) |
1kET calculated (s−1) |
1kET measured (s−1)b,c |
---|---|---|---|---|---|---|
Mb(3M) | 15.3 | 1.63 ×10−6 | 1.10 | −1.13 | 2.5×1010 | 2.1×109 |
Mb(wt) | 21.0 | 1.06 ×10−9 | 1.10 | −1.13 | 1.6×107 | ≤ 2.5×108 |
<TDA2> pre-factor is equal to 1.0eV2 40
1kET represents the S→I ET rate at the maximum in the distribution from a stretched exponential fit. A bi-exponential fit to the same data gave k1 = 1.1×1010 s−1, k2 = 1.2×109 s−1 with an amplitude ratio, a1/a2 = 0.75.14
S→I ET is not observed with Mb(wt); the upper bound for 1kET is derived from the reported lifetime of the singlet state in ZnDMb(+8).14
First, consider the S→I ET reaction. Table 1 compares the experimentally measured S→I ET quenching rate constants and the computed ET rate constants at the distance maxima from the dFeFe histograms for the meso-C BD simulations (Fig 2, left) with the Mb(wt) and Mb(3M) variants: dFeFe = 15.3Å and 21.0Å for the [Mb(3M), cyt b5] and [Mb(wt), cyt b5] complexes, respectively. As the meso-C criterion selects for reactive geometries, those geometries with donor-acceptor separations near the maxima in these histograms presumably best represent the average ET-active structure.
For the Mb(3M) variant, this distance maximum, in combination with the value of β = 1.3 Å−1 derived from the data in Fig 4, gives <TDA2> = 1.63×10−6 eV2, λ = 1.10eV, and 1kET = 2.5×1010 s−1 (Table 1). The computed rate constant at this distance is within an order of magnitude of the median ET rate constant reported for the [ZnDMb(+8), cyt b] complex, 1kET·14 Indeed, configurations with dFeFe < 17Å are calculated to have 1kET greater than or equal to the measured1kET and all configurations with dFeFe < 19Å are calculated to have 1kET ≥ kisc = 1kD (dashed line, Fig 4). This is remarkable agreement, given that the decay curves are non-exponential, suggesting that the population is not homogeneous and slowly exchanging, with a wide distribution of rate constants. Indeed, it was reported that the same data could be equally-well described with a bi-exponential decay in which one component has a rate constant k1 > 1kET and the other component has a rate constant only slightly lower than 1kET.
For Mb(wt), S→I ET is not observed, presumably because there are too few configurations with a donor-acceptor distance short enough to exhibit ET that is faster than the intrinsic singlet excited state decay to the triplet excited and ground states. This is supported by the BD simulations, which have very few configurations with dFeFe< 20Å. Using the distance maximum characterizing the Mb(wt) ensemble (dFeFe = 21.0Å) and ignoring possible small perturbations in the driving force from surface mutations gives 1kET = 1.6×107 s−1, (Table 1) a value that precludes S→I ET because it is much smaller than the observed intersystem crossing rate, k = 2.5×108 s−1.14
The data in Fig 4 can be used along with the experimentally-measured 3kET rate constants to determine approximate reaction distances for the sub-ensembles within the [Mb(3M), cyt b5] and [Mb(wt), cyt b5] complexes that undergo T→I ET. The above discussion suggests that configurations undergoing intersystem crossing to the triplet must have ET rate constants less than kisc = 2.5×108 s−1. According to Fig 4 (dashed line), this implies a starting ensemble of complexes with dFeFe > 21Å. The experimentally-measured value of 3kET = 1.0×106 s−1 for the [Mb(3M), cyt b5] complex (kET = 1×106 s−1),1 and the reported driving force, ΔG(T→I) = −0.90eV corresponds to a slightly longer average dFeFe, ~ 23Å. Not surprisingly, a calculated average distance for the T→I ET reaction in the [Mb(wt), cyt b5] complex (3kET = 7.8×103 s−1)1 is even longer. In this case, with binding of cyt b5 over the full Mb(wt) surface, the measured ET rate constant is almost certainly limited by conformational motion (gating). For the Mb(3M) complex, however, the lower bound on the distance at which intersystem crossing becomes dominant is sufficiently close to the average distance estimated from the triplet ET rate constant that it remains to be established whether the same is true for the Mb(3M) complex.
3.4 Distributions and Structural Heterogeneity
Fig 4 shows a large variation in <TDA2> among the BD configurations within a dFeFe shell, which corresponds to an equivalently large variation in kET. For example, among the starting geometries within the d = 15Å shell, <TDA2> (and kET) varies ~20-fold (solid red box, Fig 4). Fig 4 also indicates that there are many configurations in which the protein partners have quite different relative orientations and distances, but roughly the same ET rate. For example, a rate of ~1010 s−1 is obtained for starting geometries in which dFeFe varies by ~5Å (dashed red box, Fig 4). A rather dramatic consequence of this structural heterogeneity is that the MD trajectory from the BD starting geometry with the smallest value of dFeFe (14.0 Å) has <TDA2> = 3.75 × 10−6 eV2 (corresponding to k = 5.8×1010 s−1), while the MD trajectory from the BD starting geometry with the largest <TDA2> value (1.41 × 10−5 eV2, corresponding to k = 2.2×1011 s−1) has dFeFe = 15.0Å, ~1Å longer.
The structural distribution, and corresponding TDA2 distribution, necessarily introduces a dynamic dimension to these interprotein ET reactions. When exchange among conformers is rapid relative to ET, the observed rate constant will scale with <TDA2> where the average is over the ensemble of structures. When interchange is slow, non-exponential kinetics results from the distribution of configurations. For the [Mb, cyt b5] system, the timescales for the S→I and T→I ET reactions differ by several orders of magnitude. Thus, the structural diversity of the transient [Mb, cyt b5] complex requires that interconversion dynamics will impact the two reactions differently.
The experimental time courses associated with singlet quenching of the 1ZnMb(3M) complex were analyzed using an asymmetric, stretched exponential distribution,14 which indicates that conformational interconversion is slow compared to ET (or competitive with ET). The time courses associated with the intra-complex T→I ET observed on the μs-timescale are exponential,1 indicating that here, in contrast, interconversion among conformers is rapid in comparison to the rate for T→I ET.
3.5 ET Pathways
Next, we examine the differing roles of water in the ET coupling pathways. We have calculated the change in solvent accessible surface area (SASA) for the bound configurations (ΔSASA), defined as the difference between the SASA of the isolated components and the SASA of each starting configuration of the complex. As expected for transient complexes, the surface that is buried at the interface of a complex is rather small for all of the BD starting geometries (ΔSASA varies from 200 - 800 Å2, corresponding to the release of ~20-80 bound waters from each protein). In all of the starting bound geometries, water molecules are in contact with the exposed heme propionates, regardless of the heme orientations. The number of such water molecules is small, ranging from 8 to 20, and tends to increase with dFe-Fe. In most of the MD snapshots, the majority of the contact water molecules is closer to the cyt b5 heme than to the Mb heme (Table S3), as qb5 is larger and more negative in comparison to qMb (qb5 = −5.72 and qMb = −0.25, at pH7). The protein side chains and bound waters relax during the MD trajectories, while the waters near the propionates reorient to solvate the charged carboxylates.
For each starting geometry, we identified the specific amino acids and water molecules that appear in the most strongly-coupled ET pathway for each snapshot along its MD trajectory. A semi-log plot of <TDA2> as a function of dFeFe (with the geometries colored according to whether or not a solvent molecule is found in the pathway of one or more snapshots) is shown in Fig S2. For half of the 66 starting geometries, water molecules are never found in the strongest coupling pathway, while for an equal number of starting geometries, one or more water molecules are present in the strongest coupling pathway for at least some snapshots. Interestingly, the appearance of water in a pathway does not correlate with dFeFe as it is just as likely that water appears in a pathway when dFeFe is short as when dFeFe is long.
Next, we tallied the number of snapshots in which each particular amino acid and water molecule appears in the strongest coupling pathway and compared the resulting composite pathway composition profiles for the starting geometries from the 15Å, 21Å, and 32Å shells in Fig 3B-D, noting in particular how the pathway composition varies between the geometries derived from the Mb(3M) and Mb(wt) BD simulations (red and blue symbols, respectively, in Fig 5, left). In the right panel of Fig 5, the pathway profiles for the 15Å and 21Å shells are mapped onto the structures of Mb and cyt b5 with amino acids colored according to the frequency with which each amino acid appears in the strongest coupling pathway.
All of the configurations in the dFe-Fe~15Å shell (Fig 3B) have very large TDA2 values and correspondingly large corresponding ET rate constants, indicating that these poses can undergo rapid S→I ET. According to the histograms for the meso-C simulations, such geometries are quite common for the Mb(3M), but not for Mb(wt).
The protein-protein interfaces for most of the Mb(3M) starting geometries in the dFe-Fe~15Å shell are compact, and the strongest coupling ET pathway involves direct through-space tunnelling between the Mb heme and the cyt b5 heme. For one starting configuration, a slight modification of this pathway (in which Pro 40 of cyt b5 mediates the tunnelling between the two hemes) was found. Both kinds of starting geometries are quite stable and their pathways are thus invariant for the duration of the MD trajectory. In none of these pathways is there enough space between the two hemes for water to participate in the dominant coupling pathway. Although there are waters that solvate the propionates, which presumably help to shield the heme-heme repulsion, they are too far from the ET path to lower the tunneling barrier.
Starting geometries from the Mb(wt) simulations in the dFe-Fe~15Å shell (Fig 3B, Fig 5), contrast with the case of geometries from Mb(3M), as there are no direct through-space tunneling pathways between the hemes. Most often, ET is mediated by water along with one or more amino acid residues (Gly 41, Pro 40 and/or the axial ligand, His39). Such pathways are dynamic, with both the amino acid and the solvent composition of the pathway varying along the MD trajectory. However, as the histograms from the Mb(wt) simulations indicate, very few configurations appear in the dFe-Fe~15Å shell, and they contribute little to the overall reactivity.
For the starting geometries from the dFe-Fe~21Å shell (Fig 3C, Fig 5), the amino acid and solvent composition of the strongest coupling pathways is still more complex, especially for the Mb(wt) starting geometries. Here, multiple water molecules are usually present. Configurations in this shell, of course, have smaller TDA2 than the more compact geometries from the dFe-Fe~15Å shell and ET rate constants that are less than the inter-system crossing rate constant. Consequently this shell cannot contribute significantly to S→I ET, but instead undergoes intersystem crossing followed by T→I ET.
Finally, the dFeFe~32Å shell (Fig 3D, Fig 5) contains configurations from only Mb(wt) starting geometries. Here, TDA2 is exceedingly small and the corresponding ET rate constant is very slow. Such configurations contribute negligibly to the overall ET rate. As the superposition of these starting geometries indicates (Fig 3D), these structures are quite diverse, and conversion to a more reactive form is likely required for them to undergo T→I ET. Not surprisingly, the side-chain positions and solvent compositions fluctuate significantly during the MD runs and multiple, diverse pathways are found.
4. Conclusion
We have described an approach for computing ET rates in a transiently bound, dynamic complex with a diverse ensemble of structures. We find that this approach can be applied in both the DD ([Mb(wt), cyt b5]) and SD ([Mb(3M), cyt b5]) regimes. Identifying poses relevant to the ET kinetics is computationally challenging for such complexes, as the ET rates span several orders of magnitude in time and the discrete structures comprising the reactive ensemble span a large range of donor-acceptor distances. The ET time scale for the [Mb(wt), cyt b5] ensemble is 3τ ~130 μs (3kET = 7.8×103 s−1), while for the [Mb(3M), cyt b5] ensemble, 1τ ~500 ps (1kET = 2.1×109 s−1) and 3τ ~1μs (3kET = 106 s−1). We used two different computational methods to span the length and time scales at play. BD was used to understand the interprotein complex geometries. Constrained all-atom MD was used to compute snapshots for each BD starting geometry. The Pathway model was used to examine the electron tunnelling path from donor to acceptor for each MD snapshot. The difference in the calculated distances in the BD ensembles was used to understand differences in the measured rates for the SD-type Mb(3M) and DD-type Mb(wt) complexes. For example, we have replicated the experimental ET rates for [ZnDMb(+8), cyt b5] S→I ET and explained why for Mb(wt) S→I ET cannot be measured on the experimental time scales. We have further predicted the reaction distance for the sub-ensemble that participates in the T→I ET, based on the measured 3kET. We have shown that <TDA2> decreases approximately exponentially with dFeFe, with an exponent (β~1.3 Å−1) midway between the reported values for water-mediated tunneling and protein-mediated tunneling, indicating that ET in the [Mb, cyt b5] complex involves a combination of water-mediated and protein-mediated tunnelling pathways.
At this point, it is interesting to compare the methodology and results presented here to previous work by Liang et al.,21 which studied the same system using a Monte Carlo (MC) sampling framework. They biased their sample toward the most strongly coupled complexes, while here bound interprotein complexes with a large range of couplings are examined, thus sampling a more inclusive population of the complexes. In the Monte Carlo method, the structures of the partners were kept rigid, while the current study allows for flexible, functional docking by performing an MD optimization of each BD starting geometry. Another difference is that the earlier study did not treat waters explicitly.
Interestingly, we have found a dual role for water in this protein-protein ET system. Water lowers the tunneling barrier and also induces protein interfacial remodeling. Both factors are required for finding qualitative agreement between theoretical and experimental ET rates. Water also enables the two proteins to dock and to form the ET active interprotein complex, by screening repulsion between heme propionates. These observations highlight the importance of including the concerted motion of solvent molecules and interfacial side-chains when studying interactions at protein-protein interfaces. B D simulations model the overall ensemble properties, while all-atom simulations are necessary to describe remodeling of the protein-protein interface and the formation of the associated tunneling pathways.
Supplementary Material
Acknowledgements
We gratefully acknowledge NIH funding of this research: HL063203 (BMH) and GM48043 (DNB).
Footnotes
Electronic Supplementary Information (ESI) available: [details of any supplementary information available should be included here].
references
- 1.Nocek JM, Knutson AK, Xiong P, Petlakh Co N, Hoffman BM. J. Am. Chem. Soc. 2010;132:6165–6175. doi: 10.1021/ja100499j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bendall D. Protein Electron Transfer. Garland Science. 1996 [Google Scholar]
- 3.Beratan DN, Skourtis SS, Balabin IA, Balaeff A, Keinan S, Venkatramani R, Xiao D. Acc. Chem. Res. 2009;42:1669–1678. doi: 10.1021/ar900123t. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gray HB, Winkler JR. Biochim. Biophys. Acta. 2010;1797:1563–1572. doi: 10.1016/j.bbabio.2010.05.001. [DOI] [PubMed] [Google Scholar]
- 5.Prytkova TR, Kurnikov IV, Beratan DN. Science. 2007;315:622–625. doi: 10.1126/science.1134862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ubbink M. FEBS Lett. 2009;583:1060–1066. doi: 10.1016/j.febslet.2009.02.046. [DOI] [PubMed] [Google Scholar]
- 7.Crowley PB, Carrondo MA. Proteins: Str. Fun. Bio. 2004;55:603–612. doi: 10.1002/prot.20043. [DOI] [PubMed] [Google Scholar]
- 8.Migliore A, Corni S, Di Felice R, Molinari E. J. Phys. Chem. B. 2007;111:3774–3781. doi: 10.1021/jp068773i. [DOI] [PubMed] [Google Scholar]
- 9.Leys D, Scrutton NS. Curr. Op. Struct. Bio. 2004;14:642–647. doi: 10.1016/j.sbi.2004.10.002. [DOI] [PubMed] [Google Scholar]
- 10.Hoffman BM, Celis LM, Cull DA, Patel AD, Seifert JL, Wheeler KE, Wang J, Yao J, Kurnikov IV, Nocek JM. PNAS. 2005;102:3564–3569. doi: 10.1073/pnas.0408767102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hagler L, Coppes RI, Jr, Herman RH. J. Biol. Chem. 1979;254:6505–6514. [PubMed] [Google Scholar]
- 12.Bekhit AED, Faustman C. Meat Sci. 2005;71:407–439. doi: 10.1016/j.meatsci.2005.04.032. [DOI] [PubMed] [Google Scholar]
- 13.Livingston DJ, McLachlan SJ, La Mar GN, Brown WD. J. Biol. Chem. 1985;260:15699–15707. [PubMed] [Google Scholar]
- 14.Xiong P, Nocek JM, Vura-Weis J, Lockard JV, Wasielewski MR, Hoffman BM. Science. 2010;330:1075–1078. doi: 10.1126/science.1197054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lin J, Balabin IA, Beratan DN. Science. 2005;310:1311–1313. doi: 10.1126/science.1118316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Northrup SH, Boles JO, Reynolds JCL. Science. 1988;241:70. doi: 10.1126/science.2838904. [DOI] [PubMed] [Google Scholar]
- 17.Northrup SH, Thomasson KA, Miller CM, Barker PD, Eltis LD, Guillemette JG, Mauk AG. Biochem. 1993;32:6623. doi: 10.1021/bi00077a014. [DOI] [PubMed] [Google Scholar]
- 18.Northrup SH, Reynolds JCL, Miller CM, Forrest KJ, Boles JO. J. Am. Chem. Soc. 1986;108:8162–8170. [Google Scholar]
- 19.Evans SV, Brayer GD. J. Mol. Bio. 1990;213:885–897. doi: 10.1016/S0022-2836(05)80270-0. [DOI] [PubMed] [Google Scholar]
- 20.Durley RCE, Mathews FS. Acta Cryst. D. 1996;52:65–76. doi: 10.1107/S0907444995007827. [DOI] [PubMed] [Google Scholar]
- 21.Liang Z-X, Kurnikov IV, Nocek JM, Mauk AG, Beratan DN, Hoffman BM. J. Am. Chem. Soc. 2004;126:2785–2798. doi: 10.1021/ja038163l. [DOI] [PubMed] [Google Scholar]
- 22.Northrup SH, Pear MR, Lee CY, McCammon JA, Karplus M. PNAS. 1982;79:4035–4039. doi: 10.1073/pnas.79.13.4035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Vriend G. J. Mol. Graph. 1990;8:52–56. doi: 10.1016/0263-7855(90)80070-v. [DOI] [PubMed] [Google Scholar]
- 24.Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M. J. Comp. Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhang L, Hermans J. Proteins: Struc. Func. Gen. 1996;24:433–438. doi: 10.1002/(SICI)1097-0134(199604)24:4<433::AID-PROT3>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
- 26.Humphrey W, Dalke A, Schulten K. J. Molec. Graphics. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 27.MacKerell AD, Banavali N, Foloppe N. Biopolymers. 2000;56:257–265. doi: 10.1002/1097-0282(2000)56:4<257::AID-BIP10029>3.0.CO;2-W. [DOI] [PubMed] [Google Scholar]
- 28.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
- 29.Beratan D, Betts J, Onuchic J. Science. 1991;252:1285–1288. doi: 10.1126/science.1656523. [DOI] [PubMed] [Google Scholar]
- 30.Balabin IA, Hu X, Beratan DN. J. Comp. Chem. 2012;33:906–910. doi: 10.1002/jcc.22927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ponce A, Gray HB, Winkler JR. J. Am. Chem. Soc. 2000;122:8187. [Google Scholar]
- 32.Gray HB, Winkler JR. PNAS. 2005;102:3534. doi: 10.1073/pnas.0408029102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Miller NE, Wander MC, Cave RJ. J. Phys. Chem. A. 1999;103:1084–1093. [Google Scholar]
- 34.Marcus R, Sutin N. Biochim. Biophys. Acta. 1985;811:265–322. [Google Scholar]
- 35.Sharp KA. Biophys J. 1998;74:1241–1250. doi: 10.1016/S0006-3495(98)77838-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Andrew SM, Thomasson KA, Northrup SH. J. Am. Chem. Soc. 1993;115:5516–5521. [Google Scholar]
- 37.Venturoli G, Drepper F, Williams JC, Allen JP, Lin X, Mathis P. Biophys. J. 1998;74:3226–3240. doi: 10.1016/s0006-3495(98)78029-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Engstrom G, Xiao K, Yu C-A, Yu L, Durham B, Millett F. J. Bio. Chem. 2002;277:31072–31078. doi: 10.1074/jbc.M202594200. [DOI] [PubMed] [Google Scholar]
- 39.Blumberger J. Phys. Chem. Chem. Phys. 2008;10:5651–5667. doi: 10.1039/b807444e. [DOI] [PubMed] [Google Scholar]
- 40.Smith DMA, Rosso KM, Dupuis M, Valiev M, Straatsma TP. J. Phys. Chem. B. 2006;110:15582–15588. doi: 10.1021/jp057068r. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.