Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Jun 30.
Published in final edited form as: J Comput Chem. 2024 Mar 13;45(17):1493–1504. doi: 10.1002/jcc.27344

β-sheets mediate the conformational change and allosteric signal transmission between the AsLOV2 termini

Sian Xiao a,*, Mayar Tarek Ibrahim a, Gennady M Verkhivker b, Brian D Zoltowski a, Peng Tao a,*
PMCID: PMC11076166  NIHMSID: NIHMS1976865  PMID: 38476039

Abstract

Avena sativa phototropin 1 light-oxygen-voltage 2 domain (AsLOV2) is a model protein of Per-Arnt-Sim (PAS) superfamily, characterized by conformational changes in response to external environmental stimuli. This conformational change begins with the unfolding of the N-terminal A’α helix in the dark state followed by the unfolding of the C-terminal Jα helix. The light state is characterized by the unfolded termini and the subsequent modifications in hydrogen bond patterns. In this photoreceptor, β-sheets are identified as crucial components for mediating allosteric signal transmission between the two termini. Through combined experimental and computational investigations, the Hβ and Iβ strands are recognized as the most critical and influential β-sheets in AsLOV2’s allosteric mechanism. To elucidate the role of these β-sheets, we introduced 13 distinct mutations (F490L, N492A, L493A, F494L, H495L, L496F, Q497A, R500A, F509L, Q513A, L514A, D515V, and T517V) and conducted comprehensive molecular dynamics simulations. In-depth hydrogen bond analyses emphasized the role of two hydrogen bonds, Asn482-Leu453 and Gln479-Val520, in the observed distinct behaviors of L493A, L496F, Q497A, and D515V mutants. This illustrates the role of β-sheets in the transmission of the allosteric signal upon the photoactivation of the light state.

Graphical Abstract

Avena sativa phototropin 1 light-oxygen-voltage 2 domain (AsLOV2) undergoes significant conformational alterations in response to light, initiating with the unfolding of the N- and C-terminal helices. The research underscores the pivotal function of the Hβ and Iβ strands in mediating conformational change and allosteric signal transmission. Through extensive mutations and molecular dynamics simulations, the research dissects the influence of specific β-sheets on the allosteric mechanism, highlighting the essentiality of particular hydrogen bonds in this process. The findings contribute to a deeper understanding of the structural dynamics underpinning light-induced signal transduction in photoreceptors.

graphic file with name nihms-1976865-f0001.jpg

Introduction

The Per-Arnt-Sim (PAS) superfamily is widely present across the kingdoms of life, members of which are often found in pathways that regulate responses to environmental changes, including the circadian response pathway.1,2 As a subset of the PAS superfamily, the light, oxygen, or voltage sensing (LOV) domains are small sensor modules found in proteins that respond to environmental signals including light, redox potential, and oxygen, regulating various biological processes. Avena sativa phototropin 1 light-oxygen-voltage 2 domain (AsLOV2) serves as a model protein to study LOV function and allostery. It contains a flavin mononucleotide (FMN) cofactor located in the center of the PAS fold. A large α-helical region of the AsLOV2 C-terminal to the PAS fold is termed the Jα helix (Figure 1).3 AsLOV2 undergoes an allosteric activation from its resting dark state upon irradiation with blue light. A covalent bond forms between Cys450 side chain in the PAS fold and C(4a) of the FMN, resulting in the formation of cysteinyl-flavin adduct, referred to as the light state.4 This light activation also leads to a significant conformational change in the overall protein structure, including the unfolding of the Jα helix. The reversion from the light state to the dark state for LOV domains is generally spontaneous and starts when irradiation ceases. This process includes the thiol bond scission and a conformational change back to the dark state, occurring within seconds to hours, depending on the type of LOV domain.5 With its characteristic conformational changes upon light activation, AsLOV2 has been extensively studied as a model of designing optogenetic switches.68

Figure 1.

Figure 1.

AsLOV2 protein structure (PDB ID: 2V1B).

Past studies have identified the termini as the location of the most characteristic conformational changes in AsLOV2, including the undocking and unfolding of the Jα helix at carboxyl terminal, and the unfolding of the A’α helix at the amino terminal9. However, the complete signaling propagation pathway remains elusive, with β-sheets being determined to play an important role in this process 2,10,11. Zayner et al. carried out an extensive mutational analysis of AsLOV2 domain to study the relation between the sequence of the protein and its function, including 13 distinct mutations on Hβ sheet and Iβ sheet (F490L, N492A, L493A, F494L, H495L, L496F, Q497A, R500A, F509L, Q513A, L514A, D515V, and T517V)12. Photocycle time and conformational change of these mutations are analyzed but no obvious correlation exists between these two properties12. However, the ample experimental observations related to multiple mutations provide an opportunity for computational studies focusing on the allosteric mechanism underlying the activation process of the AsLOV2 domain.

The crystal structures are mostly limited to proteins in their equilibrium states. Therefore, little information of the light activation of AsLOV2 could be obtained directly from its crystal structures. Molecular dynamics (MD) simulations can capture the behavior of proteins in full atomic detail and at very fine temporal resolution, serving as important tools for understanding the physical basis of protein structure and function13,14. In this study, extensive MD simulations were conducted on the selected mutations of AsLOV2 to investigate its allosteric mechanisms, focusing on the role of Hβ and Iβ sheets. Combining simulation analysis with the Markov state model1517, we identified some intriguing mutant behaviors potentially due changes of the hydrogen bond pattern.

Materials and Methods

Molecular dynamics simulations

Initial AsLOV2 structures were sourced from the Protein Data Bank (PDB IDs: dark structure - 2V1A, light structure - 2V1B)18. In the light state, a covalent bond between C(4a) of FMN and Cys450 of AsLOV2 was introduced, along with the 13 mutations detailed in previous section.

Simulation systems were prepared with the CHARMM19 program package (version c41b1), using force field parameters for the FMN from a previous work20. Protonation states of titratable residues were set at neutral pH, and protein systems were solvated in TIP3P21 water boxes, each with a 10 Å buffer zone. Sodium and chloride ions were added to each system to neutralize and to maintain an ionic strength of approximately 0.15 M. Structures underwent initial minimization using the steep descent method for 200 steps, followed by adopted basis Newton-Raphson minimization for 1000 steps. Subsequently, 24 picosecond (ps) simulations raised the temperature from 0K to 300K. Systems were subjected to 10 nanoseconds (ns) isothermal−isobaric (NPT) equilibrations at 300 K (equilibrium simulation), followed by 1 microsecond (μs) canonical (NVT) simulations (production simulation) at the same temperature. Snapshots of product simulations were saved every 100 ps. Three independent simulations were conducted for the wild type (WT) AsLOV2 in dark and light states, and for the light states of the 13 selected mutants.

The SHAKE22 constraint was applied in all simulations to maintain the length of bonds involving hydrogen atoms. During the simulation, the water molecules were kept rigid, with fixed bond lengths and angles. The long-range electrostatic interactions were accounted for using the particle mesh Ewald summation method23. The nonbonding interactions were cut off at 12 Å. The simulation was conducted using OpenMM24 (version 7.6.0).

Simulation analysis

Root-mean-square fluctuation (RMSF) evaluates the flexibility of individual residues. RMSFi quantifies the fluctuation of residue i around its average position ri¯ over T simulation frames. RMSFi is calculated as:

RMSFi=1Tt=1Tritri¯2 (Eq. 1)

where rit is the coordinate for residue i in frame t.

Hydrogen bonds are identified using the Baker-Hubbard method, which sets cutoffs for distances between hydrogen atoms and acceptors θ and angles between donors, hydrogens, and acceptors r25. The method considers NH and OH as donors, and O and N as acceptors. The Hydrogen bonds analysis was carried out using MDTraj26 (version 1.9.7). The following criterion and the occurrence fractions during simulations were employed for validation of the hydrogen bonds:

θ>120°andr<2.5Å (Eq. 2)

The Dictionary of Protein Secondary Structure (DSSP)27 algorithm assigns standard secondary structure annotations to amino acid residues of a protein based on three-dimensional structure. It classifies residues into α-helices, β-sheets, turns, and bridges, using hydrogen bonding patterns. The DSSP calculation is conducted by MDTraj using implementation of DSSP-2.2.0, giving the secondary structure assignments of each residue in each frame.

Dynamic cross correlation (DCC) analysis evaluates MD simulations trajectories to identify correlative motions between residues or residue groups within a molecule, crucial for understanding molecular function and interaction. It involves calculating correlation coefficients between the movements of pairs of atoms over the course of a simulation, revealing how the motion of one part of a molecule is coupled to another. The DCC between the pth and qth atoms is defined by the following equation,

DCCp,q=ΔrptΔrqttΔrpt2tΔrqt2t (Eq. 3)

where rpt denotes the vector of the pth atom’s coordinates as a function of time t, t means the time ensemble average, and Δrpt=rptrptt. The similar notation applies for the qth atom.

Essential dynamics of proteins distinguish between two configurational subspaces: the “essential” subspace, characterized by a few degrees of freedom and anharmonic motion that encompasses major positional fluctuations relevant to protein function, and the “physically constrained” subspace, where motion follows a narrow Gaussian distribution, indicating only local fluctuations.28 Essential dynamics analysis is performed using ProDy package29,30 (version 2.4.0).

Markov State Model

Examining MD simulations provides valuable insights into the conformational landscape and dynamic movements of biomolecules. However, the complexity and high dimensionality of data from these simulations can obscure the crucial dynamic signatures31. To address this challenge and improve conformational landscape analysis, time-lagged Independent Component Analysis (t-ICA) is employed for dimensionality reduction, projecting extensive MD simulation data onto a more intuitive and interpretable low-dimensional space.

The t-ICA method identifies the slowest degrees of freedom, preserving kinetic information in the MD trajectories. It finds coordinates of maximal autocorrelation at a specific lag time, making it an excellent choice for dimensionality reduction in MD data transformation for Markov state model construction.3234

After obtaining the low-dimensional representation from t-ICA analysis, the k-means clustering method is applied to the projected low-dimensional space. Clusters identified in this analysis are referred to as microstates, with each simulation frame assigned to a microstate. The next step tests the kinetical relevance of these microstates and identify an appropriate lag time τ. Relaxation timescales, or implied timescales, are examined against various lag time to generate a model that satisfies the Markov assumption. Upon determining an appropriate lag time, microstates that can transit within the relaxation timescale are clustered together as one macrostate. Perron-Cluster Cluster Analysis was used to identify macrostates, i.e., kinetically separate equilibrium states35,36. Further theoretical details are available in a previous study37. t-ICA and Markov state model building were conducted using the PyEMMA package38 (version 2.5.12).

Results

AsLOV2 Wild Type

The RMSF of alpha Carbons (Cα) were calculated for the WT AsLOV2 dark and light states. The RMSF values reported in Figure2A represent averages from three independent simulations. RMSF based on each independent simulation of each state is provided in the Supporting Information (Figure S1). Enhanced fluctuations in the light state, observed in both A’α and Jα helices compared to the dark state during simulations, indicate increased flexibility in these regions upon light activation and formation of the FMN-CYS450 covalent bond. Additionally, RMSF elevation is observed in the loop connecting the Hβ sheet to the Iβ sheet, the terminal residues of the Iβ sheet, and the loop between the Iβ sheet and Jα helix. Mean RMSF values for individual residues in the A’α and Jα helices, and Hβ and Iβ sheets, were calculated for further comparison (Figure 2B). The A’α helix at N-terminal shows the highest mean RMSF values among the selected secondary structures in both states. Furthermore, the A’α helix has a higher mean RMSF value in the light state than the dark state, indicating heightened mobility upon light exposure. The Jα helix follows this trend but to a smaller degree possibly due to its larger size. The Hβ and Iβ sheets have lower mean RMSF values, with a slight elevation in the light state, denoting notable flexibility changes with light activation. This supports the conclusion that light activation induces a state of increased flexibility in specific regions of the protein, potentially facilitating its functional conformational changes.

Figure 2.

Figure 2.

Root-mean-square fluctuations (RMSF) analysis of wild type (WT) of AsLOV2 dark and light states. (A) RMSF of WT AsLOV2 dark state (blue) and light state (green). The plotted RMSF is the averaged value over three independent simulations of each configuration. Four shadowed regions indicate the A’α helix (purple), Hβ sheet (green), Iβ sheet (yellow), and Jα helix (red), respectively. (B) Mean RMSF values of residues in four secondary structures for WT AsLOV2 dark state (blue) and light state (orange), respectively.

The undocking and unfolding of both A’α and Jα helices represent signature conformational changes when transitioning from the dark state to the light state of AsLOV2. The preservation of secondary structures in WT AsLOV2 during simulations of both states is accordingly investigated. The ratio of residues recognized as secondary structure and the total number of residues is calculated for each structure, based on the simulation data for each state (Figure 3). For example, if a helix contains ten residues and eight are recognized with helical character using DSSP analysis in a given simulation frame, the preservation ration for this helix in that frame is 0.8. This is calculated and averaged for all frames. A significant decrease of the preservation ration of the A’α helix suggests its unfolding during the simulation. Ratio analysis at the residue level for A’α helix is presented in Figure S2 and Table S1. In contrast, the Jα helix displays a stable helical ratio, with no significant changes between the two states. This suggests that the unfolding timescale of Jα helix exceeds the scope of conventional MD simulations. The Hβ and Iβ sheets exhibit a slight and moderate decrease in their sheet ratios, respectively. Given the typically stable nature of β-sheets, these observed decreases could be indicative of their involvement in the protein’s response to light activation, potentially playing a role in signal transduction. Combining with the RMSF analysis, these observations shed more light onto the protein conformation and dynamics change in response to light activation.

Figure 3.

Figure 3.

Average preservations of secondary structures for WT AsLOV2 dark state (blue) and light state (orange) in four key secondary structures (A’α helix, Jα helix, Hβ sheet, and Iβ sheet).

The dynamical correlation within the dark and light states of WT AsLOV2 is represented by the dynamic cross-correlation matrices (DCCMs) (Figure 4). For clarity, a discrete color scale with a segmentation threshold of 0.4 is used in the plots presented in Figure 4 to accentuate the correlation contrasts between residues. The discrete color scale distinctly marks highly correlated or anti-correlated regions. DCCM plots using a continuous color scale are available in the Supporting Information (Figure S3). In Figure 4A, deep red hues along the diagonal indicate strong positive correlation among neighboring residues, and similarly, robust correlations are noted among anti-parallel β-sheets. With light activation, the overall correlation structure is retained. Notable shifts in the Jα helix’s correlation patterns, particularly with anti-parallel β-sheet regions and the Fα helix emerge (Figure 4B), suggesting a potential undocking tendency, where the Jα helix exhibits a distinct movement away from the core structure in response to the light activation. This tendency supports the light-induced undocking behavior of Jα helix, even though the undocking of the Jα helix was not observed in the current study.

Figure 4.

Figure 4.

Discrete average dynamic cross-correlation matrix for WT AsLOV2 in (A) dark state and (B) light state over three parallel simulations. For clarity, a discretized color scale with a segmentation threshold of 0.4 is used.

AsLOV2 Light States Mutants

Simulations of the selected AsLOV2 mutants in the light states were performed to compare their impacts on the dynamics of the AsLOV2 domain. The comparative analysis focuses on the four key secondary structures of AsLOV2: A’α and Jα helices, Hβ and Iβ sheets. Average RMSF values for these four secondary structures across all light states are plotted in Figure 5. The RMSF plot for each mutant is shown in Figure S4. The comparison of average RMSF values over the whole AsLOV2 domain for each light state is shown in Figure S5. The RMSF values of A’α helix vary significantly among all light states under investigation (Figure 5A). Large error bars suggest significant variability among the simulations. This could also be attributed to the short length of the A’α helix and its inherent flexibility given its location at the flexible N-terminal region. In contrast, the Hβ sheet, Iβ sheet, and Jα helix of most mutants exhibit RMSF values close to the WT value, indicating minimal impact from the corresponding mutants on protein flexibility. L493A emerges as an outlier with elevated RMSF values across the entire protein (Figure S5), suggesting a significant increase in flexibility due to the mutation. Additionally, the D515V mutation on the Iβ sheet uniquely lowers the mean RMSF of the Iβ sheet, suggesting localized structural stabilization. These specific changes in dynamics necessitate additional investigation to elucidate the mutants’ impact on the protein’s behavior and interactions.

Figure 5.

Figure 5.

Average RMSF values over residues of WT and mutants in light state for (A) A’α helix, (B) Jα helix, (C) Hβ sheet, and (D) Iβ sheet.

Preservation of the four secondary structures in the light states of AsLOV2 WT and mutants is calculated and plotted for comparison (Figure 6). Consistency is observed in the Hβ sheet and Jα helix across all mutants, with the proportion of these structures close to that of the WT. The A’α helix, on the other hand, exhibits increased helical content in the light state among most mutants compared to the WT, yet still lower than the WT dark state values, suggesting a light-dependent partial unfolding or destabilization. While the A’α helix ratios show a general increase, the elongated error bars signal considerable variability, complicating definitive conclusions about the mutants’ influence. A general but not precisely negative correlation between helix ratio and RMSF underscores a correlated relationship between the flexibility and unfolding behavior. Unlike a previous study focusing on mutants located on A’α helix itself, it is difficult to ascertain definitive conclusions about Hβ sheet and Iβ sheet mutants’ influence on the unfolding behavior of A’α helix. The Iβ sheet displays interesting responses to Q497A and D515V. Compared to other mutants, Q497A exhibits a higher Iβ sheet RMSF (Figure 5D) and a correspondingly lower Iβ sheet ratio (Figure 6D), suggesting an increase in flexibility. Conversely, D515V shows a lower Iβ sheet RMSF and a higher Iβ sheet ratio, implying enhanced stability. Therefore, Q497A and D515V present as prime candidates for further investigation to deepen our understanding of their impact on the Iβ sheet’s dynamics.

Figure 6.

Figure 6.

Average preservations of secondary structures for WT and mutants in light state for (A) A’α helix, (B) Jα helix, (C) Hβ sheet, and (D) Iβ sheet.

Dynamic cross-correlation analysis reveals consistent negative correlation between the Jα helix and the adjacent anti-parallel β-sheet region in various mutants. A subset of mutants demonstrates DCCM patterns remarkably akin to the WT with mutant R500A as an example (Figure 7A). Intriguingly, in several mutants including L496F (Figure 7B), a novel trend emerges wherein the Fα helix exhibits increased negative correlations with spatially distant secondary structures: Aβ and Bβ sheets, as well as the Cα and Dα helices, despite being separated by the Eα helix and FMN. These mutants exhibit an uptick in negative correlations between inter-secondary structures, such as the loops connecting Fα helix and Gβ sheet and the loop connecting Gβ sheet and Hβ sheet. In F509L mutant, the C-terminal displays stronger negative correlations with the distant cluster comprising the Aβ and Bβ sheets as well as the Cα and Dα helices (Figure 7C). An extreme case is presented by L493A, in which the cluster of Aβ and Bβ sheets, and the Cα and Dα helices, shows pronounced positive intra-group correlations. The anti-parallel β-sheets also display strong positive correlations with other β-sheets. Meanwhile, the Jα helix and the C-terminal reveal more negative correlations with these groups but increased positive correlations with the A’α helix and Fα helix located in the same periphery (Figure 7D).

Figure 7.

Figure 7.

Representative discrete average dynamic cross-correlation matrix for light state (A) R500A, (B) L496F, (C) F509L, and (D) L493A.

In each light state system of WT and mutants, the top 10 essential modes are extracted, with mean square-fluctuations of each residue calculated and plotted (Figure 8). The most significant contributions to these modes arise from regions including the N-terminal with A’α helix, loops between Hβ sheet and Iβ sheet, between Iβ sheet and Jα helix, and the C-terminal with Jα helix. Most mutants exhibit higher flexibility comparing to WT, especially at the loop between Hβ sheet and Iβ sheet, consistent with the RMSF results. L493A, H495L, and F509L show some extra essential movements at Cα helix (residues 431 to 439, spatially close to A’α helix) and the loop between Gβ sheet and Hβ sheet (residues 483 to 489, spatially close to Cα helix). Several mutants, including F490L, L493A, L496F, Q497A, F509L show higher flexibility at the loop between Iβ sheet and Jα helix. Especially, L493A extends the movement flexibility from this loop to the Jα helix.

Figure 8.

Figure 8.

Mean square-fluctuations for top 10 essential modes for each system in light state.

Dimensionality Reduction Analysis and Markov State Model

Simulations of all WT and mutant light states were combined and analyzed using t-ICA, with each state projected onto a two-dimensional (2D) space defined by the two vectors: IC 1 and IC 2. The mutants, especially L493A and L514, display significantly different distributions (Figure 9A). The overall population from the combined simulations is shown in Figure 9B. While most mutant simulations mainly resemble the WT distribution (depicted as a high-density light-yellow region) with only slight deviation, mutants such as L493A, H495L, L496F, F509L, and L514A show pronounced deviations, exploring a much larger conformational space.

Figure 9.

Figure 9.

Projection of light states simulation onto two-dimensional space based on t-ICA. (A) Distribution of each individual systems. (B) Overall population of combined light states simulations.

Markov state model (MSM) was used to reveal dynamic patterns by kinetically clustering the conformations. Conformations are divided into 50 microstates via k-means clustering analysis within the t-ICA 2D space. Top 15 implied timescales, calculated with lag times ranging from 0.1 to 30 ns (Figure S6), showed convergence after about 20 ns, which was chosen as the lag time for MSM construction. The timescale differentiation resulted in the definition of seven distinct macrostates, with PCCA sequentially used for kinetic clustering.

All seven macrostates and their populations are depicted in Figure 10A and Figure 10B, respectively. Macrostates 2 and 1 exhibit the most dominant occupancies at 78.72% and 11.92%, respectively. The WT light state occupies both macrostates 1 and 2, mirroring the patterns shown in Figure 9B, where these states are the most populated. Beyond macrostate 2, mutants such as L493A, H495L, L496F, F509L, and L514A distinctly occupy macrostates 3 through 7 (Figure S7). This variation in occupancy reveals unique dynamic behaviors, highlighting the diverse impact of these mutations on the conformational landscape.

Figure 10.

Figure 10.

Markov state model of AsLOV2 light states. (A) The macrostates represented on the t-ICA 2D space. (B) Relative population composition of the macrostates.

Discussion

Integrating findings from experimental analyses with computational data, we selected four mutants for an in-depth investigation from a pool of thirteen. This selection includes three mutants located on the Hβ sheet (L493A, L496F, and Q497A) and one on the Iβ sheet (D515V), based on comprehensive simulation dynamics, conformation exploration, and experimental results. Our study aims to uncover the underlying reasons for the distinct behaviors exhibited by these specific mutants. First, we extract the correlation values from DCCM for residues surrounding the mutant sites. The criteria for defining surrounding residues are that in the first frame of WT or mutant production runs, any residue with at least one atom within 3 Å to any atom in the mutant site (Figure 11). The Baker-Hubbard method is used to identify hydrogen bonds in the selected region appearing at least 50% of the total frames across three simulations.

Figure 11.

Figure 11.

The comparison of correlation between the mutated residue and its adjacent residues in the mutant and in WT for (A) L493A, (B) L496F, (C) Q497A, and (D) D515V.

To further investigate and compare the dynamical behavior of these four mutants relative to the WT light state, we analyzed the hydrogen bonds (Table S2 and Figure 12). We found that all four mutants lack the hydrogen bond between Thr407 and Ala452 present in the WT. Thr407 was identified to be essential for the unfolding mechanism of A’α helix15. The absence of this hydrogen bond is suggested to be the main cause of the high fluctuations observed in the N-terminal.

Figure 12.

Figure 12.

The unique hydrogen bonds patterns formed with the most disruptive mutants compared to WT for (A) L493A hydrogen bonds, (B) L496F hydrogen bonds, (C) Q497A hydrogen bonds, and (D) D515V hydrogen bonds.

Compared to WT, we observe several missing hydrogen bonds between flexible secondary structures in the L493A mutant. Among these, two missing hydrogen bonds between Iβ sheet and A’α helix, one between Ser431-O and GLys413-O and one between Ser433-OG and Glu412-O, could lead to the increased flexibility of the A’α helix. Another two missing hydrogen bonds between Iβ sheet and loop connecting Iβ sheet and Jα helix, one between Gln479-NE2 and Val520-O and one between Gln479-OE1 and Val520-N, may lead to the increased flexibility of the Jα helix. There is another missing hydrogen bond between Gβ sheet and Eα helix between Ser486-OG and Glu439-OE2. On the other hand, the L493A mutant has more hydrogen bonds within secondary structures or positive correlated regions than the WT. As shown in the local correlation patterns, residue Ala493 shows a more negative correlation with Jα helix but a more positive correlation with Gβ sheet compared to Leu493 (Figure 11A). As a result, L493A exhibits a higher flexibility with high fluctuations and the amplified correlation patterns (Figure 12A).

For the L496F mutant, Gln479-NE2 with Val520-O and Gln479-OE1 with Val520-N are missing compared to the WT, leading to a more flexible Jα helix. The hydrogen bond between Asn482-ND2 on the Gβ sheet and Leu453-O on the Eα helix is also absent. This leads to a more flexible Eα helix and Fα helix, and it is observed that the correlation between Phe496 to Fα helix are less positive in the mutant (Figure 11B). These absent hydrogen bonds might result from the steric hindrance of the larger Phenylalanine side chain (Figure 12B), which could contribute to the different distribution of L493A and L496F mutants from the WT and other mutants in the t-ICA 2D space (Figure 9A).

For the Q497A mutant, the two hydrogen bonds between Gln497 and Ile510 remain. In addition, the Cys450-FMN adduct has more hydrogen bonds associated with Arg451, Arg460, and Arg464 on Fα helix (Figure 12C). These hydrogen bonds may help the Q497A mutant maintain the local correlation, protein flexibility and dynamics similar to the WT (Figure 11C). Similar changes in the hydrogen bonds patterns of the Cys450-FMN core were observed in D515V mutant. This can be attributed that Asn482 form an extra hydrogen bond in both mutants with Leu453, compared to other mutants (Figure 12D). This hydrogen bond is disrupting the hydrogen bonds of Asn482 with FMN core in both Q497A and D515V, explaining the drastic changes in the hydrogen bond patterns observed in the FMN core.

For the D515V mutant, we observe an increased number of hydrogen bonds, particularly two hydrogen bonds near residue Val515: one between Leu493-N and Leu514-O, and one between Gly516-N and Trp491-O. As comparison, the WT exhibits one hydrogen bond between Leu514-N and Ala493-O. In this mutant, residues Leu493 and Leu514, located on two adjacent anti-parallel β-sheets, are likely to have stronger correlation with each other due to these hydrogen bonds. Similarly, the hydrogen bond between Gly516 and Trp491 also helps to maintain or strengthen the positive correlation comparing to the WT especially given the fact that residue Gly516 is at the end of the Iβ sheet and behave more similarly to as a loop. The correlations between the mutated residue Val515 and residues 490–492 are more positive than in the WT (Figure 11D). As a result, the Iβ sheet, especially the connecting region to the Jα helix, becomes more constrained, leading to reduced flexibility and increased higher Iβ sheet ratio. It is possible when Aspartic Acid changes to Valine at position 515, the negative side chain becomes neutral and the electrostatic attractions from surroundings (opposite to residues 490–492) mainly becomes steric effect and contribute to the attractions of residues 490–492.

Conclusion

Extensive molecular dynamics simulations were conducted for the wild type and 13 different mutations introduced into the Hβ and Iβ sheets of AsLOV2 to investigate the role of β-sheets in the protein’s allosteric mechanism. RMSF, DCCM, and MSM analyses indicated that four mutations accounted for most of the changes. The L493A mutant exhibits the highest fluctuations across various regions of the protein. The L493A and L496F mutants are among those exploring the most conformational space, likely due to the missing hydrogen bond between Val520 and Gln479. The Q497A and D515V mutants alter the hydrogen bond patterns with the FMN molecule significantly more than other mutations, likely due to the missing hydrogen bond between Asn482 and Leu453.

Supplementary Material

Supinfo

Acknowledgments

Computational time was provided by the Southern Methodist University’s Center for Research Computing.

Funding

Research reported in this article was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award No. R15GM122013.

Abbreviations

MD

molecular dynamics

RMSF

root-mean-square fluctuations

DSSP

dictionary of protein secondary structure

DCCM

dynamic cross-correlation matrix

Footnotes

Declaration of Interests

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Data Availability Statement

The processed results presented or mentioned in this study can be found in Zenodo repository at https://doi.org/10.5281/zenodo.10247350.

Reference

  • (1).McIntosh BE; Hogenesch JB; Bradfield CA Mammalian Per-Arnt-Sim Proteins in Environmental Adaptation. Annu. Rev. Physiol 2010, 72 (1), 625–645. 10.1146/annurev-physiol-021909-135922. [DOI] [PubMed] [Google Scholar]
  • (2).Möglich A; Ayers RA; Moffat K Structure and Signaling Mechanism of Per-ARNT-Sim Domains. Structure 2009, 17 (10), 1282–1294. 10.1016/j.str.2009.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (3).Lungu OI; Hallett RA; Choi EJ; Aiken MJ; Hahn KM; Kuhlman B Designing Photoswitchable Peptides Using the AsLOV2 Domain. Chem. Biol 2012, 19 (4), 507–517. 10.1016/j.chembiol.2012.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (4).Crosson S; Moffat K Structure of a Flavin-Binding Plant Photoreceptor Domain: Insights into Light-Mediated Signal Transduction. Proc. Natl. Acad. Sci 2001, 98 (6), 2995–3000. 10.1073/pnas.051520298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (5).Zoltowski BD; Vaccaro B; Crane BR Mechanism-Based Tuning of a LOV Domain Photoreceptor. Nat. Chem. Biol 2009, 5 (11), 827–834. 10.1038/nchembio.210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (6).Pudasaini A; El-Arab KK; Zoltowski BD LOV-Based Optogenetic Devices: Light-Driven Modules to Impart Photoregulated Control of Cellular Signaling. Front. Mol. Biosci 2015, 2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (7).Strickland D; Moffat K; Sosnick TR Light-Activated DNA Binding in a Designed Allosteric Protein. Proc. Natl. Acad. Sci 2008, 105 (31), 10709–10714. 10.1073/pnas.0709610105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (8).Strickland D; Yao X; Gawlak G; Rosen MK; Gardner KH; Sosnick TR Rationally Improving LOV Domain–Based Photoswitches. Nat. Methods 2010, 7 (8), 623–626. 10.1038/nmeth.1473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (9).Zayner JP; Antoniou C; Sosnick TR The Amino-Terminal Helix Modulates Light-Activated Conformational Changes in AsLOV2. J. Mol. Biol 2012, 419 (1), 61–74. 10.1016/j.jmb.2012.02.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (10).Nash AI; Ko W-H; Harper SM; Gardner KH A Conserved Glutamine Plays a Central Role in LOV Domain Signal Transmission and Its Duration. Biochemistry 2008, 47 (52), 13842–13849. 10.1021/bi801430e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (11).Yamamoto A; Iwata T; Sato Y; Matsuoka D; Tokutomi S; Kandori H Light Signal Transduction Pathway from Flavin Chromophore to the Jα Helix of Arabidopsis Phototropin1. Biophys. J 2009, 96 (7), 2771–2778. 10.1016/j.bpj.2008.12.3924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (12).Zayner JP; Antoniou C; French AR; Hause RJ; Sosnick TR Investigating Models of Protein Function and Allostery With a Widespread Mutational Analysis of a Light-Activated Protein. Biophys. J 2013, 105 (4), 1027–1036. 10.1016/j.bpj.2013.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (13).Hollingsworth SA; Dror RO Molecular Dynamics Simulation for All. Neuron 2018, 99 (6), 1129–1143. 10.1016/j.neuron.2018.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (14).Karplus M; McCammon JA Molecular Dynamics Simulations of Biomolecules. Nat. Struct. Biol 2002, 9 (9), 646–652. 10.1038/nsb0902-646. [DOI] [PubMed] [Google Scholar]
  • (15).Ibrahim MT; Trozzi F; Tao P Dynamics of Hydrogen Bonds in the Secondary Structures of Allosteric Protein Avena Sativa Phototropin 1. Comput. Struct. Biotechnol. J 2022, 20, 50–64. 10.1016/j.csbj.2021.11.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (16).Xiao S; Verkhivker GM; Tao P Machine Learning and Protein Allostery. Trends Biochem. Sci 2023, 48 (4), 375–390. 10.1016/j.tibs.2022.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (17).Verkhivker G; Alshahrani M; Gupta G; Xiao S; Tao P From Deep Mutational Mapping of Allosteric Protein Landscapes to Deep Learning of Allostery and Hidden Allosteric Sites: Zooming in on “Allosteric Intersection” of Biochemical and Big Data Approaches. Int. J. Mol. Sci 2023, 24 (9), 7747. 10.3390/ijms24097747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (18).Halavaty AS; Moffat K N- and C-Terminal Flanking Regions Modulate Light-Induced Signal Transduction in the LOV2 Domain of the Blue Light Sensor Phototropin 1 from Avena Sativa,. Biochemistry 2007, 46 (49), 14001–14009. 10.1021/bi701543e. [DOI] [PubMed] [Google Scholar]
  • (19).Brooks BR; Brooks CL III; Mackerell AD Jr.; Nilsson L; Petrella RJ; Roux B; Won Y; Archontis G; Bartels C; Boresch S; Caflisch A; Caves L; Cui Q; Dinner AR; Feig M; Fischer S; Gao J; Hodoscek M; Im W; Kuczera K; Lazaridis T; Ma J; Ovchinnikov V; Paci E; Pastor RW; Post CB; Pu JZ; Schaefer M; Tidor B; Venable RM; Woodcock HL; Wu X; Yang W; York DM; Karplus M CHARMM: The Biomolecular Simulation Program. J. Comput. Chem 2009, 30 (10), 1545–1614. 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (20).Freddolino PL; Gardner KH; Schulten K Signaling Mechanisms of LOV Domains: New Insights from Molecular Dynamics Studies. Photochem. Photobiol. Sci 2013, 12 (7), 1158–1170. 10.1039/c3pp25400c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (21).Jorgensen WL; Chandrasekhar J; Madura JD; Impey RW; Klein ML Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys 1983, 79 (2), 926–935. 10.1063/1.445869. [DOI] [Google Scholar]
  • (22).Ryckaert J-P; Ciccotti G; Berendsen HJC Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes. J. Comput. Phys 1977, 23 (3), 327–341. 10.1016/0021-9991(77)90098-5. [DOI] [Google Scholar]
  • (23).Darden T; York D; Pedersen L Particle Mesh Ewald: An N⋅log(N) Method for Ewald Sums in Large Systems. J. Chem. Phys 1993, 98 (12), 10089–10092. 10.1063/1.464397. [DOI] [Google Scholar]
  • (24).Eastman P; Swails J; Chodera JD; McGibbon RT; Zhao Y; Beauchamp KA; Wang L-P; Simmonett AC; Harrigan MP; Stern CD; Wiewiora RP; Brooks BR; Pande VS OpenMM 7: Rapid Development of High Performance Algorithms for Molecular Dynamics. PLOS Comput. Biol 2017, 13 (7), e1005659. 10.1371/journal.pcbi.1005659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (25).Baker EN; Hubbard RE Hydrogen Bonding in Globular Proteins. Prog. Biophys. Mol. Biol 1984, 44 (2), 97–179. 10.1016/0079-6107(84)90007–5. [DOI] [PubMed] [Google Scholar]
  • (26).McGibbon RT; Beauchamp KA; Harrigan MP; Klein C; Swails JM; Hernández CX; Schwantes CR; Wang L-P; Lane TJ; Pande VS MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J 2015, 109 (8), 1528–1532. 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (27).Kabsch W; Sander C Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen-Bonded and Geometrical Features. Biopolymers 1983, 22 (12), 2577–2637. 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
  • (28).Amadei A; Linssen ABM; Berendsen HJC Essential Dynamics of Proteins. Proteins Struct. Funct. Bioinforma 1993, 17 (4), 412–425. 10.1002/prot.340170408. [DOI] [PubMed] [Google Scholar]
  • (29).Bakan A; Meireles LM; Bahar I ProDy: Protein Dynamics Inferred from Theory and Experiments. Bioinformatics 2011, 27 (11), 1575–1577. 10.1093/bioinformatics/btr168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (30).Zhang S; Krieger JM; Zhang Y; Kaya C; Kaynak B; Mikulska-Ruminska K; Doruker P; Li H; Bahar I ProDy 2.0: Increased Scale and Scope after 10 Years of Protein Dynamics Modelling with Python. Bioinformatics 2021, 37 (20), 3657–3659. 10.1093/bioinformatics/btab187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (31).Xiao S; Song Z; Tian H; Tao P Assessments of Variational Autoencoder in Protein Conformation Exploration. J. Comput. Biophys. Chem 2023, 22 (04), 489–501. 10.1142/S2737416523500217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (32).Pérez-Hernández G; Paul F; Giorgino T; De Fabritiis G; Noé F Identification of Slow Molecular Order Parameters for Markov Model Construction. J. Chem. Phys 2013, 139 (1), 015102. 10.1063/1.4811489. [DOI] [PubMed] [Google Scholar]
  • (33).Noé F; Clementi C Kinetic Distance and Kinetic Maps from Molecular Dynamics Simulation. J. Chem. Theory Comput 2015, 11 (10), 5002–5011. 10.1021/acs.jctc.5b00553. [DOI] [PubMed] [Google Scholar]
  • (34).Molgedey L; Schuster HG Separation of a Mixture of Independent Signals Using Time Delayed Correlations. Phys. Rev. Lett 1994, 72 (23), 3634–3637. 10.1103/PhysRevLett.72.3634. [DOI] [PubMed] [Google Scholar]
  • (35).Röblitz S; Weber M Fuzzy Spectral Clustering by PCCA+: Application to Markov State Models and Data Classification. Adv. Data Anal. Classif 2013, 7 (2), 147–179. 10.1007/s11634-013-0134-6. [DOI] [Google Scholar]
  • (36).Trendelkamp-Schroer B; Wu H; Paul F; Noé F Estimation and Uncertainty of Reversible Markov Models. J. Chem. Phys 2015, 143 (17), 174101. 10.1063/1.4934536. [DOI] [PubMed] [Google Scholar]
  • (37).Xiao S; Alshahrani M; Gupta G; Tao P; Verkhivker G Markov State Models and Perturbation-Based Approaches Reveal Distinct Dynamic Signatures and Hidden Allosteric Pockets in the Emerging SARS-Cov-2 Spike Omicron Variant Complexes with the Host Receptor: The Interplay of Dynamics and Convergent Evolution Modulates Allostery and Functional Mechanisms. J. Chem. Inf. Model 2023, 63 (16), 5272–5296. 10.1021/acs.jcim.3c00778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (38).Scherer MK; Trendelkamp-Schroer B; Paul F; Pérez-Hernández G; Hoffmann M; Plattner N; Wehmeyer C; Prinz J-H; Noé F PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov Models. J. Chem. Theory Comput 2015, 11 (11), 5525–5542. 10.1021/acs.jctc.5b00743. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supinfo

Data Availability Statement

The processed results presented or mentioned in this study can be found in Zenodo repository at https://doi.org/10.5281/zenodo.10247350.

RESOURCES