Skip to main content
PLOS One logoLink to PLOS One
. 2024 May 15;19(5):e0302398. doi: 10.1371/journal.pone.0302398

Computational investigation of cis-1,4-polyisoprene binding to the latex-clearing protein LcpK30

Aziana Abu Hassan 1, Marko Hanževački 1,¤,*, Anca Pordea 1,*
Editor: Sushil Mishra2
PMCID: PMC11095694  PMID: 38748648

Abstract

Latex clearing proteins (Lcps) catalyze the oxidative cleavage of the C = C bonds in cis-1,4-polyisoprene (natural rubber), producing oligomeric compounds that can be repurposed to other materials. The active catalytic site of Lcps is buried inside the protein structure, thus raising the question of how the large hydrophobic rubber chains can access the catalytic center. To improve our understanding of hydrophobic polymeric substrate binding to Lcps and subsequent catalysis, we investigated the interaction of a substrate model containing ten carbon-carbon double bonds with the structurally characterized LcpK30, using multiple computational tools. Prediction of the putative tunnels and cavities in the LcpK30 structure, using CAVER-Pymol plugin 3.0.3, fpocket and Molecular Dynamic (MD) simulations provided valuable insights on how substrate enters from the surface to the buried active site. Two dominant tunnels were discovered that provided feasible routes for substrate binding, and the presence of two hydrophobic pockets was predicted near the heme cofactor. The larger of these pockets is likely to accommodate the substrate and to determine the size distribution of the oligomers. Protein-ligand docking was carried out using GOLD software to predict the conformations and interactions of the substrate within the protein active site. Deeper insight into the protein-substrate interactions, including close-contacts, binding energies and potential cleavage sites in the cis-1,4-polyisoprene, were obtained from MD simulations. Our findings provide further justification that the protein-substrate complexation in LcpK30 is mainly driven by the hydrophobic interactions accompanied by mutual conformational changes of both molecules. Two potential binding modes were identified, with the substrate in either extended or folded conformations. Whilst binding in the extended conformation was most favorable, the folded conformation suggested a preference for cleavage of a central double bond, leading to a preference for oligomers with 5 to 6 C = C bonds. The results provide insight into further enzyme engineering studies to improve catalytic activity and diversify the substrate and product scope of Lcps.

Introduction

Rubber materials are essential in everyday life, with more than 25 million metric tons of natural and synthetic rubber being produced and consumed yearly [1]. This results in a high amount of rubber waste, which is challenging to recycle or decompose, thus creating a serious environmental concern once the material reaches the end of life. An economically and environmentally sustainable strategy for rubber waste disposal is yet to be established. The degradation of rubber polymers mainly involves oxidative cleavage of C = C bonds in the polymer backbone and has been reported with organic or organometallic catalysts, which are limited by the use of rare metals (e.g. Grubbs catalyst) and harsh reagents [2]. An attractive alternative is enzymatic rubber degradation using oxidative enzymes. Rubber oxygenases (RoxA and RoxB), isolated from Gram-negative bacteria and latex clearing proteins (Lcps), isolated mostly from Gram-positive bacteria, have been shown to degrade both natural and synthetic cis-1,4-polyisoprene, with the latter enzymes being more extensively studied [3]. The products are functionalized oligoisoprenoids carrying a carbonyl group (ketone and aldehyde) at each end (Fig 1a). This cleavage preserves the polyisoprene structure and gives access to lower molecular weight products, which can be either further metabolized or used as materials with potentially interesting properties [4]. Thus, rubber degrading enzymes offer exciting perspectives for polymer degradation within a circular economy and have been extensively studied in the past few years.

Fig 1. Crystal structure and cis-1,4-polyisoprene cleavage reaction catalyzed by LcpK30.

Fig 1

(a) Oxidative cleavage of cis-1,4-polyisoprene catalyzed by LcpK30. (b) Crystal structure of LcpK30 (PDB ID: 5O1L) with a close view of heme in the active site..Coordination of an imidazole molecule to the distal site of heme fixes the protein structure in its open conformation (PDB: 5O1L), while the coordination by the sidechain of Lys167 forces the closed state of the enzyme (PDB: 5O1M).

A range of Lcps have been identified from several organisms [59] and were shown to be b-type cytochromes with a non-covalently bound heme cofactor. The crystal structure of LcpK30 has been determined to possess a globin-like fold, showing the heme to be coordinated to His198 (Fig 1b) [10]. The oxidative cleavage of polyisoprene by Lcp occurs in an endo-type mechanism and results in a mixture of oligoisoprenoids with different lengths, from C20 to C65 (3 to 12 isoprene units) [3, 1113]. A defined distribution of degradation products is observed, with a preference for 5–6 (for Lcp1VH2) [14] or 4–7 (for LcpK30) [1, 8, 15] isoprene repetitive units and lower concentrations of smaller and larger products. Quantum mechanics/molecular mechanics (QM/MM) calculations suggested that the addition of the heme-bound dioxygen to the C = C bond triggers its cleavage via a dioxetane intermediate [16]. This mechanism is similar to two other heme-based dioxygenases, indoleamine and tryptophan 2,3-dioxygenases.

Structural and spectroscopic analysis of LcpK30 suggested the existence of two conformational states, closed and open states (Fig 1b) [10]. In the open state, a continuous hydrophobic substrate channel passes next to the heme group and allows the substrate to access the active site, whilst, in the closed state, the channel is blocked by coordination of the heme to Lys167. This difference is most visible in the conformation of the active site helix (also known as helix E) [10], comprised of residues 160–176. The helix is more structured in the closed state compared to the open state, in which this helix is split into two fragments (S1 Fig). Most enzymes that react with polymeric substrates, such as PETases (with PET substrates) and lytic polysaccharide monooxygenases, LPMOs (with polysaccharide substrates) contain exposed active sites, which allow easy contact with the polymer chains of the substrate [17, 18]. Intriguingly, the catalytic heme cofactor in Lcps is buried within a hydrophobic pocket, which raises the question of how the insoluble hydrophobic rubber chains can access the active site.

Despite an increased understanding of the structure and catalytic mechanism of Lcps, a useful process for enzymatic rubber degradation has not been achieved. The enzymatic degradation rates remain low, and the range of oligoisoprenoid products is too heterogeneous to be convenient for applications as materials. Furthermore, the enzymatic degradation is limited to cis-1,4-polyisoprene rubber, whilst the degradation of other diene rubbers remains largely underexplored. These challenges could potentially be addressed through protein engineering approaches, which require an understanding of the interaction between substrate and protein.

In this study, we used a computational modelling approach starting from the LcpK30 crystal structure in open conformation to understand the interaction between LcpK30 and its flexible polyisoprene substrate. Firstly, the potential access pathways of cis-1,4-polyisoprene to the active site of LcpK30 were explored in the static structure using the CAVER PyMol plugin 3.0.3 and fpocket, to identify relevant hydrophobic tunnels and pockets that are large enough to accommodate the substrate [19]. Dynamic simulations further investigated tunnel availability and flexibility. We then carried out supramolecular docking of a C50 substrate model containing 10 repeating isoprene units (C50H82), which was deemed sufficiently large to provide valuable insights into the location, conformations and interaction of the polymer chains with the enzyme. Given the challenge to obtain precise information from the docking of long alk(en)yl chains without specific pharmacophores, we used molecular dynamics (MD) simulations to provide a better understanding of the stability and the nature of the predicted substrate poses and conformational changes of the protein.

Results and discussion

Tunnel availability and conformational flexibility of LcpK30 without substrate bound

Putative tunnels were initially identified within the static crystal structure of LcpK30, to obtain a general overview of potentially important pathways for the substrate access from the surface to the active site. This was done with the CAVER-PyMOL plugin, which provides a graphical interface for setting up the calculation and allows interactive visualization of tunnels or channels in protein static structures [20]. We used both the open (5O1L) and the closed state conformations (5O1M), however only the former led to tunnel identification using the default parameters (see Methods). A total of four tunnels were identified, and their lengths, radii, and bottlenecks (narrowest parts of the tunnels) were determined (see S1 Table and S2 Fig for details on the static tunnel features). Tunnels 1 and 2 had the highest throughput and, thus, the highest probability to be used as routes for the transport of substrates (Fig 2a and S1 Table). Near the heme cofactor in the active site, these tunnels were lined up by residues known to be of high importance for LcpK30 activity: Glu148, Arg164, Lys167 and Thr168 [10, 12]. Other, more surface-exposed residues interacting with the tunnels included Trp82, Thr83, Arg84, Ala151, Val152, Gly157, Gly158, Ala159, Asp163, Ile165, Ala166, Ala169, Arg170, Leu171, Asp174, His184, Gly185, Ser186, Val189, Thr190, Lys193, Thr194, Val197, His198, Thr230, Glu392, Gly393, Arg394, Arg395, Ile396, Ala397, Ile398, Asp399, and Pro401. The radius of putative tunnels ranged between approximately 1 Å and 2.8 Å, with the bottleneck (0.96 Å) located near the heme and the wider radius found towards the outer surface of the protein.

Fig 2. Static tunnel mapping and pocket identification in the LcpK30 open structure without substrate.

Fig 2

(a) Two dominant tunnels identified by CAVER-Pymol plugin 3.0.3 (tunnel 1 in blue and tunnel 2 in green); (b) five detected pockets or cavities closest to heme in the LcpK30 static X-ray structure, identified by automated geometry-based fpocket tool. The pockets are pocket 1 (olive), pocket 3 (cyan), pocket 4 (orange), pocket 9 (magenta) and pocket 12 (yellow).

The average radii throughout both tunnels were smaller than the estimated width of the cis-1,4-polyisoprene substrate molecule (S2c Fig), which suggested that significant conformational changes of the protein are required to accommodate the hydrophobic chains of the large polymeric substrates through this gateway. Alternatively, these tunnels could also be suited for transporting smaller oxygen to the heme. We observed a significant difference in the length of the tunnels from the heme starting point to the protein surface, with tunnel 2 being about twice as long compared to tunnel 1. Whilst a shorter tunnel such as tunnel 1 may generally give quicker access of substrates to the active site, in this case, where the substrate is a long polymer, the longer tunnel 2 may provide more opportunity for protein-substrate interaction. Furthermore, tunnel 2 has a wider opening (2.8 Å) at the protein surface (Glu392, Gly393, Arg394, Arg395, Ile396) which agrees well with the estimated thickness of cis-1,4-polyisoprene and which could represent a potential gateway for the substrate access to the active site.

The important pockets (protein cavities) in the enzyme in the open state were also predicted using an automated geometry-based fpocket tool that decomposes a 3D protein into Voronoi polyhedrals (see Methods for details). A total of 24 pockets were detected throughout LcpK30. However, only five pockets (pocket 1, 3, 4, 9 and 12) were considered relevant for substrate binding due to their near proximity to heme and the active site (Fig 2b). Out of these five, two pockets were especially interesting since they are located at a distal site directly above the heme cofactor: a larger pocket 3, with a pocket volume of 173.52 Å3 and a smaller pocket 12, with a volume of 77.21 Å3. These two pockets also had a relatively high hydrophobicity score, indicating that they could accommodate the lipophilic cis-1,4-polyisoprene ligand in its bound state. Interestingly, these cavities were also occupied by imidazole molecules in the crystal structure. The residues surrounding these two pockets are Ser138, Ser142, Glu148, Val152, Arg164, Lys167, Thr168, Leu171, Thr230, Ser233 and Leu234, and include the same residues lining up the tunnels identified earlier using CAVER-PyMOL plugin. Taken together, the information about the tunnels and pockets calculated on the X-ray structure of LcpK30, provided a better picture on possible entry pathways that lead from the surface into the hydrophobic active site of the protein interior and gave an indication of the space where ligand might bind.

Since the intrinsically dynamic nature of the protein could largely influence the pathways and cavities through the protein, we carried out extensive MD simulations of LcpK30 to obtain more information, specifically on the conformational changes that could influence binding of the substrate to LcpK30. We assumed that polyisoprene binding occurs in the catalytically active state of LcpK30 and therefore we prepared an LcpK30 model starting from the open state structure, in which the co-crystalized imidazole molecule was replaced with O2 bound to the heme cofactor and where the Lys167 chain (not heme bound) was fully protonated, which we were able to simulate during relatively long simulation timescales. For comparison, we also modelled the closed-like state system, prepared from the closed state structure conformation, but where the coordination of Lys167 to heme was removed and replaced with O2 coordination, whilst Lys167 was fully protonated.

Relatively low C-alpha backbone RMSD were obtained over the course of 500 ns MD simulations, for both open and closed-like conformations (see S3a Fig for the open conformation). This suggested that the enzyme with no substrate bound did not undergo large structural changes compared to the reference X-ray structures of the open and closed states. The RMS fluctuation analysis of the C-alpha atoms revealed conformational changes of the open state protein linked to residues 155–160 from the loop near heme and residues 392–396 at the C-terminus (S3b Fig). Similar fluctuations were also found in the enzyme from the experimental crystallographic B-factor (S4 Fig). We further characterized these conformational changes as the loop rotation containing Arg394, which occasionally replaced Lys167, forming the salt bridge with the carboxylate group of heme during the simulation. Taking into account these dynamic features of the protein, we hypothesize that the structural adjustment of LcpK30 could facilitate the entry of substrate and exit of oligoisoprenoid products through the gateway comprised of C-terminus loop 392–396 and flexible loop 155–160.

Principal component analysis (PCA) taking into account backbone cartesian coordinates was used to compare open and closed states of the enzyme with no substrate. A clear separation of the two states along the PC1 was observed, suggesting that major conformational changes must occur that would allow the transition between closed (positive PC1 values) and open (negative PC1 values) conformations (Fig 3a). Furthermore, the normal mode displacements (Fig 3b) along the PC1 were characterised as conformational changes that predominantly involve helix E residues 160–176, which also contains the central Lys167. Namely, in the open conformation Lys167 is exposed towards the protein surface interacting with the propionates of the heme cofactor, in contrast to the closed-like conformation where it is rotated into the protein interior interacting with O2 and Glu148. On the other hand, displacements along the PC2 are associated with the flexibility of the C-terminus loop 392–397 ranging from fully closed (negative PC2 values) to a fully open (positive PC2 values) conformation (Fig 3b).

Fig 3. Principal component analysis (PCA) of MD simulations of open and closed-like states of LcpK30 without substrate.

Fig 3

PCA was performed considering cartesian coordinates of protein backbone atoms (N, Cα, C and O). (a) MD snapshots projected on the first two principal components. Blue and red circles represent MD snapshots belonging to open and closed-like states, respectively. (b) Normal mode displacement vectors associated with the first two principal components showing only motions longer than 2 Å as porcupine in both directions. The backbone is shown as cartoon representation and colored by the mobility where lower and higher flexibility is depicted in blue and red, respectively.

Despite the closed-like state having identical amino acid protonation states and the same species present in the active site as the open state (fully protonated Lys167 and O2 bound to heme, a transition between the open and the closed-like states was not observed during the simulation, possibly because of a large barrier separating these two states, which simply could not be sampled using conventional MD simulations at the available timescales. This implies that the opening of the enzyme is a complex and controlled process that happens on longer timescales and involves large conformational changes of the helix E and Lys167 and might even be coupled with the approach and the interaction with the polyisoprene substrate.

Given the enzyme flexibility required to accommodate the substrate, we sought to improve the static tunnel predictions by employing the dynamical approach implemented in CAVER Analyst 2, using MD snapshots instead of static crystal structures. The first 5 identified tunnel clusters ranked based on priority (calculated by averaging tunnel throughputs over all snapshots) were analyzed, for MD simulations with both the open and closed-like conformations and the details about the features of the dynamic tunnel clusters are shown in Table 1. Open state tunnel clusters had overall larger average and maximum bottleneck radius, shorter length, smaller curvature and higher throughput than closed state ones, which ultimately makes them more favorable for substrate binding. Furthermore, the priority of the tunnel clusters calculated based on the appearance frequency during the MD simulations was much higher in the open (14–49%) compared to the closed (4–13%) state. Interestingly, when comparing the spatial distribution of timeless tunnel cluster pathways, we could clearly see how the central Lys167 plays an important role in separating tunnels into two wider superclusters in the open state (Fig 4a, clusters 1 and 3 forming the green supercluster and clusters 2 and 5 forming the blue supercluster). This is in contrast with the rotated Lys167 conformation forcing the appearance of one narrow supercluster in the closed-like state (S5 Fig, clusters 1,3,4 and 5 forming the limon supercluster).

Table 1. Features of first 5 dominant dynamical tunnel clusters in LcpK30 open and closed-like states.

ID Average bottleneck radius (Å) Std. dev. Maximum bottleneck radius (Å) Average length (Å) Std. dev. Average curvature Std. dev. Average throughput Std. dev.
Open state
1 1.36 0.21 1.98 13.79 3.01 1.34 0.16 0.64 0.09
2 1.13 0.18 1.87 10.68 2.71 1.18 0.10 0.60 0.07
3 1.33 0.21 1.87 19.43 3.51 1.28 0.10 0.56 0.09
4 1.18 0.17 1.62 20.43 3.29 1.38 0.17 0.46 0.06
5 1.37 0.21 1.77 10.72 2.44 1.20 0.14 0.66 0.08
Closed-like state
1 1.01 0.08 1.21 14.92 4.13 1.31 0.20 0.48 0.11
2 1.00 0.08 1.19 19.06 4.65 1.60 0.34 0.39 0.08
3 0.99 0.07 1.20 17.40 4.72 1.44 0.33 0.42 0.09
4 0.98 0.07 1.16 23.52 5.09 1.74 0.31 0.35 0.09
5 0.98 0.07 1.20 24.88 4.84 1.82 0.33 0.30 0.08

Calculations were performed on 1500 snapshots from MD simulations of open (upper) and closed-like (lower) states of LcpK30.

Fig 4. Dynamical tunnel mapping and pocket identification in the LcpK30 open structure without substrate.

Fig 4

(a) Dynamical tunnel calculations with CAVER Analyst 2 using MD simulations of LcpK30 in open conformation. First 5 dominant tunnel clusters were calculated on 1500 snapshots from MD simulations of the open state. Two superclusters are formed, colored in green (comprised of clusters 1 and 3) and blue (comprised of clusters 2 and 5). Clustering displays the center lines for all tunnels computed for all snapshots at once. Center lines are colored according to their related clusters. (b) Pockets detected closest to heme calculated on 1500 snapshots from MD simulations of LcpK30 in the open state.

The two dominant favorable tunnel superclusters obtained with the open state were similar to tunnels 1 and 2 initially calculated in the static X-ray structure (Fig 1), but with a slightly larger average and maximum bottleneck radii. Both superclusters were lined by predominantly hydrophobic residues as found in the case of static tunnels. This suggests that structural changes of LcpK30 do not largely influence the ranking and features of the putative pathways for substrate access. Interestingly, we also observed a new branched pathway connecting the heme cofactor in the protein interior with the surface through the tunnels surrounded by Glu148, Ile145, Asp226, Val229 and Thr230 (cluster 4 in Fig 4a). Due to the salt bridge between Arg147 and the neighboring Glu52 and Asp226, cluster 4 splits from its main route into two pathways. Because of its smaller average bottleneck radius, this cluster might be important for the transport of smaller products after the cleavage or O2 into and from the active site. We also noted that such a narrow tunnel could only be observed in the static structure after decreasing a minimum probe radius.

The analysis of potential protein cavities using the snapshots from MD simulations also confirmed the persistence of two hydrophobic pockets that could accommodate ligands in the active site near heme, both with slightly increased average volumes compared to the static structure. The larger pocket 3 directly interacts with heme and has an average volume of 182.08 Å3. A smaller pocket 12 has an average volume of 116.97 Å3 and is buried deeper in the protein interior (Fig 4b).

Interestingly, despite removing crystal waters from the crystal structure before the simulation, we observed a water network occupying the tunnels and both hydrophobic pockets above heme. Shortly after the relaxation, several buried protein sites, including the active site, were solvated, which was confirmed by obtaining a similar water occupancy from MD simulations as in the crystal structure. Moreover, we observed increased hydration of the tunnels in the open state compared to poorly hydrated tunnels in the closed state, again suggesting that these tunnels are more feasible in the open structure. This was also supported by the radial distribution function (RDF) of water oxygen atoms around the heme-bound O2 indicating that the probability of water molecules approaching into the hydrophobic active site of LcpK30 is higher in the open conformation (S6 Fig).

Taken together, these results suggested an easier binding of larger substrates into the open compared to the closed conformation and indicated the existence of hydrophobic tunnels that provide access of the polymeric substrate to the active site. Next, we used docking of a polyisoprene substrate model to the open state structure to gain an understanding of the binding mode of the substrate to LcpK30.

Molecular docking of the cis-1,4-polyisoprene substrate into LcpK30

Computational modelling of the interaction between flexible oligomeric ligands and their protein receptors is challenging. Dynamic modelling would be the best approach to account for the flexibility and obtain robust results [21, 22]. Modelling of the interaction between a polyisoprene-type substrate and LcpK30 has been previously reported in an elegant study by Zhang and Liu, in which they carried out docking of a short oligomer with four repeating units into the binding site near the heme cofactor containing a bound dioxygen [16]. In their work, the authors primarily focused on the QM/MM calculations of the LcpK30 catalytic mechanism featuring the oxidative cleavage of the C = C bond. However, such a short substrate model does not provide the full information about the interactions occurring further away from the active site, for example within the tunnels and hydrophobic pockets, that could additionally influence its binding and the catalysis outcome. On the other hand, the natural substrate of LcpK30 is a long polymer consisting of more than 10,000 repeating isoprene units, which would be too complex for standard docking into protein [23]. The major oligomers produced in the reaction with LcpK30 typically range from C20 to C35 (4 to 7 repeating isoprene units). Therefore, we reasoned that a substrate model with 10 repeating isoprene units (C50H82, Fig 5) was sufficiently large to provide valuable insights into the interaction of the polymer substrate with the enzyme. In an initial approach, we used docking to get an approximate indication of the binding of this substrate model to LcpK30, which we further refined with molecular dynamics simulations (see next section).

Fig 5. Substrate model with 10 repeating isoprene units (C50H82) used in this work.

Fig 5

Our docking calculations were carried out in the GOLD software, using two different scoring functions, ChemPLP and ChemScore, which differ in the calculation of the fitness scores (see Methods). These scoring functions calculate docking scores by employing different empirical energy terms, such as van der Waals (VDW) energy, electrostatics, hydrogen bond, desolvation, entropy and hydrophobicity, to yield a dimensionless fitness score, which is optimised for ligand binding site prediction over binding affinity prediction. Hence, the fitness score provides an indication of the quality of the docking position. A higher (positive) fitness score indicates a higher probability that a ligand will bind to a protein in the given docking position, whilst a negative score indicates poor binding [24]. The crystal structure of LcpK30 in its open conformation was used as the protein receptor, where a dioxygen molecule was manually coordinated to the Fe(II) in heme. The binding site included all atoms within 10 Å of radius from the central iron atom in heme, which also contained the residues reported to be part of the active site of LcpK30 (Arg164, Lys167, Thr168 and His198) [12]. In the initial docking approach, the substrate was fully flexible while the protein was kept rigid. However, by using this rigid docking approach, we found that even the highest ranked poses had a very low, or even negative fitness score, indicating that significant steric clashes occurred between protein and substrate (S7 Fig and S2 Table). Therefore this docking approach was unsuitable to accurately describe the interaction between LcpK30 and the polyisoprene substrate.

To improve docking quality, induced fit (flexible) docking was carried out, accounting for flexibility of the receptor by allowing a full torsional flexibility of 10 amino acid sidechains in the active site of LcpK30 (see Methods for details and Table 2 for docking results). Increasing the size of the defined binding site from 10 Å to 15 Å was tested but was not suitable; although higher fitness scores were obtained, the oligomeric substrate tended to dock preferentially at the surface of the protein, without interaction with the heme and the active site of LcpK30 (S3 Table and S8 Fig). The 10 structures obtained using induced fit docking with each of the ChemPLP and ChemScore protocols were significantly different from those obtained with rigid docking and showed higher fitness scores. To obtain further information on their orientation for catalysis, we analyzed the proximity of the C = C bond to the distal oxygen and the polymer conformation based on its span, its chain terminus position, and the position of the cleaved double bond (Table 2). In most docked poses, the distance between the C = C bond and the distal oxygen was below 5 Å, indicating that these structures could indeed represent pre-reactive states. Some exceptions were observed (ChemPLP docking pose rank 7; ChemScore docking poses 6 and 8), where the substrate was further away from the heme.

Table 2. Structural features of the best docking poses obtained from induced fit docking within GOLD.

Docking solutions are ranked based on the fitness score from highest to lowest.

Rank Fitness Score C = C closest to Fe−O2 complexa Distance of C = C to the oxygen atom (Å)b Ligand conformation
-H2C-C = = C-CH3
ChemPLP docking protocol
1 78.94 6 5 5.3 Folded
2 67.25 5 5.2 4 Folded
3 64.8 3 4.3 3.5 Extended
4 40.51 7 5.9 5.1 Folded
5 29.74 9 3.8 2.5 Extended
6 21.37 8 2.2 2.6 Extended
7 19.99 9 5.9 7 Extended
8 -13.14 3 2.6 2.4 Extended
9 -42.16 4 5 5.4 Folded
10 -60.12 5 3.2 2.9 Extended
ChemScore docking protocol
1 32.65 6 5.8 4.5 Folded
2 29.29 8 3.3 3.8 Extended
3 18.16 4 4 3.1 Extended
4 13.57 3 3 2.9 Extended
5 12.1 3 2.9 2.6 Extended
6 5.8 7 8.3 9 Extended
7 5.44 8 3.4 3.1 Extended
8 -2.2 1 7.8 7.2 Extended
9 -3.92 3 3.7 3.3 Folded
10 -6.98 8 4.1 3.9 Extended

a the first C = C unit number was counted from the end terminal with two methyl group R-C = C-(CH3)2 –see Fig 5.

b the distance was measured from each C atom in the double bond to the distal O atom of the Fe-O2 complex.

Overall, we found two prominent docked conformations of the oligomeric substrate, which we characterized as extended and folded. An extended conformation was defined as a structure in which one terminus of the oligomeric substrate model occupied the hydrophobic cavity within the protein interior near to heme (consistent with pocket 3 in our previous finding on the identification of tunnels and pockets, Fig 2b), interacting with the residues above the heme at a distal site, whilst the other terminus was exposed to solvent at the surface of the protein. A folded conformation was defined where both termini were found towards the surface of the protein (S9 Fig). In both these conformations, a C = C bond was found in the proximity of the distal oxygen, thus increasing the possibility of the reaction. The poses that were further away from the heme had an extended-like conformation. Interestingly, the higher docking scores using both ChemPLP and ChemScore functions were obtained with folded conformations. This probably occurred due to reduced steric clashes between atoms of neighboring residues in the protein structure, which increased the fitness scoring function for the folded conformation.

PLIP analysis of the substrate-LcpK30 contacts revealed mostly hydrophobic interactions. Three residues, Lys167, Leu171 and Ile396 were involved in interactions with all docked conformations, with Ala159 involved in interactions with most docked conformations obtained with both ChemPLP and ChemScore (S4 and S5 Tables and S10 Fig). Residue Lys167 has previously been reported to serve as a gating mechanism that opens and closes the entrance to the hydrophobic channel of the enzyme active site [10]. Our docking results infer that this residue might also play an important role in stabilizing protein-substrate complexes. Furthermore, only the extended docked conformations interacted with the Glu148 side chain, located further inside the hydrophobic channel above the heme and previously suggested to play a function in fine-tuning the active pocket to accommodate the substrate for reaction [16]. Earlier work on mutation of Glu148 to alanine, histidine, and glutamine revealed that the glutamate residue affected the specific activity of the enzyme but showed no significant differences in biochemical and biophysical properties, as well as no change in oligoisoprenoid product distribution [10]. Given this, we hypothesize that the extended substrate conformation is more representative of a catalytically relevant pose of the substrate within LcpK30. However, it cannot be excluded that the cleavage occurs with the substrate in the alternative, folded conformation, given the overall higher docking scores and flexibility of the system.

Molecular dynamics simulation of LcpK30 in the presence of cis-1,4-polyisoprene.

We further explored the influence of substrate binding on the conformation of LcpK30 by performing MD simulations of the enzyme in the presence of the docked oligomeric substrate C50H82. We carried out simulations for all top 10 ranked solutions obtained with each of the ChemPLP and ChemScore fitness functions (20 poses in total).

Compared to the case without the substrate, relatively larger RMSD values of the enzyme C-alpha atoms were observed in the presence of the substrate, sometimes even above 2 Å, especially in the case of the poses in the extended conformation that bind deeper within the protein interior, such as docking solutions 10 and 3 from ChemPLP and ChemScore, respectively (Fig 6). From the RMSF calculations of the C-alpha atoms during the simulation of LcpK30 with the substrate bound, we observed overall very similar fluctuations as in the case of the enzyme without the substrate, which could indicate that these are characteristic for the protein in solution irrespective of whether substrate is present or not. However, when comparing the high RMSD docking solutions with the structure without the substrate, we characterized an increased fluctuation of protein residues 145–154 from a short helix interacting with the substrate bound in the hydrophobic pocket near the heme cofactor. Moreover, larger fluctuations were also observed for the gateway loop residues 155–160 and the helix E residues 160–176, containing the flexible Lys167 (see also S11 and S12 Figs for a visualization of RMSF values on the backbone of LcpK30).

Fig 6. RMSD and RMSF profiles for LcpK30 in the open state with cis-1,4-polyisoprene (C50H82) substrate model bound.

Fig 6

C-alpha RMSD (left) and RMSF (right) were calculated during 100 ns MD simulations, starting from 10 docking poses obtained with (a) ChemPLP and (b) ChemScore. All values are calculated from the reference X-ray structure of the enzyme. Residues 29–49 from N-terminus were omitted from the analysis to minimize the noise.

The PCA analysis combining MD snapshots of the enzyme with and without substrate showed that compared to the empty enzyme, the presence of substrate slightly changed the conformation of the enzyme, irrespective of the extended or folded substrate binding mode (S13 Fig). The bound and unbound states were not separated along only one principal component, instead displacements along both PC1 and PC2 were observed. Normal mode displacements along the PC1 included the loop 155–160, helix E 160–176 and C-terminal helix Z2 297–314, whilst the displacement along the PC2 mostly localised at the C-terminal loop residues 392–397. As observed from the RMSD analysis, in some cases where the substrate binds deeper in the active site, the conformation of the enzyme bound to substrate was significantly different from the empty enzyme (for example in the case of extended substrate conformations 10 from ChemPLP and 3 from ChemScore). Furthermore, significant structural changes of the enzyme, especially C-terminus loop 392–397, were observed in case of simulations with the folded substrate conformations 1 from both ChemPLP and ChemScore.

The effects of the bound substrate on the tunnels leading to the active site were investigated next, by calculating the dynamical tunnel clusters within the complex between LcpK30 and cis-1,4-polyisoprene in extended and folded conformations, originating from both ChemPLP and ChemScore poses. The calculated features of the first 10 dominant clusters demonstrated that although throughputs were relatively high, and the priority of the individual tunnel clusters was very broad ranging from 9–44%, these tunnels had overall similar average and maximum bottleneck radius as in the case of LcpK30 in the closed-like state and without substrate bound (Fig 7 and Table 3, compare with Fig 4 and Table 1). However, the spatial distribution of tunnel clusters was similar to the open conformation without substrate, with two clearly separated superclusters due to the presence of the salt bridge between propionates of the heme cofactor and Lys167. All these results indicated that although mutual conformational changes of the enzyme and the substrate occurred (as supported by the PCA analysis) which influenced the throughput of the tunnels, a bound polyisoprene substrate strongly interacted with the enzyme within the characterized tunnels, which caused the narrowing of the bottlenecks.

Fig 7. Dynamical tunnel mapping and pocket identification in the LcpK30 open structure with bound substrate.

Fig 7

First 10 dominant tunnel clusters were calculated on 1000 snapshots from MD simulations of the open state LcpK30 with cis-1,4-polyisoprene substrate bound, using CAVER Analyst 2.

Table 3. Features of first 10 dominant tunnel clusters in LcpK30 bound to substrate.

ID Average bottleneck radius (Å) Std. dev. Maximum bottleneck radius (Å) Average length (Å) Std. dev. Average curvature Std. dev. Average throughput Std. dev.
1.08 0.08 1.21 16.05 4.28 1.49 0.28 0.67 0.11 1.08
1.05 0.08 1.21 11.90 3.17 1.39 0.20 0.66 0.12 1.05
1.09 0.07 1.21 6.73 3.11 1.29 0.21 0.80 0.07 1.09
1.06 0.08 1.21 25.88 3.73 1.38 0.13 0.49 0.10 1.06
1.05 0.08 1.21 26.25 3.28 1.50 0.20 0.49 0.09 1.05
1.06 0.08 1.21 23.15 3.70 1.38 0.19 0.52 0.10 1.06
1.07 0.07 1.21 11.80 3.15 1.40 0.24 0.72 0.07 1.07
1.02 0.09 1.21 18.56 3.76 1.43 0.21 0.49 0.11 1.02
1.07 0.08 1.21 9.07 3.57 1.71 0.55 0.73 0.12 1.07
1.02 0.08 1.21 24.41 3.53 1.61 0.19 0.40 0.09 1.02

Calculations were performed on 1000 snapshots from MD simulations of open state LcpK30 with cis-1,4-polyisoprene substrate bound. The tunnels were calculated starting from central Fe atom of heme and excluding O2 and substrate.

Following the protein dynamics, we investigated the conformations of the substrate bound to the enzyme during MD simulations, to gain more insight into the stability of the complex initially obtained with docking. While we observed minor fluctuations of the polymeric chain that binds deeper into the protein and tightly interacts with the heme cofactor, no significant conformational changes of the substrate occurred during 100 ns simulations, compared to the initial structures obtained with docking. Solvent-exposed regions experienced more flexibility (Fig 8), which could indicate that the interactions with the enzyme position and stabilize the flexible substrate in a suitable conformation for the reaction with the heme-bound oxygen.

Fig 8. Ensemble of 50 superimposed snapshots from 100 ns MD simulations of cis-1,4-polyisoprene near heme, obtained starting from the 20 top ranked docking poses.

Fig 8

a) Snapshots obtained from ChemPLP fitness function; b) Snapshots obtained from ChemScore fitness function. Trajectory smoothing window size was 5. Protein and hydrogens were omitted for the sake of clarity.

Overall, these results indicate that, for the substrate to bind in the hydrophobic pocket of the active site, subtle mutual structural rearrangements of the LcpK30 receptor, especially around the gateway and of the polymeric chains are crucial for the formation of the stable complex.

To investigate the substrate-enzyme interactions, we calculated close contacts occurring in MD simulations and extracted information about the specific residues that most frequently interact with different ligand conformations (S6 and S7 Tables). Based on the interaction patterns, the two main types of bound conformations observed with docking (extended and folded) were confirmed. Similar to the docking results, some substrates were found in an extended-like conformation, without interactions with the hydrophobic cavity. The simulation confirmed that the distinctive difference between the extended and folded conformations arises from the fact that ligands in the extended conformation can reach the buried hydrophobic protein cavity, which is not the case for folded structures. Both conformations readily interacted with heme and O2. The most frequent interactions and representative conformations of the substrate model bound in the active site of LcpK30 are shown in Fig 9.

Fig 9. Residues that frequently interact with the substrate during MD simulations.

Fig 9

(a) Close contacts between frequently occurring bound conformations of the substrate (extended and folded) and LcpK30, calculated from 100 ns MD simulations of 20 docking poses obtained with ChemPLP and ChemScore fitness function. The profile was constructed by averaging total fractions obtained for similar conformations. Conformations were clustered based on the contacts and the features from the visual inspection; (b) Representative snapshots extracted from MD simulations showing the main extended (ChemPLP pose 10 and ChemScore pose 5 shown as dark green and light green licorice, respectively) and folded (ChemScore pose 1 shown as magenta licorice) conformations and protein residues (shown as lines) that interact with the cis-1,4-polyisoprene substrate model bound to LcpK30 near heme (sphere representation). The three tunnels and the hydrophobic cavity are shown as blue, green, orange and yellow surface, respectively. The rest of the LcpK30 and hydrogen atoms are omitted for clarity.

Analyzing the frequent close contacts with the protein and the binding features of the extended and folded conformations of the cis-1,4-polyisoprene substrate bound to LcpK30 during MD simulations, we observed that both conformations readily interacted with the two previously found putative protein tunnels. The main differences between the two binding modes arose from the fact that only the substrate in the extended conformation could reach the buried hydrophobic cavity, which allowed it to favorably interact with the hydrophobic residues in the protein interior and to position a double bond near the heme-bound oxygen. Interestingly, we also found extended conformations that bound even deeper inside the protein, allowing the substrate termini to interact with the narrow tunnel cluster 4. In such conformations, one part of the polymeric chain was bound in the active site while the other was exposed to the surface interacting with either of the wider superclusters (blue and green in Fig 4, clusters 1, 2, 3 or 5). In contrast, the substrate in the folded conformation did not reach the hydrophobic pocket but rather coiled around Lys167 and the flexible C-terminus loop comprised of residues 392–397, with a central double bond positioned near the heme active site in an orientation favorable for catalysis.

To expand our investigation, we explored dominant energy factors that drive the cis-1,4-polyisoprene binding to LcpK30, by calculating van der Waals and electrostatic energy terms with linear interaction energy [25] on structures of the complex from MD simulations of all docking solutions from ChemPLP and ChemScore (Fig 10). Overall, we found that EvdW dominates the total interaction energy with 94% contribution over Eele, which contributes only 6%. Furthermore, the binding energy was calculated with the MM/GBSA method [26] using snapshots from MD simulations and treating the solvation implicitly. Interestingly, the calculated binding energies show similar trends as observed for the total interaction energies. Whilst most docking solutions, irrespective of the conformation, have similar binding energy, structures 9 and 10 from the ChemPLP fitness score have significantly higher or lower binding energy, respectively. We further analyzed the relationship between interaction energies, and therefore binding affinities, and the total number of close contacts between the substrate and LcpK30 during MD simulations. We found that the binding affinity decreases when there are too few or too many close contacts. For example, the conformer with the most favorable binding affinity had 9 frequent contacts with the protein residues, while the conformer with the least favorable binding had only two contacts less. However, the conformation that achieved the least interactions with the protein (5 close contacts) had less favorable binding energy than the structure that had the most interactions (15 contacts). Moreover, we observed that the substrates in the extended conformation typically achieved 9 or more frequent contacts with the protein residues, while this number is less than 9 close contacts in the case of substrates that bind in the folded conformation (S6 and S7 Tables). It is important to mention that the calculated binding energies do not correlate with the docking fitness scores, which highlights the importance of the flexibility of the system and the advantage of exploring different bound conformations of a ligand with molecular simulations.

Fig 10. Linear interaction energy analysis between the ligand and LcpK30.

Fig 10

Calculations were performed on 100 ns MD simulations of 10 docking poses obtained with (a) ChemPLP and (b) ChemScore fitness function. The Eele is the electrostatic energy (orange circles) and EvdW is van der Waals energy (blue circles). The total energy Etot (green circles) is calculated as a sum of Eele and EvdW. The binding energy ΔEbind (yellow circles) was calculated on 50 snapshots from MD simulations with the MM/GBSA method.

The cleavage sites in the cis-1,4-polyisoprene substrate model were predicted by constructing the histograms of distances calculated between distal oxygen atom from heme-O2 cofactor and each of the ten C = C double bonds of the substrate during MD simulations. The near-attack-conformations of the substrate, in which the C = C bonds are frequently found around or below 4 Å from the O2, were considered suitable for the oxidative cleavage by LcpK30 (S14 and S15 Figs). We used this information to construct the oxidative cleavage frequency profile by assigning the total number of potential cleavage events to a specific C = C bond based on their distance from the reactive O2. Fig 11 shows that the central C = C double bonds (numbered 5 and 6, see Fig 5) of the substrate were more frequently found in the near vicinity of heme-O2 and therefore more prone to oxidative cleavage into two oligomers of 4 and 5 isoprene units. This can partially explain the previously reported experimentally observed distribution of oligomers produced by LcpK30 which shows a preference for products containing 4–7 intact isoprene units [1, 8, 15]). Furthermore, 4 out of the 6 folded conformations of the substrate had central C = C bonds in a favorable position for cleavage. On the other hand, the oxidative cleavage of the substrates in the extended conformation seemed to be more diverse and preferably occurred at the C = C bonds 3, 4 and 8 (S14 and S15 Figs).

Fig 11. Cleavage frequency profile.

Fig 11

Cleavage frequency profile constructed from distance histograms indicating a total count of each of the ten C = C double bonds (coloured differently) from the cis-1,4-polyisoprene substrate that are found in the near vicinity (below 4 Å) of the distal oxygen atom in the heme-bound O2 molecule. The profile is calculated from 100 ns MD simulations of LcpK30 in the presence of the substrate for all 20 docking poses obtained with ChemPLP and ChemScore fitness functions.

Conclusion

Knowledge about pathways that lead from the surface to the active site is important to understand enzymatic catalysis. Frequently, the residues comprising these pathways actively transport substrates and products between the active site and bulk solvent, which is particularly important for enzymes with buried active sites [2729]. In this work, docking and molecular dynamic simulations provided a deeper understanding of the interaction between the cis-1,4-polyisoprene and the enzyme latex clearing protein, including a detailed characterization of the tunnels and hydrophobic pockets in LcpK30. Using the CAVER tunnel identification and the automated fpocket tools within the static LcpK30 structure, as well as MD simulation analyses, we characterized potential entry and exit pathways, as well as hydrophobic pockets for the binding of oligomeric cis-1,4-polyisoprene chains within the enzyme. A range of residues interacting with the substrate were identified and comprised Ser138, Ser142, Glu148, Val152, Arg164, Lys167, Thr168, Leu171, Thr230, Ser233 and Leu234. We suggest that these residues represent suitable targets for mutagenesis aimed at improving enzymatic degradation rates and/or substrate and oligoisoprenoid product range of LcpK30.

Computational docking of a large substrate model with 10 repeating isoprene units (C50H82), using two different scoring functions within the GOLD program, revealed two main bound conformations: extended and folded. Residues Lys167, Leu171, Ile396 and Ala159 were involved in the protein-substrate interactions in most of the docking solutions, and we suggest they play an important role in stabilizing protein-substrate complexes. Only the extended substrate conformations interacted with residue Glu148, which was previously shown to affect enzyme activity. Therefore, we suggest that the extended conformation is more representative of a catalytically relevant substrate pose within the enzyme. Further analysis using MD simulations revealed additional close contacts between the substrate in different conformations and the enzyme. This analysis also confirmed that hydrophobic (van der Waals) interactions were mostly responsible for mutual subtle conformational changes of both LcpK30 and the polymeric substrate upon binding. Furthermore, an oxidative cleavage frequency profile was constructed.

Taken together, the tunnel and pocket predictions, docking results and MD simulations suggest that the substrate enters via a gateway comprised of flexible loop 155–160 that extends to the helix 160–176 and the C-terminus residues 392–397 which could also serve as the potential exit pathway for the oligoisoprenoid products (Fig 12). A hydrophobic pocket exists above the heme, where the substrate binds, and which is likely to determine the size distribution of the oligomers. An optimal number of hydrophobic interactions favors better substrate binding, which suggests the substrate needs to bind deeper into the protein cavity for the reaction to occur and may explain the oligomer sizes obtained during cleavage. We hypothesize that the most favorable substrate binding occurs in an extended conformation, with one terminus within the hydrophobic pocket and the other terminus at the surface of the protein. A second binding mode with the substrate in a folded conformation also occurred, with a central double bond in proximity to the heme-bound oxygen, leading to a preference for oligomers with 5 to 6 C = C bonds, which can partially explain previously reported experimental results showing cleavage into mainly 4–7 intact isoprene units.

Fig 12. Proposed gateway for the substrate entry into LcpK30.

Fig 12

The gateway residues were proposed based on the results from static and dynamic modelling of LcpK30 with cis-1,4-polyisoprene ligand. The representative snapshots of (a) extended and (b) folded conformation of cis-1,4-polyisoprene in complex with LcpK30 highlighting their interaction with the putative tunnels and the hydrophobic protein cavity. The O2 bound on heme was omitted for the sake of clarity.

Whilst experimental evidence is required to confirm the substrate binding modes, the results presented here provide valuable hypotheses on how latex clearing proteins can incorporate and cleave oligomeric substrates and suggest suitable targets for mutagenesis aimed at improving enzymatic degradation rates. Furthermore, the thorough computational investigation on the structure, interaction, and catalytic mechanism of LcpK30 discussed in this work will provide deeper insight into the understanding of enzymatic catalysis with other large, flexible and hydrophobic macromolecules.

Computational methods

Protein structure preparation

The high-resolution crystal structures of the LcpK30 enzyme in the open and closed conformations were obtained from the RCSB Protein Data Bank (PDB ID: 5O1L for open and 5O1M for closed) [10]. Although the LcpK30 is a biologically active dimer, only one monomer (chain A) was retained for all calculations and modelling because the interface between the two monomers is relatively far from the active site. The corresponding heme cofactor was also retained, while all other co-crystallized species including imidazole, 2,3-butanediol and 1,2-ethanediol molecule and waters were removed prior to all calculations.

Protein tunnel calculations

Putative tunnels were calculated with the CAVER-Pymol plugin 3.0.3 [19]. The hydrogen atoms were added to the protein structure using Chimera 1.16, considering the H-bonds network [30]. The initial starting point for the tunnel search was specified by the central iron atom of the heme prosthetic group. The search was performed using a minimum probe radius of 0.9 Å. Default parameters for the maximum distance of 3 Å, desired radius of 5 Å, shell depth of 4 Å, and shell radius of 3 Å were used [19]. The clustering of tunnels was performed by the average-link hierarchical Murtagh algorithm based on the calculated matrix of pairwise tunnel distances. The clustering threshold was set to a default value of 3.5. Each atom of the protein was approximated by 12 spheres. Calculations of tunnels on snapshots from MD simulations was carried out in CAVER Analyst 2 using default parameters [31].

Protein pocket calculations

Simple pocket detection was performed with the open source protein pocket (cavity) detection algorithm based on Voronoi tessellation known as fpocket [32]. The detection of pockets on MD trajectories of LcpK30 was carried out with mdpocket considering protein and heme atoms only [33]. Default values of parameters to describe pockets were used. Namely, the minimum and the maximum radius that an alpha sphere might have in a binding pocket was 3.4 Å and 6.2 Å, respectively. The pairwise single linkage clustering method with the Euclidean distance measure was used for the clustering algorithm. Each pocket contained at least 15 alpha spheres and pockets volume were calculated using a Monte-Carlo algorithm.

Protein-ligand docking

The GOLD program (version 2022.2.0) was used for the molecular docking protocols, with the default parameters as defined within the program [24, 34]. The protonation states of the titratable residues were assigned using the H++ server [35]. To simulate the catalytically active state of the protein, the imidazole molecule coordinated on the iron of heme was replaced with O2 that was manually added. The 2D structure of the C50 cis-1,4-polyisoprene ligand model was constructed using ChemDraw 18.2 (Perkin Elmer, MA, USA) and the 3D structure with all hydrogen atoms included was obtained using Chem3D 18.2 (Perkin Elmer, MA, USA).

The ChemPLP and ChemScore fitness functions available in GOLD were used [34]. These scoring functions were selected because they both perform well for lipophilic binding sites and accurately model the steric complementarity between protein and ligand. All docking runs were performed with default parameters using both rigid and flexible docking. The binding site was defined as all protein residues whose atoms are within 10 or 15 Å from the central iron of heme. Ligand flexibility was enabled during all docking runs. To include flexibility of the active site, induced fit (flexible) docking was carried out, where the side chains of all residues that have been identified as important to the catalytic activity of LcpK30 (Glu148, Arg164, Lys167, Thr168) were specified as fully flexible [12]. Several other residues which were found in the pathway identified from the static tunnel calculations using CAVER (Ser142, Ile145, Val152, Leu171, Thr230 and Leu234) were also defined as flexible. These side chains were allowed to rotate freely during docking to vary over the range -180 to +180 rotatable torsion degree. The protein-ligand interaction was analyzed using Protein-Ligand Interaction Profiler (PLIP) program [36].

Molecular dynamics simulations

The protonation states of the titratable residues at the physiological conditions were assigned using the H++ server [35]. The enzyme coordinates were taken from the crystal structure of the open (PDB ID: 5O1L) conformation of LcpK30, chain A [10]. The classical ff19SB [37] force field parameters were assigned to the standard residues, while the bonded model was used to describe the oxyheme cofactor and the coordinating His198. The cofactor was parametrized with the metal center parameter builder (MCPB) protocol [38]. To generate bonded force field parameters of the heme group with MCPB.py, we used a crystal structure of LcpK30 in the open state (PDB: 5O1L) [10] in which the central iron is coordinated by four equatorial nitrogen atoms from the protoporphyrin ring and one axial epsilon-nitrogen atom from the histidine sidechain of the enzyme. We manually included an oxygen molecule bound to the remaining iron site. We applied the B3LYP/6-31G(d) level of theory to carry out the geometry optimization for both the small and large models. Apart from the heme cofactor and O2 molecule, the small model had the sidechain of the coordinating histidine, while the large model also included the histidine backbone. The force constants were calculated for the optimized small model using the same method. Prior to the Merz-Kollman RESP partial charges calculation we performed minimization of the large model allowing the relaxation of hydrogen atoms and of O2 to the optimal positions. The +2 charge was defined for the iron atom, and the total charge of metal site was -2, while the metal site had a possibility to adopt multiple electronic structures. We tested different spin states and concluded that the bond parameters are closer to the experimental crystal structure of oxymyoglobin in the singlet spin state. The ESP charges, and consequently RESP charges, were similar both in the case of low and high spin systems. We employed the Seminario method to obtain force field parameters using the ff19SB/GAFF force field and ChgModB to perform the RESP charge fitting. All QM calculations were carried out with Gaussian 16 program [39].

To explore conformational dynamics of the enzyme and the interactions of the enzyme with the ligands, we simulated LcpK30 in the absence and in the presence of polyisoprene ligand C50 cis-1,4-polyisoprene with 10 C = C double bonds. We used all of the top 10 ranked solutions obtained with each of the ChemPLP and ChemScore fitness functions in the GOLD software (20 poses in total). The GAFF force field was used to describe bonded parameters of the ligands while the partial charges were calculated using the AM1-BCC method [40, 41]. We solvated the systems with OPC [35] water using the truncated octahedron box with each atom of the solute at least 10 Å away from the edge of water surrounding the solute. All systems were neutralized by adding Na+ counterions. The 12–6 set of parameters for monovalent atomic ions for the OPC water model were used [42]. All systems were minimized for a total of 10,000 steps, first 5,000 steps of the steepest descent method and the conjugate gradient for the remaining cycles. In the first relaxation step, the temperature of the system was gradually increased from the initial temperature of 0 to a target temperature of 300 K over 100 ps and kept constant for another 100 ps of the NVT simulation. We employed the Langevin thermostat with the collision frequency of 2 ps for the temperature control. In the second step of the relaxation, the system was propagated at 300 K over 100 ps of simulation time at a constant pressure (NPT), allowing the box to relax using the Berendsen barostat. The weak positional restraints (5 kcal mol-1 Å-2) were applied to the solute in the relaxation phase. This last step of relaxation included the removal of all restraints on the solute for 2 ns of simulation time at a constant pressure. Finally, for LcpK30 without the ligand, three independent production runs were carried out at 300 K and the constant pressure for at least 500 ns of simulation time. The production simulations of LcpK30 with the ligand were carried out for 100 ns each. All docking poses obtained from ChemPLP and ChemScore fitness functions were simulated. The Particle Mesh Ewald approach was used for treating long-range electrostatic interactions and the non-bonded cut-off was set to 10 Å. The SHAKE algorithm was used to constrain bonds involving hydrogen and a time step of 2 fs was used in all simulations. All MD simulations and the trajectory analysis were carried out in the AMBER20 program [43]. The visualization was performed in Pymol 2.5.4 [44] and VMD 1.9.3 [45].

Supporting information

S1 Fig. Crystal structure analysis.

Close view of the active site from the crystal structure of LcpK30 showing the helix 160–176 near heme in (a) open state (PDB: 5O1L) and (b) closed state (PDB: 5O1M).

(TIF)

pone.0302398.s001.tif (2.3MB, tif)
S2 Fig. Tunnel profiles for the dominant static tunnels.

Profiles were calculated using the LcpK30 crystal structure in the open state (PDB 5O1L) using the CAVER-Pymol plugin 3.0.3. (a) Variation of tunnel radius with length; (b) tunnel profile heatmaps; (c) distances between different atoms in the cis-1,4-polyisoprene chains, used to estimate maximum width of the chain in an extended conformation. The distances were measured between the indicated carbon atoms that represent the bond length (blue) and close contact atoms (red). The addition of C-H distances is anticipated to add max. 1.1 Å.

(TIF)

pone.0302398.s002.tif (4.1MB, tif)
S3 Fig. RMSD and RMSF profiles for LcpK30 in the open state.

(a) C-alpha RMSD and (b) RMSF values were calculated from 3 independent 500 ns MD simulations (black, red and green) of LcpK30 from the reference X-ray structure in the open state. Due to increased flexibility of the N-terminus, residues 29–49 were omitted from the analysis to minimize the noise. The RMSF values are shown per residue as average from all 3 trajectories.

(TIF)

pone.0302398.s003.tif (1.5MB, tif)
S4 Fig. Backbone flexibility of open state LcpK30 from MD simulations.

(a) Experimental B-factor from the X-ray structure and (b) C-alpha RMSF (Å) values from MD simulations plotted on the backbone of LcpK30. Due to large fluctuations of the N-terminus, residues 29–49 were omitted from the analysis of the MD trajectories for the sake of clarity. The heme cofactor is represented in spheres (orange = Fe, white = C, red = O, blue = N). The protein structures are depicted with the B-factor putty representation where the backbone is displayed as a tube with a diameter correlated to the experimental B-factor from the X-ray structure (a) or RMSF from MD simulations (b). Thicker tube indicates higher flexibility. The structures are coloured with a continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low, and red is high backbone flexibility.

(TIF)

pone.0302398.s004.tif (2.1MB, tif)
S5 Fig. Dynamical tunnel mapping and pocket identification in the LcpK30 closed-like structure.

First 5 dominant tunnel clusters were calculated with CAVER Analyst 2 on 1500 snapshots from MD simulations of the closed-like state. Clustering displays the centre lines for all tunnels computed for all snapshots at once. Centre lines are coloured according to their related clusters. Clusters 1, 3, 4 and 5 form one narrow supercluster, colored in limon.

(TIF)

pone.0302398.s005.tif (1.2MB, tif)
S6 Fig. Dynamic of water molecules during simulations of open and closed-like states of LcpK30.

Radial distribution function (RDF) of water oxygen atoms from O2 molecule bound to heme in (a) open and (b) closed LcpK30. The isosurface (isovalue = 100) showing the density of water oxygen atoms around the active site in (c) open and (d) closed LcpK30. Top view at the heme cofactor in the active site of LcpK30 showing water molecules as blue surface using the representative MD snapshots in (e) open and (f) closed LcpK30. The analysis was carried out on 6000 frames from 3 repeat MD simulations of open and closed-like LcpK30 states.

(TIF)

pone.0302398.s006.tif (3.1MB, tif)
S7 Fig. Superimposed conformations results from rigid docking of cis-1,4-polyisoprene C50H82 within LcpK30.

(a) Best 10 poses obtained using the ChemPLP scoring function; (b) best 10 poses obtained using ChemScore scoring function.

(TIF)

pone.0302398.s007.tif (2.4MB, tif)
S8 Fig. Effect of binding site size on docking.

Superimposed docking conformations results of C50H82 within a defined radius of (a) 10 Å and (b) 15 Å from the central iron atom in heme using ChemPLP flexible docking. Ligand is represented as yellow sticks, and the heme cofactor is represented in spheres (white = C, red = O, blue = N).

(TIF)

S9 Fig. Docking conformations of cis-1,4-polyisoprene (C50H82) near heme.

Docking conformations of cis-1,4-polyisoprene (C50H82) near heme. Ranking is by the fitness score from highest to lowest. a) Poses obtained with the ChemPLP fitness function; b) poses obtained with the ChemScore fitness function. Docking solutions were calculated with induced fit docking using a 10 Å binding site. Enzyme and hydrogens were omitted for the sake of clarity.

(TIF)

pone.0302398.s009.tif (2.7MB, tif)
S10 Fig. Protein-ligand interactions identified by PLIP analysis.

Protein-ligand interactions identified by PLIP analysis. Docked poses were obtained with ChemPLP (above) and ChemScore (below). (a) docking solutions ranked 1, representative of folded conformations and (b) docking solutions ranked 3, representative of extended conformations. Docking solutions were calculated with induced fit docking using a 10 Å binding site and ranked based on the fitness score from highest to lowest (see Table 1). The hydrophobic interactions between the ligand and interacting residues are shown by dashed lines. The three residues that appeared in all interactions are underlined. The residue that interacted only with the extended conformation is coloured in blue. The red coloured residue (Ala159) appeared in all interactions with poses obtained using ChemScore and in most interactions with poses obtained using ChemPLP.

(TIF)

pone.0302398.s010.tif (7.6MB, tif)
S11 Fig. C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness function.

C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness function. All values are calculated from the reference X-ray structure of the enzyme and projected on the average structure of the enzyme. Due to increased flexibility, residues 29–49 from N-terminus were omitted from the analysis to minimise the noise. The substrate is omitted for the sake of clarity. The protein structures are depicted with the putty representation where the backbone is displayed as a tube with a diameter correlated to the RMSF from MD simulations (thicker tube indicates higher RMSF). The structures are coloured with continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low and red is high backbone flexibility.

(TIF)

pone.0302398.s011.tif (4.8MB, tif)
S12 Fig. C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness function.

C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness function. All values are calculated from the reference X-ray structure of the enzyme and projected on the average structure of the enzyme. Due to increased flexibility, residues 29–49 from N-terminus were omitted from the analysis to minimise the noise. The substrate is omitted for the sake of clarity. The protein structures are depicted with the putty representation where the backbone is displayed as a tube with a diameter correlated to the RMSF from MD simulations (thicker tube indicates higher RMSF). The structures are coloured with continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low and red is high backbone flexibility.

(TIF)

pone.0302398.s012.tif (4.8MB, tif)
S13 Fig. Principal component analysis (PCA) carried out on structures from MD simulations of open and closed LcpK30 with and without substrate.

PCA was performed considering cartesian coordinates of protein backbone atoms (N, Cα, C and O). (a) MD snapshots projected on the first two principal components. Blue and red circles represent MD snapshots belonging to open and closed-like states, respectively, orange circles represent MD snapshots belonging to the open state LcpK£0 in complex with the substrate. (b) Normal mode displacement vectors associated with the first two principal components showing only motions longer than 2 Å as porcupine in both directions. The backbone is coloured by the mobility where lower and higher flexibility is depicted in blue and red, respectively. Structures from MD simulations projected on the first two principal components considering open LcpK30 (grey) and LcpK30 in complex with substrate from (c) ChemPLP and (d) ChemScore docking poses.

(TIF)

pone.0302398.s013.tif (13.3MB, tif)
S14 Fig. Probability distribution of distance between distal oxygen of O2 and each C = C double bond for docking poses obtained with ChemPLP.

Calculations were during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness score. The cleaved bonds were the ones found closest to O2. The color of the C = C double bond (see scheme at the bottom) corresponds to the color of the probability histogram.

(TIF)

pone.0302398.s014.tif (1.8MB, tif)
S15 Fig. Probability distribution of distance between distal oxygen of O2 and each C = C double bond for docking poses obtained with ChemScore.

Calculations were during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness score. The cleaved bonds were the ones found closest to O2. The color of the C = C double bond (see scheme at the bottom) corresponds to the color of the probability histogram.

(TIF)

pone.0302398.s015.tif (1.7MB, tif)
S1 Table. The features of dominant tunnels calculated using the LcpK30 crystal structure in the open state (PDB 5O1L) using the CAVER-Pymol plugin 3.0.3.

(PPTX)

pone.0302398.s016.pptx (53.8KB, pptx)
S2 Table. Docking solutions obtained from rigid docking within GOLD software.

Solutions are ranked based on the fitness score from highest to lowest.

(PPTX)

pone.0302398.s017.pptx (63.7KB, pptx)
S3 Table. Impact of the size of the defined protein binding site on the ChemPLP docking scores.

Docking solutions are obtained from flexible docking using the ChemPLP scoring function within the GOLD software. The docking was performed in a binding site defined as all atoms within 10 or 15 Å from the central iron atom in heme. The 10 docking solutions are ranked based on the fitness score from highest to lowest.

(PPTX)

pone.0302398.s018.pptx (47.9KB, pptx)
S4 Table. List of residues interacting with the docked cis-1,4-polyisoprene, extracted from PLIP analysis of 10 docking poses obtained using ChemPLP.

Docking solutions were ranked based on the fitness score from highest to lowest. Residues that interact with all / most poses are highlighted in grey and light grey, respectively.

(PPTX)

pone.0302398.s019.pptx (58.7KB, pptx)
S5 Table. List of residues interacting with the docked cis-1,4-polyisoprene, extracted from PLIP analysis of 10 docking poses obtained using ChemScore.

Docking solutions were ranked based on the fitness score from highest to lowest. Residues that interact with all / most poses are highlighted in grey and light grey, respectively.

(PPTX)

pone.0302398.s020.pptx (57.2KB, pptx)
S6 Table. Frequency of contacts between the substrate and LcpK30 calculated during 100 ns MD simulations of 10 docking poses obtained from ChemPLP fitness function.

Docking solutions were initially ranked based on the fitness score from highest to lowest. A cut-off of 5 Å was used and only contacts with total fraction more than 4.5% are listed. Ext = extended conformation; Fold = folded conformation.

(PPTX)

pone.0302398.s021.pptx (55.6KB, pptx)
S7 Table. Frequency of contacts between the substrate and LcpK30 calculated during 100 ns MD simulations of 10 docking poses obtained from ChemScore fitness function.

Docking solutions were initially ranked based on the fitness score from highest to lowest. A cut-off of 5 Å was used and only contacts with total fraction more than 4.5% are listed. Ext = extended conformation; Fold = folded conformation.

(PPTX)

pone.0302398.s022.pptx (55.3KB, pptx)

Acknowledgments

We would like to thank Prof. Derek Irvine for his contribution and useful discussion on project direction. We also gratefully acknowledge the support and access to the University of Nottingham High Performance Computing Facility.

Data Availability

Input and output information from the computational analysis including Caver and fpocket setups; GOLD docking proto-cols; additional analysis from MD simulations and full MD inputs; and details on protein–ligand docking results are freely available on Figshare: https://doi.org/10.6084/m9.figshare.22941149.

Funding Statement

The research was funded by the Malaysian Rubber Board, who provided a studentship for AAH and financial support for the project. We also gratefully acknowledge the support and access to the University of Nottingham High Performance Computing Facility. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Andler R. Bacterial and enzymatic degradation of poly(cis-1,4-isoprene) rubber: Novel biotechnological applications. Biotechnol Adv. 2020;44: 107606. doi: 10.1016/j.biotechadv.2020.107606 [DOI] [PubMed] [Google Scholar]
  • 2.Adjedje VKB, Schell E, Wolf YL, Laub A, Weissenborn MJ, Binder WH. Enzymatic degradation of synthetic polyisoprenes: Via surfactant-free polymer emulsification. Green Chemistry. 2021;23: 9433–9438. doi: 10.1039/d1gc03515k [DOI] [Google Scholar]
  • 3.Jendrossek D. Rubber oxygenases. Appl Microbiol Biotechnol. 2018; 125–142. doi: 10.1007/s00253-018-9453-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Soares FA, Steinbüchel A. Enzymatic and Chemical Approaches for Post-Polymerization Modifications of Diene Rubbers: Current state and Perspectives. Macromol Biosci. 2021;21: 1–19. doi: 10.1002/mabi.202100261 [DOI] [PubMed] [Google Scholar]
  • 5.Oetermann S, Jongsma R, Coenen A, Keller J, Steinbüchel A. LcpR VH2—regulating the expression of latex-clearing proteins in Gordonia polyisoprenivorans VH2. Microbiology (United Kingdom). 2019;165: 343–354. doi: 10.1099/mic.0.000755 [DOI] [PubMed] [Google Scholar]
  • 6.Hiessl S, Böse D, Oetermann S, Eggers J, Pietruszka J, Steinbüchel A. Latex Clearing Protein—an Oxygenase Cleaving Poly (cis -1,4-Isoprene) Rubber at the cis Double Bonds. Appl Environ Microbiol. 2014;80: 5231–5240. doi: 10.1128/AEM.01502-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Watcharakul S, Röther W, Birke J, Umsakul K, Hodgson B, Jendrossek D. Biochemical and spectroscopic characterization of purified Latex Clearing Protein (Lcp) from newly isolated rubber degrading Rhodococcus rhodochrous strain RPK1 reveals novel properties of Lcp. BMC Microbiol. 2016;16: 1–13. doi: 10.1186/s12866-016-0703-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Birke J, Jendrossek D. Rubber Oxygenase and Latex Clearing Protein Cleave Rubber to Different Products and Use Different Cleavage Mechanisms. Appl Environ Microbiol. 2014;80: 5012–5020. doi: 10.1128/AEM.01271-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Linh DV, Huong NL, Tabata M, Imai S, Iijima S, Kasai D, et al. Characterization and functional expression of a rubber degradation gene of a Nocardia degrader from a rubber-processing factory. J Biosci Bioeng. 2017;123: 412–418. doi: 10.1016/j.jbiosc.2016.11.012 [DOI] [PubMed] [Google Scholar]
  • 10.Ilcu L, Röther W, Birke J, Brausemann A, Einsle O, Jendrossek D. Structural and Functional Analysis of Latex Clearing Protein (Lcp) Provides Insight into the Enzymatic Cleavage of Rubber. Sci Rep. 2017;7: 1–11. doi: 10.1038/s41598-017-05268-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Röther W, Birke J, Jendrossek D. Assays for the Detection of Rubber Oxygenase Activities. Bio Protoc. 2017;7: 1–14. doi: 10.21769/BioProtoc.2188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Röther W, Austen S, Birke J, Jendrossek D. Cleavage of Rubber by the Latex Clearing Protein (Lcp) of Streptomyces sp. Strain K30: Molecular Insights. Appl Environ Microbiol. 2016;82: 6593–6602. doi: 10.1128/AEM.02176-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Birke J, Röther W, Jendrossek D. Latex Clearing Protein (Lcp) of Streptomyces sp. Strain K30 Is a b -Type Cytochrome and Differs from Rubber Oxygenase A (RoxA) in Its Biophysical Properties. Appl Environ Microbiol. 2015;81: 3793–3799. doi: 10.1128/AEM.00275-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Andler R, Altenhoff AL, Mäsing F, Steinbüchel A. In vitro studies on the degradation of poly(cis-1,4-isoprene). Biotechnol Prog. 2018;34: 890–899. doi: 10.1002/btpr.2631 [DOI] [PubMed] [Google Scholar]
  • 15.Röther W, Birke J, Grond S, Beltran JM, Jendrossek D. Production of functionalized oligo-isoprenoids by enzymatic cleavage of rubber. Microb Biotechnol. 2017;10: 1426–1433. doi: 10.1111/1751-7915.12748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zhang S, Liu Y. Mechanical Insights into the Enzymatic Cleavage of Double C-C Bond in Poly(cis-1,4-isoprene) by the Latex Clearing Protein. Inorganic Chemistry. 2020. doi: 10.1021/acs.inorgchem.0c00726 [DOI] [PubMed] [Google Scholar]
  • 17.Joo S, Cho IJ, Seo H, Son HF, Sagong HY, Shin TJ, et al. Structural insight into molecular mechanism of poly(ethylene terephthalate) degradation. Nat Commun. 2018;9. doi: 10.1038/s41467-018-02881-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ciano L, Paradisi A, Hemsworth GR, Tovborg M, Davies GJ, Walton PH. Insights from semi-oriented EPR spectroscopy studies into the interaction of lytic polysaccharide monooxygenases with cellulose. Dalton Transactions. 2020;49: 3413–3422. doi: 10.1039/c9dt04065j [DOI] [PubMed] [Google Scholar]
  • 19.Chovancova E, Pavelka A, Benes P, Strnad O, Brezovsky J, Kozlikova B, et al. CAVER 3.0: A tool for the analysis of transport pathways in dynamic protein structures. PLoS Comput Biol. 2012;8. doi: 10.1371/journal.pcbi.1002708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Brezovsky J, Kozlikova B, Damborsky J. Computational analysis of protein tunnels and channels. Methods in Molecular Biology. 2018;1685: 25–42. doi: 10.1007/978-1-4939-7366-8_3 [DOI] [PubMed] [Google Scholar]
  • 21.Ciemny M, Kurcinski M, Kamel K, Kolinski A, Alam N, Schueler-Furman O, et al. Protein–peptide docking: opportunities and challenges. Drug Discov Today. 2018;23: 1530–1537. doi: 10.1016/j.drudis.2018.05.006 [DOI] [PubMed] [Google Scholar]
  • 22.Tsvetkov VB, Serbin V A. Molecular dynamics modeling the synthetic and biological polymers interactions pre-studied via docking: Anchors modified polyanions interference with the HIV-1 fusion mediator. J Comput Aided Mol Des. 2014;28: 647–673. doi: 10.1007/s10822-014-9749-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tarachiwin L, Sakdapipanich JT, Tanaka Y. Relationship between particle size and molecular weight of rubber from hevea brasiliensis. Rubber Chemistry and Technology. 2005;78: 694–704. doi: 10.5254/1.3547907 [DOI] [Google Scholar]
  • 24.Liebeschuetz JW, Cole JC, Korb O. Pose prediction and virtual screening performance of GOLD scoring functions in a standardized test. J Comput Aided Mol Des. 2012;26: 737–748. doi: 10.1007/s10822-012-9551-4 [DOI] [PubMed] [Google Scholar]
  • 25.Aqvist J, Luzhkov VB, Brandsdal BO. Ligand binding affinities from MD simulations. Acc Chem Res. 2002;35: 358–365. doi: 10.1021/ar010014p [DOI] [PubMed] [Google Scholar]
  • 26.Genheden S, Ryde U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opinion on Drug Discovery. 2015. doi: 10.1517/17460441.2015.1032936 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kaushik S, Marques SM, Khirsariya P, Paruch K, Libichova L, Brezovsky J, et al. Impact of the access tunnel engineering on catalysis is strictly ligand-specific. FEBS Journal. 2018;285: 1456–1476. doi: 10.1111/febs.14418 [DOI] [PubMed] [Google Scholar]
  • 28.Kokkonen P, Bednar D, Pinto G, Prokop Z, Damborsky J. Engineering enzyme access tunnels. Biotechnol Adv. 2019;37: 107386. doi: 10.1016/j.biotechadv.2019.04.008 [DOI] [PubMed] [Google Scholar]
  • 29.Lu Z, Li X, Zhang R, Yi L, Ma Y, Zhang G. Tunnel engineering to accelerate product release for better biomass-degrading abilities in lignocellulolytic enzymes. Biotechnol Biofuels. 2019;12: 1–9. doi: 10.1186/s13068-019-1616-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kutov DC, Katkova E V., Sulimov A V., Kondakova OA, Sulimov VB. Influence of the method of hydrogen atoms incorporation into the target protein on the protein-ligand binding energy. Bulletin of the South Ural State University, Series: Mathematical Modelling, Programming and Computer Software. 2017;10: 94–107. doi: 10.14529/mmp170308 [DOI] [Google Scholar]
  • 31.Jurcik A, Bednar D, Byska J, Marques SM, Furmanova K, Daniel L, et al. CAVER Analyst 2.0: Analysis and visualization of channels and tunnels in protein structures and molecular dynamics trajectories. Bioinformatics. 2018;34: 3586–3588. doi: 10.1093/bioinformatics/bty386 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Le Guilloux V, Schmidtke P, Tuffery P. Fpocket: An open source platform for ligand pocket detection. BMC Bioinformatics. 2009;10: 1–11. doi: 10.1186/1471-2105-10-168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Schmidtke P, Bidon-chanal A, Luque FJ, Barril X. MDpocket: Open-source cavity detection and characterization on molecular dynamics trajectories. Bioinformatics. 2011;27: 3276–3285. doi: 10.1093/bioinformatics/btr550 [DOI] [PubMed] [Google Scholar]
  • 34.Sapundzhi F, Prodanova K, Lazarova M. Survey of the scoring functions for protein-ligand docking. AIP Conf Proc. 2019;2172: 1–6. doi: 10.1063/1.5133601 [DOI] [Google Scholar]
  • 35.Anandakrishnan R, Aguilar B, Onufriev A V. H++ 3.0: Automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulations. Nucleic Acids Res. 2012;40: 537–541. doi: 10.1093/nar/gks375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Adasme MF, Linnemann KL, Bolz SN, Kaiser F, Salentin S, Haupt VJ, et al. PLIP 2021: Expanding the scope of the protein-ligand interaction profiler to DNA and RNA. Nucleic Acids Res. 2021;49: W530–W534. doi: 10.1093/nar/gkab294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Tian C, Kasavajhala K, Belfon KAA, Raguette L, Huang H, Migues AN, et al. Ff19SB: Amino-Acid-Specific Protein Backbone Parameters Trained against Quantum Mechanics Energy Surfaces in Solution. J Chem Theory Comput. 2020;16: 528–552. doi: 10.1021/acs.jctc.9b00591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Li P, Merz KM. MCPB.py: A Python Based Metal Center Parameter Builder. J Chem Inf Model. 2016;56: 599–604. doi: 10.1021/acs.jcim.5b00674 [DOI] [PubMed] [Google Scholar]
  • 39.Gaussian 16, Revision C.01, M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, et al. No Title. Gaussian, Inc, Wallingford CT. 2016.
  • 40.Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general Amber force field. J Comput Chem. 2004;25: 1157–1174. doi: 10.1002/jcc.20035 [DOI] [PubMed] [Google Scholar]
  • 41.Wang J, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. J Mol Graph Model. 2006;25: 247–260. doi: 10.1016/j.jmgm.2005.12.005 [DOI] [PubMed] [Google Scholar]
  • 42.Li P, Song LF, Merz KM. Systematic parameterization of monovalent ions employing the nonbonded model. J Chem Theory Comput. 2015;11: 1645–1657. doi: 10.1021/ct500918t [DOI] [PubMed] [Google Scholar]
  • 43.D.A. Case, K. Belfon, I.Y. Ben-Shalom, S.R. Brozell, D.S. Cerutti, T.E. Cheatham, et al. AMBER 2020. University of California, San Francisco. 2020.
  • 44.The PyMOL Molecular Graphics System, Version 2.5.4, Schrödinger LLC. No Title.
  • 45.Humphrey W, Dalke A, Schulten K. Sartorius products. J Mol Graph. 1996;14: 33–38. [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Sushil Mishra

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

14 Sep 2023

PONE-D-23-23541Computational investigation of cis-1,4-polyisoprene binding to the latex clearing protein LcpK30PLOS ONE

Dear Dr. Pordea,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Oct 29 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Sushil Mishra, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that funding information should not appear in any section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The studied enzymes, latex clearing proteins (Lcps) are interesting candidates for structural studies as they can cleave polymers despite having active catalytic site buried inside the protein structure and furthermore, they can be used to degrade of rubbers and potentially even synthetic polymers. Application of various computational tools, such as CAVER-Pymol plugin 3.0.3, fpocket and Molecular Dynamic (MD) simulations allowed the authors to analyse the structures of Lcps and substrate binding.

Comment 1

The authors compared the open and closed crystal structures of LcpK30. The open crystal structure of LcpK30 contains, apart from heme, an imidazole molecule in its active site. The open crystal structure in single state is thoroughly analysed by the authors, also considering the probability of acquisition of a polymer in the cavities. It is not confirmed, however, if the analysed protein would have the same open conformation during incorporation of a large oligomeric substrate, as it has while incorporating imidazole, which is a smaller aromatic molecule. In PET-degrading enzymes that have active site exposed on the surface, it is the binding site’s flexibility that most probably allows the enzyme to incorporate such polymer substrate for depolymerisation. The other example can be found in epoxide hydroxylases, the native long hydrophobic substrate was thought to enter through one tunnel and with proper positioning was reaching exit of the second tunnel. Both tunnels made a L-shape tunnel network. However, during MD study, it was shown, that two domains of the epoxide hydrolase can undergo small conformational changes, and instead of two isolated tunnel, groove can be observed (https://doi.org/10.1016/j.csbj.2021.10.042 ). It is thus possible that LcpK30 undergoes some strong conformational changes to allow easier accommodation of oligomeric substrate to the protein core where the active site is located. Such big conformational changes were not possible to be observed by the authors in the performed studies. The results of the docking studies using GOLD software of flexible model substrate to rigid protein suggest that the protein truly needs to undergo some conformational changes prior to incorporating the substrate, as the obtained poses had very low or even negative fitness score.

Comment 2

The authors identified putative tunnels within the crystal structure of LcpK30 only in the OPEN state, using the CAVER-PyMOL plugin. CAVER-PyMOL plugin allows to analyse tunnels in the macromolecule’s core using the geometric approach in single state. The results are strongly influenced by the chosen radius probe. The authors should probably also analyse LcpK30 in the CLOSED state for comparison using CAVER-PyMOL plugin. Moreover, since authors run MD simulations, it could be recommended to use either CAVER implementation for MD simulation analysis or AQUA-DUCT which can more precisely show the potential accessing paths. Authors need to take into consideration, that CAVER and specifically CAVER plugin is using radial probe for tunnels detection and analysis, therefore all asymmetric tunnels cannot be described properly. Instead some of the tunnels has to be merged together. Such problems can be avoided e.g. using water molecules as a small probes – this is especially important for bulky substrates, or substrates which are not globular.

Authors can see, what type of analysis is accessible in previously mentioned article https://doi.org/10.1016/j.csbj.2021.10.042 and comparison of geometry based approach (CAVER) with molecules tracking approach (AQUA-DUCT) can be found https://doi.org/10.1021/acs.jcim.2c00985 and https://doi.org/10.1371/journal.pcbi.1010119 In general, the analysis of tunnels in crystal structures only can be misleading. Since authors have MD simulations in hands, the analysis of accessibility to the active site can be done quickly with small effort only.

Comment 3

Plot in fig.3 a should not be a line plot, but a point plot.

Comment 4

In lines 504-507 and 567-570 the authors mention experimental results of distribution of oligomers produced by LcpK30. It is not clear however, what results they mean exactly (citation?) and what are the oligomers produced by LcpK30 and which part of the modelled substrate they constitute.

Comment 5

In lines 589-592 of the methods section, the authors mention that they remove water molecules from the crystal structure of LcpK30. Then in lines 677-680 they describe the solvation of protein with OPC water using the truncated octahedron box. Prior to solvation, the authors should add the crystal water as to ensure the correct water networks inside the protein core. In the case of holo system, only crystallographic waters producing clashes with the docked substrate, should have been removed. Such procedure is much safer, and the equilibration time is shorter. In case when whole buried water is removed from the interior, some cavities might initially be closed, and longer time is required to expand them to size accommodating water again. Authors can also used 3D-RISM and Placevent for system preparation. More details can be found in papers dedicated to mentioned methods or in review describing methods using water molecules in computational analysis (https://doi.org/10.1016/j.csbj.2020.02.001 ).

Overall, the study gives interesting insight into LcpK30 structure, tunnel and pocket network and their dynamics, and provides hypotheses on how Lcps can incorporate and cleave oligomeric substrates. The authors identified probable tunnels used for transport of the oligomeric substrates to the active site. However, there are doubts if analysis was done deep enough to provide true description of the substrate accessibility.

Article provide important findings, as the tunnel-lining residues are suitable targets for mutagenesis aimed at improving enzymatic degradation rates. The information on tunnels structure and dynamics mostly comes from analysis of open structure of LcpK30. However, the MD simulations should provide insight into much more opened structure. Authors has mentioned helix E as important one – on the second side of the hem, we can see C-terminal loop, which is stabilised by short helix (71-81). It is quite probable, that this sandwich can open more significantly. The second tunnel will remain separated by the loop. Authors has identified I396 as a one of the interacting residues – it is the one which separates two distinctive tunnels from CAVER analysis.

The authors could compare the closed structure as well and tunnels dynamics in the structures with bound substrate to more deeply understand what conformational changes the LcpK30 enzyme undergoes and how incorporation of a big oligomeric substrate influences tunnel dynamics. This was partly done by RMSD and RMSF analysis, but tunnel radius investigation in a enzyme-substrate complex could give insight into tunnel bottlenecks and residues that might interfere in the substrate/product transport.

I would like to underlay that since authors have run MD simulations the mentioned improvements are in their hands. However, the conclusions can differ substantially.

Reviewer #2: The manuscript by Hassan and co-workers brings an interesting approach to the investigation of the polyisoprene binding to latex-clearing proteins and indeed might represent a good contribution for a better understanding of the mechanisms behind the action of this type of protein. The submitted version of the manuscript, however, seems to show some flaws that don't permit ensuring publication. Some suggestions of improvements are listed below:

1) The english language is good but small typos and grammar issues were detected. I recommend a deep revision before resubmission;

2) I don't see need of an author summary. Unless this is a request from the journal I suggest deleting it;

3) I suggest no repeating colors in Figs. 2a and 2b. This can confuse the readers. Use different colors for tunnels and pockets;

4) The docking protocol is poor. There is no re-docking to validate the docking protocol neither an appropriate description about the number of runs, criteria to select the best poses, algorithm used, etc.... Also there is no experimental support for the binding of the ligand. Authors should use some experimental evidence that justify the poses chosen. If it's not possible, this should be clarified in the text and the speculative character of the work evidenced;

5) How many poses were selected for the MD simulations? this is not clear in the manuscript. Why to select folded and extended poses? There is no hint in the literature pointing to one or another? This should be discussed in the text;

6) Most docking programs state favorable docking scores as negative energy values. If GOLD is different this should be clarified in the text;

7) Many important results are shown as supplementary material. This makes it hard to follow the discussion. I recommend revising it and bringind back to the text the most relevant figures and tables.

8) Most of software and methods used are not properly cited in the manuscript;

9) Author mention binding energy calculations during the MD simulations but don't present any result.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 May 15;19(5):e0302398. doi: 10.1371/journal.pone.0302398.r002

Author response to Decision Letter 0


1 Feb 2024

Thank you for giving us the opportunity to respond to reviewers comments and to revise our manuscript. We carefully considered and addressed the editor’s and reviewers’ comments and include our response in the attached document "Response to reviewers". We thank the reviewers for their very insightful comments, which allowed us to improve the manuscript.

Journal Requirements

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Response: we used the links above to format our documents and included all supplementary figures and tables as separate files, named and cited appropriately in the text.

2. Please note that funding information should not appear in any section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

Response: funding text was removed.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

Response: the relevant accession number for all the data was provided as a link included in the Data Availability Statement at the end of the document. https://doi.org/10.6084/m9.figshare.22941149.

Reviewers' comments

We thank the reviewers for their very insightful comments, which allowed us to improve the manuscript. We addressed all comments below. We included additional data and explanations and we moved some of the data from the supplementary information into the main manuscript, as suggested by the reviewers.

Reviewer #1

The studied enzymes, latex clearing proteins (Lcps) are interesting candidates for structural studies as they can cleave polymers despite having active catalytic site buried inside the protein structure and furthermore, they can be used to degrade of rubbers and potentially even synthetic polymers. Application of various computational tools, such as CAVER-Pymol plugin 3.0.3, fpocket and Molecular Dynamic (MD) simulations allowed the authors to analyse the structures of Lcps and substrate binding.

Comment 1.1

The authors compared the open and closed crystal structures of LcpK30. The open crystal structure of LcpK30 contains, apart from heme, an imidazole molecule in its active site. The open crystal structure in single state is thoroughly analysed by the authors, also considering the probability of acquisition of a polymer in the cavities.

It is not confirmed, however, if the analysed protein would have the same open conformation during incorporation of a large oligomeric substrate, as it has while incorporating imidazole, which is a smaller aromatic molecule.

In PET-degrading enzymes that have active site exposed on the surface, it is the binding site’s flexibility that most probably allows the enzyme to incorporate such polymer substrate for depolymerisation. The other example can be found in epoxide hydroxylases, the native long hydrophobic substrate was thought to enter through one tunnel and with proper positioning was reaching exit of the second tunnel. Both tunnels made a L-shape tunnel network. However, during MD study, it was shown, that two domains of the epoxide hydrolase can undergo small conformational changes, and instead of two isolated tunnel, groove can be observed (https://doi.org/10.1016/j.csbj.2021.10.042 ). It is thus possible that LcpK30 undergoes some strong conformational changes to allow easier accommodation of oligomeric substrate to the protein core where the active site is located.

Such big conformational changes were not possible to be observed by the authors in the performed studies. The results of the docking studies using GOLD software of flexible model substrate to rigid protein suggest that the protein truly needs to undergo some conformational changes prior to incorporating the substrate, as the obtained poses had very low or even negative fitness score.

Response:

We thank the reviewer for this question because it allowed us to further discuss the importance of the conformational flexibility in LcpK30. To address this question, we carried out extensive 3 x 500 ns MD simulations starting from the closed-like conformation of the enzyme, prepared as described below. In addition to that, to explore conformational changes of the enzyme upon enzyme-substrate complex formation, we performed principal component analysis (PCA) taking into account backbone cartesian coordinates, using MD snapshots from both the open and the closed-like states of the empty enzyme (without substrate bound), as well as of the enzyme in complex with the polyisoprene substrate. New text and data was included in the manuscript, as described below.

Pages 8-10: we introduced changes in title, text and Fig 2 title to clarify differences between static and dynamic models.

Lines 189-196

“We assumed that polyisoprene binding occurs in the catalytically active state of LcpK30 and therefore we prepared an LcpK30 model starting from the open state structure, in which the co-crystalized imidazole molecule was replaced with O2 bound to the heme cofactor and where the Lys167 chain (not heme bound) was fully protonated, which we were able to simulate during relatively long simulation timescales. For comparison, we also modelled the closed-like state system, prepared from the closed state structure conformation, but where the coordination of Lys167 to heme was removed and replaced with O2 coordination, whilst Lys167 was fully protonated.”

We note that the crystal structure of the closed conformation (PDB 5O1M) most likely resembles the resting state of the enzyme, with neutral Lys167 (instead of O2) coordinating heme. We were not interested in simulating this state because the coordination of Lys167 to Fe would prevent binding of the catalytic O2 and its subsequent interaction with the substrate.

Lines 198-202

“Relatively low Calpha backbone RMSD were obtained over the course of 500 ns MD simulations, for both open and closed-like conformations (see S3a Fig for the open conformation). This suggested that the enzyme with no substrate bound did not undergo large structural changes compared to the reference X-ray structures of the open and closed states.”

Lines 212-244

“Principal component analysis (PCA) taking into account backbone cartesian coordinates was used to compare open and closed states of the enzyme with no substrate. A clear separation of the two states along the PC1 was observed, suggesting that major conformational changes must occur that would allow the transition between closed (positive PC1 values) and open (negative PC1 values) conformations (Fig 3a). Furthermore, the normal mode displacements (Fig 3b) along the PC1 were characterised as conformational changes that predominantly involve helix E residues 160-176, which also contains the central Lys167. Namely, in the open conformation Lys167 is exposed towards the protein surface interacting with the propionates of the heme cofactor, in contrast to the closed-like conformation where it is rotated into the protein interior interacting with O2 and Glu148. On the other hand, displacements along the PC2 are associated with the flexibility of the C-terminus loop 392-397 ranging from fully closed (negative PC2 values) to a fully open (positive PC2 values) conformation (Fig 3b).

Despite the closed-like state having identical amino acid protonation states and the same species present in the active site as the open state (fully protonated Lys167 and O2 bound to heme), a transition between the two states has not been observed, and the reason for that might be a large barrier separating these two states that simply could not be sampled using conventional MD simulations at the available timescales. This implies that the opening of the enzyme is a complex and controlled process that happens on longer timescales and involves large conformational changes of the helix E and Lys167 and might even be coupled with the approach and the interaction with the polyisoprene substrate.”

We also introduced a principal component analysis to compare the conformations of LcpK30 with and without bound substrate. For this, we combined the MD snapshots for the empty enzyme (no substrate, from analysis above) with MD snapshots of the enzyme in complex with the cis-1,4-polyisoprene chain (originating from all simulations of both ChemPLP and ChemScore docking solutions) into a unique PCA plot, now included in S13 Fig. Based on the PCA plots, we concluded the following:

Lines 437-451

“The PCA analysis combining MD snapshots of the enzyme with and without substrate showed that compared to the empty enzyme, the presence of substrate slightly changed the conformation of the enzyme, irrespective of the extended or folded substrate binding mode (S13 Fig). The bound and unbound states were not separated along only one principal component, instead displacements along both PC1 and PC2 were observed. Normal mode displacements along the PC1 included the loop 155-160, helix E 160-176 and C-terminal helix Z2 297-314, whilst the displacement along the PC2 mostly localised at the C-terminal loop residues 392-397. As observed from the RMSD analysis, in some cases where the substrate binds deeper in the active site the conformation of the enzyme bound to substrate was significantly different from the empty enzyme (for example in the case of extended substrate conformations 10 from ChemPLP and 3 from ChemScore). Furthermore, significant structural changes of the enzyme, especially C-terminus loop 392-397, were observed in case of simulations with the folded substrate conformations 1 from both ChemPLP and ChemScore.”

From this data, we conclude that conformational changes occur for the substrate-bound enzyme, compared to either open or closed state without substrate. Simulation of the closed-like state with substrate bound was not possible, because the substrate could not be accommodated within the structure. However, we hypothesise that the binding of the substrate is more likely to occur to the open state than to the closed state, because more space is available within the enzyme (see also answer below, related to tunnels in closed structure).

The reviewer also suggested that there might be a groove forming which would allow easier approach and binding of larger substrates into LcpK30, similar to the one reported for epoxide hydrolase, however we did not observe the formation of such groove from our simulations. We believe that in the case of LcpK30, the tunnel formation is most likely controlled by the conformation of central Lys167 from the helix E and its interaction with the heme propionate groups and by the conformation of the flexible C-terminus loop.

Comment 1.2

The authors identified putative tunnels within the crystal structure of LcpK30 only in the OPEN state, using the CAVER-PyMOL plugin. CAVER-PyMOL plugin allows to analyse tunnels in the macromolecule’s core using the geometric approach in single state. The results are strongly influenced by the chosen radius probe. The authors should probably also analyse LcpK30 in the CLOSED state for comparison using CAVER-PyMOL plugin. Moreover, since authors run MD simulations, it could be recommended to use either CAVER implementation for MD simulation analysis or AQUA-DUCT which can more precisely show the potential accessing paths. Authors need to take into consideration, that CAVER and specifically CAVER plugin is using radial probe for tunnels detection and analysis, therefore all asymmetric tunnels cannot be described properly. Instead some of the tunnels has to be merged together. Such problems can be avoided e.g. using water molecules as a small probes – this is especially important for bulky substrates, or substrates which are not globular. Authors can see, what type of analysis is accessible in previously mentioned article https://doi.org/10.1016/j.csbj.2021.10.042 and comparison of geometry based approach (CAVER) with molecules tracking approach (AQUA-DUCT) can be found https://doi.org/10.1021/acs.jcim.2c00985 and https://doi.org/10.1371/journal.pcbi.1010119 In general, the analysis of tunnels in crystal structures only can be misleading. Since authors have MD simulations in hands, the analysis of accessibility to the active site can be done quickly with small effort only.

Response:

No static tunnels were identified using CAVER-Pymol plugin 3.0.3 within the closed state structure (5O1M), using the same calculation parameters as with the open state structure. However, by decreasing the minimum probe radius of tunnel calculations from 0.9 to 0.6 Å, three tunnels were detected, with much lower throughput (0.2 Å) and bottleneck radius (0.6 Å).

We included a clarification at lines 132-133: “We used both the open (5O1L) and the closed state conformations (5O1M), however only the former led to tunnel identification using the default parameters (see Methods).”

We thank the reviewer for suggesting alternative programs for detecting and analysing tunnels from MD simulations. We are aware of the limitations of CAVER software when it comes to characterising wide and asymmetric tunnels, which arise from the fact that the individual tunnels are detected by constructing the Voronoi diagrams using simple spherical probes. However, we strongly believe that accurate tunnel predictions could be obtained by employing the dynamical approach implemented in CAVER Analyst 2 using MD snapshots instead of static crystal structures. In the previously submitted S5a Fig and S2 Table, we have already presented the dynamical tunnel calculations with CAVER Analyst 2 using our MD simulations of the enzyme in open conformation.

We have now moved the analysis of dynamical tunnel clusters for the open state enzyme from the supplementary information into the main manuscript (Table 1 and Fig 4a). In addition to this, we also analysed the features of dynamical tunnel clusters of the closed-like conformation and compared those to the tunnels obtained during MD simulations of open state (Table 1 and S5 Fig). New text and data was included in the manuscript, as described below.

Lines 245-290:

“Given the enzyme flexibility required to accommodate the substrate, we sought to improve the static tunnel predictions by employing the dynamical approach implemented in CAVER Analyst 2, using MD snapshots instead of static crystal structures. The first 5 identified tunnel clusters ranked based on priority (calculated by averaging tunnel throughputs over all snapshots) were analyzed, for MD simulations with both the open and closed-like conformations and the details about the features of the dynamic tunnel clusters are shown in Table 1. Open state tunnel clusters had overall larger average and maximum bottleneck radius, shorter length, smaller curvature and higher throughput than closed state ones, which ultimately makes them more favorable for substrate binding. Furthermore, the priority of the tunnel clusters calculated based on the appearance frequency during the MD simulations was much higher in the open (14-49%) compared to the closed (4-13%) state. Interestingly, when comparing the spatial distribution of timeless tunnel cluster pathways, we could clearly see how the central Lys167 plays an important role in separating tunnels into two wider superclusters in the open state (Fig 4a, clusters 1 and 3 forming the green supercluster and clusters 2 and 5 forming the blue super-cluster). This is in contrast with the rotated Lys167 conformation forcing the appearance of one narrow supercluster

Attachment

Submitted filename: Response to reviewers.docx

Decision Letter 1

Sushil Mishra

3 Apr 2024

Computational investigation of cis-1,4-polyisoprene binding to the latex clearing protein LcpK30

PONE-D-23-23541R1

Dear Dr. Pordea,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Sushil Mishra, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

None

Reviewers' comments:

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Crystal structure analysis.

    Close view of the active site from the crystal structure of LcpK30 showing the helix 160–176 near heme in (a) open state (PDB: 5O1L) and (b) closed state (PDB: 5O1M).

    (TIF)

    pone.0302398.s001.tif (2.3MB, tif)
    S2 Fig. Tunnel profiles for the dominant static tunnels.

    Profiles were calculated using the LcpK30 crystal structure in the open state (PDB 5O1L) using the CAVER-Pymol plugin 3.0.3. (a) Variation of tunnel radius with length; (b) tunnel profile heatmaps; (c) distances between different atoms in the cis-1,4-polyisoprene chains, used to estimate maximum width of the chain in an extended conformation. The distances were measured between the indicated carbon atoms that represent the bond length (blue) and close contact atoms (red). The addition of C-H distances is anticipated to add max. 1.1 Å.

    (TIF)

    pone.0302398.s002.tif (4.1MB, tif)
    S3 Fig. RMSD and RMSF profiles for LcpK30 in the open state.

    (a) C-alpha RMSD and (b) RMSF values were calculated from 3 independent 500 ns MD simulations (black, red and green) of LcpK30 from the reference X-ray structure in the open state. Due to increased flexibility of the N-terminus, residues 29–49 were omitted from the analysis to minimize the noise. The RMSF values are shown per residue as average from all 3 trajectories.

    (TIF)

    pone.0302398.s003.tif (1.5MB, tif)
    S4 Fig. Backbone flexibility of open state LcpK30 from MD simulations.

    (a) Experimental B-factor from the X-ray structure and (b) C-alpha RMSF (Å) values from MD simulations plotted on the backbone of LcpK30. Due to large fluctuations of the N-terminus, residues 29–49 were omitted from the analysis of the MD trajectories for the sake of clarity. The heme cofactor is represented in spheres (orange = Fe, white = C, red = O, blue = N). The protein structures are depicted with the B-factor putty representation where the backbone is displayed as a tube with a diameter correlated to the experimental B-factor from the X-ray structure (a) or RMSF from MD simulations (b). Thicker tube indicates higher flexibility. The structures are coloured with a continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low, and red is high backbone flexibility.

    (TIF)

    pone.0302398.s004.tif (2.1MB, tif)
    S5 Fig. Dynamical tunnel mapping and pocket identification in the LcpK30 closed-like structure.

    First 5 dominant tunnel clusters were calculated with CAVER Analyst 2 on 1500 snapshots from MD simulations of the closed-like state. Clustering displays the centre lines for all tunnels computed for all snapshots at once. Centre lines are coloured according to their related clusters. Clusters 1, 3, 4 and 5 form one narrow supercluster, colored in limon.

    (TIF)

    pone.0302398.s005.tif (1.2MB, tif)
    S6 Fig. Dynamic of water molecules during simulations of open and closed-like states of LcpK30.

    Radial distribution function (RDF) of water oxygen atoms from O2 molecule bound to heme in (a) open and (b) closed LcpK30. The isosurface (isovalue = 100) showing the density of water oxygen atoms around the active site in (c) open and (d) closed LcpK30. Top view at the heme cofactor in the active site of LcpK30 showing water molecules as blue surface using the representative MD snapshots in (e) open and (f) closed LcpK30. The analysis was carried out on 6000 frames from 3 repeat MD simulations of open and closed-like LcpK30 states.

    (TIF)

    pone.0302398.s006.tif (3.1MB, tif)
    S7 Fig. Superimposed conformations results from rigid docking of cis-1,4-polyisoprene C50H82 within LcpK30.

    (a) Best 10 poses obtained using the ChemPLP scoring function; (b) best 10 poses obtained using ChemScore scoring function.

    (TIF)

    pone.0302398.s007.tif (2.4MB, tif)
    S8 Fig. Effect of binding site size on docking.

    Superimposed docking conformations results of C50H82 within a defined radius of (a) 10 Å and (b) 15 Å from the central iron atom in heme using ChemPLP flexible docking. Ligand is represented as yellow sticks, and the heme cofactor is represented in spheres (white = C, red = O, blue = N).

    (TIF)

    S9 Fig. Docking conformations of cis-1,4-polyisoprene (C50H82) near heme.

    Docking conformations of cis-1,4-polyisoprene (C50H82) near heme. Ranking is by the fitness score from highest to lowest. a) Poses obtained with the ChemPLP fitness function; b) poses obtained with the ChemScore fitness function. Docking solutions were calculated with induced fit docking using a 10 Å binding site. Enzyme and hydrogens were omitted for the sake of clarity.

    (TIF)

    pone.0302398.s009.tif (2.7MB, tif)
    S10 Fig. Protein-ligand interactions identified by PLIP analysis.

    Protein-ligand interactions identified by PLIP analysis. Docked poses were obtained with ChemPLP (above) and ChemScore (below). (a) docking solutions ranked 1, representative of folded conformations and (b) docking solutions ranked 3, representative of extended conformations. Docking solutions were calculated with induced fit docking using a 10 Å binding site and ranked based on the fitness score from highest to lowest (see Table 1). The hydrophobic interactions between the ligand and interacting residues are shown by dashed lines. The three residues that appeared in all interactions are underlined. The residue that interacted only with the extended conformation is coloured in blue. The red coloured residue (Ala159) appeared in all interactions with poses obtained using ChemScore and in most interactions with poses obtained using ChemPLP.

    (TIF)

    pone.0302398.s010.tif (7.6MB, tif)
    S11 Fig. C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness function.

    C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness function. All values are calculated from the reference X-ray structure of the enzyme and projected on the average structure of the enzyme. Due to increased flexibility, residues 29–49 from N-terminus were omitted from the analysis to minimise the noise. The substrate is omitted for the sake of clarity. The protein structures are depicted with the putty representation where the backbone is displayed as a tube with a diameter correlated to the RMSF from MD simulations (thicker tube indicates higher RMSF). The structures are coloured with continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low and red is high backbone flexibility.

    (TIF)

    pone.0302398.s011.tif (4.8MB, tif)
    S12 Fig. C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness function.

    C-alpha RMSF calculated during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness function. All values are calculated from the reference X-ray structure of the enzyme and projected on the average structure of the enzyme. Due to increased flexibility, residues 29–49 from N-terminus were omitted from the analysis to minimise the noise. The substrate is omitted for the sake of clarity. The protein structures are depicted with the putty representation where the backbone is displayed as a tube with a diameter correlated to the RMSF from MD simulations (thicker tube indicates higher RMSF). The structures are coloured with continuous scale that ranges from blue to red to indicate the backbone mobility, where blue is low and red is high backbone flexibility.

    (TIF)

    pone.0302398.s012.tif (4.8MB, tif)
    S13 Fig. Principal component analysis (PCA) carried out on structures from MD simulations of open and closed LcpK30 with and without substrate.

    PCA was performed considering cartesian coordinates of protein backbone atoms (N, Cα, C and O). (a) MD snapshots projected on the first two principal components. Blue and red circles represent MD snapshots belonging to open and closed-like states, respectively, orange circles represent MD snapshots belonging to the open state LcpK£0 in complex with the substrate. (b) Normal mode displacement vectors associated with the first two principal components showing only motions longer than 2 Å as porcupine in both directions. The backbone is coloured by the mobility where lower and higher flexibility is depicted in blue and red, respectively. Structures from MD simulations projected on the first two principal components considering open LcpK30 (grey) and LcpK30 in complex with substrate from (c) ChemPLP and (d) ChemScore docking poses.

    (TIF)

    pone.0302398.s013.tif (13.3MB, tif)
    S14 Fig. Probability distribution of distance between distal oxygen of O2 and each C = C double bond for docking poses obtained with ChemPLP.

    Calculations were during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemPLP fitness score. The cleaved bonds were the ones found closest to O2. The color of the C = C double bond (see scheme at the bottom) corresponds to the color of the probability histogram.

    (TIF)

    pone.0302398.s014.tif (1.8MB, tif)
    S15 Fig. Probability distribution of distance between distal oxygen of O2 and each C = C double bond for docking poses obtained with ChemScore.

    Calculations were during 100 ns MD simulations of LcpK30 with cis-1,4-polyisoprene bound, starting from 10 docking poses obtained with ChemScore fitness score. The cleaved bonds were the ones found closest to O2. The color of the C = C double bond (see scheme at the bottom) corresponds to the color of the probability histogram.

    (TIF)

    pone.0302398.s015.tif (1.7MB, tif)
    S1 Table. The features of dominant tunnels calculated using the LcpK30 crystal structure in the open state (PDB 5O1L) using the CAVER-Pymol plugin 3.0.3.

    (PPTX)

    pone.0302398.s016.pptx (53.8KB, pptx)
    S2 Table. Docking solutions obtained from rigid docking within GOLD software.

    Solutions are ranked based on the fitness score from highest to lowest.

    (PPTX)

    pone.0302398.s017.pptx (63.7KB, pptx)
    S3 Table. Impact of the size of the defined protein binding site on the ChemPLP docking scores.

    Docking solutions are obtained from flexible docking using the ChemPLP scoring function within the GOLD software. The docking was performed in a binding site defined as all atoms within 10 or 15 Å from the central iron atom in heme. The 10 docking solutions are ranked based on the fitness score from highest to lowest.

    (PPTX)

    pone.0302398.s018.pptx (47.9KB, pptx)
    S4 Table. List of residues interacting with the docked cis-1,4-polyisoprene, extracted from PLIP analysis of 10 docking poses obtained using ChemPLP.

    Docking solutions were ranked based on the fitness score from highest to lowest. Residues that interact with all / most poses are highlighted in grey and light grey, respectively.

    (PPTX)

    pone.0302398.s019.pptx (58.7KB, pptx)
    S5 Table. List of residues interacting with the docked cis-1,4-polyisoprene, extracted from PLIP analysis of 10 docking poses obtained using ChemScore.

    Docking solutions were ranked based on the fitness score from highest to lowest. Residues that interact with all / most poses are highlighted in grey and light grey, respectively.

    (PPTX)

    pone.0302398.s020.pptx (57.2KB, pptx)
    S6 Table. Frequency of contacts between the substrate and LcpK30 calculated during 100 ns MD simulations of 10 docking poses obtained from ChemPLP fitness function.

    Docking solutions were initially ranked based on the fitness score from highest to lowest. A cut-off of 5 Å was used and only contacts with total fraction more than 4.5% are listed. Ext = extended conformation; Fold = folded conformation.

    (PPTX)

    pone.0302398.s021.pptx (55.6KB, pptx)
    S7 Table. Frequency of contacts between the substrate and LcpK30 calculated during 100 ns MD simulations of 10 docking poses obtained from ChemScore fitness function.

    Docking solutions were initially ranked based on the fitness score from highest to lowest. A cut-off of 5 Å was used and only contacts with total fraction more than 4.5% are listed. Ext = extended conformation; Fold = folded conformation.

    (PPTX)

    pone.0302398.s022.pptx (55.3KB, pptx)
    Attachment

    Submitted filename: Response to reviewers.docx

    Data Availability Statement

    Input and output information from the computational analysis including Caver and fpocket setups; GOLD docking proto-cols; additional analysis from MD simulations and full MD inputs; and details on protein–ligand docking results are freely available on Figshare: https://doi.org/10.6084/m9.figshare.22941149.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES