Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2026 Mar 31;16:10786. doi: 10.1038/s41598-026-44427-2

Investigating Escherichia coli Colicin E9 immunity protein interactions with DNA gyrase of Pseudomonas aeruginosa: advanced computational approach for developing novel antimicrobial strategies

Rihaf Alfaraj 1,, Fai Alkathiri 1, Rupesh Chikhale 2,
PMCID: PMC13039675  PMID: 41917062

Abstract

Protein-protein interactions (PPIs) are essential to numerous cellular processes, making a thorough investigation of these interactions crucial for a deeper understanding of molecular mechanisms. This research initiative aims to elucidate the complex relationship between the Colicin E9 immunity protein from Escherichia coli (E. coli) and DNA gyrase from Pseudomonas aeruginosa, a significant target for antimicrobial strategies. By inhibiting DNA gyrase, the immunity protein Colicin E9, produced by E. coli, allows the host to generate a protective protein that lowers the risk of self-toxicity from this interaction. Understanding this relationship can aid the development of antibacterial strategies to combat resistant infections. Molecular docking, molecular dynamics (MD) simulations, and binding energy analyses were integral to the computational approach used in this study. MD simulations were employed to assess the stability and dynamics of the protein complex after docking was conducted using ClusPro and LightDock. The MM-GBSA method was used to evaluate binding free energies and to characterise structural features through hydrogen-bond analysis to uncover key stabilising interactions. Several important residues were identified that help stabilise the interface between the two proteins: HIS40, MET27, LYS105, GLU44, ASP47, and three others. The ClusPro complex displayed impressive interactions, featuring between six and ten hydrogen bonds along with a binding free energy of ΔG = -3.88 kcal/mol, indicating a strong protein interaction. Despite some variations, both complexes maintained stable interfaces throughout the MD simulations, and DNA gyrase retained its structural integrity. This study concludes that the protein complex formed between the Colicin E9 immunity protein and DNA gyrase is stable, paving the way for future experiments and the development of new antimicrobial agents targeting DNA gyrase in P. aeruginosa infections.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-026-44427-2.

Keywords: Colicin E9 immunity protein, DNA gyrase, Protein-protein interactions, Molecular docking, Molecular dynamics

Subject terms: Biochemistry, Computational biology and bioinformatics, Drug discovery, Microbiology, Structural biology

Introduction

Protein-protein interactions (PPIs) are essential to a wide range of biological processes, including signal transduction, gene regulation, and enzymatic activities, thereby enhancing understanding of cellular mechanisms and the causes of various diseases13. In the context of global health, decoding the PPIs is vital for identifying novel pharmacological targets to combat multidrug-resistant (MDR) infections4. High-throughput screening methodologies aimed at investigating PPIs in both prokaryotic and eukaryotic proteomes have yielded compelling evidence that cells can be viewed as intricate assemblies of interconnected molecular complexes, whose foundational components are frequently described as ‘protein hubs’5. Bacteriocins, a diverse family of protein antibiotics, represent a sophisticated bacterial defense strategy. The category of protein antibiotics comprises an extensive and varied family of multidomain toxins that inflict lethal effects on specific Gram-negative bacterial species during intraspecies competition for essential resources, thereby emphasising their ecological significance6. In recent years, significant progress has been made in understanding the complex mechanisms by which potent toxins are absorbed by target cells7. In particular, identifying the various structures associated with different bacteriocin domains has been crucial for illuminating their functional dynamics8. Nevertheless, the structural biochemistry of intact bacteriocins remains poorly understood, prompting inquiries into how these structures may differ across bacterial species, an area that warrants further exploration9. Here, we examine the bacteriocin produced by Escherichia coli (E. coli) and the essential enzyme DNA gyrase found in Pseudomonas aeruginosa (P. aeruginosa), with the goal of elucidating the mechanisms behind their interactions and the resulting implications.

To ensure that the host organism possesses robust immunity to the detrimental effects of Colicin E9, a specific inhibitory protein, referred to as the Colicin E9 immunity protein or Im9, is synthesised in a coordinated manner alongside the colicin itself10. It is important to note that the complex formed between Colicin E9 and Im9 is ultimately secreted by the bacterial cell that produces it. Although it is broadly acknowledged within the scientific community that the Colicin E9-Im9 complex is likely to dissociate during its engagement with the target cell, given that the DNase enzyme is active within this complex11. Although the Colicin DNases (E2, E7, E8, and E9) share ~ 80% sequence similarity, their immunity proteins are highly divergent and exhibit strict cognate specificity; only the correct partner can provide total immunity12. However, it is critical to emphasise that, despite these similarities, only the cognate inhibitors can provide complete immunity to the enzymatic activity of the colicin DNases13. Colicins, which are plasmid-encoded toxins produced by Enterobacteriaceae, are generated in response to environmental stressors or nutrient scarcity through the SOS response14. To prevent self-toxicity, colicin-producing cells co-synthesize highly specific immunity proteins (e.g., Im9 for Colicin E9)15. This diminutive protein proficiently binds to the colicin, consequently inhibiting its nuclease activity. These small (~ 9.5 kDa) proteins bind to the 60 kDa DNase domain with extraordinary affinity, neutralizing enzymatic activity until the toxin is released into the environment. Consequently, in addition to the significant challenges that a nuclease colicin faces as it navigates the translocation of its enzymatic component across two membrane barriers to reach its target, these toxins are further burdened by the challenge of releasing their tightly bound immunity protein during this translocation21.

This study investigates the potential interactions between the Escherichia coli Im9 and the essential enzyme DNA gyrase from P. aeruginosa. P. aeruginosa is a known profound human pathogen characterized by its metabolic adaptability and extensive antibiotic resistance profiles16,17. Given that DNA gyrase is a validated target for antimicrobial therapy, exploring its interaction with Im9 may reveal new regulatory roles for bacteriocin components and provide a template for innovative therapeutic strategies against MDR pathogens18. Initially, computational mapping using the ScanNet server indicates that Im9 could interact directly with the active-site amino acids of DNA gyrase. By binding to the identified ‘hotspot’ residues, Im9 physically blocks the DNA gyrase from P. aeruginosa’s identified interacting pocket, preventing the substrate interactions necessary for DNA replication. Further, sophisticated algorithms were utilised in advanced protein-protein docking simulations to pinpoint potential interaction interfaces, thereby deepening understanding of the molecular mechanisms that regulate protein interactions1921. The docked complexes were subjected to a comprehensive analysis of their binding energies and interactions, which are recognised for their accuracy in molecular simulations. As a result, molecular dynamics (MD) simulations were used to further explore the most energetically stable complexes, assess their dynamic properties, and confirm the stability and integrity of their interaction interfaces. This integrated approach promotes a clearer understanding of the molecular interactions between the Colicin E9 immunity protein and DNA gyrase in P. aeruginosa, enabling more advanced experimental investigations and the strategic development of new antimicrobial agents.

Materials and methodology

Protein preparation

The three-dimensional (3D) structures of the target proteins were obtained from the Protein Data Bank (PDB)22, a vital repository for macromolecular structures. Proper preparation of macromolecules before the in-silico study is essential, as it provides the necessary data for further analyses, including molecular docking and protein-protein interaction studies23. This preparation enhances understanding of the biological roles of the proteins under study. PDB ID 6M1S2426 represents the DNA gyrase from P. aeruginosa (Chain A), while PDB ID 5EW526,27 refers to the Colicin E9-immunity protein (Chain E). These selected structures illustrate key molecular functions, such as DNA gyrase, which is essential for managing DNA topology, a process crucial for both replication and transcription. Likewise, in competitive microbial environments, the Colicin E9-Immunity protein confers immunity to bacteriocins. The integrity of the protein models was assessed through a comprehensive structural analysis using PyMol28, with an emphasis on locating missing residues, chain discontinuities, and other structural abnormalities. A gap of missing residues from positions 102 to 119 in the DNA gyrase (PDB ID: 6M1S) was found. The Colicin E9-Immunity protein (PDB ID: 5EW5) was found to have a complete structure without any defects. During the protein preparation process, any non-biological ions and water molecules were removed. It is important to note that the above non-biological components may introduce artefacts and reduce the accuracy of the analyses. The missing residues in the DNA gyrase structure were repaired using homology modelling29. Homology modelling was carried out using the SWISS-MODEL server34,35, an online web server that employs template-based modelling to reconstruct missing residues in the protein. The repaired structure was rigorously validated using the SAVES server30. SAVES database assesses the stereochemical integrity, Ramachandran plots, and structural fidelity. In order to optimise both the modelled structures, specifically to eliminate close contacts and reduce any structural overlaps, a short span of MD simulation lasting for 50 ns was performed and described in a subsequent section. This simulation is well-suited for fine-tuning the protein’s spatial arrangement and motion, thereby yielding more accurate samples of its shape.

MD simulation for structural optimisation

MD simulations were conducted on the crystal structure of the Colicin E9-immunity protein and on modelled structures of DNA gyrase to verify the optimisation of their structural integrity and stability. This step aimed to reach energetically favourable conformations to refine the structures for upcoming docking studies. The MD simulations of both proteins were carried out using GROMACS2024.331,32 for a 50 ns time span. The protein topology was generated using the all-atom CHARMM36 force field33. To replicate physiological conditions, each of the proteins was immersed in a cubic water box utilising the TIP3P (Transferable Intermolecular Potential with 3 Points) water model34. To neutralise the system, the required quantity of Na+/Cl ions was added. To remove the steric class and close contacts, energy minimisation was performed. To equilibrate the systems, both were equilibrated in NVT, followed by NPT, for 5 ns each. Furthermore, the production phase was carried out for 50 ns, utilising periodic boundary conditions to eliminate edge effects. Several parameters were computed from the finalised MD simulation trajectories to assess protein stability. These parameters include the protein backbone root-mean-square deviation (RMSD), root-mean-square fluctuation (RMSF), radius of gyration (RoG), and solvent-accessible surface area (SASA). The protein-protein docking analysis was conducted using the last conformations of both proteins at 50 ns, which were extracted from the trajectory.

Active site identification

Identifying active sites, specific regions on proteins where molecular interactions occur, can substantially improve the accuracy of docking position predictions while also offering vital insights into the functional dynamics of proteins35. Recognising these active sites is essential for understanding the mechanisms that regulate protein interactions and for enhancing the accuracy of protein-protein docking simulations, particularly when incorporating molecular biology insights into the evaluation process. Examining these interactions at the molecular level is critical to the development of drug design and treatment approaches. Identifying the precise active site reduces the search space for docking algorithms, thereby increasing the accuracy of protein-protein interaction predictions. In this context, DNA gyrase served as the receptor, while Colicine E9 served as the ligand. The active-site residues of the DNA gyrase protein were identified using the ScanNet server36, an advanced tool for accurately predicting functional sites within protein structures. The ScanNet al.gorithm combines structural and sequence information to effectively predict residues essential for binding and catalytic functions. The analysis input comprised the optimised DNA gyrase structure obtained from MD simulations. The server identified crucial residues in the binding region that are likely involved in ligand-protein interactions. The identified active-site residues will serve as the primary interaction region for the docking studies.

Protein-protein docking

To predict potential binding sites, molecular docking was utilised to illustrate the interaction between DNA gyrase and Colicin E9. The docking analyses relied on the final conformations of both proteins obtained from 50 ns MD simulation trajectories. It employed ClusPro 2.037 and LightDock38, reputable docking platforms, to investigate the protein-protein interactions. ClusPro’s docking workflow employed a rigid-body approach to generate thousands of docked conformations, which were then organised by energy score. The cluster with the highest score indicates the most favourable interaction, as determined by energy minimisation. ClusPro’s scoring function evaluates biologically relevant conformations by factoring in van der Waals forces, electrostatic interactions, and desolvation energy.

LightDock is a computational tool that simulates macromolecular interactions, with a focus on the flexibility required for protein-protein docking. This flexibility is essential for accurate prediction of protein complex structures, particularly because proteins undergo conformational changes upon binding. LightDock’s adaptable docking method is crucial for addressing the limitations of rigid-body assumptions common in traditional docking approaches. By permitting some flexibility in side chains and utilising swarm intelligence to explore the docking environment, LightDock improves docking models. The energy scores from the generated docking poses were used to rank the poses, with the top-ranked poses analysed for consistency with ClusPro predictions. By assessing the docking models generated by both tools, we identified key interaction residues, binding orientations, and energetically favourable complexes. To establish a foundation for future research on interaction dynamics, we selected the complex with the highest score for subsequent structural and functional analyses. Following molecular docking, the stabilising energy of the docked protein was explored using the PRODIGY tool39.

Molecular dynamics simulations of docked complexes

To enhance and confirm the docked complexes generated by ClusPro and LightDock, MD simulations were performed based on GROMACS2023.331,32, lasting 300 ns. The protein topology was generated using the CHARMM36 force field33. The docked complexes were set up for simulation by placing them in a cubic water box, ensuring a 10 Å buffer between the complexes and the box edges. The TIP3P34 water model was used to mimic the solvent environment. To neutralise the system and simulate physiological conditions, the required sodium (Na⁺) and chloride (Cl⁻) ions were added to achieve an ionic concentration of 0.15 M NaCl. To ensure structural integrity and thermodynamic stability, the solvated systems underwent a multi-stage refinement protocol. Initial configurations were subjected to rigorous energy minimization to resolve steric clashes and optimize local geometries, providing a stable foundation for dynamic simulations. This was followed by equilibration in two phases, such as NVT and NPT ensembles. Following equilibration, production molecular dynamics (MD) simulations were conducted for 300 ns under periodic boundary conditions. The stability of the complexes was assessed by analysing protein backbone RMSD, RMSF, RoG, SASA, and the number of hydrogen bonds throughout the simulations. Principal component analysis (PCA) of MD simulation trajectories was used to reduce dimensionality and identify significant collective motions within biomolecules. The protein flexibility, stability, and potential interactions between IM9 and DNA gyrase were explored.

Binding free energy calculation using the MM-GBSA approach

The Molecular Mechanics-Generalised Born Surface Area (MM-GBSA) methodology constitutes a dependable computational framework for assessing the binding free energy associated with protein-protein complexes40,41. This approach integrates molecular mechanics with continuum solvation models, specifically the Generalised Born (GB) and Surface Area (SA) models, to accurately evaluate binding energy42. The binding free energy quantifies the energetic change associated with the interaction between two proteins and is crucial for understanding their interactions. The GB model evaluates electrostatic interactions, whereas the SA model incorporates non-polar factors that contribute to the total binding energy. The binding free energy of Colicin E9 toward DNA gyrase was estimated using the gmx_MMPBSA package43 according to the MM-GBSA methodology. Also, the energetic contribution of binding-site residues was explored using per-residue decomposition energy.

Results and discussion

Protein preparation and structural optimisation

In the initial structural analysis of the considered DNA gyrase protein from P. aeruginosa, residues at positions 102–119 were found to be missing. To address the issue of DNA gyrase and to repair any missing atoms in the Colicin E9 immunity protein, both proteins were selected for homology modelling using the SWISS-MODEL tool44, utilising the same template structure as the original PDB entry. To check the closeness of the prediction, the crystal and modelled structures of DNA gyrase and Colicin E9 immunity protein were superimposed, as shown in Fig. 1. Both structures were closely aligned, with only slight deviations. From the superimposed structures, the RMSD was calculated to quantify structural similarity. The RMSD values were found to be 0.491 Å for DNA gyrase and 0.447 Å for the Colicin E9 immunity protein, respectively. The low RMSD values for both proteins indicate similar predictions with high accuracy. The local loop region of the broken part and modelled portion was superimposed (Fig. S1 in the Supplementary file), and RMSD was calculated. Low RMSD of 0.836 Å indicated the close prediction of the broken portion of the DNA gyrase protein. Furthermore, structural validation of the modelled DNA gyrase protein and Colicin E9 immunity protein was conducted to assess their quality and suitability for subsequent computational analyses, including the SAVES server and 50 ns of MD simulation.

Fig. 1.

Fig. 1

Illustrating the structural comparison between the crystal and modelled proteins. Red and green colours represent the crystal and modelled structure, respectively. (A) DNA gyrase, and (B) Colicin E9 immunity protein. Amino acids at the broken part of the DNA gyrase are labelled.

Validation using the SAVES server

The original crystal structures, along with the final frames of the 50 ns MD simulation trajectories for both proteins, were used to derive several statistical parameters from the SAVES server30, which are given in Table 1. Particularly, the Errat score, which evaluates the quality of non-bonded atomic interactions, indicated excellent structural integrity for both proteins. The modelled DNA gyrase structure had an ERRAT score of 94.3, which declined slightly to 92.0 after 50 ns of MD simulation, indicating minor structural adjustments during optimisation. In a similar manner, the Colicin E9 immunity protein started with an Errat score of 97.3, decreasing to 89.2 following simulation. The evaluation of the Ramachandran plot confirmed the stereochemical integrity of both models. For DNA gyrase, the crystal structure showed that 88.6% of residues were located in the most favourable regions. This figure increased to 89.8% after the MD simulation. At the same time, the proportion of residues in the additionally allowed areas decreased from 11.4% to 8.5%, while no residues were located in disallowed regions, signifying improved stereochemical stability. Regarding the Colicin E9 immunity protein, the most favourable regions increased from 91.8% to 94.5%, and residues within the additionally allowed areas diminished from 8.2% to 5.5% post-simulation. Also, for DNA gyrase, no residues were detected in the disallowed regions of the Colicin E9 immunity protein, indicating high-quality structural modelling.

Table 1.

Statistical metrics from the SAVES server for DNA gyrase and Colicin E9 immunity protein before and after 50 ns MD simulations.

Parameters DNA gyrase Colicin E9 immunity protein
Homology modelled Last frame from
MD simulation
Homology modelled Last frame from
MD simulation
Errat score 94.30 92.00 97.30 89.20

Residues in the most

favoured region

88.60% 89.80% 91.80% 94.50%

Residues in the

additional allowed

region

11.40% 8.50% 8.20% 5.50%

Residues in the

disallowed region

0.00% 0.00% 0.00% 0.00%

Validation through molecular dynamics simulation

A 50 ns MD simulation was conducted on the reconstructed DNA gyrase and Colicin E9 immunity proteins, using homology modelling to refine their structures and assess stability. These simulations were performed in a solvated environment with ionic conditions that reflect physiological states. The structural deviations were assessed by calculating backbone RMSD values for both proteins before and after the 50 ns MD simulations. The structural integrity of both DNA gyrase and Colicin E9 immunity protein was found throughout the simulations (Fig. 2A). No individual frame showed deviations greater than 0.3 nm. Furthermore, the patterns of amino acid fluctuations noted during the dynamic states displayed no irregularities (Fig. 2B). The rigidity and compactness of both proteins remained intact throughout the MD simulation (Fig. 2C). The changes in SASA for both proteins during the dynamic states indicated proper protein folding (Fig. 2D). The data from the MD simulations of both proteins reflected their stability and rigidity, as well as their robustness in dynamic states. Thus, the coordinates from the MD simulation were free of close contacts or overlaps, making them suitable for any in silico study.

Fig. 2.

Fig. 2

Statistical parameters from MD simulation of DNA gyrase (Green) and Colicin E9 immunity protein (Red). (A) Protein backbone RMSD, (B) RMSF, (C) Radius of gyration, and (D) Solvent accessible surface area.

To comprehend the positional displacement of DNA gyrase and Colicin E9 immunity protein, the final frame from the 50 ns MD simulation trajectories was superimposed onto the homology model structures, and the RMSD was calculated (Fig. 3). The RMSD upon superimposition was found to be 0.891 Å for DNA gyrase and 0.742 Å for Colicin E9 immunity protein, respectively. Although these values reflect minor positional displacements, they have significantly affected the structure’s quality, as assessed by the Ramachandran plot and other parameters in the preceding section.

Fig. 3.

Fig. 3

Illustrating the structural comparison between the modelled protein and the last frame from the MD simulation. Red and green colours denote the modelled and MD-simulated structures, respectively. (A) DNA gyrase, and (B) Colicin E9 immunity protein.

Active residue identification

Active-site residues in the DNA gyrase protein were identified using the ScanNet server36, a computational tool that predicts functionally important residues by analyzing structural and sequence features. Figure 4 presents the ScanNet output, highlighting the predicted active site areas, with colours indicating binding probabilities. Specifically, areas marked in blue indicate a binding probability below 0.5, suggesting a reduced likelihood of functional interaction, whereas regions coloured in red reflect residues with a binding probability exceeding 0.5, indicating a higher likelihood of binding. Using the criteria established for the ScanNet server, a comprehensive list of amino acids essential for effective binding interactions with incoming molecular entities was generated. The identified residues include ALA20, LYS23, ARG24, PRO25, GLY26, MET27, TYR28, GLY30, ASP31, THR32, ASP33, ASP34, ASP47, ILE50, ASP51, LEU54, ALA55, TYR57, ARG78, PRO81, GLY104, PHE106, ASP108, ASN109, THR110, TYR111, LYS112, VAL113, SER114, GLY115, GLY116, LEU117, HIS118, ILE187, LYS190, ARG191, ARG193, GLU194, LEU195, PHE197, LEU198, ASN199, SER200, TYR219, GLU220, and GLY221. The spatial distribution of these residues has been meticulously mapped onto the DNA gyrase protein structure, suggesting their role in forming the binding pocket critical for the protein’s catalytic activity and substrate interactions. The active site residues are distributed across diverse regions of the protein, including surface-exposed loops and conserved structural elements, consistent with recognized functional motifs. Future docking research on the Colicin E9 immunity protein will build upon this strategic arrangement, emphasizing the significance of these residues in regulating the protein’s functional profile. These findings not only enhance the understanding of DNA gyrase’s enzymatic function but also provide essential information for the informed development of inhibitors that specifically target this important bacterial enzyme.

Fig. 4.

Fig. 4

The modelled protein (left) and predicted active site of DNA gyrase (right) of P. aeruginosa were constructed through the ScanNet program based on a geometric deep learning model.

Protein-protein docking analyses obtained through ClusPro and LightDock

Protein-protein docking was conducted to elucidate the interaction between DNA gyrase and the Colicin E9 immunity protein. This investigation employed two complementary tools, ClusPro 2.041 and LightDock42, each utilising distinct docking algorithms to improve prediction reliability. For both tools, the top three docking models were chosen based on their scoring metrics and interaction energies. The leading models from each tool are illustrated in Fig. 5. ClusPro 2.0 generated multiple docking configurations via a clustering algorithm and evaluated them using multiple energy terms, including electrostatic interactions, hydrophobic effects, and van der Waals forces. Models #1, #2, and #3 (Fig. 5A) were selected for their cluster sizes and energy scores. We identified the crucial residues involved in the interaction, which align with the active-site residues suggested by the ScanNet server. Concurrently, LightDock generated docking models (Models #4, #5, and #6 shown in Fig. 5B) by assessing the proteins’ conformational flexibility during their interaction. The top three models were ranked based on their docking energy scores.

Fig. 5.

Fig. 5

Structural representation of the top three docking models from (A) ClusPro (Models #1 to #3) and (B) LightDock (Models #4 to #6), portraying the interaction between DNA Gyrase and Colicin E9 immunity protein.

Table 2 presents the comprehensive docking results for each model, including components from both ClusPro and LightDock. In particular, the ClusPro weighted score, by means of total interaction energy (E) is represented as a linear combination of different energy terms and expressed by the following expression.

graphic file with name d33e692.gif 1

Table 2.

Summary of docking results for ClusPro and LightDock models.

ClusPro
Model number Cluster Members Representative Weighted score
1 1 361 Center -344.43
Lowest Energy -467.88
2 2 126 Center -364.05
Lowest Energy -430.61
3 3 85 Center -347.55
Lowest Energy -366.14
LightDock
Model number Swarm Glowworm Scoring Top
4 125 186 23.765 1
5 159 169 23.685 2
6 31 132 23.423 3

Each term in this equation represents a specific biophysical interaction at the protein-protein interface, and their relevance is modulated by the weighting coefficients ωn. The weighting coefficients vary depending on the docking mode selected (e.g., Balanced, Electrostatic-favoured, Hydrophobic-favoured). The terms Erep ​and Eattr represent the repulsive and attractive components of the van der Waals potential, respectively, while setting their coefficients (ω1​ and ω2) to values less than 1.0, typically 0.40 in the balanced mode37. The Eelec term represents the electrostatic energy, calculated using a truncated and smoothed Coulombic expression. ClusPro’s balanced scoring scheme uses a significantly higher weight for ω3, specifically 600. The EDARS term represents the pairwise structure-based potential constructed using the Decoys as the Reference State (DARS) approach. DARS is essentially a desolvation potential that models the free energy change associated with the removal of water molecules from the interface upon binding. The DARS term is already scaled to the magnitude of protein-binding free energies, making ω4 = 1.0 the neutral, default weight37.

The results obtained with ClusPro are arranged into clusters, ranked by the number of members that represent docking poses specific to each cluster. In ClusPro, the most reliable prediction is usually the one with the largest member count. Cluster 1 has the highest number of Members (361), significantly more than Cluster 2 (126 Members), with a weighted score of -344.43 for the centre and − 467.88 for the lowest-energy representative, making it the most populated and energetically favourable cluster. It can also be observed that while Cluster 2 has a slightly “better” (lower) Center score (-364.05) than Cluster 1 (-344.43), Cluster 1 is still ranked as Model 1 because its population size is three times larger. In ClusPro, a large cluster indicates a broad, stable energy well, suggesting this docking orientation is the most likely to represent the native state.

Another docking program, LightDock, organizes docking results into swarms and glowworms, representing distinct clustering mechanisms employed in its flexible docking strategy. In LightDock, the score is represented as Luciferin, which is borrowed from the Glowworm Swarm Optimization (GSO) algorithm45. Luciferin encodes the fitness (energetic favourability) of a docking pose. A higher Luciferin value generally indicates a better, more energetically favourable docking pose. The algorithm drives Glowworms (representing potential poses) toward neighbours with higher luciferin levels. The amount of luciferin (l) for a specific Glowworm (i) at a given time step (t + 1) is calculated using the following expression.

graphic file with name d33e776.gif 2

Where, ‘Inline graphic’ is the previous value of luciferin, ‘Inline graphic’ is the decay constant (default: 0.4), which simulates the “fading” of light over time, ‘Inline graphic’ is the enhancement constant (default: 0.6), which determines how much the new energy evaluation affects the score45. Usually, LightDock divides the area around the receptor into hundreds of independent search regions called ‘Swarms’. In the present docking finding, Swarm values (125, 159, and 31) identify the specific geographic starting point on the receptor at which the winning model was discovered. The Glowworm values, found to be 186, 169, and 132, identify the specific point within the swarm that converged on the optimal pose. The Scoring value (top as 23.765) is the numerical representation of the pose’s fitness. Higher values typically indicate more favourable interactions in the LightDock framework. So, overall, each swarm is evaluated based on its score, which indicates the strength and quality of interactions within the predicted complexes. Swarm 1 leads with the highest score of 23.765, making it the top model, followed closely by Swarm 2 at 23.685 and Swarm 3 at 23.423. The quantity of glowworms indicates the sampling density for each swarm, with Swarm 1 having 186 glowworms, the highest amongst all swarms.

Binding energy and stabilising energy calculation

After successfully completing the molecular docking, the binding energies of the top three models from both ClusPro 2.0 and LightDock were analysed. Further, Gibbs free energy change (ΔG) in protein-protein complexes was estimated using the PRODIGY tool. PRODIGY uses structural properties, including the number and type of interfacial contacts, to predict binding affinities. The predicted and experimentally measured binding affinities demonstrated a Pearson correlation coefficient of 0.73. This approach is advantageous because it can accommodate diverse protein complexes while maintaining consistency across both rigid and flexible binding contexts. The computed ΔG values serve as quantitative indicators of binding affinity between protein pairs, suggesting stronger and more stable interactions. The ΔG for all six models is given in Table 3. All models showed high negative ΔG values, indicating a strong association between DNA gyrase and the Colicin E9 immunity protein. Significantly, Model #2 exhibited the highest binding affinity, with a ΔG of -10.80 kcal/mol, closely followed by Model #6, which had a ΔG value of -10.60 kcal/mol.

Table 3.

Binding energy predictions on ClusPro and LightDock complexes.

Tool Model ΔG (kcal/mol)
ClusPro 2.0 Model #1 -7.40
Model #2 -10.80
Model #3 -8.00
LightDock Model #4 -10.20
Model #5 -10.20
Model #6 -10.60

The stabilising energy for each of the six selected models was calculated using the PPCheck tool, with the results presented in Table 4. It is crucial to note that the stabilising energy serves as a fundamental parameter that provides significant insights into the thermodynamic stability of the complexes by assessing the roles of hydrophobic interactions, hydrogen bonds, salt bridges, and van der Waals forces. Among the various models evaluated, Model #6 demonstrated the highest affinity, achieving a stabilising energy of -65.23 kcal/mol, thereby emphasising its classification as the most stable and energetically advantageous configuration. Among the models produced by the ClusPro tool, Model #2 was recognised as the most stable, exhibiting a stabilising energy of -52.82 kcal/mol. Understanding the stability dynamics among complexes is imperative for identifying the most suitable complexes for subsequent MD simulations.

Table 4.

Stabilising energy predictions for the top three docking models from ClusPro 2.0 and LightDock.

Tool Model Energies (kcal/mol)
Electrostatic Van der Waals Total stabilizing
energy
ClusPro 2.0 Model #1 -5.67 -29.97 -35.645
Model #2 -5.62 -47.20 52.820
Model #3 -15.37 -35.03 -50.406
LightDock Model #4 5.42 -46.32 -58.760
Model #5 7.16 -46.71 -56.150
Model #6 -4.33 -43.19 -65.225

Selection of best complexes

The binding positions and interaction patterns of all models were evaluated and compared with the active-site amino acids identified by ScanNet through exploration. It was found that Models #1, #3, #4, and #5 from ClusPro and LightDock either failed to reach the active-site residues or were only partially close to them. Since the above models were unable to interact with the hotspot amino acid residues of DNA gyrase, these models were not considered for further analyses. Moreover, analyses of binding and stabilising energies clearly showed that Models #2 and #6 exhibited the highest affinity and stability between DNA gyrase and the Colicin E9 immunity protein. Thus, although Model #2 (from ClusPro) and Model #6 (from LightDock) produced slightly lower scores in docking analyses, they were still regarded as the best models for binding and stabilising energies, warranting further analysis. The overall interaction pattern in three-dimensional (3D) mode was explored and is shown in Fig. 6, indicating that Model #6 exhibited a larger interaction surface than Model #2, which displayed a more compact, focused interface. As shown in Fig. 6, the structural analysis of both complexes reveals key residues and interaction mechanisms that stabilise the protein-protein binding. A significant number of amino acid residues from both DNA gyrase and the Colicin E9 immunity protein were actively involved in the binding interactions, reinforcing the strong association between these two entities.

Fig. 6.

Fig. 6

Interaction interface visualisation of (A) Model #2, and (B) Model #6.

Protein-protein binding interaction analysis

The investigation of protein-protein interactions aimed to elucidate the critical residues and forces governing the binding interface of the docked complexes. In accordance with assessments of both stabilising energy and binding energy, Models #2 and #6 were designated as appropriate candidates for subsequent MD simulations, given their demonstrated superior stability and heightened binding affinity. To further clarify the interaction interface, the PDBsum tool was utilized to analyse the docked complexes, as illustrated in Fig. 7. The interaction interface of Model #2 was assessed to identify the essential residues and forces that stabilize the interaction. As shown in Fig. 6, the interface is marked by a multifaceted arrangement of hydrogen bonds, salt bridges, disulfide bonds, and non-covalent interactions, all of which are pivotal in reinforcing the overall stability and specificity of the molecular complex. Table 5 describes a meticulously assembled inventory of amino acids derived from both DNA gyrase and the Colicin E9 immunity protein that are critical to the binding interactions characterised in Models #2 and #6. A comprehensive list of binding-interacting amino acids is given in Table S1 (Supplementary file). It is evident that a broad range of amino acids from both DNA gyrase and the Colicin E9 immunity protein are involved in interactions facilitated by hydrogen bonds, ionic interactions, and hydrophobic forces. The results shown in Table 5; Fig. 7 underscore a significant association between DNA gyrase and the Colicin E9 immunity protein, suggesting that this association plays a vital role in the antibacterial efficacy of Colicin E9. These findings highlight that the interplay between the Colicin E9 immunity protein and DNA gyrase not only impedes bacterial growth but may also be further exploited by targeting DNA gyrase. This provides a robust basis for utilising DNA gyrase-dependent pathways in the development of antibacterial strategies. The potential of Colicin E9’s immunity protein to affect bacterial DNA, together with the weaknesses of DNA gyrase, accentuates its significance as an essential factor in antibacterial treatment, optimising DNA gyrase’s critical function in bacterial cell replication and survival.

Fig. 7.

Fig. 7

Representing the key residues by calculating the interaction between DNA Gyrase and Colicin E9 immunity protein. (A) Model #2 and (B) Model #6.

Table 5.

Key interactions at the binding interface of Models #2 and #6.

Interaction type Model #2 Model #6
Hydrogen bonds

GyrMET27-E9ASP60, GyrLYS190-E9ASN245,

GyrLYS190 - E9ALA25 ,GyrARG191-E9TYR54,

GyrARG191 -E9CYS23 ,GyrARG191 -E9ILE53

GyrARG193-E9GLU30, GyrGLU194 -E9TYR54,

GyrASN199 -E9TYR55, GyrTYR219 -E9GLU31

GyrARG24-E9GLU31, GyrGLY26-E9LYS35,

GyrMET27 – E9THR38, GyrGLY30-E9LYS35,

GyrASP47-E9LYS80, GyrGLY104 – E9SER50,

GyrASN199-E9LYS80, GyrARG191-E9SER6,

GyrARG191-E9HIS5, GyrGLU194-E9SER8

Salt bridges GyrASP47-E9LYS57, GyrARG193-E9GLU30

GyrARG24-E9GLU31, GyrASP31-E9LYS35,

GyrASP47-E9LYS80

Non- bonded /

hydrophobic interactions

GyrMET27-E9ASP60, GyrMET27-E9GLY59,

GyrTYR28-E9GLY59, GyrTYR28-E9GLU58,

GyrASP47-E9TYR5 ,GyrASP47-E9LYS57,

GyrASP47-E9ASP51, GyrILE50-E9TYR55,

GyrLY190-E9ASN24, GyrLYS190-E9ALA25,

GyrARG191-E9TYR54, GyrARG191-E9CYS23,

GyrARG193-E9GLU30, GyrGLU194-E9TYR54,

GyrGLU194-E9ALA25, GyrPHE197-E9VAL34,

GyrPHE197-E9GLU30, GyrPHE197-E9GLU31,

GyrLEU198-E9VAL37, GyrLEU198-E9TYR54,

GyrLEU198-E9TYR55, GyrASN199-E9TYR55,

GyrTYR219-E9GLU31

GyrARG24-E9GLU31, GyrGLY26-E9LYS35 ,

GyrMET27-E9THR38, GyrMET27-E9GLU31,

GyrGLY30-E9LYS35, GyrASP31-E9LYS35,

GyrASP47-E9LYS80, GyrASP51-E9PRO47,

GyrLEU54-E9ASN78 , GyrGLY104-E9PRO47,

GyrGLY104-E9GLU41, GyrGLY104-E9SER50,

GyrLYS105-E9GLU42 ,GyrARG191-E9HIS5,

GyrARG191-E9SER6

GyrGLU194-E9SER8, GyrLEU198-E9GLY79,

GyrLEU198-E9SER81, GyrASN199-E9LYS80

Gyr: DNA gyrase; E9: Colicin E9 immunity protein.

Molecular dynamics simulation

One essential technique that greatly advances the understanding of PPIs and their implications for antibacterial efficacy is MD simulation. Additionally, it facilitates exploration of mechanistic insights into the stability and binding affinities of protein-protein complexes. Additionally, MD simulations provide a comprehensive understanding of protein structure and function, which can inform the development of effective antibacterial agents. A detailed atomic-level analysis can uncover potential binding sites and mechanisms for groundbreaking antibacterial agents. This approach is especially crucial given the rise of antibiotic resistance, as it encourages the formulation of novel strategies to combat resistant bacterial strains. To explore the dynamic stability of DNA gyrase and Colicin E9 immunity protein complexes obtained from both ClusPro and LightDock tools, a long-range 300 ns MD simulation was conducted. Several statistical parameters from the MD simulation trajectories were calculated, and these are presented in Fig. 8.

Fig. 8.

Fig. 8

Comprehensive MD simulation analysis of the ClusPro and LightDock complexes over 300 ns. (A) Protein Backbone RMSD, (B) RMSF, (C) Radius of Gyration, (D) Solvent Accessible Surface Area, and (E) Interprotein hydrogen bonds.

Protein backbone RMSD

The protein backbone RMSD plot in Fig. 8A provides a comparative assessment of the structural integrity of the complexes formed between DNA Gyrase and the Colicin E9 immunity protein during a 300 ns MD simulation. RMSD quantifies the positional displacement of backbone atoms over time, thereby indicating structural deviations and equilibration processes in the complexes. The RMSD of DNA Gyrase (Model #2) remains comparatively low and stable during the entirety of the simulation, oscillating between 0.1 and 0.2 nm. In contrast, the RMSD of the Colicin E9 immunity protein (Model #2) exhibited greater fluctuations, attaining approximately 0.35 nm during the initial phases of simulation, particularly between 25 and 55 ns. This observation suggests that the Colicin E9 immunity protein underwent flexibility and conformational alterations before achieving a stable state. Following the initial equilibration, the RMSD stabilises at approximately 0.2 nm, and the protein interface adjusts to form a stable complex. The RMSD of DNA gyrase in Model #2 stays low and consistent throughout the simulation, fluctuating around 0.2 nm. This indicates that DNA gyrase may undergo slight conformational changes upon interaction with the Colicin E9 immunity protein in Model #6. In Model #6, the RMSD of the Colicin E9 immunity protein varies around 0.2 ns during the first 200 nanoseconds. After this initial phase, the deviations become somewhat larger than those in the Model #2 complex, suggesting that the Colicin E9 immunity protein exhibits some flexibility and undergoes conformational adjustments before stabilizing. Both complexes show stable RMSD profiles concerning DNA gyrase. Importantly, the Model #2 complex exhibits a more rigid and stable interaction, while the Model #6 complex permits greater conformational flexibility.

RMSF

The RMSF analysis provides a residue-specific assessment of the dynamic flexibility of protein complexes over the 300 ns MD simulation. RMSF quantifies the average atomic displacement of individual residues relative to their mean positions, thereby improving the identification of regions characterised by flexibility and stable binding interfaces. The amino acid residues of DNA gyrase show relatively low RMSF values in key regions, particularly in Models #2 and #6. This highlights the inherent stability of DNA gyrase during the MD simulation. In contrast, the RMSF for the Colicin E9 immunity protein in Model #6 exceeds 0.7 nm, spanning roughly 60 to 70 residues. It is crucial to emphasise that the maxima for residues 30 to 60 are notably more prominent, particularly within the context of Model #6, in which the RMSF values exceed 0.6 ns. These specific regions correspond to flexible loops and surface-exposed residues that are anticipated to undergo conformational changes during binding. The RMSF analysis reveals that DNA gyrase preserves a consistent structural conformation across both complexes, whereas the Colicin E9 immunity protein exhibits considerable flexibility, thereby enabling dynamic modulation at the binding interface. The pronounced fluctuations observed in Model #6 suggest that this framework encompasses a broader range of conformational sampling, whereas Model #2 exhibits a more rigid, compact binding configuration.

RoG

The RoG analysis quantitatively evaluates the compactness and structural integrity of protein complexes throughout the 300-ns MD simulation, as shown in Fig. 8C. The RoG parameters for DNA gyrase indicate significant stability, with a consistent value of approximately 1.75 nm across the complexes, as indicated by Models #2 and #6. This consistency indicates that DNA gyrase maintains its compact, folded conformation throughout the simulation, with no observable unfolding or conformational changes. The enduring stability of the RoG for DNA gyrase highlights its structural integrity and the minimal impact of interactions with the Colicin E9 immunity protein. On the other hand, the RoG values associated with the Colicin E9 immunity protein demonstrate significant variability, highlighting its intrinsic adaptability. Regarding the Model #2 complex, the RoG attains a stable configuration at approximately 1.25 nm following an initial period characterised by minor oscillations. Conversely, the Model #6 complex shows a modest increase in fluctuations during the preliminary stages of the simulation, especially in the initial 50 ns, indicating that the flexible docking technique enables examination of a wider range of initial conformations. Finally, the RoG converges to approximately 1.25 nm over time, indicating that the Colicin E9 immunity protein adopts a compact, stable structural conformation.

SASA

The solvent exposure analysis and DNA gyrase stability in conjunction with the Colicin E9 immunity protein (Chain B/C) during the 300 ns MD simulation were performed using the SASA parameter, as shown in Fig. 8D. For DNA gyrase, the SASA analysis indicates a stable value of approximately 120 nm² for both Model #2 and Model #6 complexes. This stability indicates that DNA gyrase maintains its compact, stable structure throughout the simulation, with only slight variations in the solvent-exposed regions. These results imply that the interaction with the Colicin E9 immunity protein has minimal effect on DNA gyrase’s overall conformation or stability. In contrast, the Colicin E9 immunity protein (Model #2) exhibits greater fluctuations in SASA than DNA gyrase, highlighting its dynamic nature. Within the Model #2 complex, the SASA stabilises around 60 nm², with only minimal variability. On the other hand, the Model #6 complex shows slightly reduced fluctuations during the initial phase of the simulation, particularly within the first 50 ns, before stabilising at a comparable value of 60 nm².

Hydrogen bond analysis

The hydrogen-bond analysis shows the interaction stability between DNA gyrase and the Colicin E9 immunity protein during the 300 ns MD simulation, as shown in Fig. 8E. In the Model #2 complex, the number of hydrogen bonds remains relatively stable, fluctuating between 6 and 10 bonds for most of the frames, with occasional peaks exceeding 10 bonds. The consistency of hydrogen bonds creates a strong, well-defined interaction interface, with only slight temporal variation. In contrast, the Model #6 complex shows greater variability in the number of hydrogen bonds, ranging from 3 to 10, and we can even notice occasional spikes during specific timeframes, such as around 90–150 ns. These fluctuations suggest that the Model #6 complex undergoes conformational adjustments at the binding interface, allowing for transient interactions and optimising the interaction over time. The Model #2 complex exhibits a more consistent and rigid hydrogen-bond network, reflecting a stable binding configuration that is less susceptible to dynamic fluctuations. Both complexes exhibit interesting peaks in hydrogen-bond counts. However, the peaks in the Model #2 complex are more pronounced, indicating a strong and stable binding interface. On the other hand, the Model #6 complex supports more dynamic interactions, which are useful for modelling various physiological scenarios.

Binding free energy analysis using the MM-GBSA approach and its limitations

The MM-GBSA evaluation (Table 6) determines the binding free energy for the protein-protein complexes, breaking it down into contributions from Electrostatic Energy (EEL), Van der Waals energy, and the total binding free energy (Δtotal). The results highlight notable differences in the interactions exhibited by Models #2 and #6. The Model #2 complex shows a binding free energy of -3.88 kcal/mol, which is a much better result compared to the binding free energy of -1.62 kcal/mol seen in the Model #6 complex. This improved binding energy for Model #2 mainly comes from its strong electrostatic contributions, adding up to 531.01 kcal/mol, along with a significant boost in Van der Waals interactions, measured at -10.70 kcal/mol. These elements suggest tighter packing and improved charge stabilisation at the interface of interaction. In the Model #6 complex, despite its less favourable binding energy of -1.62 kcal/mol, it still shows steady interactions. The electrostatic energy for Model #2 is 473.80 kcal/mol, playing a crucial role in stabilising binding, although it is slightly less effective than in the ClusPro complex. In Model #6, we see that the Van der Waals energy of -6.03 kcal/mol indicates a significant degree of stabilisation from hydrophobic and dispersion forces. Additionally, the standard deviations for both complexes indicate that our energy assessments are quite reliable, with Model #2 showing slightly greater variability in its electrostatic interactions (SD = 261.56 kcal/mol). This may be due to its more rigid docking approach.

Table 6.

Binding energy of Colicin E9 immunity protein towards DNA gyrase using the MM-GBSA approach.

Complex Energies (kcal/mol)
EEL Van der Waals Δtotal
ClusPro Complex (Model #2) 531.01 ± 261.56 -10.70 ± 16.75 -3.88 ± 7.36
LightDock Complex (Model #6) 473.80 ± 108.18 -6.03 ± 14.32 -1.62 ± 7.50

While MM-GBSA data provide a rapid assessment of the driving forces behind the Im9–DNAgyrase interaction, several caveats must be considered in the presented outcomes. Firstly, the reported values represent the effective binding enthalpy. However, to bind two proteins of interest, they must lose significant translational and rotational freedom (entropy loss), which herein does not include the solute configurational entropy, which typically imposes a significant penalty in Im9–DNA gyrase PPI associations. Secondly, the Generalized Born (GB) implicit-solvent model used in the present study might not fully capture the role of structural waters at the interface. Consequently, these values should be interpreted as a qualitative ranking of potential binding modes rather than as an absolute experimental free-energy estimate. Certainly, employing MM-GBSA in the present study neglects the significant configurational entropy loss inherent in PPI complex association; these results likely represent transient encounter states characteristic of the initial recognition phase between the non-cognate protein components and their bacterial targets.

Per residue decomposition energy

The energetic contributions of the active-site amino acid residues were explored using per-residue decomposition energy calculations. The total decomposition energy, along with its components, such as van der Waals and electrostatic energies, for the DNA gyrase active-site amino acid residues, is shown in Fig. 9. It was found that for Model #2 (Fig. 9A), except for one (ASP47), all the binding site amino acid residues, such as MET27, TYR28, ILE50, LEU54, LYS190, ARG191, ARG193, GLU194, LEU195, PHE197, LEU198, ASN199, TYR219, showed negative decomposition energy, which indicates that Colicin Im9 protein favorably exhibits binding affinity towards DNA gyrase through hydrogen bonds or van der Waals interactions. For Model #6 (Fig. 9B), ARG24, GLY26, MET27, GLY30, ASP31, ASP51, LEU54, and LEU198 showed negative per-residue binding energy, while ASP47, GLY104, ARG191, ASN199, and GLU194 exhibited positive decomposition energy. It is important to note that although a few residues had positive decomposition energy, their values were close to zero. Therefore, it is clear that Colicin Im9 showed significant affinity towards DNA gyrase in Model #6.

Fig. 9.

Fig. 9

Per residue decomposition energy of the DNA gyrase binding site residues.

Principal component analysis (PCA)

The PCA plot provides a comparative depiction of the conformational landscapes investigated during the MD simulations of the protein-protein complexes generated by ClusPro and LightDock. PCA effectively distils the intricate multidimensional atomic motions into principal components, thereby furnishing a more robust representation of structural variability and stability by encapsulating the predominant modes of motion throughout the simulation.

In Model No. 2 (Fig. 10A), the PCA plot reveals a broader and more dispersed array of conformations, indicating increased structural variability and a more comprehensive sampling of conformational space throughout the simulation. This extensive distribution indicates that the interaction interface investigated a diverse range of structural states, thereby reflecting more dynamic behaviour relative to the initial rigid-body docking methodology utilised in Model #2. The observed flexibility may be attributed to structural changes that occurred during the MD simulation.

Fig. 10.

Fig. 10

PCA plot showing the conformational space explored during MD simulations for (A) Model #2 and (B) Model #6.

The Model #6 complex illustrated in Fig. 10B exhibits a more compact and systematically structured conformation, indicating that structural variations were successfully minimised during the simulation. In a later stage of the MD simulation, it was observed that the contact interface became well stabilised, with only slight structural changes, as evidenced by denser aggregation. This observation unites nicely with Model #6’s flexible docking method, which enabled initial adjustments that led to a highly stable binding configuration. The differences observed between the two plots illustrate the relationship between structural flexibility and stability. The expansive distribution seen in Model #2 reflects enhanced conformational exploration, whereas the tighter clustering in Model #6 suggests improved stability and a convergence toward a favoured binding state. These outcomes emphasise the complementary features of the two docking techniques, with Model #2 illustrating a more dynamic interaction framework and Model #6 highlighting structural stabilisation within protein-protein interactions.

Overall, the PCA plots reveal distinct conformational behaviors in the two protein-protein complexes during 300-ns MD simulations, directly correlating with their binding stabilities. In Model #2, the broader and more dispersed PCA distribution indicates greater structural variability and extensive sampling of conformational space, reflecting a more dynamic interaction interface with increased flexibility. In contrast, Model #6 showed a more compact, systematically clustered PCA plot, indicating minimized structural variations, successful convergence toward a favored binding state, and improved overall stabilization at the interface after initial adjustments. This tighter clustering in Model #6, with a stabilizing energy of -65.23 kcal/mol, aligns with its superior long-term binding stability compared to the more flexible Model #2, despite Model #2’s slightly better MM-GBSA binding free energy. These PCA patterns thus complement energetic analyses by illustrating how conformational exploration influences complex rigidity and persistence.

Conclusion

This research presents a comprehensive computational analysis of the interactions between the Colicin E9 immunity protein and DNA Gyrase in E. coli and P. aeruginosa. It reveals critical aspects, including structural stability, binding kinetics, and fundamental interaction mechanisms. Using molecular docking, MD simulations, and binding energy evaluations, significant insights into the properties of the protein-protein interface and the stability of the resulting complexes were obtained. Critical interacting residues at the binding interface were identified, which include HIS40, MET27, LYS105, GLU44, ASP47, ASN199, LEU198, ARG191, and GLU194, all of which were pivotal in stabilising the interaction through hydrogen bonding, electrostatic interactions, and van der Waals forces. These residues created a precisely defined interaction network essential for inhibiting DNA Gyrase by the Colicin E9 immunity protein. The examination of hydrogen bonding revealed that both docking approaches, ClusPro and LightDock, yielded stable complexes, characterised by a significant number of hydrogen bonds that were consistently maintained throughout the simulation period. The complex denoted as Model #2 exhibited a greater and more stable number of hydrogen bonds, ranging from six to ten, thereby highlighting its rigid, well-defined interaction interface. Conversely, the complex associated with Model #6 displayed considerable variability, with hydrogen-bond counts oscillating between 3 and 10, indicating a more dynamic interaction interface capable of accommodating conformational alterations. Furthermore, the evaluation of binding free energy performed through MMGBSA substantiated Model #2 as the more stable of the two complexes, demonstrating a more favourable binding free energy (Δtotal = -3.88 kcal/mol) in comparison to the LightDock complex (Δtotal = -1.62 kcal/mol). This stability has been attributed to the increased electrostatic and van der Waals interactions observed in the ClusPro complex. Nevertheless, the Model #6 complex demonstrated the capability to capture a broader conformational space, thereby underscoring its flexibility in modelling dynamic interactions. Taken together, as Im9 neutralizes the nuclease activity of Colicin E9 to protect the host cell, as demonstrated in many different literatures, its high-affinity binding to DNA gyrase could suggest a conserved inhibitory mechanism that ‘silences’ the enzyme’s function. Such consequences may be supported by the observation of a consistent presence of 6 to 10 interprotein hydrogen bonds throughout the 300 ns simulation, which might indicate a rigid, well-defined interface. This stability suggests that once the Im9 protein engages with DNA gyrase, it forms a persistent complex capable of long-term enzymatic suppression. In conclusion, the findings of this study suggest that the thoroughly analysed protein complex formed between the Colicin E9 immunity protein and DNA gyrase represents a stable docking configuration, thus providing a solid foundation for subsequent experimental validation and the potential development of novel antimicrobial agents specifically targeting DNA gyrase in the context of P. aeruginosa infections.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (194KB, docx)

Acknowledgements

The authors wish to thank the College of Pharmacy at King Saud University in Riyadh, Saudi Arabia, for providing the facilities necessary for this research. The authors would also like to extend their appreciation to the Deanship of Scientific Research at King Saud University for funding this work through the ISPP Program (ISPP25-4).

Author contributions

Rihaf Alfaraj: Writing – review & editing, Writing – original draft, Supervision, Methodology, Investigation, Formal analysis, Conceptualisation. Fai Alkathiri: Writing – review & editing, Writing – original draft, Supervision, Investigation, Funding acquisition, Data curation, Conceptualisation. Rupesh Chikhale - Writing – review & editing, Writing – original draft, Supervision, Investigation, Funding acquisition, Data curation, Conceptualisation.

Funding

This research project was supported by the International Scientific Partnership Program ISPP (Grant ID: ISPP25-4), King Saud University, Riyadh, Saudi Arabia.

Data availability

The data supporting the findings of this study are available within the article.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Rihaf Alfaraj, Email: ralfaraj@ksu.edu.sa.

Rupesh Chikhale, Email: R.Chikhale@ucl.ac.uk.

References

  • 1.Westermarck, J., Ivaska, J. & Corthals, G. L. Identification of protein interactions involved in cellular signaling. Mol. Cell. Proteomics. 12, 1752–1763 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chakrabarty, S., Wang, S., Roychowdhury, T., Ginsberg, S. D. & Chiosis, G. Introducing dysfunctional Protein-Protein Interactome (dfPPI) – a platform for systems-level protein-protein interaction (PPI) dysfunction investigation in disease. Curr. Opin. Struct. Biol.88, 102886 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Liu, X., Abad, L., Chatterjee, L., Cristea, I. M. & Varjosalo, M. Mapping protein–protein interactions by mass spectrometry. Mass. Spectrom. Rev.45, 69–106 (2026). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Fry, D. C. Targeting protein-protein interactions for drug discovery. Methods Mol. Biol.1278, 93–106 (2015). [DOI] [PubMed] [Google Scholar]
  • 5.Farooq, Q. U. A., Shaukat, Z., Aiman, S. & Li, C. H. Protein-protein interactions: methods, databases, and applications in virus-host study. World J. Virol.10, 288–300 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Darbandi, A. et al. Bacteriocins: properties and potential use as antimicrobials. J. Clin. Lab. Anal.36, e24093 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sugrue, I., Ross, R. P. & Hill, C. Bacteriocin diversity, function, discovery and application as antimicrobials. Nat. Rev. Microbiol.22, 556–571 (2024). [DOI] [PMC free article] [PubMed]
  • 8.Arbulu, S. & Kjos, M. Revisiting the multifaceted roles of bacteriocins. Microb. Ecol.87, 1–14 (2024). [DOI] [PMC free article] [PubMed]
  • 9.Solis-Balandra, M. A. & Sanchez-Salas, J. L. Classification and multi-functional use of bacteriocins in health, biotechnology, and food industry. Antibiotics13, 666 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li, W., Dennis, C. A., Moore, G. R., James, R. & Kleanthous, C. Protein-protein interaction specificity of Im9 for the endonuclease toxin colicin E9 defined by homologue-scanning mutagenesis. J. Biol. Chem.272, 22253–22258 (1997). [DOI] [PubMed] [Google Scholar]
  • 11.Robinson, R. A. Small force remodels a large protein to unleash its deadly potential. PLoS Biol.11, e1001486 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cascales, E. et al. Colicin biology. Microbiol. Mol. Biol. Rev.71, 158–229 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wallis, R., Moore, G. R., James, R. & Kleanthous, C. Protein-protein interactions in colicin E9 DNase-immunity protein complexes. 1. Diffusion-controlled association and femtomolar binding for the cognate complex. Biochemistry34, 13743–13750 (1995). [DOI] [PubMed] [Google Scholar]
  • 14.Marković, K. G. et al. Colicins and microcins produced by enterobacteriaceae: characterization, mode of action, and putative applications. Int. J. Environ. Res. Public. Health19, 11825 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Keeble, A. H. et al. Experimental and computational analyses of the energetic basis for dual recognition of immunity proteins by Colicin endonucleases. J. Mol. Biol.379, 745–759 (2008). [DOI] [PubMed] [Google Scholar]
  • 16.Bayramoglu, B. et al. Bet-hedging in bacteriocin producing Escherichia coli populations: the single cell perspective. Sci. Rep.7, 1–10 (2017). [DOI] [PMC free article] [PubMed]
  • 17.Ambade, S. S., Gupta, V. K., Bhole, R. P., Khedekar, P. B. & Chikhale, R. V. A review on five and six-membered heterocyclic compounds targeting the penicillin-binding protein 2 (PBP2A) of methicillin-resistant Staphylococcus aureus (MRSA). Molecules . 28, 7008 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Dighe, S. N. & Collet, T. A. Recent advances in DNA gyrase-targeted antimicrobial agents. Eur. J. Med. Chem.199, 112326 (2020). [DOI] [PubMed] [Google Scholar]
  • 19.Vakser, I. A. Protein-protein docking: from interaction to interactome. Biophys. J.107, 1785 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Huang, S. Y. Exploring the potential of global protein–protein docking: an overview and critical assessment of current programs for automatic ab initio docking. Drug Discov Today. 20, 969–977 (2015). [DOI] [PubMed] [Google Scholar]
  • 21.Ghosh, I. et al. TLK1-mediated RAD54 phosphorylation spatio-temporally regulates Homologous Recombination Repair. Nucleic Acids Res.51, 8643–8662 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Burley, S. K. et al. Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res.47, D520–D528 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Chang, Y. et al. A guide to in silico drug design. Pharmaceutics15, 49 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.The DNA Gyrase B ATP binding domain of Pseudomonas aeruginosa in complex with compound 12o. https://doi.org/10.2210/PDB6M1S/PDB (2020).
  • 25.Hu, Y. et al. Discovery of pyrido[2,3- b]indole derivatives with gram-negative activity targeting both DNA gyrase and topoisomerase IV. J. Med. Chem.63, 9623–9649 (2020). [DOI] [PubMed] [Google Scholar]
  • 26.Klein, A. et al. Structural and biophysical analysis of nuclease protein antibiotics. Biochem. J.473, 2799–2812 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Crystal structure of Colicin E9 in complex with Its immunity protein Im9. 10.2210/PDB5EW5/PDB (2016).
  • 28.PyMOL. | pymol.org. https://pymol.org/2/ (2025).
  • 29.Muhammed, M. T. & Aki-Yalcin, E. Homology modeling in drug discovery: overview, current applications, and future perspectives. Chem. Biol. Drug Des.93, 12–20 (2019). [DOI] [PubMed] [Google Scholar]
  • 30.Protein Structure Validation using SAVES V6. .0 | Download Scientific Diagram. https://www.researchgate.net/figure/Protein-Structure-Validation-using-SAVES-V60_tbl1_377099795 (2025).
  • 31.Van Der Spoel, D. et al. Fast, flexible, and free. J. Comput. Chem. 26, 1701–1718 (2005). [DOI] [PubMed] [Google Scholar]
  • 32.Abraham, M. J. et al. High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX1–2, 19–25 (2015).
  • 33.Croitoru, A. et al. Additive CHARMM36 force field for nonstandard amino acids. J. Chem. Theory Comput.17, 3554–3570 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Mark, P. & Nilsson, L. Structure and dynamics of the TIP3P, SPC, and SPC/E water models at 298 K. J. Phys. Chem. A. 105, 9954–9960 (2001). [Google Scholar]
  • 35.Liao, J., Wang, Q., Wu, F. & Huang, Z. Silico methods for identification of potential active sites of therapeutic targets. Molecules27, 7103 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tubiana, J., Schneidman-Duhovny, D. & Wolfson, H. J. ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat. Methods19, 730–739 (2022). [DOI] [PubMed]
  • 37.Kozakov, D. et al. The ClusPro web server for protein–protein docking. Nat. Protocols 12, 255–278 (2017). [DOI] [PMC free article] [PubMed]
  • 38.Roel-Touris, J. & Bonvin, A. M. J. J. LightDock goes information-driven. Bioinformatics36, 950–952 (2020). [DOI] [PMC free article] [PubMed]
  • 39.Xue, L. C., Rodrigues, J. P., Kastritis, P. L., Bonvin, A. M. & Vangone, A. PRODIGY: a web server for predicting the binding affinity of protein–protein complexes. Bioinformatics32, 3676–3678 (2016). [DOI] [PubMed]
  • 40.Genheden, S. & Ryde, U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opin. Drug Discov.10, 449–461 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Godschalk, F., Genheden, S., Söderhjelm, P. & Ryde, U. Comparison of MM/GBSA calculations based on explicit and implicit solvent simulations. Phys. Chem. Chem. Phys.15, 7731–7739 (2013). [DOI] [PubMed] [Google Scholar]
  • 42.Hou, T., Wang, J., Li, Y. & Wang, W. Assessing the performance of the MM/PBSA and MM/GBSA methods. 1. The accuracy of binding free energy calculations based on molecular dynamics simulations. J. Chem. Inf. Model.51, 69–82 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Valdés-Tresanco, M. S., Valdés-Tresanco, M. E., Valiente, P. A. & Moreno, E. Gmx_MMPBSA: a new tool to perform end-state free energy calculations with GROMACS. J. Chem. Theory Comput.17, 6281–6291 (2021). [DOI] [PubMed]
  • 44.Waterhouse, A. et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res.46, W296–W303 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Jiménez-García, B. et al. LightDock: a new multi-scale approach to protein–protein docking. Bioinformatics34, 49–55 (2018). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (194KB, docx)

Data Availability Statement

The data supporting the findings of this study are available within the article.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES