Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2026 Apr 7;17:1798540. doi: 10.3389/fmicb.2026.1798540

A metal-aware library-expansion and virtual-screening workflow for Pseudomonas aeruginosa ATCC 15692 (PAO1) MraY identifies compound 5311309 with stable binding and mid-micromolar activity

Tao Shen 1, Lichen Zhang 1, Rui Chen 2, Shengnan Jia 3, He Zhengfu 1,*
PMCID: PMC13095821  PMID: 42022127

Abstract

Introduction

Pseudomonas aeruginosa lung infections are difficult to eradicate because of biofilms and rising resistance. MraY (MRAY_PSEAE), an essential membrane enzyme in peptidoglycan biosynthesis, depends on divalent metals (notably Mg2+); therefore, metal coordination is critical for reliable structure-based prioritization.

Methods

We built an explicit protein–ligand–metal model of MRAY_PSEAE and extracted pocket constraints from point-cloud geometry and polar/charged features. Candidates were generated through similarity-guided library expansion, combining PubChem 2D similarity retrieval with local rule-based BRICS enumeration. After canonicalization and 2D/property filtering, compounds were clustered by fingerprint similarity and prioritized through chemical-state plausibility, docking, Prime/MM-GBSA rescoring, and ligand strain filtering, followed by interaction analysis, 1-μs MD, and in silico ADMET.

Results

Three hits (6675, 2733768, 5311309) were prioritized. In microsecond MD, compound 5311309 showed the most stable pose and key hotspot retention. ADMET indicated a permeability/efflux bottleneck along with liver–kidney and irritation/sensitization alerts, whereas hERG and AMES signals were favorable. In growth assays, compound 5311309 showed an IC50 of approximately 15.1 μM and an MIC of 64 μM.

Conclusion

Metal-aware pocket modeling plus multi-level screening improved plausibility and ranking for this metal-dependent target, identifying compound 5311309 as a practical lead optimization starting point.

Keywords: 5311309, ADMET, Mg2+/metal site, MM-GBSA, molecular dynamics, MraY (MRAY_PSEAE), Pseudomonas aeruginosa ATCC 15692 (PAO1), structure-based drug design

1. Introduction

Pseudomonas aeruginosa is a leading cause of severe and persistent lung infections, ranging from ventilator-associated pneumonia in hospitalized patients to chronic airway colonization in cystic fibrosis (CF) (Qin et al., 2022). A hallmark of P. aeruginosa pulmonary disease is its propensity to form multicellular biofilms, which restrict antibiotic penetration, promote tolerant subpopulations, and complicate immune clearance—collectively making eradication difficult and relapse common (Moreau-Marquis et al., 2008; Oluyombo et al., 2019). The therapeutic challenge is amplified by accelerating antimicrobial resistance (AMR) (Ferrara et al., 2024). In its 2024 update of the Bacterial Priority Pathogens List (BPPL), the World Health Organization (WHO) continues to highlight carbapenem-resistant P. aeruginosa as a high-priority AMR threat, underscoring the ongoing need for new anti-infectives and non-cross-resistant mechanisms. These realities motivate renewed efforts to identify essential bacterial targets that are distinct from those addressed by existing antibiotic classes and that can enable rational, structure-guided inhibitor discovery.

Bacterial cell-wall biogenesis—particularly peptidoglycan assembly—remains one of the most validated, selective, and clinically actionable antibacterial pathways (Torrens and Cava, 2024). Within this pathway, phospho-MurNAc-pentapeptide translocase MraY catalyzes the first membrane-committed step: transfer of the phospho-MurNAc-pentapeptide moiety from UDP-MurNAc-pentapeptide to the lipid carrier undecaprenyl phosphate (C55-P), producing lipid I, an indispensable precursor for downstream peptidoglycan synthesis (Liu et al., 2016). In the UniProt nomenclature for P. aeruginosa, this enzyme is often referred to as MRAY_PSEAE (e.g., associated with accession Q9HVZ8 in curated listings). Because MraY operates at the interface of cytosolic nucleotide chemistry and membrane-embedded lipid processing, its inhibition can collapse cell-wall construction at an early choke point, offering strong potential for bactericidal intervention.

Importantly, MraY catalysis and substrate handling are tightly coupled to divalent metal ions, most notably Mg2+, which participate in stabilizing highly charged phosphate groups during transfer chemistry (Cao et al., 2025). A mechanistic and chemical–biology synthesis of the field explicitly describes MraY as performing the C55-P transfer in the presence of an Mg2+ cofactor, linking metal coordination directly to productive catalysis and inhibitor binding energetics (Nakaya et al., 2022). Consistent with this finding, multiple classes of nucleoside natural products (e.g., muraymycins, mureidomycins, liposidomycins/caprazamycins, and tunicamycins) have validated MraY as druggable; however, translating these complex scaffolds into clinically viable agents remains challenging (Mashalidis et al., 2019). For Gram-negative pathogens such as P. aeruginosa, additional constraints—including outer-membrane penetration, efflux, and the need to maintain potency in the membrane-proximal catalytic site—further increase the challenegesacc for successful lead discovery (Saxena et al., 2023).

Computational structure-based drug design (SBDD) provides a scalable route to explore chemical space, but metal-containing active sites remain a recurring failure mode for conventional docking and scoring (Xu et al., 2024). Metalloprotein docking is frequently complicated by three-body interactions among residues, metal ions, and ligands, along with strict coordination geometry requirements that are not always captured by standard force fields or empirical scoring functions (Wang et al., 2020). The community response has included specialized metal-aware docking and modeling approaches, reflecting a broad recognition that explicit treatment of metal coordination can substantially improve pose fidelity and screening utility (Zhang et al., 2026). For MraY—where a Mg2+-stabilized, phosphate-rich catalytic environment is central—neglecting metal physics can lead to distorted binding hypotheses and unreliable rank ordering, ultimately undermining downstream synthesis prioritization.

In parallel, deep generative models have rapidly evolved from 2D de novo design tools into 3D, pocket-conditioned frameworks capable of proposing ligand geometries consistent with target binding sites. Comprehensive reviews have documented how VAEs, GANs, autoregressive transformers, and diffusion models can explore vast chemical space while optimizing multi-objective criteria relevant to drug discovery (Cheng et al., 2021). More recently, equivariant diffusion models have demonstrated practical structure-based generation conditioned directly on protein pockets, enabling iterative generate–score–refine workflows in 3D (Schneuing et al., 2024). However, for metal-dependent targets, naïve pocket conditioning is often insufficient: the model must learn—or be constrained to respect—the coordination number, preferred donor atom types, and geometry-dependent energetics that govern Mg2+-mediated recognition.

Against this backdrop, this study focuses on MRAY_PSEAE (MraY) as a high-value antibacterial target for P. aeruginosa ATCC 15692 (PAO1) and establishes a metal-aware, structure-based candidate discovery workflow rather than a learned de novo generative model. The pocket metal ion is treated as a first-class structural feature by explicitly encoding the Mg2+ position and its coordination context and by coupling candidate sourcing (PubChem similarity retrieval plus BRICS rule-based enumeration) with metal-sensitive filtering and scoring to maintain physically plausible binding hypotheses. This framework is intended to provide synthetically actionable MraY inhibitor candidates and to serve as a practical template for metal-informed structure-based screening.

2. Methods

2.1. Protein–ligand complex input and ligand identification

The reference complex model was obtained from AlphaFill (Q9HVZ8). In this run, the model MRAY_PSEAE was used. The bound ligand residue name was either specified manually or inferred automatically as the non-protein, non-water, non-metal residue with the largest number of heavy atoms (minimum five heavy atoms).

2.2. Pocket definition and metal ion handling

The binding pocket was defined as all standard amino-acid residues with any heavy atom within 6.0 Å of any ligand heavy atom. All pocket heavy-atom coordinates were retained as a point cloud for fast distance-based interaction counting. Pocket donor and acceptor point sets were assigned using lightweight, residue-aware heuristics (e.g., Lys/Arg/His N atoms as donors; Asp/Glu O atoms as acceptors; Ser/Thr/Tyr O atoms as donors/acceptors). These descriptors were used for interaction summarization and downstream ranking rather than as docking-grade hydrogen-bond constraints.

Metal ions were identified as single-atom (non-water and non-amino-acid) residues whose element belonged to a curated set (e.g., Zn, Mg, Fe, Mn, Cu, Co, Ni, Ca, Na, and K) and whose distance to the ligand was within max(cutoff + 2 Å, 6 Å), i.e., 8 Å for a 6 Å pocket cutoff. Each metal was assigned a nominal charge (e.g., Ni2 + and Mg2+). In MRAY_PSEAE.pdb, three proximal metal ions (2 × Ni and 1 × Mg) were detected and retained during protein/receptor preparation, and the docking grid was generated with these ions present. Glide docking was then performed using standard settings, without additional metal-specific coordination constraints or custom metal-scoring terms (the screened ligands were metal-free organic molecules). Accordingly, “metal-aware” in this study refers to explicit metal-site representation in receptor preparation and downstream metal-aware scoring/analysis, rather than the use of a specialized Glide metal-coordination protocol.

2.3. PubChem similarity retrieval

If enabled, a 2D similarity search was performed via PubChem PUG REST (fastsimilarity_2d) using the reference SMILES, with a similarity threshold of 30/100 and a maximum of 1,000 records returned. Retrieved CIDs were converted to SMILES, canonicalized, filtered to single-fragment molecules, and deduplicated.

2.4. Local fragment-based enumeration (BRICS)

Two RDKit BRICS-based generators were applied. First, a single BRICS bond replacement can be achieved by substituting one fragment for an R-group library derived from diverse seed fragments. Second, BRICS recombination involves reassembling 2–3 fragments sampled from the reference BRICS fragments plus a small BRICS-labeled fragment pool. Candidates were sanitized, canonicalized, filtered into single-fragment molecules, and deduplicated.

2.5. Molecular similarity and property filtering

For each candidate SMILES, 2D similarity to the reference was computed as the maximum Tanimoto similarity across three fingerprints: Morgan fingerprint (radius 2, 2048 bits), RDKit topological fingerprint, and MACCS keys. Candidates with 2D similarity of < 0.02 were discarded.

Physicochemical properties were computed in RDKit, including molecular weight, logP, TPSA, HBD, HBA, rotatable bonds, ring count, and QED. Additional structural filters were applied: molecular weight 200–600 Da, logP −0.2–5.5, TPSA ≤ 160 Å2, minimum hetero atoms of ≥ 4, minimum hetero element types of ≥ 1, and a minimum functional-group count of ≥ 3 (functional groups were counted via SMARTS patterns). Lipinski rule-of-five violations were counted using standard thresholds (MW > 500, logP > 5, HBD > 5, HBA > 10); candidates with more than two rule-of-five violations were discarded. Candidates with QED < 0.30 were discarded.

2.6. 3D conformer generation and pose alignment

For candidates passing the 2D and property filters, 3D conformers were generated with RDKit ETKDGv3 (random seed = 0), followed by UFF geometry optimization (up to 200 iterations). Candidate conformers were aligned to the bound reference ligand pose using RDKit alignment (AlignMol) with an atom mapping derived either from an exact substructure match (fast path) or from an MCS between the candidate and the reference; at least three mapped atoms were required. The aligned RMSD was recorded as an auxiliary output.

2.7. Sequence alignment and secondary structure annotation

When comparing homologous proteins, pairwise global alignment was performed using Biopython’s PairwiseAligner (match score = 1, mismatch score = −0.5, gap open penalty = −10, gap extension penalty = −0.5). Secondary structure was assigned from DSSP when available (mkdssp or dssp); otherwise, a phi/psi-based fallback classified residues into helix (H), strand (E), or coil (C). Alignment plots were rendered with Matplotlib and exported at 600 dpi in an ESPript-like style for manuscript figures.

2.8. Pose and protonation-state handling for docking

During docking, alternative ligand protonation/tautomeric states were handled using the Schrödinger state-penalty workflow, and multiple poses were sampled for each state. For downstream ranking, compounds with state penalty > 0 were excluded, and one representative best-ranked pose per retained ligand was propagated to MM-GBSA, ligand strain analysis, and MD stimulations.

2.9. Molecular dynamics simulations in a membrane environment

MD simulations were performed in Desmond 2025–11 using the OPLS5 force field. Protein–ligand complexes were embedded in a POPC membrane, solvated with TIP3P water, neutralized, and set to 0.15 M KCl. After Desmond default membrane relaxation, 1.0 μs production runs were carried out in the NPT ensemble at 300 K and 1 bar (Nosé–Hoover thermostat; semi-isotropic MTK barostat), with PME electrostatics and a 9.0 Å short-range cutoff. Hydrogen-involving bonds were constrained using M-SHAKE, and trajectories were integrated with a 2-fs time step. No explicit distance/angle restraints were applied to enforce Mg2+/Ni2+ coordination. No explicit distance or angle restraints were applied to enforce Mg2+/Ni2+ coordination during production MD. Metal-associated interactions were inspected from trajectory interaction analyses; however, explicit metal–ligand distance/angle cutoffs were not used as hard selection criteria in the current study.

2.10. Concentration-dependent growth inhibition assay and IC50/MIC determination

Growth inhibition assays were performed using Pseudomonas aeruginosa ATCC 15692 (PAO1) in 96-well plates. Overnight cultures were diluted to a standardized starting inoculum and treated with a log-spaced concentration series at constant final dimethyl sulfoxide (DMSO; vehicle control), with five replicate wells per concentration. Endpoint OD600 was measured, background-corrected by subtracting the mean media-only blank signal, and then normalized to the blank-corrected vehicle control. High-concentration wells were visually inspected for precipitation; wells with visible precipitates were excluded from IC50 fitting. IC50 was obtained by four-parameter logistic fitting of log10 concentration–response data. Minimum inhibitory concentration (MIC) was defined as the lowest concentration with background-corrected OD600 of ≤ 0.05 and no detectable growth in all 5/5 replicates. This operational MIC readout was derived under our in-house assay conditions and does not represent a formal CLSI/EUCAST reference MIC determination.

3. Results and discussion

3.1. Structural characterization of the binding pocket and metal-site constraints for candidate screening

Motivated by the need to design metal-aware inhibitors for MRAY_PSEAE, we carried out multi-level characterization of the ligand-binding pocket to derive computable constraints for structure-conditioned generation and screening. We prioritized constraints that jointly capture spatial occupancy, polar and charged anchoring, and the local physicochemical environment surrounding the metal site, as these factors are expected to govern both posing feasibility and competitiveness against the reference binding mode.

We first constructed a protein–ligand–metal complex model for MRAY_PSEAE using AlphaFill (Figure 1A). AlphaFill enriches predicted protein models by transplanting missing ligands, cofactors, and metal ions from experimentally determined structures based on sequence and structural similarity, thereby providing a practical starting point that preserves the small-molecule and ion context for downstream modeling (Hekkelman et al., 2023). In the transferred model, both Mg2+ and Ni2+ were present. Consistent with the original MraY structural report (Chung, 2013 #582), we treated Mg2+ as the catalytically relevant divalent ion, whereas Ni2+ was considered crystallization-associated (loop E/HHH motif) and not a confirmed physiological cofactor. Building on this complex, we computed and visualized a ligand-associated electron density map within the pocket (Figure 1B). Electron density is routinely used in structural workflows to evaluate whether the data support ligand placement and to assess pose plausibility; thus, the density map provides an interpretable spatial prior for regions most compatible with ligand occupancy. A limitation is that explicit metal-coordination energetics were not parameterized during glide docking, which may reduce sensitivity to coordination-specific binding effects.

Figure 1.

Panel A shows a ribbon illustration of a protein structure with colored alpha helices and a bound ligand represented as a stick model. Panel B presents a close-up of the ligand-binding site, with atoms, hydrogen bonds, and surrounding protein environment highlighted. Panel C contains a three-dimensional scatter plot labeling pocket atoms, ligand atoms, and metal ions using different colored dots. Panel D displays a radar chart graphing protein pocket properties, including axes for hydrophobic, polar, positive, negative, aromatic, and other characteristics.

Multi-level characterization of the MRAY_PSEAE ligand binding pocket and metal site constraints. (A) Overall MRAY_PSEAE ligand metal ion complex model from AlphaFill. (B) Electron density map of the ligand in the pocket. (C) Pocket ligand point cloud with metal ions and interaction types. (D) Radar chart of pocket residue composition and ligand proximity.

To simplify the pocket description while retaining interaction-critical information, we represented the binding site as a pocket-ligand point cloud with explicit inclusion of metal ions (Figure 1C). This representation encodes key interaction types as discrete points and is consistent with established pocket comparison and modern 3D pocket modeling practices that treat pockets as atom clouds or point clouds (Hoffmann et al., 2010). The pocket scale comprised 279 atomic points, 36 donor points, 41 acceptor points, and 54 hydrophobic points, indicating that the site requires substantial hydrogen-bond complementarity together with hydrophobic cavity filling. The transferred metal neighborhood contained one Mg2+ site and two Ni2+ sites. The nearest ligand-to-metal distance was 0.86 Å, indicating an extremely close metal neighborhood likely to impose strong geometric and electrostatic constraints; however, given the likely crystallization origin of Ni2+, we did not treat Ni2+-specific chelation as a physiological requirement during ranking. In practice, candidates were prioritized based on metal-proximal geometric feasibility and electrostatic complementarity centered on the Mg2+-relevant catalytic region, while poses that relied primarily on Ni2+-specific contacts were interpreted conservatively.

Finally, Figure 1D plots metal electrostatic similarity against total score and shows a near-zero correlation (Pearson r = −0.080). This likely reflects the narrow dynamic range of metal_electrostatic_sim relative to pocket_score, causing the metal term to act as a modest adjustment rather than a primary ranking driver. Accordingly, uncertainty in Ni™ assignment is unlikely to be a major determinant of final ranking in the current workflow. If metal coordination and electronic effects are intended to be decisive, discriminability and interpretability could be improved by increasing the weight of the metal term, introducing explicit coordination-geometry constraints (donor atom type, distance, and angle), and adopting a more rigorous electrostatic treatment at the metal site where feasible. Future studies will include sensitivity analyses comparing Mg-only and Mg + Ni receptor models, with explicit monitoring of coordination distances and angles.

3.2. Candidate sourcing, 2D similarity control, and multi-objective prioritization

To maintain practical chemical accessibility while preserving comparability to the reference ligand, the candidate set was sourced primarily through PubChem 2D similarity retrieval. This strategy intentionally focused exploration on near-neighbor chemistry likely to be purchasable or synthetically tractable while accepting reduced scaffold novelty. Because PubChem similarity is fingerprint-based, it captures 2D neighborhood relationships but does not guarantee compatibility with the 3D, metal-proximal MRAY_PSEAE pocket.

As shown in Figure 2A, selected compounds were concentrated in a mid-similarity regime (0.322–0.731; median 0.478; IQR 0.435–0.524), consistent with thresholded analog retrieval. Functional-group profiling (Figure 2B) yielded counts of 3–13 (median 6; p90 10), with enrichment of hydroxyl/ether and carbonyl-related motifs. This pattern indicates that the retained set is chemically multifunctional rather than dominated by overly simple hydrophobic scaffolds.

Figure 2.

Panel A shows a histogram with a blue density curve of 2D similarity to a reference across multiple samples, peaking around 0.5 similarity. Panel B is a bar chart of functional group matches, with most counts between three and seven matches. Panel C is a grouped bar chart summarizing pocket interactions for the top twenty molecules; blue bars represent pocket contacts, orange bars are hydrogen bonds, and green bars are pocket clashes. Panel D displays a scatter plot of pocket score versus drug-likeness (QED), showing scattered points without a strong trend. Panel E is a scatter plot of total score versus metal electrostatic similarity, indicating a positive correlation.

Candidate source similarity constraints and multi-objective selection signals. (A) Distribution of 2D similarity to the reference ligand. (B) Distribution of functional group SMARTS match counts and overall functional group composition. (C) Interaction summary of top 20 molecules with pocket contacts, pocket H-bonds, and pocket clashes. (D) QED versus pocket score and the Pareto frontier used for shortlist selection. (E) Metal electrostatic similarity versus total score.

In the top 20 subset (Figure 2C), pocket contacts were the main contributor to ranking (34–72; median 45.5), whereas hydrogen bonds were relatively sparse (0–4; median 1). Steric clashes ranged from 0 to 11 (median 4.5), and pocket_score spanned 33–53 (median 36). These data indicate that shape/surface complementarity dominated score separation under the current setup, with hydrogen-bond terms providing secondary refinement.

Joint evaluation of pocket_score and QED (Figure 2D) showed no meaningful correlation (Pearson r = 0.021), supporting Pareto-based multi-objective triage rather than forced early collapse into a single scalar metric. Metal electrostatic similarity was also weakly associated with total score (Figure 2E; Pearson r = −0.080), suggesting that, in the current weighting scheme, the metal term acted mainly as a modest modifier. Consistent with this finding, uncertainty in Ni2+ assignment is unlikely to be the dominant driver of final ranking; future studies will include Mg2+-only versus Mg2++Ni2+ sensitivity analysis with explicit coordination-distance/angle monitoring.

3.3. Chemical-space compression using fingerprint clustering and tiered docking/energy screening

To reduce redundancy while preserving representative coverage of the 270-compound set, we performed Canvas fingerprint-similarity hierarchical clustering (Figures 3A,B). The Kelley criterion selected 17 clusters with clustering strain = 1.263, and the reordered similarity matrix showed a clear diagonal block structure, indicating strong within-cluster similarity and low inter-cluster similarity in the fingerprint space. Figure 3C summarizes 17 cluster representatives together with the reference ligand. The full list of generated and selected compounds is provided in Supplementary Table S1.

Figure 3.

Panel A shows a line graph of Kelley Penalty versus number of clusters, displaying a U-shaped curve with a minimum around 50 clusters. Panel B presents a heatmap labeled “Cluster Order” with a color scale from blue to red, indicating cluster relationships among approximately 260 items. Panel C displays chemical structure diagrams arranged in four rows. Panel D features a line graph titled “State Penalty” with blue and red segments based on index values. Panel E shows a similar plot for “Docking Score,” panel F for “MMGBSA dG Bind,” and panel G presents a scatter plot of “Ligand Strain Energy” versus “Compound ID,” with color-coded points.

Fingerprint similarity clustering and tiered docking energy screening of 270 candidates. (A) Kelley penalty versus number of clusters from Canvas. (B) Canvas fingerprint similarity matrix 270 × 270 reordered by cluster membership. (C) Representative structures with 18 compounds, including 17 cluster representatives and the MRAY_PSEAE reference ligand. (D) State penalty distribution. (E) Docking score comparison with the reference ligand. (F) Prime MM-GBSA comparison with the reference ligand. (G) Ligand strain energy comparison.

Candidates were then filtered through a tiered structure-based workflow (Figures 3DG). State-penalty filtering removed 175 compounds with state penalty > 0, retaining 95 candidates with more plausible protonation/tautomeric states under modeled conditions. Because DockingScore can include a state-related penalty contribution, this pre-filtering step reduced confounding due to low-probability ionization/tautomer states before rank comparison. At the docking stage, 21 candidates scored better than the reference ligand (reference DockingScore = −6.212; Figure 3E).

Prime MM-GBSA rescoring reduced this set to four candidates outperforming the reference baseline (reference MM-GBSA = −61.83 kcal/mol; Figure 3F). Ligand strain filtering further narrowed the shortlist to three compounds with lower bound-state strain than the reference (reference strain = 2.9446 kcal/mol): 6675, 2733768, and 5311309 (Figure 3G).

Because this is a charged, metal-dependent system, MM-GBSA values were interpreted strictly as comparative ranking signals under one consistent protocol rather than as absolute affinity estimates. Known limitations include approximate solvation/entropy treatment, sensitivity to protonation/ion assignment, and incomplete explicit treatment of metal-coordination energetics.

3.4. Binding-mode interpretation of prioritized hits and 1-μs MD stability assessment

To rationalize the final shortlist, we examined interaction diagrams for 6675, 2733768, and 5311309 (Figures 4AC) and then evaluated pose retention in 1-μs MD trajectories (Figures 4EI). Compound 6,675 adopted a multi-anchor mode with polar contacts involving Arg58, Lys133, and Asp198, together with Mg2+/Ni2+-associated contacts and broad hydrophobic packing (including Leu193, Phe264, Met321, Ala322, and Pro323).

Figure 4.

Panel A, B, and C display ligand interaction diagrams showing molecular structures with labeled amino acid residues and interaction arrows. Panels D to H contain molecular graphics depicting three-dimensional protein-ligand complexes with color-coded protein secondary structures, water molecules, and interaction highlights.

Binding mode analysis of prioritized hits and pose retention in 1.0 μs MD. (A) 2D interaction diagram of compound 6,675. (B) 2D interaction diagram of compound 2733768. (C) 2D interaction diagram of compound 5311309. (D,E) Representative 3D binding snapshots of 6,675 during 1.0 μs MD. (F,G) Representative 3D binding snapshots of 2733768 during 1.0 μs MD. (H,I) Representative 3D binding snapshots of compound 5311309 during 1.0 μs MD. Protein is shown as a cartoon ligand, waters as sticks, and metal ions as spheres. Dashed lines are direct or water-mediated contacts.

Compound 2733768 showed dual polar anchoring and strong core packing: a terminal hydroxyl–Lys119 interaction, a sulfonyl-centered polar/ionic network (including Ala56 and Arg58), a water-mediated support near Asn192, and metal-proximal contacts involving Mg2+ and Ni2+. The scaffold occupied a broad pocket region spanning both polar and hydrophobic subsites.

Compound 5311309 displayed a more focused carboxylate-centered anchor, with Lys119 ionic/H-bond support, Mg2+-proximal stabilization, and persistent polar reinforcement by Asn192, Leu193, and Asp195. Hydrophobic complementarity was concentrated near Phe264 and the Met321/Ala322/Pro323 flank.

Across 1-μs trajectories, compound 5311309 retained the most stable overall binding conformation, whereas 6675 and 2733768 exhibited larger conformational adjustments. The combined evidence from interaction and dynamics therefore prioritized compound 5311309 as the most stable binder among the three finalists under the current protocol.

3.5. Trajectory-level interaction hotspots and energetic stability of compound 5311309 versus the reference ligand

Using the trajectory-analysis workflow described in the section “Methods (Section 2.9),” we compared compound 5311309 with the reference ligand at residue resolution over 1 μs (Figures 5A,B). The two profiles shared dominant hotspots, indicating preservation of the same primary binding axis rather than site switching. Asp267 was the principal high-frequency node in both systems, with additional overlap at Asn192, Asp195, Ser125, Arg126, Arg58, and Val57. Recurrent hydrophobic engagement around Phe264 and retention of the Met321/Pro323 flank were also observed.

Figure 5.

Panel of seven scientific data visualizations. Bar charts A and B display interaction fractions for various amino acid residues, with multiple color-coded bars per residue. Line graph C shows MM/GBSA binding energy over time for two molecules, with the blue line consistently higher than the orange. Line graphs D through G illustrate the time evolution of four molecular properties: rGyr in blue, MoISA in orange, PSA in red, and SASA in green, each maintaining distinct, fluctuating profiles along the horizontal axis labeled in microseconds.

Trajectory-level interaction hotspots and stability of compound 5311309 versus the reference ligand over 1.0 μs. (A) Residue-wise interaction fraction profile for the reference ligand. (B) Residue-wise interaction fraction profile for compound 5311309. (C) MM-GBSA over time for the reference ligand and compound 5311309. X axis is time in μs, and y axis is MM-GBSA in kcal/mol. (D) Radius of gyration rGyr of compound 5311309 over time. (E) MolSA of compound 5311309 over time. (F) PSA of compound 5311309 over time. (G) SASA of compound 5311309 over time.

Despite this shared hotspot architecture, contact distribution differed. The reference ligand sampled a broader footprint across multiple pocket neighborhoods, whereas compound 5311309 showed a tighter interaction corridor from Lys119 via Ser125/Arg126 toward Asn192/Leu193/Asp195/Asp267, with limited peripheral sampling. This pattern is consistent with stronger focal anchoring and reduced exploratory motion.

Trajectory MM-GBSA profiles (Figure 5C) supported a more favorable energetic baseline for compound 5311309 throughout the full window. The reference ligand fluctuated approximately from −65 to −30 kcal/mol (most frequently −50 to −35), whereas compound 5311309 fluctuated from −82 to −55 kcal/mol (most frequently −72 to −60). Both traces showed local fluctuations without sustained drift, consistent with stable plateau-like sampling.

Ligand property trajectories for compound 5311309 (Figures 5DG) showed early adjustment followed by stabilization: slight increases in the radius of gyration and MolSA, larger but bounded SASA fluctuations with an upward tendency, and early rise-then-plateau behavior of polar-surface metrics. Torsion analysis indicated that most rotatable bonds remained within narrow windows, with only one torsion showing notable early variability before settling.

3.6. In silico ADMET profiling suggests a permeability/efflux bottleneck with focused liver–kidney and irritation/sensitization flags

To assess developability alongside binding performance, we summarized model-predicted ADMET outputs and treated them strictly as hypothesis-generating signals. Overall, the profile suggests a meaningful exposure bottleneck driven by permeability and transporter effects, together with distribution and safety risks (Hakkola et al., 2020). In particular, predicted high plasma protein binding with low unbound fraction and hepatobiliary risk signals limit confidence in translational potential at this stage. Therefore, we interpret compound 5311309 as an early optimization hit rather than a development-ready Gram-negative antibacterial candidate. Detailed ADMET data are provided in Supplementary Table S2.

For absorption, Caco-2 = −5.649 and MDCK = −5.06 are both predicted as high risk (Bittermann and Goss, 2017), whereas PAMPA is predicted to be positive (Gousiadou et al., 2023). This Caco-2/MDCK versus PAMPA divergence is compatible with a transporter-mediated component but remains hypothetical without experimental validation. Consistently, the P-gp inhibitor is predicted negative, and the P-gp substrate is predicted positive, which may reduce intracellular exposure through efflux (Rehman et al., 2025). For Gram-negative antibacterial development, these liabilities are especially consequential because efficacy depends on sufficient intracellular accumulation despite permeability barriers and active efflux. Predicted bioavailability thresholds are mixed (F20% and F30% favorable; F50% unfavorable), and HIA is predicted to be acceptable; these outputs should be regarded as preliminary rather than definitive in vivo evidence.

For distribution, PPB = 98.1% and Fu = 2.7% are predicted and flagged as high risk, which may reduce free circulating and infection-site exposure. The predicted VDss = −0.878 may be compatible with relatively limited tissue distribution. Poor BBB penetration is predicted, which may be acceptable for non-CNS indications (McCartan et al., 2023). Transporter predictions are mixed: OATP1B1/OATP1B3 are not strongly flagged, whereas BCRP and BSEP inhibition are predicted (Anabtawi et al., 2022). The BSEP signal warrants specific caution because of potential cholestatic liability and therefore represents a key translational risk hypothesis requiring dedicated follow-up (Ren et al., 2021).

For metabolism and excretion, most CYP inhibitor calls are predicted negative, while CYP2C9 and CYP3A4 substrate calls are predicted positive. This pattern may suggest metabolism through these pathways and potential sensitivity to inducers/inhibitors. HLM stability is predicted to be favorable. Predicted plasma clearance (CL = 1.287) is not in a high-clearance range, and predicted half-life (T1/2 = 1.128, model-specific units) may be short to moderate. These outputs should be viewed as prioritization cues for PK experiments rather than quantitative PK conclusions.

For toxicity, several early screens are predicted to be favorable, including low hERG-related risk (hERG = 0.104; 10 μM = 0.073) and favorable AMES/carcinogenicity predictions (AMES = 0.084; carcinogenicity = 0.048). Multiple cytotoxicity readouts are also predicted to be favorable. However, higher-risk predictions are observed for human hepatotoxicity (0.767), DILI (0.499), nephrotoxicity (0.724), skin sensitization (0.857), and eye irritation (0.918), with additional flags for ototoxicity (0.706) and respiratory toxicity (0.561). A genotoxicity score of 0.479, together with a favorable AMES, should be treated as an uncertain mixed signal requiring orthogonal testing (Monem et al., 2025). These toxicity outputs further support cautious translational interpretation.

Taken together, these in silico outputs define two immediate optimization priorities: improving intracellular exposure by mitigating permeability/efflux liabilities, and early de-risking of hepatobiliary/renal and irritation/sensitization signals while preserving favorable predicted hERG and mutagenicity profiles. Accordingly, the current ADMET results are presented as model-based risk hypotheses, and compound 5311309 is positioned as a lead-optimization starting point rather than a clinically translatable Gram-negative antibacterial candidate at this stage.

3.7. Concentration-dependent growth inhibition defines distinct IC50 and MIC thresholds for compound 5311309 and the positive control Tunicamycin

Compound 5311309 showed clear concentration-dependent inhibition of bacterial growth, as measured by endpoint OD600. At concentrations up to 8 μM, growth remained close to the vehicle level, whereas a steep decline was observed across ~10–20 μM on the log concentration scale. Four-parameter dose–response fitting yielded an IC50 of approximately 15.1 μM. In the same assay, the positive control Tunicamycin (Catalog No. T13229) displayed substantially stronger low-dose activity, with an IC50 of approximately 0.30 μM, indicating higher apparent potency than compound 5311309 under these conditions.

With further dose escalation, OD600 values approached baseline. Using the operational criteria applied in this study (OD600 ≤ 0.05 with no detectable growth in 5/5 replicates), the MIC was 64 μM for compound 5311309 and 128 μM for Tunicamycin. The separation between IC50 and MIC indicates that partial growth suppression occurs at lower concentrations, whereas a stringent no-growth endpoint can be reached only at higher concentrations. These data support follow-up work to clarify whether the dominant effect is bacteriostatic or bactericidal and how the phenotype depends on exposure duration and inoculum level; mechanistic interpretation also remains preliminary because direct intracellular MraY target engagement has not yet been demonstrated, and off-target effects (e.g., membrane perturbation or other pathways) cannot currently be excluded (see Figure 6).

Figure 6.

Scatter plot showing growth percentage versus concentration in micromolar on a logarithmic scale for two compounds, labeled 5311309 (blue circles) and Tunicamycin (orange squares). IC50 and MIC values are indicated with vertical dashed lines: 5311309 has IC50 at 14.6 micromolar and MIC at 64 micromolar; Tunicamycin has IC50 at 0.31 micromolar and MIC at 128 micromolar. Error bars represent standard deviation.

Dose–response growth inhibition of Pseudomonas aeruginosa by compound 5311309 and Tunicamycin.

4. Conclusion

This study establishes a metal-aware, pocket-conditioned discovery workflow tailored to MRAY_PSEAE, where Mg2+-dependent catalysis imposes strict geometric and electrostatic constraints that can confound standard docking/scoring. By integrating explicit metal-site representation, chemical-space compression, tiered energetic rescoring (including MM-GBSA and strain-energy control), and microsecond-scale MD validation, we efficiently reduced 270 candidates to three prioritized hits and identified compound 5311309 as the most dynamically stable binder. The overlap of interaction hotspots between compound 5311309 and the reference ligand supports a conserved binding axis within the catalytic pocket. Functionally, compound 5311309 produced clear, dose-dependent growth inhibition with an IC50 in the mid-μM range (~15.1 μM) and a higher MIC threshold (64 μM), motivating follow-up mechanistic assays (target engagement and bacteriostatic/bactericidal resolution). In silico ADMET profiling highlights two immediate optimization axes: (i) improving intracellular exposure by mitigating permeability/efflux limitations and (ii) early de-risking of predicted hepatic/renal and irritation/sensitization liabilities while maintaining favorable hERG and mutagenicity predictions. Collectively, these findings provide both a methodological template for metal-dependent antibacterial targets and a concrete chemical starting point for anti-P. aeruginosa lead development. A key limitation is that this study does not implement diffusion/VAE/GAN-based de novo generation; instead, it explores a near-neighbor chemical space via similarity-guided library expansion, which improves practical tractability but may constrain scaffold novelty. A further limitation is the absence of formal retrospective benchmarking with known MraY inhibitors/decoys. Future studies will include such benchmarking and orthogonal experimental validation.

Funding Statement

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by National Natural Science Foundation of China (grant number: 82103305).

Edited by: Vincent Zhang, University of Illinois at Urbana-Champaign, United States

Reviewed by: Anand Anbarasu, VIT University, India

Zhouyu Ning, Fudan University, China

Soharth Hasnat, Bangabandhu Sheikh Mujibur Rahman Agricultural University, Bangladesh

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

TS: Writing – original draft, Writing – review & editing. LZ: Writing – original draft, Writing – review & editing. RC: Software, Writing – original draft. SJ: Writing – original draft, Writing – review & editing. HZ: Writing – original draft, Writing – review & editing.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2026.1798540/full#supplementary-material

Data_Sheet_1.CSV (14.3KB, CSV)
Data_Sheet_2.CSV (92.4KB, CSV)

References

  1. Anabtawi N., Drabison T., Hu S., Sparreboom A., Talebi Z. (2022). The role of OATP1B1 and OATP1B3 transporter polymorphisms in drug disposition and response to anticancer drugs: a review of the recent literature. Expert Opin. Drug Metab. Toxicol. 18, 459–468. doi: 10.1080/17425255.2022.2113380, [DOI] [PubMed] [Google Scholar]
  2. Bittermann K., Goss K. U. (2017). Predicting apparent passive permeability of Caco-2 and MDCK cell-monolayers: a mechanistic model. PLoS One 12:e0190319. doi: 10.1371/journal.pone.0190319, [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Cao Y., Qiao W., Zhang C., Zhu M., Wu Q., Lou W.-Y., et al. (2025). Programmable enzyme catalysis based on multiscale confinements. Nat. Synth. 4, 1338–1348. doi: 10.1038/s44160-025-00898-1 [DOI] [Google Scholar]
  4. Cheng Y., Gong Y., Liu Y., Song B., Zou Q. (2021). Molecular design in drug discovery: a comprehensive review of deep generative models. Brief. Bioinform. 22:bbab344. doi: 10.1093/bib/bbab344, [DOI] [PubMed] [Google Scholar]
  5. Ferrara F., Castagna T., Pantolini B., Campanardi M. C., Roperti M., Grotto A., et al. (2024). The challenge of antimicrobial resistance (AMR): current status and future prospects. Naunyn Schmiedeberg's Arch. Pharmacol. 397, 9603–9615. doi: 10.1007/s00210-024-03318-x, [DOI] [PubMed] [Google Scholar]
  6. Gousiadou C., Doganis P., Sarimveis H. (2023). Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus Sars-CoV-2. Netw Model Anal Health Inform Bioinform 12:16. doi: 10.1007/s13721-023-00410-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Hakkola J., Hukkanen J., Turpeinen M., Pelkonen O. (2020). Inhibition and induction of CYP enzymes in humans: an update. Arch. Toxicol. 94, 3671–3722. doi: 10.1007/s00204-020-02936-7, [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Hekkelman M. L., De Vries I., Joosten R. P., Perrakis A. (2023). AlphaFill: enriching AlphaFold models with ligands and cofactors. Nat. Methods 20, 205–213. doi: 10.1038/s41592-022-01685-y, [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Hoffmann B., Zaslavskiy M., Vert J. P., Stoven V. (2010). A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction. BMC Bioinformatics 11:99. doi: 10.1186/1471-2105-11-99, [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Liu Y., Rodrigues J. P., Bonvin A. M., Zaal E. A., Berkers C. R., Heger M., et al. (2016). New insight into the catalytic mechanism of bacterial MraY from enzyme kinetics and docking studies. J. Biol. Chem. 291, 15057–15068. doi: 10.1074/jbc.M116.717884, [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Mashalidis E. H., Kaeser B., Terasawa Y., Katsuyama A., Kwon D. Y., Lee K., et al. (2019). Chemical logic of MraY inhibition by antibacterial nucleoside natural products. Nat. Commun. 10:2917. doi: 10.1038/s41467-019-10957-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Mccartan R., Khorkova O., Volmar C. H., Wahlestedt C. (2023). Nucleic acid-based therapeutics for the treatment of central nervous system disorders. Front. Genet. 14:1250276. doi: 10.3389/fgene.2023.1250276, [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Monem S., Abdel-Hamid A. H., Hassanien A. E. (2025). Drug toxicity prediction model based on enhanced graph neural network. Comput. Biol. Med. 185:109614. doi: 10.1016/j.compbiomed.2024.109614, [DOI] [PubMed] [Google Scholar]
  14. Moreau-Marquis S., Stanton B. A., O'toole G. A. (2008). Pseudomonas aeruginosa biofilm formation in the cystic fibrosis airway. Pulm. Pharmacol. Ther. 21, 595–599. doi: 10.1016/j.pupt.2007.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Nakaya T., Yabe M., Mashalidis E. H., Sato T., Yamamoto K., Hikiji Y., et al. (2022). Synthesis of macrocyclic nucleoside antibacterials and their interactions with MraY. Nat. Commun. 13:7575. doi: 10.1038/s41467-022-35227-z, [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Oluyombo O., Penfold C. N., Diggle S. P. (2019). Competition in biofilms between cystic fibrosis isolates of Pseudomonas aeruginosa is shaped by R-pyocins. MBio 10:e01828-18. doi: 10.1128/mBio.01828-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Qin S., Xiao W., Zhou C., Pu Q., Deng X., Lan L., et al. (2022). Pseudomonas aeruginosa: pathogenesis, virulence factors, antibiotic resistance, interaction with host, technology advances and emerging therapeutics. Signal Transduct. Target. Ther. 7:199. doi: 10.1038/s41392-022-01056-1, [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Rehman M. U., Chong K. T., Tayara H. (2025). Drugs inhibition prediction in P-gp enzyme: a comparative study of machine learning and graph neural network. Comput. Toxicol. 34:100344. [Google Scholar]
  19. Ren T., Pang L., Dai W., Wu S., Kong J. (2021). Regulatory mechanisms of the bile salt export pump (BSEP/ABCB11) and its role in related diseases. Clin. Res. Hepatol. Gastroenterol. 45:101641. doi: 10.1016/j.clinre.2021.101641, [DOI] [PubMed] [Google Scholar]
  20. Saxena D., Maitra R., Bormon R., Czekanska M., Meiers J., Titz A., et al. (2023). Tackling the outer membrane: facilitating compound entry into gram-negative bacterial pathogens. NPJ Antimicrob. Resist. 1:17. doi: 10.1038/s44259-023-00016-1, [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Schneuing A., Harris C., Du Y., Didi K., Jamasb A., Igashov I., et al. (2024). Structure-based drug design with equivariant diffusion models. Nat. Comput. Sci. 4, 899–909. doi: 10.1038/s43588-024-00737-x, [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Torrens G., Cava F. (2024). Mechanisms conferring bacterial cell wall variability and adaptivity. Biochem. Soc. Trans. 52, 1981–1993. doi: 10.1042/BST20230027, [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Wang K., Lyu N., Diao H., Jin S., Zeng T., Zhou Y., et al. (2020). GM-DockZn: a geometry matching-based docking algorithm for zinc proteins. Bioinformatics 36, 4004–4011. doi: 10.1093/bioinformatics/btaa292, [DOI] [PubMed] [Google Scholar]
  24. Xu C., Wu S., Liu P., Huang Y., Chen Y., Ding G., et al. (2024). Computational identification and analysis of CNP0269688 as a natural product inhibitor disrupting the interaction between the HIV matrix domain and tRNA. Front. Chem. 12:1450339. doi: 10.3389/fchem.2024.1450339, [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Zhang H., Zhang X., Su Q., Zheng Y., Jiang L., Zhu K., et al. (2026). MetalloDock: decoding metalloprotein-ligand interactions via physics-aware deep learning for metalloprotein drug discovery. J. Am. Chem. Soc. 148, 3086–3101. doi: 10.1021/jacs.5c15876, [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data_Sheet_1.CSV (14.3KB, CSV)
Data_Sheet_2.CSV (92.4KB, CSV)

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.


Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES