Abstract
Mesothelin (MSLN) is a cell surface glycoprotein overexpressed in many solid tumors, which is known to interact with cancer antigen CA125/MUC16, promoting cancer cell adhesion and metastasis. MSLN has been used as a target of multiple antibody-based therapeutic strategies, but their efficacy remains limited, potentially due to inherent pharmacokinetics conferred by the structure of antibodies (~150 kDa). To provide an alternative targeting molecule, we engineered a small scaffold protein derived from the tenth domain of human fibronectin type III (Fn3, 12.8 kDa) to bind MSLN with nanomolar affinity as a theranostic agent for MSLN-positive cancers. In this study, we explored the Fn3-MSLN interaction site through computational modeling and experimentally validated the model through domain-level and fine epitope mapping. Fn3-MSLN binding was predicted by a consensus approach, comparing multiple protein-protein docking software, the deep-learning-based algorithm AlphaFold3, and performing molecular dynamics (MD) simulations. To validate the prediction, full-length MSLN, single MSLN domains, or combinations of domains were expressed on the yeast surface, and Fn3 binding to displayed MSLN domains was measured by flow cytometry. The employed algorithms predicted two distinct binding modes for Fn3. Overall, experimental data agreed with our in silico prediction resulting from the AlphaFold3 model, confirming that MSLN domains B and C are predominantly involved in the interaction.
Keywords: protein engineering, protein scaffold, protein docking, molecular dynamics simulations, yeast surface display, Fn3, targeted therapy, mesothelin
1. INTRODUCTION
Mesothelin (MSLN) is a surface glycoprotein overexpressed in numerous malignancies, including mesothelioma, ovarian, pancreatic, and lung cancers.1,2 MSLN is anchored on the cell surface by the C-terminus through a glycosylphosphatidylinositol (GPI) anchor. MSLN derives from the cleavage of a 68 kDa precursor (628 amino acids), encoded by the MSLN gene.3 The cleavage occurs at arginine 295 (Arg295) and leads to two peptides: soluble megakaryocyte potentiating factor (MPF, 31 kDa) released into the bloodstream and membrane-bound MSLN (40 kDa).1 MSLN can also be shed in blood, pleural and peritoneal fluid, in its soluble form, which is referred to as soluble mesothelin-related peptide (SMRP).4 In the mature form, membrane-bound MSLN has three predicted N-glycosylation sites (Asn388, Asn488, and Asn515).5 Despite its biological role remaining unclear,6 MSLN has been proposed as a promising cancer biomarker for diagnosis and therapy since it is aberrantly expressed in many solid tumors and has low or no expression in healthy tissues.7–10 Furthermore, MSLN was shown to bind mucin 16 (CA125/MUC16), an antigen overexpressed in various cancers.11 MSLN-MUC16 binding was found to promote cell adhesion and metastasis, revealing the potential tumorigenic role of MSLN.12–15 Therefore, there is sustained interest in MSLN as a target for diagnosis and therapy. In the last decade, multiple antibody-based approaches targeting MSLN were developed and tested in clinical trials.16–19
Recently, a crystal structure of MSLN was published by Zhan et al. (2023), showing that the MSLN membrane distal region (N-terminus) is rigid, whereas the C-terminus, bound to the cell membrane, is more flexible.20 The N-terminal region has been observed to bind CA125/MUC16 and a chimeric monoclonal antibody (mAb), known as MORAb-009 or amatuximab, that has been used to target MSLN for immunotherapy.21–24 The C-terminal region is closer to the cleavage site that leads to SMRP and is recognized by mAb 15B6, which was produced in recent studies to avoid being released from the cell during the process of MSLN shedding and SMRP release.18,20
Overall, clinical trials studying antibody-based strategies targeting MSLN for therapy resulted in poor efficacy, attributed to immunogenicity and an “on target/off tumor” effect due to binding circulating SMRP, and trials have yet to result in approved MSLN-targeting approaches.10,24,25 Since MSLN-positive cancers are generally solid tumors, the antibodies’ large size could limit the ligand penetration into the tumor, lowering the antibody efficacy.26 It has been observed that the anti-tumor efficiency of antibody-based strategies is increased in blood cancers, where the tumor cells, thus the target, are easily accessible.26
To provide an alternative targeting molecule for MSLN, we recently reported engineered MSLN-binding proteins based on the tenth type III domain of human fibronectin (Fn3).27,28 The Fn3 structure (~13 kDa, 98 amino acids) consists of a hydrophobic core that confers stability to the protein with a beta-sandwich structure, and three loops, which are the variable regions, amenable to diversification.29,30 In previous studies, Fn3 has been engineered to successfully recognize various targets, and preclinical and clinical studies have shown the potential of Fn3-based molecules for therapeutic and molecular imaging applications.31–37 We previously observed that the MSLN-binding Fn3 variants internalize and promote apoptosis in MSLN-positive ovarian cancer cells.27,28 However, we have not previously had knowledge regarding the protein-protein binding interface between engineered Fn3 variants and MSLN.
A detailed understanding of the molecular mechanism regulating protein-protein interactions requires an atomistic-level knowledge of the adducts formed between the binding partners. This information is usually gained by solving the three-dimensional structure of protein-protein complexes using experimental techniques, such as X-ray crystallography or nuclear magnetic resonance spectroscopy. However, such knowledge is often challenging to obtain for large biomolecular complexes. Therefore, in recent years, molecular modeling and simulations have contributed significantly to unraveling the molecular interactions occurring between biological macromolecules.38 More recently, deep-learning-based algorithms, such as AlphaFold3, have proved to accurately predict the three-dimensional structure of proteins and protein-protein complexes, revolutionizing the structural biology field.39
In silico methods, such as protein-protein docking, can predict the native (minimum energy) binding pose and the corresponding free energy involved. Moreover, they are useful in complementing the experimental techniques for determining their complex adducts and identifying interactions and key ‘hot spot’ residues, which are essential for drug discovery.40 However, reliable predictions remain challenging to obtain due to the substantial conformational space that these algorithms have to sample exhaustively and the limited accuracy of the employed scoring functions necessary to rank the binding poses.41 Performing μs-long force field-based MD simulations to relax the adducts predicted by docking simulations can further improve the accurate identification of native binding poses of protein-protein complexes. These could also be followed by the molecular mechanics-Poisson Boltzmann surface area (MM-PBSA) method to estimate their binding free energy (ΔGb) and refine the relative ranking of the docking poses.42
Throughout this present work, we report results with an Fn3 variant that was identified from five rounds of directed evolution to enhance affinity for MSLN, which we will refer to here as Fn3_5. Here, using a combination of protein-protein docking, machine-learning-based algorithms, and MD simulations, we report key structural information for the Fn3_5/MSLN complex and, thus, its binding interface. Next, we validate our in silico predictions through domain-level epitope mapping and yeast surface display. We determined the predicted Fn3_5-MSLN binding site and the most contributing interacting residues. Investigating the MSLN/Fn3_5 binding interaction can provide a foundation for understanding the biological effect of Fn3_5 on MSLN-expressing tumor cells and inform future rational engineering through point mutations to further improve the MSLN-Fn3 binding affinity.
2. RESULTS
2.1. In silico modeling of the Fn3_5/MSLN complex
To identify the putative three-dimensional structure of the Fn3_5/MSLN complex, we performed blind protein−protein docking calculations. First, the structure of Fn3_5 was obtained via homology modeling (see Methods section) and was subsequently relaxed by performing a 1 μs-long MD simulation (Figure S1). Due to the known limitations of docking algorithms in correctly predicting the native protein−protein binding interface, we have employed a consensus docking strategy, which has been successfully performed for similar systems.43 In particular, we selected three well-known protein-protein docking algorithms: ClusPro44, HDOCK45, and pyDock46. Remarkably, all three algorithms predicted a similar binding region for Fn3_5, located at the N-terminal part of MSLN (Figure 1A). This binding interface overlaps with the epitope of the MORAb-009 antibody (Figure S2), which was recently solved in its complex with MSLN.20
Figure 1. Top-ranking Fn3_5/MSLN binding models.

(A) Superposition of the top-ranking models obtained with ClusPro (pink), HDOCK (red), pyDock (cyan) software, and the AlphaFold server (blue). The mesothelin (MSLN) structure was divided into four domains, as reported in previous studies.20 (B) Binding free-energy (ΔGb, kcal/mol) of the MSLN/Fn3_5 complexes obtained from all the MD trajectories. The disordered C-terminal tail of Fn3_5 is the result of a hydrophilic portion of the Fn3_5 sequence that supports expression and solubility of engineered Fn3 variants.
While this work was under review, the new version of the deep-learning method for the prediction of three-dimensional biomolecular structures, AlphaFold3, was published and was made easily accessible to the scientific community through the AlphaFold Server (https://alphafoldserver.com/).39 Since AlphaFold3 performance is reported to be improved significantly compared to previous versions, particularly for protein-protein interactions, we have compared our consensus predictions obtained using canonical protein-protein docking algorithms and this new methodology. Interestingly, we obtained a new binding pose different from those predicted by ClusPro, HDOCK, and pyDock (Figure 1A). In particular, in the AlphaFold3 model, Fn3_5 interacts mainly with the B and C domains of MSLN.
Next, to relax the obtained models and refine the binding poses, we have performed μs-long MD simulations on the best poses obtained. All the investigated models rapidly reached structural convergence, with the one obtained by AlphaFold3 showing very small fluctuations (Figure S3). Then, we evaluated the free energy of binding (ΔGb) using the molecular mechanics-Poisson Boltzmann surface area (MM-PBSA) method.47 The AlphaFold3 model showed the strongest binding (Figure 1B), while the other three related poses show lower, although similar, binding energies, with the ClusPro model showing the lowest value. Due to the general consensus shown by the three docking algorithms, we decided to further analyze the ClusPro model along with the one obtained using AlphaFold3. Thus, we dissected the residues that mainly contributed to the binding at the protein-protein interface between MSLN and Fn3_5 by applying a per-residue decomposition to the three replicas of the ClusPro and AlphaFold3 models (in Table 1, the lowest energy model is shown, while in Tables S1 and S2, the per-residue decomposition of all three replicas is reported).
Table 1. Per-residue decomposition of binding free-energy (ΔGb) of the ClusPro and AlphaFold3 complexes.
These were obtained from the lowest energy replica of the ClusPro and AlphaFold3 docking poses and reported together with their per-residue decomposition analysis. Residues stabilizing the binding pose are marked in orange (positive ΔGb), yellow (ΔGb between −1.01 and −2.00 kcal/mol), light green (ΔGb between −2.01 and −3.00 kcal/mol), and dark green (ΔGb greater than −3.01 kcal/mol).
| ClusPro | |||||
|---|---|---|---|---|---|
| Domains | Mesothelin Residues | Per-residue decomposition | Loops | Fn3_5 residues | Per-residue decomposition |
| A | Ala341 | −1.61 ± 0.54 | BC | Ala29 | −1.67 ± 0.53 |
| Pro343 | −3.03 ± 0.60 | Leu30 | −5.94 ± 0.74 | ||
| Phe344 | −2.59 ± 0.76 | Tyr31 | −1.67 ± 0.76 | ||
| Tyr346 | −1.26 ± 0.43 | DE | Pro51 | −1.95 ± 0.38 | |
| B | Tyr374 | −3.23 ± 0.79 | Arg52 | −3.65 ± 1.13 | |
| Glu400 | −3.43 ± 1.36 | Tyr53 | −6.29 ± 0.64 | ||
| Lys403 | +1.91 ± 1.30 | Tyr54 | −2.15 ± 0.64 | ||
| Gly404 | −-2.31 ± 0.79 | FG | Arg77 | −4.86 ± 1.72 | |
| Glu406 | −1.82 ± 1.09 | ||||
| AlphaFold3 | |||||
| Domains | Mesothelin Residues | Per-residue decomposition | Loops | Fn3_5 residues | Per-residue decomposition |
| B | Glu406 | −1.14 ± 1.59 | BC | Tyr24 | −2.23 ± 1.47 |
| Met407 | −2.93 ± 0.81 | Leu30 | −3.19 ± 0.69 | ||
| Pro409 | −1.85 ± 0.72 | Ty31 | −6.91 ± 1.81 | ||
| Ala412 | −1.16 ± 0.42 | Framework | Phe48 | −1.35 ± 0.97 | |
| Ile415 | −2.09 ± 0.38 | Thr49 | −3.03 ± 2.40 | ||
| C | Ser443 | −2.48 ± 0.64 | Val50 | −1.35 ± 0.42 | |
| Leu444 | −1.17 ± 0.47 | DE | Pro51 | −4.52 ± 0.54 | |
| Ser445 | −4.09 ± 0.89 | Arg52 | −5.97 ± 1.55 | ||
| Pro446 | −2.20 ± 0.42 | Tyr53 | −4.66 ± 0.91 | ||
| Asp469 | +1.80 ± 1.75 | Leu55 | −2.49 ± 0.56 | ||
| Gln472 | −1.36 ± 0.48 | FG | Val75 | −1.20 ± 0.28 | |
| Arg77 | −1.56 ± 1.57 | ||||
| Asn79 | −2.34 ± 1.27 | ||||
In the ClusPro model, we noticed that the residues mostly contributing to the binding are located in domains A and B of MSLN and all three loops BC, DE, and FG of Fn3_5 (Figure 2A). In particular, some residues such as Pro343, Phe344, and Tyr374 of MSLN and Leu30, Tyr31 (loop BC), Pro51, Arg52, Tyr53, Tyr54 (loop DE), and Arg77 (loop FG) are consistently present in all the MD simulations (Table 1 and S1). Our simulation points toward a significant involvement of the loop DE of Fn3_5 in the binding. A closer view of the protein-protein binding interface is shown in Figure 2A. Here, we noticed the presence of persistent salt bridges between Arg52 and Arg77 of Fn3_5 and Glu400 and Glu406 of MSLN (Figure 2A). π-π stacking interactions performed by Phe344 and Tyr374 of MSLN and Tyr53 and Tyr54 of Fn3_5 also strongly contribute to the binding. Some of these residues are also part of the epitope recognized by the MORAb-009 antibody,20 which indirectly validates the proposed binding interface. On the other hand, the binding of the other two models (HDOCK and pyDock, Tables S3 and S4) appears to be less specific; due to an increased tilt of Fn3_5, the FG loop is less involved in the binding of MSLN in the HDOCK model and absent in the one obtained with pyDock. Also, in these cases, we observe a significant contribution of Loop DE of Fn3_5, especially in the pyDock model, which accounts for almost all the interactions. However, all of these simulations indicate the MSLN domains A and B as the most involved in Fn3_5 binding.
Figure 2. Residues of Fn3_5 most significantly contributing to binding.

The binding interface between Fn3_5 and MSLN in the A) ClusPro and B) AlphaFold3 binding poses are highlighted. In the insets are reported the residues most significantly contributing to the ΔGb. Salt bridges are highlighted using dashed blue lines. Fn3_5 is shown as pink/blue ribbons.
In the AlphaFold3 model, we observed that the MSLN Domain C is strongly involved in the binding with Fn3_5 (Figure 2B). In particular, Pro409, Ser443, Ser445, and Pro446 residues of MSLN and Leu30, Tyr31 (loop BC), Thr49 (framework region), Arg52, Tyr53, and Tyr54 (loop DE) of Fn3_5 are contributing to the binding in all MD simulations (Table 1 and S2).
We also noticed that the most involved Fn3 loop in the binding is DE, as observed in the ClusPro model, and in particular, Arg52 and Tyr53. A closer inspection of the binding interface shows that Arg52 is also involved in a salt bridge with Glu406 of MSLN. Another stabilizing hydrogen bond is formed by Ser445 of MSLN, and also the overall binding is stabilized by hydrophobic interactions of Pro409, Ile415, and Pro446 of MSLN and Tyr31, Pro51, and Tyr53 of Fn3_5 (Figure 2B). Therefore, this model, which has shown the lowest binding energy (Figure 1B), underlines a clear contribution of both domain B and C of MSLN in the binding with Fn3_5, also resulting in the lowest calculated binding energy.
The MSLN structure is composed of a combination of HEAT and ARM motifs, which generally mediate protein-protein interactions by furnishing unique mechanical properties.48 To investigate if Fn3_5 binding would influence the MSLN internal dynamics, we have performed a principal component analysis (PCA, Figure S4) using the MD trajectories obtained from the ClusPro and AlphaFold3 models. We have observed in the MSLN structure, and in both systems, a breathing motion where the N- and C- termini move with respect to a hinge centered on domain C. By comparing this movement with an MD simulation performed on MSLN alone, we have noticed that this motion is not affected by the presence of Fn3_5. Finally, we have observed a local rigidification of the MSLN region implicated in Fn3_5 binding, mostly involving domains A and B for ClusPro and domains B and C for the AlphaFold3 model (Figure S5).
2.2. Domain-level epitope mapping system for MSLN binding established using yeast surface display
To experimentally validate the proposed binding interaction between Fn3_5 and MSLN, we designed a set of yeast-surface displayed constructs composed of combinations of MSLN domains (Figure 3A), based on the published MSLN crystal structure.20 MSLN was divided into four domains, following the naming convention of domains A, B, C, and D from Zhan et al.20 We refer to the full length domain as ABCD. Yeast surface display has previously been used as a method to map the binding domains on EGFR for a selection of anti-EGFR antibodies, by displaying fragments of EGFR.49 Because of the differences in yeast and human glycosylation patterns, with yeast exhibiting hypermannosylation that could sterically hinder binding interactions, we removed the three N-linked glycosylation sites of MSLN, by mutating asparagine to glutamine in the gene encoding the full-length shed MSLN, at the following three sites: Asn388Gln (in domain B), Asn488Gln (in domain C), Asn515Gln (in domain D). We confirmed that each of the 10 domain combinations were successfully expressed on the surface of yeast with an antibody that recognizes a c-myc tag at the C-terminus of each displayed construct (Figure 3B). A non-expressing population of yeast is characteristic of all yeast surface displayed proteins, and provides an internal negative control for these epitope mapping experiments.
Figure 3. Independent domains of MSLN can be expressed on the yeast surface.

(A) Design of MSLN domains for epitope mapping using yeast surface display. (B) All domain combinations expressed well on the surface of yeast, detected by an antibody to the C-terminal c-myc epitope tag. A non-expressing population of yeast is characteristic of yeast surface display. Expression tests were conducted at least four times for each construct. Representative expression data shown, with all constructs expressed the same day in freshly prepared media. Percent expressing population is shown for each histogram, and median fluorescence intensity is indicated for each positive population.
We then determined the domain of MSLN bound by a commercially available anti-MSLN antibody. The epitope of MSLN that the antibody was raised against was previously not reported. We simultaneously labeled each MSLN construct with the anti-MSLN antibody and an anti-c-myc antibody, to separate those yeast that were expressing from those that were not. Our data showed that domain C was necessary and sufficient for binding the anti-MSLN antibody (Figure 4, top). According to the manufacturer website, in addition to applications in flow cytometry, the antibody was suitable for western blots, suggesting that the antibody recognizes a linear epitope on MSLN, which would be recognizable regardless of whether the displayed MSLN was properly folded.
Figure 4. Validation of domain-level epitope mapping of MSLN binding.

Expression of each domain on the yeast surface is shown on the x-axis, as measured in flow cytometry by a C-terminal c-myc epitope tag for each construct. The construct being tested is labeled in the upper left corner of each plot. The y-axis shows the detection of binding to either an anti-MSLN antibody (top) or a fragment of MUC16 (bottom). The anti-MSLN antibody recognizes an epitope in domain C. MUC16 binds domain A, as previously reported.21
Due to the eukaryotic protein quality control machinery of S. cerevisiae, the expression of a protein in yeast is often properly folded, although correct fold must be experimentally confirmed, as there are also examples of misfolded, expressed domains in yeast surface display.50,51 Towards validating that the MSLN surface-displayed fragments were properly folded, we incubated the displayed MSLN constructs with a soluble domain of MUC16, which is reported to predominantly bind MSLN domain A, with minor contributions from domain B.21 We co-incubated MUC16 with an antibody that recognizes the C-terminal c-myc epitope tag in yeast surface display, to identify those yeast that were expressing each construct. For the expressing population of yeast, the presence of domain A was necessary and sufficient for binding MUC16 (Figure 4, bottom), confirming proper fold for domain A of each construct. The strongest binding signal was observed for domains ABCD and ABC compared to AB and isolated domain A. This data suggests that while the domains can be independently expressed, each domain may contribute to stabilizing the overall fold of MSLN, and therefore each domain may play an indirect role in enhancing the binding of MSLN with targeting molecules.
2.3. Binding assays support in silico modeling of Fn3_5/MSLN interaction
After establishing the domain-level epitope mapping strategy using subdomains of MSLN displayed on the surface of yeast, we then measured the dissociation constant (KD) of Fn3_5 and the full-length MSLN construct ABCD using an equilibrium titration binding assay and flow cytometry. Using human cancer cell lines with high levels of expression of MSLN on their surface, we have measured a KD = 11 +/− 4 nM (unpublished, manuscript under review). For the human cell line measurement, the MSLN is full length, encompassing domains ABCD and an additional short C-terminal fragment with a glycosylphosphatidylinositol anchor to the mammalian cell membrane. The MSLN expressed in cancer cell lines also has intact N-linked glycosylation. For the yeast-displayed ABCD and Fn3_5, we measured KD = 36 +/− 13 nM (mean ± standard deviation, ten replicates) (Figure 5A), consistent with the interaction between Fn3_5 and human cancer cells with MSLN, further validating our approach to mapping the binding domains for Fn3_5.
Figure 5. Fn3_5 binding interaction mediated by a combination of domains A, B, C, and D of MSLN.

(A) Equilibrium titration binding assay of Fn3_5 with yeast surface displayed MSLN domain ABCD. KD = 36 ± 13 nM (mean ± standard deviation of ten replicates, representative curve shown). (B) Domain mapping of all MSLN domain constructs with 100 nM and 400 nM Fn3_5 co-incubated with an anti-c-myc tag antibody detecting domain expression. Binding was detected to ABCD, ABC, BCD, and BC, without conclusive evidence of binding for other subdomains.
We then assessed the binding of Fn3_5 to each displayed MSLN construct, measuring at both 100 and 400 nM of protein (Figure 5B). We chose 100 nM as a near-saturating amount for binding to the ABCD construct. Since our in silico calculations predict that Fn3_5 interacts with both domains A and B, in the case of the ClusPro model or with both domain B and C, in the case of the AlphaFold3 model, we hypothesized that the higher 400 nM concentration would be needed to identify binding to MSLN fragments that did not encompass the entire binding interface. At the 100 nM concentration, the following MSLN fragments demonstrated binding to Fn3_5: ABCD, ABC, and BCD. We also observed moderate binding to domain BC, but did not observe detectable binding for other constructs. For the 400 nM concentration of Fn3_5, we saw similar binding signals comparing ABCD, ABC, and BCD, suggesting they were near saturating concentrations of Fn3_5. We also observed an increase in binding signal for domain BC, consistent with 100 nM and 400 nM both being at sub-saturating levels for BC. We did not observe any conclusive binding for other domains displayed on the surface of yeast. A full titration equilibrium binding assay with domain AB further confirmed that there was no measurable binding to domain AB when expressed alone on the surface of yeast (Figure S6). It is also possible that there is binding between Fn3_5 and domains A, B, or C alone, but that the interaction is sufficiently weak not to be measurable using flow cytometry at concentrations up to 400 nM of Fn3_5. These data are in agreement with the model obtained by AlphaFold3, which predicts an interaction of the B and C domains of MSLN with Fn3_5. This also agrees with the relative rankings of our computational predictions, showing a higher ΔGb for the AlphaFold3 model compared with the other models.
2.4. Interacting residues predicted from AlphaFold3 model validated via MSLN point mutagenesis and binding assays
To validate potential interacting residues identified via our in silico approach on the ClusPro and AlphaFold3 models, we have selected some of the amino acids mostly contributing to binding (Table 1), mutated them into alanine (Ala), and repeated the titration binding assays between the resulting MSLN mutants and Fn3_5. For the ClusPro model, the Glu400Ala mutation was selected, since in our simulations Glu400 is involved in a persistent salt bridge with Arg52 of Fn3_5, resulting in a stabilization of the Fn3_5/MSLN complex. On the other hand, for the AlphaFold3 model, we have chosen to mutate the residue mostly contributing to the binding, Ser445Ala, as highlighted by our per-residue decomposition analysis (Table 1). Then we also selected an amino acid slightly contributing to the stabilization of both the ClusPro and AlphaFold3 models by establishing salt bridges with arginine residues of Fn3_5 (Figure 2), selecting Glu406Ala. Finally, a double mutant, Glu406Ala/Ser445Ala, was also constructed for testing the AlphaFold3 model.
The four full-length MSLN variants expressed on the surface of yeast and bound MUC16, confirming that domain A was properly folded (Figure 6A). Titration binding assays of Fn3_5 with the MSLN variants were repeated in at least triplicate, and a mean KD and standard deviation determined (Figure 6B). To assess statistical significance of the differences in dissociation constants, comparisons of the mean KD values of at least three replicates were analyzed using a two-tailed t-test. Resulting p-values are given in Table S5. Mutation of the key residue for the ClusPro model, Glu400Ala, did not diminish binding of Fn3_5 compared to the ABCD wildtype construct (WT: KD=36 ± 13 nM vs Glu400Ala: KD=37 ± 10 nM, p-value = 0.91). Meanwhile, the individual mutants from the AlphaFold3 model both had decreased Fn3_5 binding compared to ABCD wildtype for Glu406Ala (KD=62 ± 16 nM, p-value = 0.0143) and Ser445Ala (KD=117 ± 19 nM, p-value < .0001), with an even greater decrease in binding for the double mutant Glu406Ala/Ser445Ala (KD=164 ± 40 nM, p < .0001). Combined with the domain-level epitope mapping data, these experiments support the AlphaFold3 docking pose.
Figure 6. Fine epitope mapping of Fn3_5 binding to MSLN.

(A) Three residues in ABCD MSLN were chosen for mutation to Ala based on the AlphaFold3 or ClusPro models, generating three single mutants and one double mutant. All four MSLN variants expressed on the surface of yeast and bound MUC16. (B) Titration binding assays revealed that Glu406 and Ser445 were important for binding, supporting the AlphaFold3 model. Meanwhile, the Glu400Ala mutant corresponding to the ClusPro model binding interface maintained Fn3_5 binding that was not statistically different compared to wildtype ABCD. Representative binding curves of at least three replicates conducted on different days are shown, and mean and standard deviation reported for all replicates. Statistical analysis of differences comparing ABCD WT and mutants are given in the text and in Table S5.
3. DISCUSSION
In this study, we reported the computational modeling and experimental validation of the interaction between cancer antigen MSLN and a high-affinity MSLN-binding Fn3 variant. Prior to this work, Fn3 variants were engineered to bind tumor cell surface protein MSLN, and the Fn3 proteins were shown to be internalized and have a targeted therapeutic effect on MSLN-positive cancer cell lines.27,28 However, there was little understanding of the molecular interaction between Fn3 variants and MSLN that enabled the observed internalization and cell killing.
To unravel the putative three-dimensional structure of the Fn3_5/MSLN complex along with the interface’s residues mostly contributing to the binding, we have applied a combination of computational techniques. Notably, while the protein-protein docking algorithms ClusPro, HDOCK, and pyDock point towards a general consensus on the binding site, which is close to the N-terminus of MSLN, the newly developed deep-leaning based algorithm AlphaFold3 suggests a different binding pose, in which Fn3_5 interacts with the B and C domains of MSLN (Figure 1A). All the obtained models were refined using μs-long MD simulations, followed by an evaluation of the binding free energy. This approach allowed us to obtain a relative energetic ranking and a per-residue description of the interactions (Table 1). Finally, the experimental data are in agreement with the model obtained using AlphaFold3, showing that MSLN domains B and C are the most important for interacting with Fn3_5.
MSLN has three N-linked glycosylation sites (Asn388, Asn488, and Asn515). There is experimental evidence that the glycan patterns can vary across cancer cell lines and patient samples, with some data indicating that differential glycosylation may influence MSLN binding to MUC16.5,11,20,52 In the present study, the computational prediction did not incorporate the MSLN glycosylation data. Since the sites were mutated for the validation step, MSLN expressed on the yeast surface was not glycosylated; thus, the epitope mapping revealed that Fn3_5 can bind MSLN without glycans. The model supports this experimental data, in that the Fn3_5-MSLN binding site was predicted to be far from the MSLN glycosylation sites. Therefore, Fn3_5-MSLN interaction does not seem dependent on MSLN glycosylation (Figure S7). However, since one of the glycosylation sites (Asn388) is located in domain B (Tyr360-Lys427), in future analyses, we could explore if the glycosylation state of MSLN could stabilize the MSLN subdomain structures and/or Fn3_5-MSLN binding.
In this work, the MSLN-Fn3_5 binding prediction was carried out using the structure of MSLN in its shed form (residues Glu296-Gly580), not including the C-terminal fragment. Therefore, data about the membrane-proximal region of MSLN are not included in the in silico analysis. Additionally, neither the engineering process nor the experimental validation (MSLN and the subdomains expressed on the yeast surface) incorporates the C-terminus. Thus, our results highlighted that Fn3_5 can bind MSLN with high affinity, despite the disordered C-terminus, suggesting that this region is not critical for Fn3_5-MSLN complex formation. In recent years, the membrane-proximal region of MSLN has been a target for other studies, in which antibodies or CAR-T cells were produced to bind the C-terminus of MSLN.18,53,54 Targeting this disordered C-terminal fragment seems important for CAR-T cell and antibody-based strategies but may be less critical for MSLN-binding Fn3s that were reported to be quickly internalized.27 Targeting the unshed portion of MSLN was proposed as an alternative to avoid the “off-tumor” effect due to SMRP in a patient’s bloodstream that acted like a decoy for previous antibody-based strategies, which resulted in low anti-tumor efficacy.10,24 Therefore, when testing Fn3_5 in further research, with in vivo models or in patients, the SMRP concentration in the bloodstream should be considered, adjusting the Fn3_5 dose to avoid or limit the effect that “off-tumor” MSLN binding may have on reducing tumor binding.
In conclusion, we have identified a binding interface between Fn3_5 and MSLN by a combination of computational modeling and experimental validation. This study supports the reported consensus that the AlphaFold3 algorithm greatly improves the prediction of the three-dimensional structure of protein-protein complexes, which, in our case, outperforms existing protein-protein docking methods. In this regard, AlphaFold3 seems to bypass the well-known problems that plague docking calculations, such as the rigid model receptor, which is intrinsically overcome by the new methodology. Thus, the identification of the MSLN-Fn3_5 binding interface can serve as a foundation for the design of novel Fn3 variants to enable an even stronger binding affinity toward MSLN. Efforts to target MSLN for cancer diagnosis and therapy are ongoing,55–58 and these results can inform the further development of candidate anti-MSLN molecules for diagnostic and therapeutic purposes. Additionally, the methods reported are also relevant for studying the interaction of other targeting macromolecules and their receptors.
4. MATERIALS AND METHODS
4.1. Model building
The structure of mesothelin was obtained from the protein data bank (pdb ID 8CX3).20 The Fn3 variants described by Sirois et al. to bind MSLN were further matured as a fifth-generation library through yeast surface display and directed evolution, using our previously reported methods;27,28 details will be reported separately (unpublished, manuscript under review). The structure of variant Fn3_5 was modeled using the SWISS-MODEL server,59 starting as a template from the 6XAY pdb. The model was then relaxed with a 1 μs-long MD simulation, and the most representative structure was extracted using a cluster analysis for the protein-protein docking calculations.
4.2. Docking calculations
We performed blind protein–protein docking calculations to identify the three-dimensional structure of the MSLN/Fn3_5 complex. Due to the limitations of docking algorithms in correctly predicting the native protein–protein binding interface, we have employed a consensus docking strategy, which has been successfully performed for similar systems.43 We used programs based on different search algorithms and scoring functions to verify if the predicted binding poses were reproducible. In particular, we confronted several docking Web servers such as ClusPro,44 HDOCK,44 and pyDock46 using the suggested parameters and without specifying any constraints on interacting residues. These all share search algorithms based on fast-Fourier transform (FFT) methods for grid matching and similar scoring functions, mainly based on desolvation and electrostatic contributions, with the exception of HDOCK, which, instead, employs an iterative knowledge-based scoring function. Moreover, AlphaFold3 was used to predict the 3D structure of the MSLN/Fn3_5 complex. The generated model reported an overall Predicted Local Distance Difference Test (pLDDT) greater than 90, indicating high confidence in the accuracy of the structured regions. The overall quality of the model, as indicated by the high pLDDT values, suggests that the well-structured regions are reliable for further analysis. Then, we selected the highest-ranking model from each used algorithm for performing MD simulations.
4.3. Molecular dynamics (MD) simulations
We performed three 1 μs-long replicas of the highest-ranking docking models obtained with all the software. In particular, these were done using the GROMACS 2021.2 software package60 and the Amber ff99SB-ILDN force field61. The models were embedded in a 15 Å layer of TIP3P water molecules. The total charge was neutralized, and additional Na+ and Cl− ions were added to achieve a physiological salt concentration of 0.150 M. All the topologies were built with Ambertools 18 62 and were subsequently converted in a GROMACS format using the software parmed. We have used a soft equilibration protocol in all the MD simulations, as described previously.63 The systems initially went through an energy minimization employing a steepest descent algorithm considering a force convergence criterion of 1000 kJ/mol·nm2. Next, the models were annealed from 0 to 300 K using a temperature gradient of 50 K every 2 ns, totaling 12 ns. The proteins were subjected to harmonic position restraints with a force constant of 1000 kJ/mol·nm2 while water molecules and ions were allowed to move. Once the target temperature of 300 K was reached, 20 ns of NPT simulations were conducted to stabilize the pressure to 1 bar using a Berendsen barostat. The same restraints used in the heating phase were retained in this phase. Temperature control at 300 K was achieved by stochastic velocity rescaling thermostat. Afterward, the barostat was switched to Parrinello-Rahman, and the restraints were restricted only to the backbone atoms. These restraints were then gradually decreased into three consecutive runs of 20 ns each, during which the force constant was set to 1000, 250, and 50 kJ/mol·nm2, respectively. Thus, after a long equilibration protocol of ~100 ns, all the position restraints were released, and all production runs were performed for 1000 ns for each model. Finally, productive MD simulations were performed using the isothermal-isobaric ensemble (NPT). The particle mesh Ewald method was used to account for long-range electrostatic interactions with a cutoff of 10 Å, and a LINCS algorithm was used to constrain the bonds involving hydrogen atoms. An integration time step of 2 fs was used in all simulations. To test for the conformational stability of MSLN domains, three replicas of a 3 μs-long MD simulations were carried out.
4.4. MM-PBSA calculations
Binding free energies (ΔGb) between the MSLN and Fn3_5 proteins were calculated using the molecular mechanics-Poisson Boltzmann surface area (MM-PBSA) method47 using the MMPBSA.py program,64 as successfully employed in other studies.65 Errors are reported as their corresponding standard deviations. MM-PBSA calculations were performed on 100 frames extracted from the MD trajectories. The free energy’s entropy contribution was not considered, as it was suggested that this term does not improve the quality of the results using the MM-PBSA.66
4.5. Protein production and purification
MSLN-binding Fn3 variant Fn3_5 was produced as previously described.27,28 Briefly, Fn3_5 gene was cloned into a pET vector with a C-terminal hexahistidine tag and expressed in BL21(DE3) E. coli. Cultures were grown in LB and induced at 37°C for 3 hours with 0.5 mM isopropyl-b-D-thiogalactopyranoside (IPTG). Cells were resuspended in lysis buffer (35 mM Na2HPO4 dibasic, 15 mM NaH2PO4 monobasic, 500 mM NaCl, 5 mM CHAPS, 25 mM imidazole, 5% glycerol) supplemented with an EDTA-free protease inhibitor (Pierce), and lysed by freeze-thaw cycles. Soluble fractions were isolated through centrifugation and passed through 0.45 μm and 0.22 μm filters. Then, Fn3 proteins were purified by cobalt affinity chromatography with HisPur cobalt resin (Thermo Fisher). Samples were dialyzed into water, lyophilized, and resuspended with phosphate buffered saline (PBS) to the desired concentration. Fn3 purity was assessed by SDS-PAGE on a BioRad ChemiDoc MP imaging system.
4.6. Cloning and expression of MSLN domains using yeast surface display
Based on the recently reported crystal structure of MSLN,20 the sequence for the full length shed MSLN (residues Glu296 - Gly580) or combinations of the reported domains A, B, C, and D were cloned into the yeast surface display pCT plasmid between the NheI and BamHI restriction sites following established methods.67,68 The following domain combinations were chosen: single domains A (N-terminal domain, Glu296-Leu359), B (Tyr360-Lys427), C (Asp428-Gly501), D (Gly502-Gly580); two domain combinations AB, BC, CD; three domain combinations ABC, BCD; and full length shed MSLN ABCD. A gene block of the full length ABCD gene was designed (Integrated DNA Technologies) as a PCR template, with the three N-linked glycosylation sites at Asn388, Asn488, and Asn515 removed by mutation of Asn to Gln. Subdomains were cloned using primers designed to add the NheI and BamHI cloning sites. Following subcloning and transformation into XL1-Blue E. coli, sequences were confirmed by standard Sanger sequencing. Plasmids were transformed into EBY100 S. saccharomyces surface display strain following protocols for the Frozen-EZ Yeast Transformation II Kit (Zymo Research). An individual yeast colony with plasmid from an SD-CAA plate was grown overnight in SD-CAA liquid culture at 30°C with aeration, then induced in SG-CAA liquid media at 20°C with aeration for 18 h. Expression of all domains was confirmed by flow cytometry using the mouse 9E10 antibody that recognizes the C-terminal c-myc tag (Fisher Scientific, 13-250-0, 1:50 dilution) and a goat anti-mouse PE conjugate antibody (Sigma, P9670,1:25 dilution). Experiments to assess the ability to express each domain were repeated at least three times for each construct. For residue-level fine epitope mapping studies, additional gene blocks were designed and ordered (Integrated DNA Technologies), with a codon for Ala replacing the codons corresponding to the residues identified for mutation, and the above procedures were followed. The MSLN residues mutated to Ala individually were Glu400, Glu406, or Ser 445. Double mutant Glu406Ala/Ser445Ala was also constructed.
4.7. Validation of yeast surface displayed MSLN system
All yeast surface display incubations and washes were performed using FACS buffer, composed of PBS with 0.1% bovine serum albumin. To determine the domain on MSLN bound by a rabbit anti-MSLN antibody (Abcam, EPR19025–42), 5 × 105 yeast of each domain to be tested were incubated for 1 h at room temperature and gentle mixing with mouse 9E10 anti-c-myc antibody to measure domain expression and with rabbit anti-MSLN antibody (1:50 dilution) in a total volume of 50 μl. Samples were washed, then incubated for 30 min at 4°C with a goat anti-mouse AF488 conjugate (Thermo Fisher, A11001, 1:200 dilution) and a goat anti-rabbit PE conjugate (Abcam, ab72465, 1:200 dilution) in 25 μl volume. Samples were washed, and immediately analyzed on a Guava flow cytometer. Data were compensated for spectral overlap. Domain-level epitope mapping of MUC16 binding was performed similarly, with alternate antibodies to avoid cross-species reactivity. For each MSLN domain construct, a near-saturating concentration (100 nM) of a MSLN-binding domain of MUC16 with a hexahistidine tag (Acro Biosystems) was incubated with induced yeast during the primary incubation step simultaneously with chicken anti-c-myc antibody (Gallus Immunotech, #ACMYC, 1:330 dilution) for 1 h at room temperature with gentle mixing. Cells were washed, and incubated for 30 min at 4°C with a mouse DyLight 488 anti-6X His tag antibody (Abcam, ab117512, 1:50 dilution) and a goat anti-chicken PE antibody (Abcam, ab72482, 1:50 dilution).
4.8. Domain-level epitope mapping and titration binding assays of Fn3_5 with yeast displayed MSLN constructs
An equilibrium titration binding assay was performed with purified Fn3_5 and the MSLN ABCD construct, or with mutants of ABCD. A range of concentrations of Fn3_5 with hexahistidine tag, from 0.1 nM to 400 nM, was incubated with 5 × 105 yeast displaying an MSLN ABCD variant, co-incubated with chicken anti-c-myc antibody, for 1 h at room temperature with gentle rotation. Yeast were washed, and incubated for 30 min at 4°C with mouse DyLight 488 anti-6X His tag antibody and goat anti-chicken PE antibody. The experiment was performed at least three times for each variant. For each replicate, data were plotted in Kaleidagraph and fit with a sigmoidal curve. The dissociation constant KD was determined for each replicate as the concentration yielding the half maximal signal, and the mean and standard deviation of all replicates reported. To determine statistical significance of the differences between mean KD values comparing mutants to ABCD wildtype, a two tailed t-test was conducted, and p-values are reported in Table S5.
Domain-level epitope mapping was performed for each MSLN construct displayed on the surface of yeast, for two concentrations of Fn3_5, at 100 nM or 400 nM of protein. All other experimental conditions and antibodies were as described for the equilibrium titration binding assay. All data were compensated for spectral overlap.
Supplementary Material
Supporting Information File 1 includes:
Figure S1. Homology modeling of Fn3
Figure S2. Superposition with MORAb-009 structure
Figure S3. Root mean square deviation (RMSD)
Figure S4. Principal component analysis (PCA) of MSLN
Figure S5. Root mean square fluctuation (RMSF)
Figure S6. MSLN domains AB on the surface of yeast do not demonstrate detectable binding to Fn3_5
Figure S7. Visualization of glycosylation sites on MSLN
Table S1. Per-residue decomposition of the binding free-energy (ΔGb) of the ClusPro complex.
Table S2. Per-residue decomposition of the binding free-energy (ΔGb) of the AlphaFold3 complex.
Table S3. Per-residue decomposition of the binding free-energy (ΔGb) of the HDOCK complex.
Table S4. Per-residue decomposition of the binding free-energy (ΔGb) of the pyDock complex.
Table S5. Statistical analysis comparing KD values of Fn3_5 binding MSLN variants.
Supporting Information File 2 includes: A video reporting the motion of MSLN, as extracted from the molecular dynamic trajectory of the MSLN/Fn3_5 ClusPro complex. Principal component 1 is shown.
Supporting Information File 3 includes: A video reporting the motion of MSLN, as extracted from the molecular dynamic trajectory of the MSLN/Fn3_5 AlphaFold3 complex. Principal component 1 is shown.
SIGNIFICANCE.
A combined experimental and computational approach furnishes the first model of the interaction between cancer biomarker mesothelin and an engineered binding scaffold protein. We identified the region on MLSN that is mainly involved in the binding interface. Gaining atomistic insights could pave the way for developing novel binding variants with stronger affinity, and broadly inform the development of mesothelin-targeting diagnostics and therapies. Further, the methods reported can be used to experimentally validate protein docking models for other biochemical systems.
ACKNOWLEDGEMENTS
The research was funded in part by the National Institutes of Health (NIH) grant R15CA198927-01 (S.J.M) and R15GM151648-01 (S.J.M), Italian Association for Cancer Research (AIRC) grant IG-25708 (F.G., M.P.), the European Union-NextGenerationEU under PNRR-M4C2-I1.3 Project PE_00000019 “HEAL ITALIA” CUP (B73C22001250006) (G.B., V.B., A.S., F.G., S.L.). The authors thank Serena Geroe (Smith College) for preliminary protein modeling efforts.
Footnotes
M.P., S.L., F.G. and S.J.M. are pursuing intellectual property discussing engineered proteins that bind mesothelin for use in cancer targeting.
REFERENCES
- (1).Chang K; Pastan I Molecular Cloning of Mesothelin, a Differentiation Antigen Present on Mesothelium, Mesotheliomas, and Ovarian Cancers. Proc Natl Acad Sci U S A 1996, 93 (1), 136–140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Morello A; Sadelain M; Adusumilli PS Mesothelin-Targeted CARs: Driving T Cells to Solid Tumors. Cancer Discovery 2016, 6 (2), 133–146. 10.1158/2159-8290.CD-15-0583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (3).Chang K; Pastan I; Willingham MC Isolation and Characterization of a Monoclonal Antibody, K1, Reactive with Ovarian Cancers and Normal Mesothelium. Intl Journal of Cancer 1992, 50 (3), 373–381. 10.1002/ijc.2910500308. [DOI] [PubMed] [Google Scholar]
- (4).Hassan R; Remaley AT; Sampson ML; Zhang J; Cox DD; Pingpank J; Alexander R; Willingham M; Pastan I; Onda M Detection and Quantitation of Serum Mesothelin, a Tumor Marker for Patients with Mesothelioma and Ovarian Cancer. Clinical Cancer Research 2006, 12 (2), 447–453. 10.1158/1078-0432.CCR-05-1477. [DOI] [PubMed] [Google Scholar]
- (5).Rump A; Morikawa Y; Tanaka M; Minami S; Umesaki N; Takeuchi M; Miyajima A Binding of Ovarian Cancer Antigen CA125/MUC16 to Mesothelin Mediates Cell Adhesion*. Journal of Biological Chemistry 2004, 279 (10), 9190–9198. 10.1074/jbc.M312372200. [DOI] [PubMed] [Google Scholar]
- (6).Bera TK; Pastan I Mesothelin Is Not Required for Normal Mouse Development or Reproduction. Molecular and Cellular Biology 2000, 20 (8), 2902–2906. 10.1128/MCB.20.8.2902-2906.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Yen MJ; Hsu C-Y; Mao T-L; Wu T-C; Roden R; Wang T-L; Shih I-M Diffuse Mesothelin Expression Correlates with Prolonged Patient Survival in Ovarian Serous Carcinoma. Clinical Cancer Research 2006, 12 (3), 827–831. 10.1158/1078-0432.CCR-05-1397. [DOI] [PubMed] [Google Scholar]
- (8).Winter JM; Tang LH; Klimstra DS; Brennan MF; Brody JR; Rocha FG; Jia X; Qin L-X; D’Angelica MI; DeMatteo RP; Fong Y; Jarnagin WR; O’Reilly EM; Allen PJ A Novel Survival-Based Tissue Microarray of Pancreatic Cancer Validates MUC1 and Mesothelin as Biomarkers. PLoS ONE 2012, 7 (7), e40157. 10.1371/journal.pone.0040157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Li YR; Xian RR; Ziober A; Conejo-Garcia J; Perales-Puchalt A; June CH; Zhang PJ; Tchou J Mesothelin Expression Is Associated with Poor Outcomes in Breast Cancer. Breast Cancer Res Treat 2014, 147 (3), 675–684. 10.1007/s10549-014-3077-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (10).Yeo D; Castelletti L; van Zandwijk N; Rasko JEJ Hitting the Bull’s-Eye: Mesothelin’s Role as a Biomarker and Therapeutic Target for Malignant Pleural Mesothelioma. Cancers (Basel) 2021, 13 (16), 3932. 10.3390/cancers13163932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Gubbels JA; Belisle J; Onda M; Rancourt C; Migneault M; Ho M; Bera TK; Connor J; Sathyanarayana BK; Lee B; Pastan I; Patankar MS Mesothelin-MUC16 Binding Is a High Affinity, N-Glycan Dependent Interaction That Facilitates Peritoneal Metastasis of Ovarian Tumors. Mol Cancer 2006, 5 (1), 50. 10.1186/1476-4598-5-50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Shimizu A; Hirono S; Tani M; Kawai M; Okada K; Miyazawa M; Kitahata Y; Nakamura Y; Noda T; Yokoyama S; Yamaue H Coexpression of MUC 16 and Mesothelin Is Related to the Invasion Process in Pancreatic Ductal Adenocarcinoma. Cancer Science 2012, 103 (4), 739–746. 10.1111/j.1349-7006.2012.02214.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Chen S-H; Hung W-C; Wang P; Paul C; Konstantopoulos K Mesothelin Binding to CA125/MUC16 Promotes Pancreatic Cancer Cell Motility and Invasion via MMP-7 Activation. Sci Rep 2013, 3 (1), 1870. 10.1038/srep01870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Coelho R; Marcos-Silva L; Ricardo S; Ponte F; Costa A; Lopes JM; David L Peritoneal Dissemination of Ovarian Cancer: Role of MUC16-Mesothelin Interaction and Implications for Treatment. Expert Review of Anticancer Therapy 2018, 18 (2), 177–186. 10.1080/14737140.2018.1418326. [DOI] [PubMed] [Google Scholar]
- (15).Huo Q; Xu C; Shao Y; Yu Q; Huang L; Liu Y; Bao H Free CA125 Promotes Ovarian Cancer Cell Migration and Tumor Metastasis by Binding Mesothelin to Reduce DKK1 Expression and Activate the SGK3/FOXO3 Pathway. Int. J. Biol. Sci 2021, 17 (2), 574–588. 10.7150/ijbs.52097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Laheru DA; Cohen SJ; Phillips M; Armstrong DK; Brahmer J; Jaffee EM; Weil S; Pastan I; Hassan R A Phase I Study of MORab-009, a Monoclonal Antibody against Mesothelin, in Mesothelioma, Pancreatic and Ovarian Cancer. JCO 2008, 26 (15_suppl), 3578–3578. 10.1200/jco.2008.26.15_suppl.3578. [DOI] [Google Scholar]
- (17).Hassan R; Kindler HL; Jahan T; Bazhenova L; Reck M; Thomas A; Pastan I; Parno J; O’Shannessy DJ; Fatato P; Maltzman JD; Wallin BA Phase II Clinical Trial of Amatuximab, a Chimeric Antimesothelin Antibody with Pemetrexed and Cisplatin in Advanced Unresectable Pleural Mesothelioma. Clinical Cancer Research 2014, 20 (23), 5927–5936. 10.1158/1078-0432.CCR-14-0804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Liu X; Onda M; Watson N; Hassan R; Ho M; Bera TK; Wei J; Chakraborty A; Beers R; Zhou Q; Shajahan A; Azadi P; Zhan J; Xia D; Pastan I Highly Active CAR T Cells That Bind to a Juxtamembrane Region of Mesothelin and Are Not Blocked by Shed Mesothelin. Proc. Natl. Acad. Sci. U.S.A 2022, 119 (19), e2202439119. 10.1073/pnas.2202439119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Zhou X; Yang M; Yu J; Tan J; Xu N; Zhou Y; Zhang W; Ma J; Zhang Z; Friedlaender A; Taylor J; Yu L; Yan Z Regional Delivery of Mesothelin-Targeted Chimeric Antigen Receptor T-Cell Effectively and Safely Targets Colorectal Cancer Liver Metastases in Mice. J Gastrointest Oncol 2024, 15 (1), 312–329. 10.21037/jgo-24-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Zhan J; Lin D; Watson N; Esser L; Tang WK; Zhang A; Liu X; Hassan R; Gleinich A; Shajahan A; Azadi P; Pastan I; Xia D Structures of Cancer Antigen Mesothelin and Its Complexes with Therapeutic Antibodies. Cancer Research Communications 2023, 3 (2), 175–191. 10.1158/2767-9764.CRC-22-0306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (21).Kaneko O; Gong L; Zhang J; Hansen JK; Hassan R; Lee B; Ho M A Binding Domain on Mesothelin for CA125/MUC16. Journal of Biological Chemistry 2009, 284 (6), 3739–3749. 10.1074/jbc.M806776200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (22).Hassan R; Ebel W; Routhier EL; Patel R; Kline JB; Zhang J; Chao Q; Jacob S; Turchin H; Gibbs L; Phillips MD; Mudali S; Iacobuzio-Donahue C; Jaffee EM; Moreno M; Pastan I; Sass PM; Nicolaides NC; Grasso L Preclinical Evaluation of MORAb-009, a Chimeric Antibody Targeting Tumor-Associated Mesothelin. Cancer Immun 2007, 7, 20. [PMC free article] [PubMed] [Google Scholar]
- (23).Ma J; Tang WK; Esser L; Pastan I; Xia D Recognition of Mesothelin by the Therapeutic Antibody MORAb-009. Journal of Biological Chemistry 2012, 287 (40), 33123–33131. 10.1074/jbc.M112.381756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Chu Q Targeting Mesothelin in Solid Tumours: Anti-Mesothelin Antibody and Drug Conjugates. Curr Oncol Rep 2023, 25 (4), 309–323. 10.1007/s11912-023-01367-8. [DOI] [PubMed] [Google Scholar]
- (25).Klampatsa A; Dimou V; Albelda SM Mesothelin-Targeted CAR-T Cell Therapy for Solid Tumors. Expert Opinion on Biological Therapy 2021, 21 (4), 473–486. 10.1080/14712598.2021.1843628. [DOI] [PubMed] [Google Scholar]
- (26).Dean AQ; Luo S; Twomey JD; Zhang B Targeting Cancer with Antibody-Drug Conjugates: Promises and Challenges. mAbs 2021, 13 (1), 1951427. 10.1080/19420862.2021.1951427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (27).Sirois AR; Deny DA; Baierl SR; George KS; Moore SJ Fn3 Proteins Engineered to Recognize Tumor Biomarker Mesothelin Internalize upon Binding. PLoS ONE 2018, 13 (5), e0197029. 10.1371/journal.pone.0197029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Sirois AR; Deny DA; Li Y; Fall YD; Moore SJ Engineered Fn3 Protein Has Targeted Therapeutic Effect on Mesothelin-expressing Cancer Cells and Increases Tumor Cell Sensitivity to Chemotherapy. Biotech & Bioengineering 2020, 117 (2), 330–341. 10.1002/bit.27204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (29).Koide S; Koide A; Lipovšek D Target-Binding Proteins Based on the 10th Human Fibronectin Type III Domain (10Fn3). In Methods in Enzymology; Elsevier, 2012; Vol. 503, pp 135–156. 10.1016/B978-0-12-396962-0.00006-9. [DOI] [PubMed] [Google Scholar]
- (30).Woldring DR; Holec PV; Zhou H; Hackel BJ High-Throughput Ligand Discovery Reveals a Sitewise Gradient of Diversity in Broadly Evolved Hydrophilic Fibronectin Domains. PLoS ONE 2015, 10 (9), e0138956. 10.1371/journal.pone.0138956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (31).Getmanova EV; Chen Y; Bloom L; Gokemeijer J; Shamah S; Warikoo V; Wang J; Ling V; Sun L Antagonists to Human and Mouse Vascular Endothelial Growth Factor Receptor 2 Generated by Directed Protein Evolution In Vitro. Chemistry & Biology 2006, 13 (5), 549–556. 10.1016/j.chembiol.2005.12.009. [DOI] [PubMed] [Google Scholar]
- (32).Bloom L; Calao V FN3: A New Protein Scaffold Reaches the Clinic. Drug Discovery Today 2009, 14 (19–20), 949–955. 10.1016/j.drudis.2009.06.007. [DOI] [PubMed] [Google Scholar]
- (33).Wojcik J; Hantschel O; Grebien F; Kaupe I; Bennett KL; Barkinge J; Jones RB; Koide A; Superti-Furga G; Koide S A Potent and Highly Specific FN3 Monobody Inhibitor of the Abl SH2 Domain. Nat Struct Mol Biol 2010, 17 (4), 519–527. 10.1038/nsmb.1793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (34).Hackel BJ; Kimura RH; Gambhir SS Use of 64 Cu-Labeled Fibronectin Domain with EGFR-Overexpressing Tumor Xenograft: Molecular Imaging. Radiology 2012, 263 (1), 179–188. 10.1148/radiol.12111504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (35).Natarajan A; Hackel BJ; Gambhir SS A Novel Engineered Anti-CD20 Tracer Enables Early Time PET Imaging in a Humanized Transgenic Mouse Model of B-Cell Non-Hodgkins Lymphoma. Clinical Cancer Research 2013, 19 (24), 6820–6829. 10.1158/1078-0432.CCR-13-0626. [DOI] [PubMed] [Google Scholar]
- (36).Donnelly DJ; Smith RA; Morin P; Lipovšek D; Gokemeijer J; Cohen D; Lafont V; Tran T; Cole EL; Wright M; Kim J; Pena A; Kukral D; Dischino DD; Chow P; Gan J; Adelakun O; Wang X-T; Cao K; Leung D; Bonacorsi SJ; Hayes W Synthesis and Biologic Evaluation of a Novel 18 F-Labeled Adnectin as a PET Radioligand for Imaging PD-L1 Expression. J Nucl Med 2018, 59 (3), 529–535. 10.2967/jnumed.117.199596. [DOI] [PubMed] [Google Scholar]
- (37).Huisman MC; Niemeijer A-LN; Windhorst AD; Schuit RC; Leung D; Hayes W; Poot A; Bahce I; Radonic T; Oprea-Lager DE; Hoekstra OS; Thunnissen E; Hendrikse NH; Smit EF; De Langen AJ; Boellaard R Quantification of PD-L1 Expression with 18 F-BMS-986192 PET/CT in Patients with Advanced-Stage Non–Small Cell Lung Cancer. J Nucl Med 2020, 61 (10), 1455–1460. 10.2967/jnumed.119.240895. [DOI] [PubMed] [Google Scholar]
- (38).De Vivo M; Masetti M; Bottegoni G; Cavalli A Role of Molecular Dynamics and Related Methods in Drug Discovery. J. Med. Chem 2016, 59 (9), 4035–4061. 10.1021/acs.jmedchem.5b01684. [DOI] [PubMed] [Google Scholar]
- (39).Abramson J; Adler J; Dunger J; Evans R; Green T; Pritzel A; Ronneberger O; Willmore L; Ballard AJ; Bambrick J; Bodenstein SW; Evans DA; Hung C-C; O’Neill M; Reiman D; Tunyasuvunakool K; Wu Z; Žemgulytė A; Arvaniti E; Beattie C; Bertolli O; Bridgland A; Cherepanov A; Congreve M; Cowen-Rivers AI; Cowie A; Figurnov M; Fuchs FB; Gladman H; Jain R; Khan YA; Low CMR; Perlin K; Potapenko A; Savy P; Singh S; Stecula A; Thillaisundaram A; Tong C; Yakneen S; Zhong ED; Zielinski M; Žídek A; Bapst V; Kohli P; Jaderberg M; Hassabis D; Jumper JM Accurate Structure Prediction of Biomolecular Interactions with AlphaFold 3. Nature 2024, 630 (8016), 493–500. 10.1038/s41586024-07487-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Keskin O; Tuncbag N; Gursoy A Predicting Protein–Protein Interactions from the Molecular to the Proteome Level. Chem. Rev 2016, 116 (8), 4884–4909. 10.1021/acs.chemrev.5b00683. [DOI] [PubMed] [Google Scholar]
- (41).Kastritis PL; Bonvin AMJJ On the Binding Affinity of Macromolecular Interactions: Daring to Ask Why Proteins Interact. J. R. Soc. Interface 2013, 10 (79), 20120835. 10.1098/rsif.2012.0835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Chen F; Liu H; Sun H; Pan P; Li Y; Li D; Hou T Assessing the Performance of the MM/PBSA and MM/GBSA Methods. 6. Capability to Predict Protein–Protein Binding Free Energies and Re-Rank Binding Poses Generated by Protein–Protein Docking. Phys. Chem. Chem. Phys 2016, 18 (32), 22129–22139. 10.1039/C6CP03670H. [DOI] [PubMed] [Google Scholar]
- (43).Spinello A; Vecile E; Abbate A; Dobrina A; Magistrato A How Can Interleukin-1 Receptor Antagonist Modulate Distinct Cell Death Pathways? J. Chem. Inf. Model 2019, 59 (1), 351–359. 10.1021/acs.jcim.8b00565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (44).Kozakov D; Hall DR; Xia B; Porter KA; Padhorny D; Yueh C; Beglov D; Vajda S The ClusPro Web Server for Protein–Protein Docking. Nat Protoc 2017, 12 (2), 255–278. 10.1038/nprot.2016.169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Yan Y; Tao H; He J; Huang S-Y The HDOCK Server for Integrated Protein–Protein Docking. Nat Protoc 2020, 15 (5), 1829–1852. 10.1038/s41596-020-0312-x. [DOI] [PubMed] [Google Scholar]
- (46).Jiménez-García B; Pons C; Fernández-Recio J pyDockWEB: A Web Server for Rigid-Body Protein–Protein Docking Using Electrostatics and Desolvation Scoring. Bioinformatics 2013, 29 (13), 1698–1699. 10.1093/bioinformatics/btt262. [DOI] [PubMed] [Google Scholar]
- (47).Kollman PA; Massova I; Reyes C; Kuhn B; Huo S; Chong L; Lee M; Lee T; Duan Y; Wang W; Donini O; Cieplak P; Srinivasan J; Case DA; Cheatham TE Calculating Structures and Free Energies of Complex Molecules: Combining Molecular Mechanics and Continuum Models. Acc. Chem. Res 2000, 33 (12), 889–897. 10.1021/ar000033j. [DOI] [PubMed] [Google Scholar]
- (48).Andrade MA; Petosa C; O’Donoghue SI; Müller CW; Bork P Comparison of ARM and HEAT Protein Repeats. Journal of Molecular Biology 2001, 309 (1), 1–18. 10.1006/jmbi.2001.4624. [DOI] [PubMed] [Google Scholar]
- (49).Cochran JR; Kim Y-S; Olsen MJ; Bhandari R; Wittrup KD Domain-Level Antibody Epitope Mapping through Yeast Surface Display of Epidermal Growth Factor Receptor Fragments. Journal of Immunological Methods 2004, 287 (1–2), 147–158. 10.1016/j.jim.2004.01.024. [DOI] [PubMed] [Google Scholar]
- (50).Ellgaard L; Helenius A Quality Control in the Endoplasmic Reticulum. Nat Rev Mol Cell Biol 2003, 4 (3), 181–191. 10.1038/nrm1052. [DOI] [PubMed] [Google Scholar]
- (51).Kim Y; Bhandari R; Cochran JR; Kuriyan J; Wittrup KD Directed Evolution of the Epidermal Growth Factor Receptor Extracellular Domain for Expression in Yeast. Proteins 2006, 62 (4), 1026–1035. 10.1002/prot.20618. [DOI] [PubMed] [Google Scholar]
- (52).Chang K; Pastan I Molecular Cloning of Mesothelin, a Differentiation Antigen Present on Mesothelium, Mesotheliomas, and Ovarian Cancers. Proc. Natl. Acad. Sci. U.S.A 1996, 93 (1), 136–140. 10.1073/pnas.93.1.136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (53).Zhang Y-F; Phung Y; Gao W; Kawa S; Hassan R; Pastan I; Ho M New High Affinity Monoclonal Antibodies Recognize Non-Overlapping Epitopes On Mesothelin For Monitoring And Treating Mesothelioma. Sci Rep 2015, 5 (1), 9928. 10.1038/srep09928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (54).Liu XF; Onda M; Schlomer J; Bassel L; Kozlov S; Tai C-H; Zhou Q; Liu W; Tsao H-E; Hassan R; Ho M; Pastan I Tumor Resistance to Anti-Mesothelin CAR-T Cells Caused by Binding to Shed Mesothelin Is Overcome by Targeting a Juxtamembrane Epitope. Proc. Natl. Acad. Sci. U.S.A 2024, 121 (4), e2317283121. 10.1073/pnas.2317283121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (55).Nicolaides NC; Kline JB; Grasso L NAV-001, a High-Efficacy Antibody-Drug Conjugate Targeting Mesothelin with Improved Delivery of a Potent Payload by Counteracting MUC16/CA125 Inhibitory Effects. PLoS ONE 2023, 18 (5), e0285161. 10.1371/journal.pone.0285161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (56).Sun Z; Chu X; Adams C; Ilina TV; Guerrero M; Lin G; Chen C; Jelev D; Ishima R; Li W; Mellors JW; Calero G; Dimitrov DS Preclinical Assessment of a Novel Human Antibody VH Domain Targeting Mesothelin as an Antibody-Drug Conjugate. Molecular Therapy - Oncolytics 2023, 31, 100726. 10.1016/j.omto.2023.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (57).Chakraborty A; Onda M; O’Shea T; Wei J; Liu X; Bera TK; Pastan I A Bispecific Antibody That Targets the Membrane-Proximal Region of Mesothelin and Retains High Anticancer Activity in the Presence of Shed Mesothelin. Molecular Cancer Therapeutics 2024, OF1–OF10. 10.1158/1535-7163.MCT-23-0233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (58).Grasso L; Jiang Q; Hassan R; Nicolaides NC; Kline JB NAV-003, a Bispecific Antibody Targeting a Unique Mesothelin Epitope and CD3ε with Improved Cytotoxicity against Humoral Immunosuppressed Tumors. Eur J Immunol 2023, 53 (8), 2250309. 10.1002/eji.202250309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (59).Waterhouse A; Bertoni M; Bienert S; Studer G; Tauriello G; Gumienny R; Heer FT; de Beer TAP; Rempfer C; Bordoli L; Lepore R; Schwede T SWISS-MODEL: Homology Modelling of Protein Structures and Complexes. Nucleic Acids Research 2018, 46 (W1), W296–W303. 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (60).Van Der Spoel D; Lindahl E; Hess B; Groenhof G; Mark AE; Berendsen HJC GROMACS: Fast, Flexible, and Free. J Comput Chem 2005, 26 (16), 1701–1718. 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
- (61).Lindorff-Larsen K; Piana S; Palmo K; Maragakis P; Klepeis JL; Dror RO; Shaw DE Improved Side-chain Torsion Potentials for the Amber ff99SB Protein Force Field. Proteins 2010, 78 (8), 1950–1958. 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (62).Case DA; Aktulga HM; Belfon K; Cerutti DS; Cisneros GA; Cruzeiro VWD; Forouzesh N; Giese TJ; Götz AW; Gohlke H; Izadi S; Kasavajhala K; Kaymak MC; King E; Kurtzman T; Lee T-S; Li P; Liu J; Luchko T; Luo R; Manathunga M; Machado MR; Nguyen HM; O’Hearn KA; Onufriev AV; Pan F; Pantano S; Qi R; Rahnamoun A; Risheh A; Schott-Verdugo S; Shajan A; Swails J; Wang J; Wei H; Wu X; Wu Y; Zhang S; Zhao S; Zhu Q; Cheatham TE; Roe DR; Roitberg A; Simmerling C; York DM; Nagan MC; Merz KM AmberTools. J. Chem. Inf. Model 2023, 63 (20), 6183–6191. 10.1021/acs.jcim.3c01153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (63).Spinello A; Janos P; Rozza R; Magistrato A Cancer-Related Mutations Alter RNA-Driven Functional Cross-Talk Underlying Premature-Messenger RNA Recognition by Splicing Factor SF3b. J. Phys. Chem. Lett 2023, 14 (27), 6263–6269. 10.1021/acs.jpclett.3c01402. [DOI] [PubMed] [Google Scholar]
- (64).Miller BR; McGee TD; Swails JM; Homeyer N; Gohlke H; Roitberg AE MMPBSA.Py : An Efficient Program for End-State Free Energy Calculations. J. Chem. Theory Comput 2012, 8 (9), 3314–3321. 10.1021/ct300418h. [DOI] [PubMed] [Google Scholar]
- (65).Spinello A; Borišek J; Malcovati L; Magistrato A Investigating the Molecular Mechanism of H3B-8800: A Splicing Modulator Inducing Preferential Lethality in Spliceosome-Mutant Cancers. IJMS 2021, 22 (20), 11222. 10.3390/ijms222011222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (66).Genheden S; Ryde U The MM/PBSA and MM/GBSA Methods to Estimate Ligand-Binding Affinities. Expert Opinion on Drug Discovery 2015, 10 (5), 449–461. 10.1517/17460441.2015.1032936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (67).Gai SA; Wittrup KD Yeast Surface Display for Protein Engineering and Characterization. Current Opinion in Structural Biology 2007, 17 (4), 467–473. 10.1016/j.sbi.2007.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (68).Boder ET; Wittrup KD Yeast Surface Display for Screening Combinatorial Polypeptide Libraries. Nat Biotechnol 1997, 15 (6), 553–557. 10.1038/nbt0697-553. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting Information File 1 includes:
Figure S1. Homology modeling of Fn3
Figure S2. Superposition with MORAb-009 structure
Figure S3. Root mean square deviation (RMSD)
Figure S4. Principal component analysis (PCA) of MSLN
Figure S5. Root mean square fluctuation (RMSF)
Figure S6. MSLN domains AB on the surface of yeast do not demonstrate detectable binding to Fn3_5
Figure S7. Visualization of glycosylation sites on MSLN
Table S1. Per-residue decomposition of the binding free-energy (ΔGb) of the ClusPro complex.
Table S2. Per-residue decomposition of the binding free-energy (ΔGb) of the AlphaFold3 complex.
Table S3. Per-residue decomposition of the binding free-energy (ΔGb) of the HDOCK complex.
Table S4. Per-residue decomposition of the binding free-energy (ΔGb) of the pyDock complex.
Table S5. Statistical analysis comparing KD values of Fn3_5 binding MSLN variants.
Supporting Information File 2 includes: A video reporting the motion of MSLN, as extracted from the molecular dynamic trajectory of the MSLN/Fn3_5 ClusPro complex. Principal component 1 is shown.
Supporting Information File 3 includes: A video reporting the motion of MSLN, as extracted from the molecular dynamic trajectory of the MSLN/Fn3_5 AlphaFold3 complex. Principal component 1 is shown.
