Abstract
The identity and biological activity of most metabolites still remain unknown. A bottleneck in the exploration of metabolite structures and pharmaceutical activities is the compound purification needed for bioactivity assignments and downstream structure elucidation. To enable bioactivity-focused compound identification from complex mixtures, we develop a scalable native metabolomics approach that integrates non-targeted liquid chromatography tandem mass spectrometry and detection of protein binding via native mass spectrometry. A native metabolomics screen for protease inhibitors from an environmental cyanobacteria community reveals 30 chymotrypsin-binding cyclodepsipeptides. Guided by the native metabolomics results, we select and purify five of these compounds for full structure elucidation via tandem mass spectrometry, chemical derivatization, and nuclear magnetic resonance spectroscopy as well as evaluation of their biological activities. These results identify rivulariapeptolides as a family of serine protease inhibitors with nanomolar potency, highlighting native metabolomics as a promising approach for drug discovery, chemical ecology, and chemical biology studies.
Subject terms: Mass spectrometry, Natural products, Metabolomics, Proteases
Bioactivity-guided isolation of specialized metabolites is an iterative process. Here, the authors demonstrate a native metabolomics approach that allows for fast screening of complex metabolite extracts against a protein of interest and simultaneous structure annotation.
Introduction
Specialized metabolites, often referred to as natural products, are a tremendous pool of chemically diverse and pharmaceutically active organic compounds. By some estimates, more than 50% of all current pharmaceuticals are based on or inspired by natural products1. Nevertheless, the vast majority of biological activities and pharmaceutical potential of specialized metabolites, as well as their ecological functions, still remain to be discovered2.
While mining genome and metagenome data has begun to provide an overview of the biosynthetic potential of nature3,4, most specialized metabolites remain inaccessible, as the living organism that produces these metabolites cannot be cultured or their gene clusters remain silent under laboratory culture conditions. Natural product discovery and chemical ecology studies in environmental systems are hence becoming more and more attractive5. Along with next-generation sequencing technologies, recent instrument and computational advances in nuclear magnetic resonance (NMR) spectroscopy and non-targeted liquid chromatography-tandem mass spectrometry (LC-MS/MS) offer tremendous assistance in exploring the uncharted metabolic space of nature6. These tools enable large-scale compound dereplication, rapid identification of chemical analogs, and de novo annotation of molecular formulas, substructures, chemical classes, and structures7–12. However, the assignment of bioactivity to newly identified metabolites typically requires assays using pure compounds. Therefore, the isolation of specialized metabolites is typically guided by repetitive bioactivity assays. Together with full structure elucidation, this process usually takes months and is, therefore, a major bottleneck for the systematic exploration of nature for novel pharmaceutically active compounds and comprehensive chemical ecology studies.
A logical next step is the development of activity metabolomics13 or functional-metabolomics14 approaches that aim to add functional information to the metabolites detected in a given system. Native electrospray ionization (ESI) such as immobilized enzyme MS (IEMS)15 and affinity mass spectrometry (MS) such as pulsed ultrafiltration (UF) MS, size exclusion (SEC) or affinity bead-based pull-down assays are increasingly being used to analyze non-covalent binding of biomolecules16–21. An important difference between native MS and affinity MS is that native MS detects ligands directly bound to a protein, whereas affinity MS approaches typically measure ligand binding indirectly as free compounds. For affinity MS, the target protein is captured by UF, SEC, centrifugation, or magnetic removal and the released ligand is subsequently identified by small molecule MS analysis22–25. Both native and affinity MS approaches have been applied with single compounds as well as substrate pools, which allows for the simultaneous screening of complex natural compound libraries26–28. Depending on the experimental setup, an important limitation in the use of ligand pools is that multiple ligands compete for binding of the target at the same site, and therefore compounds with the highest affinity or concentration are more easily discovered. However, this limitation can be overcome by using a molar excess of protein compared with the total molar concentration of the ligand library15. Additionally, the annotation of bound ligands remains a challenging task, especially if the ligand pool contains multiple isobaric compounds. While direct infusion native MS workflows have been developed that can identify metabolites bound to proteins by MS/MS29, combining the separation power of ultra-high-performance liquid chromatography (UHPLC) and the selectivity of native MS and MS/MS would offer a promising improvement to decipher protein-metabolite interactions out of complex biological mixtures, such as environmental samples. However, typical UHPLC mobile phase conditions disfavor non-covalent protein binding due to an acidic pH and high organic solvent content. To perform native MS coupled to UHPLC, we developed an experimental setup that increases pH and water content of the mobile phase post-column and infuses a protein binding partner before entering the ESI interface (Fig. 1). As the protein is constantly infused post-column, one can monitor the intact protein mass over the LC-MS run and observe mass shifts when eluting metabolites bind to the protein at a defined retention time. Using collision-induced dissociation (e.g., Higher Energy C-Trap Dissociation (HCD) in the setup), the complex can be dissociated again in the mass spectrometer, and a “binding threshold” can be applied to distinguish between specific and nonspecific binding. In combination with parallel non-targeted MS/MS analyses, the mass and compound ID or compound class can be assigned (level 2 or level 3 annotation30,31).
Cyanobacteria are known to be a rich source of highly bioactive natural products32–35, and in particular, protease inhibitors from numerous chemical classes36–41. Protease inhibitors are key compounds used for the treatment of viral infections (SARS-CoV-2, HIV, and Hepatitis C)42,43, cancer44, diabetes45, hypertension46, and as general anticoagulants47. Several of the approved protease inhibitors are analogs of natural products such as aliskiren, captopril, and carfilzomib that target renin, angiotensin-converting enzyme, and the proteasome, respectively48.
In this study, we screen for protease inhibitors from an environmental cyanobacteria biofilm as an application of the native metabolomics workflow. We use chymotrypsin as the protease target to identify inhibitors from a marine cyanobacteria community. We identify 30 chymotrypsin binders in the methanolic crude extract with a single LC-MS run. The masses and MS/MS spectra of the binders are then queried against structural and spectral databases, revealing that most of them are unknown. This leads to the targeted isolation and structure elucidation of a family of highly potent protease inhibitors, which we termed rivulariapeptolides.
Results
Development of the native metabolomics approach
In a crude extract, native metabolomics provides binding information about each compound towards a protein of interest. In the experimental setup, we utilized a single 10-min LC-MS run to discover compounds that bind to the serine protease, chymotrypsin. The workflow is as follows: a crude extract is analyzed using native ESI while the protein of interest is infused post-column throughout the entire LC gradient. Binding of a small molecule to the protein of interest results in a peak with a mass corresponding to the protein bound to the compound. The m/z difference between the protein-ligand complex and the unbound protein reveals the molecular weight of the ligand while the ratio of the intensity of the protein-ligand peaks relative to the unbound protein peaks hints toward the relative binding affinity under the given conditions. To reduce nonspecific binding, native metabolomics experiments were performed in all-ion fragmentation (AIF) mode, using a threshold collision energy of 20% in the HCD cell.
We first optimized the pH for the native mass spectrometric acquisition of chymotrypsin and confirmed that the enzyme remains active under the native metabolomics buffer conditions. As a positive control for binding, we used molassamide49, a known non-covalent serine protease inhibitor of the 3-amino-6-hydroxy-2-piperidone (Ahp)-cyclodepsipeptide family. We found that an ammonium acetate solution of pH 4.5 showed the highest peak intensity (Supplementary Fig. 1a). Volatile ammonium acetate solution is the most commonly used non-denaturing solvent in native MS measurements of proteins. Even though having initially neutral pH, ammonium acetate has no buffering capacity at pH 7, and in positive ion mode, there are several factors that tend to acidify the analyte solution. In the presence of ammonium acetate, the pH may drop to values as low as 4.75 ± 1 in the final ESI droplets, reflecting the pKa of acetate buffer50. Next, we injected a serial dilution of molassamide into the native metabolomics LC-MS setup, where it was mixed post-column with a constant concentration of chymotrypsin. The protein-ligand complex was detected at a deconvoluted mass of 26195.1 Da and the unbound apoprotein at 25232.6 Da (Supplementary Fig. 1b). The observed Δ mass of 962.5 Da matches the mass of molassamide (962.4749 Da). We also observed a minor peak with a mass shift that corresponds to two molassamide molecules in the multiple charged spectrum (Supplementary Fig. 1d), which can be an indication of nonspecific binding51. However, as molassamide is a known nanomolar inhibitor, we cannot rule out whether the second peak could be due to nonspecific binding effects or if it represents a secondary weaker interaction. After deconvolution and integrating the peak area of the protein-ligand complex and plotting it against the molassamide concentration in the peak window, we obtained a binding curve. The resulting curve depicts a concentration-dependent increase of protein-ligand to unbound protein ratio with increasing ligand concentration (Supplementary Fig. 1c). The limit of detection for the molassamide-chymotrypsin interaction was between 0.1 and 1 µg/mL (1 − 10 ng on column) (Supplementary Fig. 1e). To further test the biological relevance of the native metabolomics conditions, chymotrypsin was assayed with a crude extract from the cyanobacterium Rivularia sp. using a fluorescence substrate competition assay. The bioassay conditions were designed to mimic the pH and solvent composition expected in the native mass spectrometry setup. Although chymotrypsin is optimally active in near-neutral pH, it retains activity in 10 mM NH4Acetate at pH 4.5. Under these conditions, chymotrypsin was completely inhibited by 10 µg/mL of the extract with 50% inhibition at 0.84 µg/mL (Supplementary Fig. 1f). Chymotrypsin was then assayed in increasing concentrations of acetonitrile (ACN) to determine if enzyme activity was retained in this solvent. The activity was reduced by 9 to 34% in ACN concentrations up to 33.3% v/v. In the presence of 41.7% v/v ACN, which corresponds to the end of the UHPLC gradient after the make-up addition, activity was decreased by 70% (Supplementary Fig. 1g). These results confirmed that chymotrypsin can be used as a target protease for native metabolomics as it retains activity at pH 4.5 in ACN concentrations up to 41.7% and binds to compounds from an inhibitory crude extract. The NH4Acetate concentration can have a large influence on the ionization and dissociation constants of non-covalent protein-binder complexes. Interestingly, higher NH4Acetate concentration can lead to both higher or lower binding affinities depending on the protein-ligand pair investigated52. We tested a series of NH4Acetate concentrations (10–100 mM, Supplementary Fig. 2) and found 10 mM to be the most favorable for studying chymotrypsin-molassamide complexes (highest relative abundance, highest chymotrypsin-molassamide to chymotrypsin ratio, lowest abundance of interfering NH4Acetate clusters). To test the variability of the changing ACN concentration during the LC separation, we performed a series of flow injections (without column) over the full gradient (Supplementary Fig. 3a). The XIC of the molassamide bound chymotrypsin reveals similar signal responses throughout the gradient (5-99% ACN on column). While injecting a pool of control compounds at concentrations of 10 µg/mL, we observed compound-specific signal responses of the chymotrypsin-ligand complexes (Supplementary Fig. 3b). To further assess the specificity of the protein-small molecule interactions, we evaluated the binding of the linear oligopeptidic cysteine protease inhibitor gallinamide A, the cyclic depsipeptide FR900359, the isoflavone genistein, the phenol phloroglucinol, and the anthraquinone quinalizarin34,53,54. We did not observe the binding of these negative controls to chymotrypsin under native MS conditions (Supplementary Fig. 3c).
Native metabolomics reveals chymotrypsin binders
Following the successful proof-of-concept experiments, we next screened for potential chymotrypsin binders from a crude extract of a biofilm from the marine cyanobacterium Rivularia sp, collected from coral sediments at Carlos Rosario Beach in Culebra, Puerto Rico, U.S. The methanol extract was separated by reversed-phase UHPLC and ammonium acetate buffer and chymotrypsin were infused post-column, prior to native ESI and acquisition of mass spectrometry data in the high m/z range (2500-5000 m/z). The crude extract was subsequently re-injected, without the infusion of chymotrypsin to obtain high-resolution LC-MS/MS data of compounds in the extract in the low m/z range (300–2000 m/z).
As the first step of data analysis, we plotted the total ion current (TIC) of the crude extract metabolomics (Fig. 2a, left) and extracted ion chromatogram (XIC) of the apo-chymotrypsin from charge state deconvoluted native mass spectrometry data (Fig. 2a, right) which shows several negative peaks in the range of 4.5–5.5 min. The decrease in a signal of the apoprotein in that retention time range is due to the emergence of larger masses that correspond to protein-ligand complexes. After feature finding of the deconvoluted masses and matching of the parallel metabolomics LC-MS/MS data of the crude extracts by retention time and exact mass matching, we could identify more than 30 potential small molecule-protein complexes. To display the family of small molecules that form protein-ligand complexes and to show their structural relations, we visualized them in a correlation molecular network (Fig. 2b and Supplementary Fig. 4) that is based on their MS/MS similarity (gray line), retention time, and mass matching between protein and small molecules through the red dashed lines.
Two of the most abundant chymotrypsin binders have m/z 1186.6400 and m/z 1156.5923 that also show a perfect overlap of the chromatographic profiles between intact protein and metabolomics LC-MS/MS data (Fig. 2c and Supplementary Fig. 5). Based on the high relative abundance we targeted them for further purification, structure determination by NMR and orthogonal protease inhibition assays.
Rivulariapeptolides, a family of new Ahp-cyclodepsipeptides
The potential chymotrypsin-binder with the m/z 1186.6400, identified by native metabolomics, was next targeted for isolation and structure elucidation, using state-of-the-art high-resolution MS/MS and NMR approaches55 (see also Supplementary Methods). We first separated the Rivularia crude extract into four fractions of decreasing polarity via solid phase extraction. SMART NMR11 analysis was applied to the most hydrophilic fraction and all but one structure of the top 10 SMART results were predicted as cyclic depsipeptides (Fig. 3a, b and Supplementary Fig. 6), including 6 of the top 10 as Ahp-cyclodepsipeptides (three from marine filamentous cyanobacteria: somamide B, molassamide, lyngbyastatin 6)49,56–58. Complementarily, MS/MS-based molecular networking analysis of the Rivularia crude fractions assisted with the annotation of the known Ahp-peptides molassamide, kurahamide, and loggerpeptin A, along with several putatively new ones (Fig. 3a, b and Supplementary Fig. 7a)49,57,59. Next, the SIRIUS and ZODIAC tools10,60 were applied to determine the molecular formula of the chymotrypsin-binding feature, with exact mass m/z 1186.6400 [M + H]+, as C61H87N9O15 (0.5 ppm). Subsequently, we classified the MS/MS spectrum as indicative of a “cyclic depsipeptide” based on the classification with CANOPUS9. Further substructures of the molecule were predicted as benzene, hydroxy-benzene, and proline/N-acyl-pyrrolidine derivatives, as well as piperidinone/delta-lactam for the Ahp-family defining moiety (Fig. 3c and Supplementary Fig. 7b).
To unambiguously determine the structure, we isolated m/z 1186.6400, named rivulariapeptolide 1185 (1), and performed 1D/2D NMR experiments and manual MS/MS interpretation (Fig. 3d, Supplementary Figs. 8–13, 43, and Supplementary Table 1). Subsequently, we targeted the isolation of further rivulariapeptolides by preparative HPLC, based on their protein-ligand complex ratios from the native metabolomics experiments as well as their relative abundance. In that way, we isolated and elucidated the planar structures of the rivulariapeptolides 1185, 1155, 1121, and 989 (1, 2, 3, 4) with the exact masses 1186.6400 [M + H]+ (C61H88N9O15, 0.5 ppm), 1156.5923 [M + H]+ (C59H82N9O15, − 0.2 ppm), 1122.6080 [M + H]+ (C56H84N9O15, − 0.1 ppm) and 989.4978 [M + H]+ (C50H69N8O13, − 0.1 ppm). In addition to rivulariapeptolides, we identified a new molassamide derivative that we termed “molassamide B” (6) with m/z 1041.3924 [M + H]+ (C48H66BrN8O13, −0.3 ppm), which is ortho-brominated (Supplementary Figs. 14–43 and Supplementary Tables 2, 3). The absolute configurations of the amino acids were determined by UHPLC-MS analysis of the acid hydrolysates of 2 and its pyridinium dichromate oxidation product and subsequent advanced Marfey’s analysis (Supplementary Fig. 44a). The analyses revealed L-configurations for all amino acids as is the case for other cyanobacterial Ahp-cyclodepsipeptides. The relative configuration of the stereocenters of the (3 S, 6 R)-Ahp-unit as well as the geometry of the double bond (Z-configuration) of the 2-amino-2-butenoic acid (Abu) moiety was determined by NOESY and HMBC NMR experiments (Supplementary Fig. 44b). Assuming that the other rivulariapeptolides (1, 3, 4) and molassamides (5 + 6) described here originate from the same biosynthetic nonribosomal peptide synthetase, they most probably also share the same backbone configuration. The highly comparable MS/MS and NMR data sets of compounds 1-6 (Supplementary Figs. 8–43 and Supplementary Tables 1–4) provide additional evidence that the discovered Ahp-cyclodepsipeptides from this study share the same configuration.
Finally, the chymotrypsin inhibitory activities of the purified Ahp-cyclodepsipeptides 1-6 were assessed by specific biochemical assays and confirmed the results of the native metabolomic protein infusion MS experiments (Fig. 4). All six compounds were found to be nanomolar chymotrypsin inhibitors with compound 1 being the most potent (Fig. 4, IC50 = 13.17 nM). The newly described family of rivulariapeptolides is characterized by a rare (duplicated) N-butyrylated proline moiety in the side chain. Peptides 1-4 and 6 are among the six most potent natural chymotrypsin inhibitors from the compound class of Ahp-cyclodepsipeptides that have been reported to date36 (Supplementary Table 5).
Intriguingly, a single ortho-bromination in the N-methyltyrosine moiety led to a 35-fold increase in potency for the new compound 6 (IC50 = 24.65 nM), when compared to the known compound 5 (IC50 = 862.60 nM). These promising results led us to test these six Ahp-cyclodepsipeptides against two other serine proteases, elastase and proteinase K. Elastase is produced in either the pancreas, for digestion of food, or by neutrophils for degradation of foreign proteins. Neutrophil elastase is a well-established drug target for the treatment of acute lung injury and acute respiratory distress syndrome61. Proteinase K is a fungal serine endopeptidase that is commonly used in molecular biology procedures62. This enzyme family plays important roles in the fungal infection of insects63 and mammals64. While 2 was found to be a potent elastase inhibitor (IC50 = 4.94 nM), 6 was discovered to be the most potent proteinase K inhibitor known to date (IC50 = 5.42 nM). The isolated compounds were docked by induced-fit, inside the binding pocket of alpha-chymotrypsin (PDBID 4Q2K), and all were found to have a similar binding mode (Supplementary Methods and Supplementary Fig. 45) that was revealed by crystal structures of Ahp-cyclodepsipeptides in complex with serine proteases65,66.
Discussion
Here we describe the use of native metabolomics protein infusion MS to simultaneously detect protein-metabolite binding and annotate their molecular structures. This approach can be used for rapid screening of small molecule modulators for proteins of interest, directly from crude extracts. In our case study, we identified 30 chymotrypsin-binding natural products in a 10 min LC-MS run from a few µg of crude extract (not including downstream isolation and de novo structure elucidation). In comparison to flow-injection experiments (injection of crude extract in the native MS setup without column), we observed a strong signal decrease, most likely through collective ion suppression effects (Supplementary Fig. 3d, left panel). When injecting preincubated chymotrypsin-crude extract mixture into our system (Supplementary Fig. 3d, right panel) we observed a more acceptable signal response and several protein-metabolite complexes, which we attributed to molassamide B, a molassamide derivative, and rivulariapeptolide 1185. However, in comparison to native metabolomics, the number of putative binders observed was significantly lower, indicating that chromatographic separation is an important factor in the sensitivity of the approach. Most importantly, the chromatographic dimension is also essential for the unambiguous linking of binding information to MS/MS features which facilitates dereplication and downstream structure elucidation. Online Post-column complex formation by addition of metal binding partner has been used, to study carbohydrates67 and siderophores68, respectively, which we here expand to protein targets as binding partners.
Initial tests with 12 proteins showed the accessibility of a wide range of protein targets by native metabolomics (Supplementary Fig. 46) as well as the need to optimize buffer concentrations and mass spectrometry settings. While this experiment highlights the accessibility of diverse proteins under native metabolomics conditions, we did not monitor ligand binding for these proteins. Since we did not assess binding properties with differentially affine ligands and proteins of different molecular weights, the suitability of our approach for studying protein-small molecule interactions of lower affinity and with larger proteins remains to be characterized. Still, we assume that native metabolomics is generalizable to other protein targets that are accessible via native ESI69–71 provided that they are available in large enough quantities. The amounts of protein needed for native metabolomics high-throughput screening are a few mg for a 24 h screen of 96 samples with a 10 min LC-MS method, which is achievable for many commercially available proteins or those coming from in-house heterologous protein expression.
Besides the ad hoc experimental determination of binding information, we systematically organized this information in the public GNPS spectral library through spectrum tags. Hence, native metabolomics-derived properties are accessible in future experiments and can provide biological context to complex metabolomes.
We primarily used the method for an initial screening approach to assign binary binding information (binder/non-binder). In this study we did not determine absolute binding affinities by native metabolomics. Nevertheless, informed estimations can be made about relative binding affinities by fitting the intensity ratio of bound and unbound protein as a function of ligand concentration21,72. This method first assumes that no dissociation takes place during the transmission through the mass spectrometer, and second, that the observed gas-phase intensity ratio correlates with the solution ratio. To test the degree of in-source dissociation of protein-binder complexes using our native metabolomics approach, we varied the ESI parameters (in-source CID energy, HESI aux gas temperature, S-Lense RF Level) and observed the resulting change in protein-binder complex ratios compared to the apoprotein (Supplementary Fig. 47). We concluded that the protein-binder complex of interest does not undergo in-source dissociation over a wide range of parameter settings. Only very high CID levels (80 eV) or low S-Lense RF Levels (10%) can lead to the complete loss of signal for the protein-binder complex or low absolute intensity, respectively. These assumptions imply that ESI titration measurements can deliver a relative “snapshot” of the solution concentrations to reflect solution-phase binding affinities. Nevertheless, it is important to point out that the native metabolomics approach should be considered as a first hypothesis-generating tool, and that putative binders should be further validated with orthogonal methods to both determine absolute binding affinities, enzyme inhibition rates, and to out rule nonspecific binding effects.
From a natural product discovery perspective, it is very interesting that 30 putative bioactive molecules for a target protein were discovered from a single extract. This indicates that the chemical space for certain bioactive molecular families can often be underestimated when compared to traditional bioactivity-guided approaches, as they are typically biased towards the most abundant or most active compounds. At least for Ahp-cyclodepsipeptides, recent biosynthetic studies suggest that the high structural diversity of these compounds is mainly driven by the hypervariability of amino acids in the positions proceeding and following the Ahp-unit (see Fig. 4a for the definitions of residues R3 and R2, respectively)73. The events impacting R2 can be explained by high-frequency point mutations. This is sought to provide an evolutionary platform to iteratively test combinations while maintaining the central activity. However, the amino acid substitutions at R3 most likely occur via recombination events, thereby allowing for evolutionary shortcuts73. These biosynthetic hypotheses are supported by the compounds isolated in this study and add to a better understanding of structure-activity relationships for Ahp-cyclodepsipeptides. Comparing rivulariapeptolide 1185 (1) to rivulariapeptolide 1121 (3), a leucine residue is swapped for an Abu unit at R2, and phenylalanine is replaced by leucine at R3. The most surprising structure-activity relationship gained from this study, however, was that a single substitution of bromine (molassamide B, 6) for hydrogen (molassamide, 5) led to a thirtyfive-, three-, and four-fold increase in potency towards chymotrypsin, elastase, and proteinase K, respectively.
The protease inhibition of the compounds discovered with the native metabolomics workflow was confirmed with an orthogonal fluorescence assay against three proteases. At nanomolar concentration, their IC50 show high potency and exhibits distinct selectivity when compared to other Ahp-cyclodepsipeptides36, Fig. 4 and Supplementary Table 5). For example, molassamide (5) is the second most potent inhibitor screened against proteinase K but is the least potent inhibitor for chymotrypsin and elastase. Rivulariapeptolide 1155 (2), on the other hand, is the most potent elastase inhibitor but shows much lower inhibition against both chymotrypsin and proteinase K.
Together, these findings highlight the utility of the native metabolomics approach presented herein. Beyond the discovery of protease inhibitors, we anticipate that native metabolomics may be applicable for the screening of a broad variety of interactions of biomolecules from complex mixtures at scale. We envision that native metabolomics might become a central tool for activity and functional-metabolomics workflows. This would not only benefit drug discovery and chemical ecology studies but could potentially also be leveraged for the generation of large-scale training data for machine learning approaches to predict protein-ligand interactions.
Methods
Cyanobacterial collection and taxonomy
Marine cyanobacteria biofilm samples were collected in an intertidal zone growing on rock/reef substrate near Las Palmas Beach, Manatí, Puerto Rico, U.S. (GPS coordinates: 18°28'32.0″N 66°30'00.5″W) on May 14th, 2019, and at 0.5−2.0 m of water at Carlos Rosario Beach in Culebra, Puerto Rico, U.S (GPS coordinates: 18°19'30.0″N 65°19'48.0″W) on April 6th, 2019. Biomass for both samples was hand collected (DRNA Permit O-VS-PVS15-SJ-01165-15102020). Microscopic examination indicated that this collection was morphologically consistent with the genus Rivularia. 16 S rDNA analysis confirmed the identity as Rivularia spp. PCC 7116. Voucher specimen available from E.C.D. as collection no. MAP14MAY19-1, and from W. H. G. as collection no. CUR6APR19-1.
Micro-flow LC-MS/MS data acquisition
For micro-flow UHPLC-MS/MS analysis 2 µL were injected into vanquish UHPLC system coupled to a Q-Exactive (setup A) or a Q-Exactive HF (setup B) quadrupole orbitrap mass spectrometer (Thermo Fisher Scientific, Bremen, Germany) with an Agilent 1260 quaternary HPLC pump (Agilent, Santa Clara, USA) or in setup B a fully integrated vanquish quaternary UHPLC pump (Thermo Fisher Scientific, Bremen, Germany) as make-up pumps. For reversed-phase chromatographic, a C18 core-shell micro-flow column (Kinetex C18, 150 × 1 mm, 1.8 um particle size, 100 A pore size, Phenomenex, Torrance, USA) was used. The mobile phase consisted of solvent A (H2O + 0.1% formic acid (FA)) and solvent B (acetonitrile (ACN) + 0.1% FA). The flow rate was set to 150 µL/min (setup A) or 100 µL/min (setup B). In setup A, a linear gradient from 5–50% B between 0–4 min and 50–99% B between 4 and 5 min, followed by a 2 min washout phase at 99% B and a 3 min re-equilibration phase at 5% B. In setup B, a linear gradient from 5–50% B between 0-8 min and 50–99% B between 8 and 10 min, followed by a 3 min washout phase at 99% B and a 5 min re-equilibration phase at 5% B.
Metabolomics data-dependent acquisition (DDA) of MS/MS spectra was performed in positive mode. Electrospray ionization (ESI) parameters were set to 40 arbitrary units (arb. units) sheath gas flow, auxiliary gas flow was set to 10 arb. units and sweep gas flow was set to 0 AU. The auxiliary gas temperature was set to 400 °C. The spray voltage was set to 3.5 kV and the inlet capillary was heated to 320 °C. S-lens level was set to 70 V applied. MS scan range was set to 200–2000 m/z with a resolution at m/z 200 (Rm/z 200) of 70,000 with one micro-scan. The maximum ion injection time was set to 100 ms with automatic gain control (AGC) target of 5E5. Up to two MS/MS spectra per duty cycle were acquired at Rm/z 200 17,000 with one micro-scan. The maximum ion injection time for MS/MS scans was set to 100 ms with an AGC target of 5.0E5 ions and a minimum of 5% AGC. The MS/MS precursor isolation window was set to m/z 1. The normalized collision energy was stepped from 20 to 30 to 40% with z = 1 as the default charge state. MS/MS scans were triggered at the apex of chromatographic peaks within 2 to 15 s from their first occurrence. Dynamic precursor exclusion was set to 5 s. Ions with unassigned charge states were excluded from MS/MS acquisition, as well as isotope peaks.
For native metabolomics experiments, the same chromatographic parameters were used and in addition, 220 µL/min (setup A) or 150 µL/min (setup B) ammonium acetate buffer was infused post-column through a make-up pump and a PEEKT-splitter and enzyme solution was infused with 2 µL/min flow rate via the integrated syringe pump. ESI settings were set to 40 arb. units sheath gas flow, the auxiliary gas flow was set to 10 arb. units and sweep gas flow was set to 0 arb. units. The auxiliary gas temperature was set to 150 °C. The spray voltage was set to 3 kV and the inlet capillary was heated to 253 °C. S-lens level was set to 30 V applied. MS scan range was set to 2000-4000 m/z (setup A) and 2500–4000 m/z (setup B) with a resolution Rm/z 200 140,000 (setup A) and 120,000 (setup B) with 2 microscans. MS acquisition was performed in all-ion fragmentation (AIF) mode with Rm/z 200 with 20% HCD collision energy and an isolation window of 2000 –4000 m/z (setup A) or 2500–4000 m/z (setup B).
Native metabolomics data analysis
For native LC-MS data, multiple charged spectra were deconvoluted using the Xtract algorithm in Qualbrowser, part of the Xcalibur software (Thermo Scientific) and also using FLASHDeconv (version 2.0Beta) {Jeong, K. et al. FLASHDeconv: Ultrafast, High-Quality Feature Deconvolution for Top- Down Proteomics. Cell Syst 10, 213-218 e216 (2020).}. For Xtract algorithm, both deconvoluted native LC-MS and metabolomics LC-MS/MS.raw files were converted to the centroid.mzML file format using MSconvert of the proteowizard software package. For FLASHDeconv, the raw files were converted into centroid m/zML again using MSconvert software first. Then the converted mzML files were deconvolved to generate deconvolved.mzML format files. Feature finding of both file types was performed using a modified version of MZmine2.37 (corr.17.7). Feature tables from both intact protein mass and metabolomics data were matched by their retention time (RT) and an m/z offset corresponding to the mass of chymotrypsin (25234 Da) with an RT tolerance of 0.2 min and a mass tolerance of 4 Da.
Feature tables (.csv), MS/MS spectra files (.mgf), and ion identity networking results (.csv) were exported and uploaded to the MassIVE repository. LC-MS/MS data were submitted to GNPS for feature-based molecular networking analysis. Downstream combined Molecular-Networks and chymotrypsin small molecule binding were visualized as networks in cytoscape (3.8.2).
Optimization of ESI settings, buffer concentration, and pH dependency of native metabolomics
To optimize ESI setting as well as the influence of NH4Acetate concentration and pH, we used molassamide as a test compound. For optimization of ESI settings, we performed flow injections without a column and ramped ESI aux gas temperature, in-source CID energy, S-lense RF voltage. Optimization of pH values over the entire LC gradient and peak intensities were both assessed using three different make-up buffers. Make-up buffer 1 was water, make-up buffer 2 was 10 mM NH4Acetate buffer, and make-up buffer 3 was 10 mM NH4Acetate buffer + 0.2% ammonium hydroxide, v/v. Effects of NH4Acetate concentration was assessed with an online mixed make-up flow (using Experimental Setup B) with the low-pressure gradient pump. As solvent A we used LC-MS grade water and 100 mM NH4Acetate as solvent B. Molassamide was prepared as 100 µM solutions from a 10 mM stock solution in DMSO by preparing a 1:100 dilution into solvent mixture 1 (water + 10% acetonitrile + 0.1% formic acid). Samples were analyzed as described in micro-flow LC-MS/MS data acquisition (setup A); 2 µL of each solution were injected into the mass spectrometer, while chymotrypsin (Sigma), dissolved in water to a final concentration of 2 mg/mL, was injected through the syringe pump at a flow rate of 2 µL/min. pH values were assessed by disconnecting the flow to the source and collecting ~30 µL of solvent every minute, then testing this value on pH paper.
Titration of ligands and concentration dependency
Molassamide was prepared as 100 µM solutions from a 10 mM stock solution in DMSO by preparing a 1:100 dilution into solvent mixture 1 (water + 10% acetonitrile + 0.1% formic acid). From this 100 µM solution, dilutions were prepared at 10, 1, and 0.1 µM into solvent mixture 1. 2 µL of each solution were injected into the mass spectrometer. Samples were analyzed as described in Micro-Flow LC-MS/MS data acquisition (setup A), while chymotrypsin (Sigma Aldrich) was dissolved in water to a final concentration of 2 mg/mL and injected through the syringe pump at a flow rate of 2 µL/min. The ratio of bound to unbound protein was plotted against the ligand concentration in a given HPLC peak window. Data points were fitted using the solver function.
Determination of limit of detection, selectivity, and flow-injection experiments
Molassamide was dissolved in 50% MeOH and a serial dilution with a dilution factor of 2 and 10 were performed yielding final concentrations of 100, 50, 10, 1, and 0.1 µg/mL. Two microliters of each dilution were injected into the native metabolomics micro-flow LC-MS system (setup B) and peak areas from chymotrypsin-bound molassamide were extracted and plotted against their concentration. For testing the selectivity of the method, a series of standards (gallinamide A, FR900359, quinalizarin, genistein, phloroglucinol, cymodepsipeptide A, lingaoamide, molassamide B, rivulariapeptolide 1121, rivulariapeptolide 1185, and tutuilamide A) were dissolved to 100 µg/mL and pooled to final concentrations of 10 µg/mL in 50% MeOH. For flow-injection analysis, the 10 µg/mL molassamide standard was used and the UHPLC column was bypassed with a stainless steel union. Continuous injections during the micro-flow LC gradient were performed manually through the direct control function in the Xcalibur software (Thermo Scientific) with ~1 min spacing.
Chymotrypsin activity assays in native mass spectrometry buffer
Cyanobacteria extract (1 mg/ml, methanol) was diluted in 10 mM ammonium acetate pH 4.5 to 30 ug/mL and then sequentially diluted 1.5-fold to 0.52 µg/mL in the same buffer. Bovine chymotrypsin (Sigma Aldrich) and Suc-Ala-Ala-Pro-Phe-AMC (Calbiochem, 230914) were diluted to 300 nM and 150 µM, respectively in 10 mM ammonium acetate pH 4.5. In a 384-well black microplate, 10 µL of enzyme, substrate, and cyanobacteria extract were combined (30 µL final volume) such that the concentrations in the reaction were 100 nM chymotrypsin, 50 µM of Suc-Ala-Ala-Pro-Phe-AMC and 10 µM to 0.17 nM of cyanobacteria extract. For solvent compatibility assays, chymotrypsin (100 nM) was assayed with 10 µg/mL and 1 µg/mL extract in 10 mM ammonium acetate, pH 4.5 containing 8.3 to 41.7% acetonitrile. A control assay lacked acetonitrile and cyanobacteria extract. All assays were performed in triplicate wells at 25 °C in a Synergy HTX Multi-Mode Microplate Reader (BioTek, Winooski, VT) with excitation and emission wavelengths of 360 and 460 nm, respectively. The initial reaction velocity in each well was recorded and the dose-response curve was generated using GraphPad Prism 9 software.
Calculation of IC50 values
Chymotrypsin (1 nM), proteinase K (10 nM), and elastase (20 nM) was preincubated with 0 to 3 µM of each compound for 40 min in Dulbecco’s phosphate buffered saline, pH 7.4 containing 0.01% Tween-20. The reaction was initiated by the addition of 25 µM of Suc-Ala-Ala-Pro-Phe-AMC (Calbiochem, 230914) for proteinase K and chymotrypsin, 25 µM of MeOSuc-Ala-Ala-Pro-Val-AMC (Cayman, 14907) for elastase in a final volume of 30 µL, respectively. The release of the AMC fluorophore was recorded in a Synergy HTX multi-mode reader (BioTek Instruments, Winooski, VT) with excitation and emission wavelengths at 340 and 460 nm, respectively. The maximum velocity was calculated in RFU/sec over 10 sequential points on the linear part of the progress curve. The IC50 values were determined by nonlinear regression in GraphPad Prism 9.
NMR spectroscopy
Deuterated NMR solvents were purchased from Cambridge Isotope Laboratories. 1H NMR and 2D NMR spectra were collected on a Bruker Avance III DRX-600 NMR with a 1.7 mm dual-tune TCI cryoprobe (600 and 150 MHz for 1H and 13 C NMR, respectively) and a JEOL ECZ 500 NMR spectrometer equipped with a 3 mm inverse detection probe. NMR spectra were referenced to residual solvent DMSO signals (δH 2.50 and δC 39.5 as internal standards). The NMR spectra were processed using MestReNova (Mnova 12.0, Mestrelab Research) or TopSpin 3.0 (Bruker Biospin) software.
Extraction and isolation
The preserved cyanobacterial biomass from collection no. CUR6APR19-1 was filtered through cheesecloth, and then (98.7 g dry wt) was extracted repeatedly by soaking in 500 mL of 2:1 CH2Cl2 /MeOH with warming (<30 °C) for 30 min to afford 1.44 g of dried extract. A portion of the extract was fractionated by reversed-phase solid phase extraction (C18-SPE) using a stepwise gradient solvent system of decreasing polarity (Fr. 1–1 35% ACN/H2O, 124.4 mg; Fr. 1–2 70% ACN/H2O, 76.1 mg; Fr. 1-3 100% ACN, 77 mg; Fr. 1–4 100% MeOH, 254.9 mg. Fr. 1–2 was dissolved in 70% ACN/H2O and purified by preparative HPLC using a Kinetex 5 μm RP 100 Å column (21.00 × 150 mm) and isocratic elution using 50% ACN/H2O for 8 min, then ramping up to 100% in 14 min at the flow rate of 20 mL/min, yielding 56 subfractions. Rivulariapeptolide 1185 (compound 1) and 1155 (compound 2) were isolated from subfractions 1-2-20 to 1-2-23 that were combined (4.5 mg) and further purified by semi-preparative HPLC using a Synergi 4 μm Hydro-RP 80 Å column (10.00 × 250 mm) and isocratic elution gradient elution using 35% ACN/65% H2O isocratic at the flow rate of 3.5 mL/min for 3 min the ramping up to 55% ACN in 22 min, then ramping up to 100% ACN in 1 min and holding the gradient at 100% ACN for another 5 min yielding 1 (1.6 mg, RT = 23.3 min) and 2 (1.3 mg, RT = 21.8 min) as a colorless, amorphous solid. The same HPLC conditions were used to isolate compounds 3 (rivulariapeptolide 1121, from subfraction 1-2-10 and 1-2-11, 1.1 mg, RT = 18.5 min) and 4 (rivulariapeptolide 988, from subfraction 1-2-7, 1.3 mg, RT = 12.6 min). Fr. 1-1 was dissolved in 30% ACN/H2O and purified by preparative HPLC using a Kinetex 5 μm RP 100 Å column (21.00 × 150 mm) and isocratic elution using 30% ACN/H2O for 10 min, then ramping up to 50% in 10 min and then to 95% in 2 min at the flow rate of 20 mL/min, yielding 29 subfractions. Molassamide (compound 5) was isolated from subfractions 1-1-6 and 2-1-10 were combined (3.3 mg) and further purified by semi-preparative HPLC using a Synergi 4 μm Hydro-RP 80 Å column (10.00 × 250 mm) and isocratic elution gradient elution using 35% ACN/65% H2O isocratic at the flow rate of 3.5 mL/min for 3 min the ramping up to 55% ACN in 22 min, then ramping up to 100% ACN in 1 min and holding the gradient at 100% ACN for another 5 min yielding 5 as a colorless, amorphous solid (1.8 mg) at RT = 10.8 min. The same HPLC conditions were used to isolate compound 6 (Molassamide B, from subfraction 1-1-4 and 1-1-5, 1.8 mg, RT = 13.7 min).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank the Deutsche Forschungsgemeinschaft for the support of D.P. through a postdoctoral research fellowship (PE 2600/1-1) and of D.P., P.S.1, B.W., K.S., P.S.2., H.B.-O., and C.C.H. through the CMFI Cluster of Excellence (EXC 2124, project-IDs 390838134, and 1-06.010_0) and the Collaborative Research Center CellMap (TRR 261, project-ID 398967434). R.R., P.C.D., and W.H.G. were supported by the Gordon and Betty Moore Foundation (GBMF7622) and by the US National Institutes of Health (R01 GM107550, P41 GM103484, and R03 CA211211). W.B. was supported in part by the Research Foundation—Flanders (12W0418N). We thank the Spanish Ministry of Education and Science for their support of A.I.P.-L. through the program FPU (FPU19/00289), of C.M.-S. through the program Juan de la Cierva-Incorporación (IJC2018-036923-I) and D.R. (PID2019-107724GB-I00). D.R. was supported by the European Research Council with a Starting Grant (BacBio 637971) and the Proyectos I + D + I Program PAIDI 2020 of Junta de Andalucía (P20_00479).
Source data
Author contributions
R.R., and D.P. conceived the study. W.H.G. supervised the fieldwork. R.R., K.L.A., C.B.N., E.J.C., and W.H.G. collected and extracted environmental samples. A.T.A. and D.P. developed the native metabolomics approach. A.I.P.-L., J.H.W.S., P.M.M.H., W.E.D., C.M.-S., D.R., K.S., P.S., H.B.-O., and D.P. provided proteins. W.B. and K.J. wrote the software. M.W. aided in integration with GNPS tags. R.R., A.T.A., P.S., B.W., and D.P. performed MS experiments. R.R. and M.L.M-H. performed compound isolation. R.R. carried out NMR experiments. R.R., P.S., C.C.H., and D.P. performed total hydrolysis and derivatization experiments. P.F., C.L., and A.J.O. performed activity assays. I.Y.B.S. performed docking studies. P.C.D., W.H.G., and D.P. acquired funding and provided infrastructure. R.R. and D.P. wrote the manuscript. All authors edited and approved the final manuscript.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
All raw (.raw), deconvoluted (xtract.raw), and centroided (.mzXML or.mzML) mass spectrometry data as well as processed data feature table (.csv) and MS/MS spectra (.mgf) are available through the MassIVE repository (massive.ucsd.edu) with the identifier MSV000087964 [10.25345/C5G25F], MSV000088586 [10.25345/C5N585], and MSV000088578 [10.25345/C5NW1R]. The MS/MS spectra of the newly described derivatives, including tags as protease inhibitors, have been added to the GNPS library (gnps.ucsd.edu) with the following IDs: rivulariapeptolide 1185 (1): CCMSLIB00005723387; rivulariapeptolide 1155 (2): CCMSLIB00005723986, CCMSLIB00005720236; rivulariapeptolide 1121 (3): CCMSLIB00005723398; rivulariapeptolide 988 (4): CCMSLIB00005723393; molassamide (5): CCMSLIB00005723404; molassamide B (6): CCMSLIB00006710020. Raw NMR data and structures for compounds 1 -6 have been deposited to Zenodo at 10.5281/zenodo.6956292. Structural analysis of rivulariapeptolides peptides bound to α-chymotrypsin was based on the previously published Protein Data Bank entries 4GVU and 4Q2K. Source data are provided with this paper.
Code availability
The modified version of MZmine2.37 (corr.17.7) used in this study is available at https://github.com/robinschmid/mzmine2/releases. The code for the mass-offset matching for native metabolomics data analysis is available under https://github.com/Functional-Metabolomics-Lab/Native-Metabolomics and 10.5281/zenodo.6786930. FLASHDeconv is a part of OpenMS and is available as platform independent open-source software under a BSD three-clause license at https://OpenMS.org/FLASHDeconv.
Competing interests
P.C.D. and W.H.G. are scientific advisors of Sirenas. P.C.D. is a scientific advisor of Galileo, Cybele, and scientific advisor and co-founder of Ometa Labs LLC and Enveda with approval by the UC San Diego. M.W. is the founder of Ometa Labs LLC. The remaining authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
William H. Gerwick, Email: wgerwick@health.ucsd.edu
Daniel Petras, Email: daniel.petras@uni-tuebingen.de.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-32016-6.
References
- 1.Newman DJ, Cragg GM. Natural products as sources of new drugs over the nearly four decades from 01/1981 to 09/2019. J. Nat. Prod. 2020;83:770–803. doi: 10.1021/acs.jnatprod.9b01285. [DOI] [PubMed] [Google Scholar]
- 2.Pye CR, Bertin MJ, Lokey RS, Gerwick WH, Linington RG. Retrospective analysis of natural products provides insights for future discovery trends. Proc. Natl Acad. Sci. USA. 2017;114:5601–5606. doi: 10.1073/pnas.1614680114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gavriilidou A, et al. Compendium of specialized metabolite biosynthetic diversity encoded in bacterial genomes. Nat. Microbiol. 2022;7:726–735. doi: 10.1038/s41564-022-01110-2. [DOI] [PubMed] [Google Scholar]
- 4.Medema MH, de Rond T, Moore BS. Mining genomes to illuminate the specialized chemistry of life. Nat. Rev. Genet. 2021;22:553–571. doi: 10.1038/s41576-021-00363-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hooft JJJvander, et al. Linking genomics and metabolomics to chart specialized metabolic diversity. Chem. Soc. Rev. 2020;49:3297–3314. doi: 10.1039/D0CS00162G. [DOI] [PubMed] [Google Scholar]
- 6.Peisl BYL, Schymanski EL, Wilmes P. Dark matter in host-microbiome metabolomics: tackling the unknowns–A review. Analytica Chim. Acta. 2018;1037:13–27. doi: 10.1016/j.aca.2017.12.034. [DOI] [PubMed] [Google Scholar]
- 7.Scheubert K, et al. Significance estimation for large scale metabolomics annotations by spectral matching. Nat. Commun. 2017;8:1494. doi: 10.1038/s41467-017-01318-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Schmid R, et al. Ion identity molecular networking for mass spectrometry-based metabolomics in the GNPS environment. Nat. Commun. 2021;12:3832. doi: 10.1038/s41467-021-23953-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Dührkop K, et al. Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra. Nat. Biotechnol. 2021;39:462–471. doi: 10.1038/s41587-020-0740-8. [DOI] [PubMed] [Google Scholar]
- 10.Ludwig M, et al. Database-independent molecular formula annotation using Gibbs sampling through ZODIAC. Nat. Mach. Intell. 2020;2:629–641. doi: 10.1038/s42256-020-00234-6. [DOI] [Google Scholar]
- 11.Reher R, et al. A convolutional neural network-based approach for the rapid annotation of molecularly diverse natural products. J. Am. Chem. Soc. 2020;142:4114–4120. doi: 10.1021/jacs.9b13786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Stravs, M. A., Dührkop, K., Böcker, S. & Zamboni, N. MSNovelist: de novo structure generation from mass spectra. Nat. Methods19, 865–870 (2022). [DOI] [PMC free article] [PubMed]
- 13.Rinschen MM, Ivanisevic J, Giera M, Siuzdak G. Identification of bioactive metabolites using activity metabolomics. Nat. Rev. Mol. Cell Biol. 2019;20:353–367. doi: 10.1038/s41580-019-0108-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Yan M, Xu G. Current and future perspectives of functional metabolomics in disease studies–A review. Analytica Chim. Acta. 2018;1037:41–54. doi: 10.1016/j.aca.2018.04.006. [DOI] [PubMed] [Google Scholar]
- 15.Cancilla MT, Leavell MD, Chow J, Leary JA. Mass spectrometry and immobilized enzymes for the screening of inhibitor libraries. Proc. Natl Acad. Sci. USA. 2000;97:12008–12013. doi: 10.1073/pnas.220403997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Muchiri RN, van Breemen RB. Affinity selection–mass spectrometry for the discovery of pharmacologically active compounds from combinatorial libraries and natural products. J. Mass Spectrom. 2021;56:e4647. doi: 10.1002/jms.4647. [DOI] [PubMed] [Google Scholar]
- 17.Liu J, et al. Evaluation of estrogenic activity of plant extracts for the potential treatment of menopausal symptoms. J. Agric. Food Chem. 2001;49:2472–2479. doi: 10.1021/jf0014157. [DOI] [PubMed] [Google Scholar]
- 18.Liu M, Van Voorhis WC, Quinn RJ. Development of a target identification approach using native mass spectrometry. Sci. Rep. 2021;11:2387. doi: 10.1038/s41598-021-81859-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yen H-Y, et al. PtdIns(4,5)P2 stabilizes active states of GPCRs and enhances selectivity of G-protein coupling. Nature. 2018;559:423–427. doi: 10.1038/s41586-018-0325-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Vu H, et al. Plasmodium gametocyte inhibition identified from a natural-product-based fragment library. ACS Chem. Biol. 2013;8:2654–2659. doi: 10.1021/cb400582b. [DOI] [PubMed] [Google Scholar]
- 21.Su H, et al. Anti-SARS-CoV-2 activities in vitro of Shuanghuanglian preparations and bioactive ingredients. Acta Pharm. Sin. 2020;41:1167–1177. doi: 10.1038/s41401-020-0483-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Breemen, R. B. van & Choi, Y. Development of a screening assay forligands to the estrogen receptor based on magnetic microparticles and LC-MS. Comb. Chem. High Throughput Screen.11, 1–6 (2008). [DOI] [PMC free article] [PubMed]
- 23.van Breemen RB, et al. Pulsed ultrafiltration mass spectrometry: a new method for screening combinatorial libraries. Anal. Chem. 1997;69:2159–2164. doi: 10.1021/ac970132j. [DOI] [PubMed] [Google Scholar]
- 24.Kelly MA, et al. Characterization of SH2−ligand interactions via LibraryAffinity selection with mass spectrometric detection. Biochemistry. 1996;35:11747–11755. doi: 10.1021/bi960571x. [DOI] [PubMed] [Google Scholar]
- 25.Kaur S, McGuire L, Tang D, Dollinger G, Huebner V. Affinity selection and mass spectrometry-based strategies to identify lead compounds in combinatorial libraries. J. Protein Chem. 1997;16:505–511. doi: 10.1023/A:1026369729393. [DOI] [PubMed] [Google Scholar]
- 26.Nguyen GTH, et al. Multiplexed screening of thousands of natural products for protein–ligand binding in native mass spectrometry. J. Am. Chem. Soc. 2021;143:21379–21387. doi: 10.1021/jacs.1c10408. [DOI] [PubMed] [Google Scholar]
- 27.van Breemen RB, et al. Cannabinoids block cellular entry of SARS-CoV-2 and the emerging variants. J. Nat. Prod. 2022;85:176–184. doi: 10.1021/acs.jnatprod.1c00946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Park H, et al. Mass spectrometry-based shotgun glycomics for discovery of natural ligands of glycan-binding proteins. Anal. Chem. 2020;92:14012–14020. doi: 10.1021/acs.analchem.0c02931. [DOI] [PubMed] [Google Scholar]
- 29.Gault J, et al. Combining native and ‘omics’ mass spectrometry to identify endogenous ligands bound to membrane proteins. Nat. Methods. 2020;17:505–508. doi: 10.1038/s41592-020-0821-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Sumner LW, et al. Proposed minimum reporting standards for chemical analysis. Metabolomics. 2007;3:211–221. doi: 10.1007/s11306-007-0082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Schymanski EL, et al. Identifying small molecules via hHigh resolution mass spectrometry: communicating confidence. Environ. Sci. Technol. 2014;48:2097–2098. doi: 10.1021/es5002105. [DOI] [PubMed] [Google Scholar]
- 32.Hong J, Luesch H. Largazole: from discovery to broad-spectrum therapy. Nat. Prod. Rep. 2012;29:449–456. doi: 10.1039/c2np00066k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gerwick WH, et al. Structure of Curacin A, a novel antimitotic, antiproliferative and brine shrimp toxic natural product from the marine Cyanobacterium Lyngbya majuscula. J. Org. Chem. 1994;59:1243–1245. doi: 10.1021/jo00085a006. [DOI] [Google Scholar]
- 34.Boudreau PD, et al. Design of gallinamide A analogs as potent inhibitors of the cysteine proteases human cathepsin L and Trypanosoma cruzi Cruzain. J. Med. Chem. 2019;62:9026–9044. doi: 10.1021/acs.jmedchem.9b00294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Luesch H, Yoshida WY, Moore RE, Paul VJ, Corbett TH. Total structure determination of apratoxin A, a potent novel cytotoxin from the marine Cyanobacterium Lyngbya m ajuscula. J. Am. Chem. Soc. 2001;123:5418–5423. doi: 10.1021/ja010453j. [DOI] [PubMed] [Google Scholar]
- 36.Köcher, S. et al. From dolastatin 13 to cyanopeptolins, micropeptins, and lyngbyastatins: the chemical biology of Ahp-cyclodepsipeptides. Nat. Prod. Rep. 2 (2019). [DOI] [PubMed]
- 37.Lodin-Friedman A, Carmeli S. Microginins from a Microcystis sp. Bloom material collected from the Kishon Reservoir, Israel. Mar. Drugs. 2018;16:78. doi: 10.3390/md16030078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Murakami M, et al. Microviridins, elastase inhibitors from the cyanobacterium Nostoc minutum (NIES-26) Phytochemistry. 1997;45:1197–1202. doi: 10.1016/S0031-9422(97)00131-3. [DOI] [Google Scholar]
- 39.Todorova AK, Juettner F, Linden A, Pluess T, von Philipsborn W. Nostocyclamide: a new macrocyclic, thiazole-containing allelochemical from Nostoc sp. 31 (cyanobacteria) J. Org. Chem. 1995;60:7891–7895. doi: 10.1021/jo00129a032. [DOI] [Google Scholar]
- 40.Issac M, et al. Cyclotheonellazoles A–C, potent protease inhibitors from the marine sponge Theonella aff. swinhoei. J. Nat. Prod. 2017;80:1110–1116. doi: 10.1021/acs.jnatprod.7b00028. [DOI] [PubMed] [Google Scholar]
- 41.Adiv S, Carmeli S. Protease inhibitors from microcystis aeruginosa bloom material collected from the Dalton Reservoir, Israel. J. Nat. Prod. 2013;76:2307–2315. doi: 10.1021/np4006844. [DOI] [PubMed] [Google Scholar]
- 42.Owen, D. R. et al. An oral SARS-CoV-2 Mpro inhibitor clinical candidate for the treatment of COVID-19. Science. 374, 1586–1593 (2021). [DOI] [PubMed]
- 43.Anderson, J., Schiffer, C., Lee, S.-K. & Swanstrom, R. in Antiviral Strategies (eds. Kräusslich, H.-G. & Bartenschlager, R.) (Springer, 2009).
- 44.Ito S. Proteasome inhibitors for the treatment of multiple myeloma. Cancers. 2020;12:265. doi: 10.3390/cancers12020265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Deacon CF. A review of dipeptidyl peptidase-4 inhibitors. Hot topics from randomized controlled trials. Diabetes Obes. Metab. 2018;20:34–46. doi: 10.1111/dom.13135. [DOI] [PubMed] [Google Scholar]
- 46.Messerli FH, Bangalore S, Bavishi C, Rimoldi SF. Angiotensin-converting enzyme inhibitors in hypertension. J. Am. Coll. Cardiol. 2018;71:1474–1482. doi: 10.1016/j.jacc.2018.01.058. [DOI] [PubMed] [Google Scholar]
- 47.Lee CJ, Ansell JE. Direct thrombin inhibitors. Br. J. Clin. Pharmacol. 2011;72:581–592. doi: 10.1111/j.1365-2125.2011.03916.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Al-Awadhi FH, Luesch H. Targeting eukaryotic proteases for natural products-based drug development. Nat. Prod. Rep. 2020;37:827–860. doi: 10.1039/C9NP00060G. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gunasekera SP, Miller MW, Kwan JC, Luesch H, Paul VJ. Molassamide, a depsipeptide serine protease inhibitor from the marine Cyanobacterium Dichothrix utahensis. J. Nat. Prod. 2010;73:459–462. doi: 10.1021/np900603f. [DOI] [PubMed] [Google Scholar]
- 50.Konermann L. Addressing a common misconception: ammonium acetate as neutral pH “buffer” for native electrospray mass spectrometry. J. Am. Soc. Mass Spectrom. 2017;28:1827–1835. doi: 10.1007/s13361-017-1739-3. [DOI] [PubMed] [Google Scholar]
- 51.Pedro L, Quinn RJ. Native mass spectrometry in fragment-based drug discovery. Molecules. 2016;21:984. doi: 10.3390/molecules21080984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Gavriilidou AFM, Gülbakan B, Zenobi R. Influence of ammonium acetate concentration on receptor–ligand binding affinities measured by native nano ESI-MS: a systematic study. Anal. Chem. 2015;87:10378–10384. doi: 10.1021/acs.analchem.5b02478. [DOI] [PubMed] [Google Scholar]
- 53.Miller B, et al. The marine cyanobacterial metabolite gallinamide A is a potent and selective inhibitor of human cathepsin L. J. Nat. Products. 2014;77:92–99. doi: 10.1021/np400727r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Crüsemann M, et al. Heterologous expression, biosynthetic studies, and ecological function of the selective Gq-signaling inhibitor FR900359. Angew. Chem. Int. Ed. 2018;57:836–840. doi: 10.1002/anie.201707996. [DOI] [PubMed] [Google Scholar]
- 55.H. Medema M. The year 2020 in natural product bioinformatics: an overview of the latest tools and databases. Nat. Prod. Rep. 2021;38:301–306. doi: 10.1039/D0NP00090F. [DOI] [PubMed] [Google Scholar]
- 56.Nogle LM, Williamson RT, Gerwick WH. Somamides A and B, two new depsipeptide analogues of dolastatin 13 from a Fijian cyanobacterial assemblage of Lyngbya majuscula and Schizothrix species. J. Nat. Prod. 2001;64:716–719. doi: 10.1021/np000634j. [DOI] [PubMed] [Google Scholar]
- 57.Al-Awadhi FH, Paul VJ, Luesch H. Structural diversity and anticancer activity of marine-derived elastase inhibitors: key features and mechanisms mediating the antimetastatic effects in invasive breast cancer. ChemBioChem. 2018;19:815–825. doi: 10.1002/cbic.201700627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Taori K, Matthew S, Rocca JR, Paul VJ, Luesch H. Lyngbyastatins 5–7, potent elastase inhibitors from Floridian marine cyanobacteria, Lyngbya spp. J. Nat. Prod. 2007;70:1593–1600. doi: 10.1021/np0702436. [DOI] [PubMed] [Google Scholar]
- 59.Iwasaki A, Sumimoto S, Ohno O, Suda S, Suenaga K. Kurahamide, a cyclic depsipeptide analog of dolastatin 13 from a marine cyanobacterial assemblage of Lyngbya sp. BCSJ. 2014;87:609–613. doi: 10.1246/bcsj.20140008. [DOI] [Google Scholar]
- 60.Dührkop K, et al. SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information. Nat. Methods. 2019;16:299–302. doi: 10.1038/s41592-019-0344-8. [DOI] [PubMed] [Google Scholar]
- 61.Sahebnasagh A, et al. Neutrophil elastase inhibitor (sivelestat) may be a promising therapeutic option for management of acute lung injury/acute respiratory distress syndrome or disseminated intravascular coagulation in COVID-19. J. Clin. Pharm. Ther. 2020;45:1515–1519. doi: 10.1111/jcpt.13251. [DOI] [PubMed] [Google Scholar]
- 62.Sweeney, P. J. & Walker, J. M. Proteinase K (EC 3.4.21.14). in Enzymes of Molecular Biology (ed. Burrell, M. M.) (Humana Press, 1993). [DOI] [PubMed]
- 63.Gao B-J, Mou Y-N, Tong S-M, Ying S-H, Feng M-G. Subtilisin-like Pr1 proteases marking the evolution of pathogenicity in a wide-spectrum insect-pathogenic fungus. Virulence. 2020;11:365–380. doi: 10.1080/21505594.2020.1749487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.O’Donoghue AJ, et al. Destructin-1 is a collagen-degrading endopeptidase secreted by Pseudogymnoascus destructans, the causative agent of white-nose syndrome. Proc. Natl Acad. Sci. USA. 2015;112:7478–7483. doi: 10.1073/pnas.1507082112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Salvador LA, et al. Potent elastase inhibitors from cyanobacteria: structural basis and mechanisms mediating cytoprotective and anti-inflammatory effects in bronchial epithelial cells. J. Med. Chem. 2013;56:1276–1290. doi: 10.1021/jm3017305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Keller L, et al. Tutuilamides A–C: vinyl-chloride-containing cyclodepsipeptides from marine cyanobacteria with potent elastase inhibitory properties. ACS Chem. Biol. 2020;15:751–757. doi: 10.1021/acschembio.9b00992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kohler M, Leary JA. LC/MS/MS of carbohydrates with postcolumn addition of metal chlorides using triaxial electrospray probe. Anal. Chem. 1995;67:3501–3508. doi: 10.1021/ac00115a019. [DOI] [PubMed] [Google Scholar]
- 68.Aron AT, et al. Native mass spectrometry-based metabolomics identifies metal-binding compounds. Nat. Chem. 2022;14:100–109. doi: 10.1038/s41557-021-00803-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Lössl P, Waterbeemd M, Heck AJ. The diverse and expanding role of mass spectrometry in structural and molecular biology. EMBO J. 2016;35:2634–2657. doi: 10.15252/embj.201694818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Wörner TP, et al. Resolving heterogeneous macromolecular assemblies by Orbitrap-based single-particle charge detection mass spectrometry. Nat. Methods. 2020;17:395–398. doi: 10.1038/s41592-020-0770-7. [DOI] [PubMed] [Google Scholar]
- 71.Webb IK. Recent technological developments for native mass spectrometry. Biochim. Biophys Acta Proteins Proteom. 2022;1870:140732. doi: 10.1016/j.bbapap.2021.140732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Sun J, Kitova EN, Wang W, Klassen JS. Method for distinguishing specific from nonspecific protein−ligand complexes in nanoelectrospray ionization mass spectrometry. Anal. Chem. 2006;78:3010–3018. doi: 10.1021/ac0522005. [DOI] [PubMed] [Google Scholar]
- 73.Baunach M, Chowdhury S, Stallforth P, Dittmann E. The landscape of recombination events that create nonribosomal peptide diversity. Mol. Biol. Evol. 2021;38:2116–2130. doi: 10.1093/molbev/msab015. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All raw (.raw), deconvoluted (xtract.raw), and centroided (.mzXML or.mzML) mass spectrometry data as well as processed data feature table (.csv) and MS/MS spectra (.mgf) are available through the MassIVE repository (massive.ucsd.edu) with the identifier MSV000087964 [10.25345/C5G25F], MSV000088586 [10.25345/C5N585], and MSV000088578 [10.25345/C5NW1R]. The MS/MS spectra of the newly described derivatives, including tags as protease inhibitors, have been added to the GNPS library (gnps.ucsd.edu) with the following IDs: rivulariapeptolide 1185 (1): CCMSLIB00005723387; rivulariapeptolide 1155 (2): CCMSLIB00005723986, CCMSLIB00005720236; rivulariapeptolide 1121 (3): CCMSLIB00005723398; rivulariapeptolide 988 (4): CCMSLIB00005723393; molassamide (5): CCMSLIB00005723404; molassamide B (6): CCMSLIB00006710020. Raw NMR data and structures for compounds 1 -6 have been deposited to Zenodo at 10.5281/zenodo.6956292. Structural analysis of rivulariapeptolides peptides bound to α-chymotrypsin was based on the previously published Protein Data Bank entries 4GVU and 4Q2K. Source data are provided with this paper.
The modified version of MZmine2.37 (corr.17.7) used in this study is available at https://github.com/robinschmid/mzmine2/releases. The code for the mass-offset matching for native metabolomics data analysis is available under https://github.com/Functional-Metabolomics-Lab/Native-Metabolomics and 10.5281/zenodo.6786930. FLASHDeconv is a part of OpenMS and is available as platform independent open-source software under a BSD three-clause license at https://OpenMS.org/FLASHDeconv.