Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Jan 17.
Published in final edited form as: ACS Chem Biol. 2013 Nov 8;9(1):301–309. doi: 10.1021/cb400699p

The genetic basis for the biosynthesis of the pharmaceutically important class of epoxyketone proteasome inhibitors

Michelle Schorn , Judith Zettler §,, Joseph P Noel , Pieter C Dorrestein , Bradley S Moore †,¶,*, Leonard Kaysser †,§,⊥,*
PMCID: PMC4041076  NIHMSID: NIHMS539114  PMID: 24168704

Abstract

The epoxyketone proteasome inhibitors are an established class of therapeutic agents for the treatment of cancer. Their unique α′,β′-epoxyketone pharmacophore allows binding to the catalytic β-subunits of the proteasome with extraordinary specificity. Here we report the characterization of the first gene clusters for the biosynthesis of natural peptidyl-epoxyketones. The clusters for epoxomicin, the lead compound for the anti-cancer drug Kyprolis, and for eponemycin were identified in the actinobacterial producer strains ATCC 53904 and Streptomyces hygroscopicus ATCC 53709, respectively, using a modified protocol for Ion Torrent PGM genome sequencing. Both gene clusters code for a hybrid non-ribosomal peptide synthetase/polyketide synthase multifunctional enzyme complex and homologous redox enzymes. Epoxomicin and eponemycin were heterologously produced in Streptomyces albus J1046 via whole pathway expression. Moreover, we employed mass spectral molecular networking for a new comparative metabolomics approach in a heterologous system and discovered a number of putative epoxyketone derivatives. With this study we have definitively linked epoxyketone proteasome inhibitors and their biosynthesis genes for the first time in any organism, which will now allow for their detailed biochemical investigation.

INTRODUCTION

The 26S proteasome is the essential enzymatic complex for non-lysosomal proteolytic degradation in eukaryotes.(1) It mediates levels of key factors in a variety of essential cellular processes that are deregulated in cancer cells and pivotal elements in carcinogenesis and tumorigenesis. Thus, the inhibition of the proteasome specifically targets heavily proliferating cells over quiescent cells.(2, 3) The first proteasome inhibitor bortezomib (Figure 1, 1) (marketed as Velcade® by Millenium Pharmaceuticals) was approved by the US Food and Drug Administration (FDA) in 2003. It is currently applied as a first-line treatment for multiple myeloma and mantle cell lymphoma. However, intravenous administration of the drug is associated with significant side effects. The development of proteasome inhibitors with improved properties is therefore an ongoing effort.

Figure 1.

Figure 1

Chemical structures of proteasome inhibitors.

A number of potent proteasome inhibitors have been isolated from nature, predominantly from microorganisms.(4) The most prominent class are the peptide epoxyketones which comprise epoxomicin (2)(5), eponemycin (3)(6) and several related compounds (48)(79) (Figure 1). All these molecules consist of a short peptidic core structure with a terminal C3-extended leucine derivative. 2 is particularly potent with IC50 values against the proteasome as low as 2.5 nM.(7) The compound has been used as a lead for the development of carfilzomib (9, Kyprolis, Onyx Pharmaceuticals), which was granted accelerated approval by the FDA in July 2012 for the treatment of refractory and relapsed multiple myeloma.(10) The drug appears to be better tolerated by patients than 1 and can therefore be applied in higher and more effective doses.(11, 12) Beside their usage as anticancer drugs, epoxyketones have shown excellent activity against parasites.(13) In particular, Plasmodium falciparum, the deadly malaria pathogen, is vitally affected by this class of proteasome inhibitors at different stages of its life cycle.(14) Co-crystallization experiments with 2 and the yeast proteasome revealed an irreversible two-step binding mechanism and the essential role of the α′,β′-epoxyketone warhead.(15) The unprecedented terminal α′,β′-position of the C3-carbonyl and the ring-strained epoxide constitutes two strongly electrophilic groups in the immediate proximity to each other which are very accessible for nucleophilic attack. We were thus highly interested in the biosynthesis of these compounds and their unique pharmacophore. The biotransformation of the carboxy-terminus of a polyketide intermediate to an epoxide is biochemically not trivial and may involve either new polyketide biochemistry and/or unique enzymatic redox reactions.

In this study we present the epoxomicin and the eponemycin gene cluster from an unspecified actinomycete strain ATCC 53904 and Streptomyces hygroscopicus strain ATCC 53709, respectively. Both compounds were produced by heterologous pathway expression in S. albus J1074 to definitively link epoxyketone proteasome inhibitors and their biosynthesis genes for the first time in any organism. This genetic linkage allowed us to locate homologous orphan gene clusters in various bacteria that promise the discovery of new bioactive derivative molecules.

RESULTS AND DISCUSSION

Identification of the epoxomicin and eponemycin gene clusters by Ion PGM genome sequencing

To investigate the biosynthetic pathways of epoxyketone proteasome inhibitors, we attempted to isolate the genes responsible for the formation of the prototypes 2 and 3. To this end we subjected genomic DNA of the producer strains ATCC 53904(5), an unspecified actinomycete, and S. hygroscopicus ATCC 53709(6) to semiconductor sequencing from Ion Torrent.(16) Recently, we employed Ion Torrent technology in the de novo sequencing of the draft genome of Thalassospira sp. CNJ-328, which has a GC-content at around 50%.(17) However, we were unsuccessful in the sequencing of DNA with high GC-content such as from actinobacteria using standard protocols provided by the manufacturer. To address the sequencing problems, we slightly modified the procedure for manual template preparation as described in the Ion PGM 200 Xpress Template Kit. Betaine has been shown previously to substantially improve the amplification of difficult GC-rich DNA sequences.(18) As the Ion PGM template preparation is PCR based, we thus added betaine to a final concentration of 1 M to the amplification mix. After template preparation, enrichment and sequencing with the Ion PGM system, the assembly of the obtained sequence data resulted in the generation of two draft genomes. The assembled sequence of the epoxomicin producer ATCC 53904 genome contains 8.9 Mb with a GC-content of 71.8% and was presented on 426 contigs with a 66-fold coverage. Similarly, the 9.8 Mb assembled genome sequence of the eponemycin producer ATCC 53709 has a GC-content of 71.1% and was presented on 490 contigs with a 79-fold coverage. Our modified protocol proved efficient and might thus facilitate the future application of semiconductor technology for the genome sequencing of other high-GC bacteria.

To assess the secondary metabolomic potential of both strains, we submitted the draft genome sequences to in silico analysis via the software antiSMASH.(19) Among 52 and 70 preliminary putative biosynthetic gene clusters, we identified potential clusters for the formation of 2 and 3 at 27.9 kb and 23.8 kb, respectively (Figure 2A). The homologous clusters encode hybrid non-ribosomal peptide synthetase (NRPS)/polyketide synthase (PKS) multifunctional enzymes consistent with the putative formation of the core structure of the compounds. In addition, we identified analogous genes for a putative P450 monooxygenase and a conserved acyl-CoA dehydrogenase. A detailed summary of the proposed function of the genes can be found in the Supporting Information, Supplementary Table S1.

Figure 2.

Figure 2

The biosynthesis of natural epoxyketones. A Relative organization of the epoxomicin (epx), eponemycin (epn) and orphan S. bingchenggensis gene clusters. B Predicted NRPS/PKS (non-ribosomal peptide synthetase/polyketide synthase) assembly line synthesis of epoxomicin (2) and eponemycin (3). Abbreviations: ACAD, acyl-CoA dehydrogenase; CYP450, cytochrome P450; A, adenylation domain; ACP, acyl carrier protein; PCP, peptidyl carrier protein; C, condensation domain; MT, methyltransferase; TE, thioesterase; KS ketosynthase; AT acyltransferase domain.

These observations and the presence of a gene putatively encoding a resistant β-proteasome subunit homologous to the salinosporamide resistance enzyme(20) further suggested that the identified gene clusters code for epoxomicin (epx) and eponemycin (epn) production.

Heterologous production of epoxomicin and eponemycin in S. albus

To confirm the suspected functions of the epx and epn gene clusters, we designed experiments to produce 2 and 3 in a surrogate host organism. To this end, we generated two fosmid libraries from the genomic DNA of both producer strains, ATCC 53904 and ATCC 53709, to isolate the gene clusters. The genomic libraries comprised ~1800 individual clones each and were screened by PCR. Both clusters were found intact on single fosmids, the epoxomicin gene cluster on fosmid 15C3 and the eponemycin gene cluster on fosmid 2H4. A heterologous expression approach was used to confirm the identity of the clusters.

For this purpose we replaced the chloramphenicol resistance gene in the fosmid backbone by λ-Redmediated recombination with an integration cassette we generated previously.(21) The cassette int_neo contains the attP attachment site and the integrase gene (int) of phage ΦC31, a kanamycin resistance gene (neo) and an origin of transfer (oriT) and allows site-specific integration in most Streptomyces chromosomes.(22) The resulting fosmids were named epxMS01 and epnLK01. In order to express the introduced pathway, the transcriptional and translational machinery of the host strain must recognize the promoter, ribosomal binding sites (RBS) and the regulatory system of the gene cluster. Consequently, the transfer of a biosynthetic gene cluster into a phylogenetically related strain is preferable for heterologous expression. While the eponemycin producer ATCC 53709 belongs to the genus Streptomyces, the taxonomic specification of the epoxomicin producer ATCC 53904 was not defined. We thus analyzed two phylogenetic markers, the 16S rRNA and the rpoB gene, for their relatedness to genes from other bacteria. Both markers classify the strain as a member of the Actinobacteria and the taxonomic family Pseudonocardiaceae. The rpoB gene shows highest homology (89% identity) to Actinosynnema mirum DSM 43827. The 16S rRNA confirms the close relatedness to A. mirum (96% identity) but indicates an even greater similarity to Goodfellowia coeruleoviolacea NRRL-B 24058 at 99% sequence identity.(23) Hence, we strongly consider ATCC 53904 to be a Goodfellowia species.

Both the epoxomicin and the eponemycin gene cluster were transferred into S. albus J1046 by intergeneric conjugation,(24) as this strain is one of the few in which non-Streptomyces-derived gene clusters have before been successfully expressed.(25) Three individual kanamycin resistant clones were selected and named S. albus epxMS01-(13) and S. albus epnLK01-(13), respectively. Ethyl acetate culture extracts of the mutant strains were analyzed by HPLC-MS and compared to the metabolic profiles of native S. albus J1064 and the producer strains ATCC 53904 and ATCC 53709 (Figure 3).

Figure 3.

Figure 3

HPLC-MS analysis of A S. albus epxMS01-(13) expressing the epoxomicin gene cluster and B S. albus epnLK01-(13) expressing the eponemycin gene cluster. LC-MS base peak chromatograms (BPC, top) and extracted ion chromatograms (EIC, bottom) are depicted. Mass peaks of epoxomicin (2) and eponemycin (3) are indicated as well as unique peaks (*) in the S. albus heterologous mutants correlating to derivatives of 2 and 3 (see Supporting Information Table S2).

In the case of S. albus epnLK01, we discovered a new chromatographic peak not observed in S. albus J1064 (3, Figure 3B). Based on its chromatographic properties, its high-resolution mass, and its mass spectral fragmentation fingerprint (Supplementary Figure S1), the product of the S. albus-expressed epn gene cluster is eponemycin (m/z calcd. 399.2495 [M+H]+; obsvd. 399.2500). Moreover, we clearly identified at least 11 additional epn-based products suggestive of the production of a series of eponemycin derivatives (Figure 3B and Supporting Information Table S2).

Analysis of the S. albus epxMS01 extracts (Figure 3A) confirmed the accumulation of 2 (m/z calcd. 555.3758 [M+H]+; obsvd. 555.3758). However, heterologous production of 2 was low and detected only by HPLC retention time and MS/MS comparisons with authentic material. A possible explanation for its low production is the phylogenetic distance of S. albus and the epoxomicin producer ATCC 53904 and the resulting incompatibility of their expression system and regulatory networks. However, a distinct mass peak present in the ion chromatograms of the mutant strains clearly indicates the accumulation of an epoxomicin congener (Figure 3B and Supporting Information Table S2). Our genetic experiments successfully linked the biosynthesis of 2 and 3 to the epx and epn gene sets, respectively. This experimental outcome represents the first genes-to-molecules confirmation of the pharmaceutically important family of proteasome epoxyketone inhibitors.

MS molecular networking reveals new eponemycin derivatives

While examining the base peak chromatograms of S. albus containing the epoxyketone gene clusters, we observed a number of additional unique mass peaks (Figure 3B), particularly in the epn mutant. Motivated by the prospect of new eponemycin derivatives, we subjected S. albus epnLK01 to MS molecular networking for comparative metabolic profiling. The spectral networks paradigm was originally developed for application in proteomics(26) but has recently been adapted as a general MS/MS-data analysis tool.(27), (28) A molecular network is created based on the relationships of MS/MS spectra for any molecule that is captured by mass spectrometry, even across multiple experiments. The network we generated from the combined data of S. albus J1046 with and without the epn gene cluster displayed ten major MS/MS clusters comprising six or more nodes (circles) of distinct mass fragmentation (Figure 4). These clusters likely represent structurally related molecules and are also referred to as molecular families (MFs).(29) Upon subtracting mass ions consistent with the S. albus J1046 wildtype or with only a subset of the three heterologous mutants, eight of the ten major MFs were eliminated. One of the two remaining clusters forms a tight network centered on a node with m/z 399.246 representing 3 (Figure 4). Notably, at least twelve individual mass ions are directly related to this node.

Figure 4.

Figure 4

Molecular networks of mass spectra from S. albus containing the eponemycin gene cluster. Node (circle) colors indicate the source of the ions: blue (found in S. albus J1046), red (found only in S. albus epnLK01-1, -2, or -3), and green (found only in all three S. albus epnLK01-1, -2, and -3 strains). Node size indicates mass range of parent ions (m/z 200.17 – 1329.51). Edge line width indicates relatedness of MS/MS spectra represented by two connected nodes (cos 0.59 – 0.99). Selected masses and cosine values are noted in the eponemycin clade.

The analysis of the MS/MS spectra incorporated in the network revealed a number of structural derivatives of 3 that are produced by the heterologous mutants (Supplementary Figure S2). This is especially interesting as 3 has been reported as a single compound from the wildtype producer ATCC 53709. Based on the fragmentation patterns, we postulate that most of the variation applies to the length and oxidation status of the fatty acid side chain. The heterologously produced eponemycin analogues contain shorter (C4) or longer (C9) acyl moieties with double bonds, hydroxyl and/or keto groups consistent with distinct HPLC retention times (Supplementary Table S2 and Figure S2). This observation suggests that the enzyme responsible for the attachment of the fatty acid group to the peptide is promiscuous and therefore may facilitate future bioengineering efforts. In addition some of the mass spectra suggest the production of congeners of 3 with an altered epoxyketone pharmacophore. Notably, di- and tetrahydro derivatives are so far only known as synthetic compounds(6) but may be prominent metabolites in the S. albus epnLK01 extracts (Supplementary Table S2). We plan to report the structures and biological properties of new eponemycin analogues separately.

The biosynthetic pathways for epoxomicin and eponemycin

The peptidic backbone of the two epoxyketone compounds is assembled by hybrid non-ribosomal peptide synthetase/polyketide synthase multifunctional enzymes (Figure 2B). EpxD consists of a tetra-modular NRPS and analysis of the adenylation (A) domain substrate specificities strongly correlated A1 and A3 to the activation of isoleucine and threonine, respectively.(30, 31) The specificities for the A2 and A4 domains were less evident (Supplementary Table S3). The megasynthetase begins with a putative primer fatty acyl condensation (C) domain(32) known to transfer fatty acids onto the initial amino acid residue.(33) We thus postulate that this domain is responsible for the installation of the acetyl moiety in 2. The methyltransferase (MT) domain in the first module of EpxD probably further modifies the Ile1 residue by introducing the N-methyl group.

Interestingly, the corresponding eponemycin NRPS, EpnG, does not contain a specific primer C-domain. This observation rather suggests that the eponemycin fatty acyl moiety is not constructed in the same way as in 2. Branched odd-chain fatty acids in the C6–C10 range, such as 6-methyl heptanoic acid (6-MHA), can be found in natural products(34, 35) but are not common in bacterial primary metabolism. Hence, a dedicated pathway for the generation and incorporation of 6-MHA might be encoded in the epn cluster. Normally, the biosynthesis of branched chain fatty acids involves FabH (KAS III) which accepts small CoA-activated acyl groups derived from leucine, valine or isoleucine.(36) FabH catalyzes the Claisen condensation of these substrates with one unit of malonyl-ACP to initiate type II fatty acid synthesis (FAS). For the formation of 6-MHA, one would typically expect that valinederived isobutyrate undergoes two rounds of malonate extension, one round with the help of FabH and the other carried out by common type II FAS enzymes. Notably, EpnD is a FabH-homolog similar to other enzymes that are occasionally encoded in secondary metabolite gene clusters and participate in the priming of type II PKS systems with unusual starter units(37) or the generation of acyl side chains in lipopeptide biosynthesis.(35) However, the presence of the discrete NRPS tridomain EpnJ, consisting of an mbtH-like protein, a leu-specific A-domain and a peptidyl carrier protein (PCP), might indicate a more unusual mechanism for the biosynthesis of 6-MHA (Figure 2B). Here, EpnJ-bound leucine may be subjected to deamination and reduction prior to C2-extension by the FabH homolog EpnD using malonyl-EpnE. Further reduction to afford 6-MHA would likely be performed by primary fatty acid synthase enzymes.

The terminal PKS-modules of the epoxomicin and the eponemycin assembly lines, EpxE and EpnH, respectively, are identically organized. Both comprise a malonyl-CoA-specific acyltransferase (AT)-domain and a putative C-methyltransferase (cMT) domain, which suggests that the substituted epoxy moiety does not derive from methylmalonyl-CoA but rather from malonyl-CoA and Sadenosylmethionine (Figure 2B). A C-terminal thioesterase (TE) domain in EpxE and EpnH intimates that the peptide-polyketide hybrid product is released from the enzyme as the carboxylic acid. In this case, the construction of the rare α′,β′-epoxyketone unit would have to be mediated by auxiliary enzymes likely acting subsequent or in trans to the assembly of the core structure. The transformation from the free acid to the epoxide may include two reductions, a dehydration and an epoxidation. Consequently, we analyzed the conserved biosynthesis genes in the epoxomicin and eponemycin loci, and gratifyingly, we identified two homologous genes common to both clusters – the putative acyl-CoA dehydrogenases (ACADs) epxF/epnF and the cytochrome P450 (CYP) monooxygenases epxC/epnI.

Cytochrome P450 enzymes are known to catalyze epoxidation reactions.(38) EpxC and EpnI are therefore plausible candidates to introduce an epoxy group into an unsaturated precursor molecule. Consequently, the reduction of the acid to the alcohol or the olefin may be performed by the ACADs EpxF/EpnF. Bacterial 4e reductases that catalyze such reactions have been studied in the reductive off-loading of NRPSs(39) and in wax formation.(40) However, these enzymes rather belong to the short-chain dehydrogenase/reductase (SDR) superfamily, which is clearly distinct from the flavin adenine dinucleotide (FAD)-dependent ACADs. An alternative biosynthetic route to the epoxyketone moiety may involve the EpxE/EpnH cMT-domain if it introduces two methyl groups at C2. This scenario has been proposed in various polyketides with gem-dimethyl groups.(41) Subsequently, decarboxylation of the terminal carboxylic acid group to the dimethylketone may initiate a concerted reaction involving EpxF/EpnF and EpxC/EpnI to install the epoxide. On the basis of the biosynthetic features of the epx and epn gene clusters, the construction of the α′,β′-epoxyketone pharmacophore is anticipated to involve new biochemical reactions. Detailed investigations of the pathway are therefore now underway. The distinct C8-OH group in eponemycin is likely implemented by the second CYP EpnK, which is unique to the epn cluster.

Genome mining identifies homologous gene clusters in various bacteria

We next explored other microorganisms with the prospect to identify analogous pathways to new proteasome inhibitors. EpxF, the ACAD we predict to be essential for epoxyketone formation, was employed as a probe for a BLAST sequence homology search in the National Center for Biotechnology Information (NCBI) database. Strikingly, the genes with highest similarity to epxF (Expect (E) value < 1 e−70) are all co-located with other genes from secondary metabolism. Most of these orphan biosynthetic pathways involve small hybrid NRPS/PKS systems and are present in a variety of different bacterial families (Supplementary Figure S3). Notably, such homologous gene clusters can be found in a rhizosphere-associated S. canus 299MFChir4.1 strain as well as the human pathogen Nocardia cyriacigeorgica GUH-2. It might therefore be interesting to investigate if the compounds produced by these clusters have similar functions in symbiosis, virulence and predation as the proteasome inhibitor family of syrbactins.(42, 43) The cluster from S. bingchenggensis BCW-1 is particularly intriguing because it is highly analogous to the epoxomicin gene cluster (Figure 2A). We previously identified this set of genes and suggested it encodes a proteasome inhibitor as it contains a secondary proteasome β-subunit.(20) However, the S. bingchenggensis gene cluster lacks CYP homologues such as EpxC/EpnC that we propose are involved in the epoxidation reaction. We thus predict that this cluster may alternatively direct the formation of an acylated tripeptide in which the terminal amino acid moiety is modified not to an epoxyketone but rather as a vinylketone (enone) functional group. We recently showed in a separate study that synthetic carmaphycin enone derivatives are potent and irreversible proteasome inhibitors (B.S. Moore, unpublished data). Thus we postulate that the orphan pathway in S. bingchenggensis BCW-1 encodes an unprecedented peptidic proteasome inhibitor.

In conclusion, we identified the epoxomicin and the eponemycin gene clusters, the first clusters for the biosynthesis of natural peptidyl epoxyketones. The genetic information suggests that their powerful pharmacophore is generated by a series of unprecedented biotransformations. With this study we set the genetic basis to study the formation of the natural epoxyketone proteasome inhibitors in detail. Unexpectedly, we found a set of eponemycin congeners in extracts of a surrogate host organism with the epn cluster by molecular networking, demonstrating the benefits of this new technique for comparative metabolomics. Moreover, the presence of homologous gene clusters in other bacteria may facilitate the discovery of more new bioactive derivatives in the near future.

METHODS

Bacterial strains and general methods

Chemicals, microbiological and molecular biological agents were purchased from standard commercial sources. Actinomycete strain ATCC 53904, Streptomyces hygroscopicus ATCC 53709, Streptomyces albus J1046 and their respective derivatives were maintained and grown on either MS agar (2% (w/v) soy flour, 2% (w/v) mannitol, 2% (w/v) agar; components purchased from Becton Dickinson) or TSB medium (Becton Dickinson). Escherichia coli strains were cultivated in LB medium (components purchased from Becton Dickinson) supplemented with appropriate antibiotics. DNA isolation and manipulations were carried out according to standard methods for E. coli (44) and Streptomyces (45).

MS data were collected with an Agilent 6530 Accurate-Mass Quadrupole Time-of-flight (QTOF) LC-MS instrument (Agilent Technologies), and the analytes were separated with a reversed-phase C18 column (Phenomenex Luna 5μ C18(2), 4.6 mm × 150 mm) on a 1260 Infinity LC-System (Agilent Technologies) using a flow rate of 0.1 mL/min.

Production of epoxyketone proteasome inhibitors

10 ml TSB broth was inoculated with a spore suspension of Streptomyces albus J1046/epxMS01 or Streptomyces albus J1046/epnLK01 and incubated for 2 days at 30°C and 200 rpm. Then 1% of the culture was transferred to 50 mL of R5 medium (103 g L−1 sucrose, 0.25 g L−1 K2SO4, 10.12 g L−1 MgCl2·6H2O, 10 g L−1 glucose, 0.1 g L−1 casaminoacids, 5 g L−1 yeast extract, 5.73 g L−1 TES (N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid), 80 μg L−1 ZnCl2, 400 μg L−1 FeCl3·6H2O, 20 μg L−1 CuCl2·2H2O, 20 μg L−1 MnCl2·4H2O, 20 μg L−1 Na2B4O7·10H2O, 20 μg L−1 (NH4)6Mo7O24·4H2O, 50 mg L−1 KH2PO4, 3 g L−1 Lproline, 2.94 g L−1 CaCl2, and 280 μg L−1 NaOH). After 6 days of incubation at 28 °C, 50 mL of EtOAc was added and incubated for 1 h at 200 rpm, and the EtOAc layer was recovered. The solvent was evaporated under reduced pressure. The residue was dissolved in 1 mL of MeCN, and the solution was filtered through a C18 sorbent (Spice C18 Sample Preparation Cartridges, Analtech). The filtrate was evaporated under reduced pressure in a 14-mL scintillation vial, and the residue was stored at − 20 °C until LC/MS analysis.

Analysis of culture extracts and MS molecular networking

The residue was dissolved in 1000 μL of MeCN, and 5 μL of the dissolved residue was injected onto a reversed-phase HPLC column coupled to an mass spectrometer with an electrospray ionization interface (ESI) interface (heated capillary temperature 320°C; sheath and collision gas nitrogen). The following solvent composition was used to separate the analytes: 10% (v/v) MeCN in H2O for 4 min, 10–100% (v/v) MeCN in H2O for 36 min, 100% (v/v) MeCN in H2O for 3 min, 100–10% (v/v) MeCN in H2O for 2 min. HR-MS data were acquired in positive mode ((+)-ESI). MS and MS/MS spectra were recorded with a scan rate of one and four spectra/second, respectively. Collision energy was 10 eV. Molecular formulae were calculated from mono-isotopic masses using ChemCalc.(46)

To construct molecular networks MS/MS spectra recorded from extracts of S. albus J1046/epnLK01-(13) and S. albus J1046 wildtype were clustered using MS-Cluster.(47) Cluster-consensus spectra were further processed as described by Watrous et al.(28) Each spectrum comprised the 10 highest-cosine alignments in both directions. To define the MS/MS network pair wise alignments were considered with cosine ≥0.55 and ≥6 matched peaks. Custom scripts and the attributes to the molecular network were added as described.(28) The MS networks were visualized with Cytoscape (2.8.3.).(48)

DNA sequencing and bioinformatic analysis

Ion Torrent libraries were prepared from genomic DNA (gDNA) of strains ATCC 53904 (epoxomicin producer) and ATCC 53709 (eponemycin producer), respectively, using the Ion Plus Fragment Library kit (Life Technologies) with a gDNA input of 1 μg. An S220 Focused-ultrasonicator (Covaris Inc.) was used to shear gDNA to obtain fragments of 100–250 bp size. Separation and extraction of DNA fragments was performed on the electrophoresis platform Pippin Prep (Sage Science Inc.). The Ion Library Quantitation Kit (Life Technologies) was used to quantify the libraries by quantitative real-time PCR on a LightCycler 480 (Roche Applied Science). No additional library amplification was performed. Samples were prepared manually using the Ion PGM 200 Xpress Template Kit (Life Technologies). To facilitate the amplification of the high GC content DNA, betaine was added to the amplification mix to a final concentration of 1M. The thermo profile was modified (95°C for 10 minutes; 15 cycles, 95°C for 30 seconds and 68°C for 4 minutes; 30 cycles, 95°C for 30 seconds and 68°C for 6 minutes). Enrichment was performed on the Ion OneTouch ES (Life Technologies). Two sequencing runs per library were conducted on an Ion Personal Genome Machine® (PGM) System using Ion 316 chips and the Ion PGM Sequencing 200 Kit v2 (Life Technologies). Sequencing data was assembled with the CLC Genomics Workbench software version 5.01 (CLC Bio). The draft genomes were subjected to the online tool antibiotics & Secondary Metabolite Analysis Shell (antiSMASH).(19) The epx and epn gene clusters were both found split on two contigs. The sequence gaps were closed by Sanger sequencing using primer walking and the shotgun method (GenoTech, Baejeon, Korea). Manual in silico sequence analysis was performed using GC frame-plot(49) and BLAST(50). The Geneious software package (Biomatters Ltd.) and Artemis (Wellcome Trust Genome Campus) were used for sequence analysis and annotation.

Generation and screening of fosmid libraries

The genomic fosmid libraries were constructed for strains ATCC 53904 and Streptomyces hygroscopicus ATCC 53709. High-molecular weight chromosomal DNA was randomly sheared to obtain fragments of ~40 kb size and cloned into pCC1FOS (Epicentre Biotechnologies). Fosmid libraries with ~1800 clones each were generated in E. coli EPI300 according to the manufacturers’ instructions. For identification of the biosynthetic gene clusters the fosmid libraries were screened by PCR with primer pairs epxA_f GAATCTCAAGCGCGAGGGG/epxA_r GGTGTCGCGGAAGTAGTCC and epxF_f GCGCACCATGTCGCTGTTG/epxF_r GTAGTCGGGTGTCTCCTCC (library ATCC 53904) as well as epnA_f GTGTGGCCGTGAGCGGATTC/epnA_r GCGGCCACGTTCCGATCTTG and epnK_f CAGCATGCTGCTGCAAGCCC/epnK_r CCCGGATGAAGTTCGACCGC (library ATCC 53709). The primers were designed to amplify small specific fragments (0.3–0.5 kb) from the borders of the respective gene clusters.

Heterologous expression of the epx and the epn gene clusters

An XbaI restriction fragment from merLK01 was generated representing an integration cassette (int_neo) for stable chromosomal integration. int_neo was used to replace cat in fosmids 15C3 (epx cluster) and 2H4 (epn cluster) as described previously.(21) The resulting fosmids epxMS01 and epnLK01 were verified by restriction analysis. The fosmids were transferred into E. coli ET12567 and introduced into S. albus J1046 by triparental intergeneric conjugation with the help of E. coli ET12567/pUB307. Kanamycin resistance mutants were selected and designated as S. albus 7J1046/epxMS01-(13) and S. albus J1046/epnLK01- (13), respectively.

Supplementary Material

1_si_001

Acknowledgments

We thank Dr. R. Kersten for assistance with mass spectrometry analysis. We are grateful to Dr. A. Lechner and H. Sun for help with molecular networking and semiconductor sequencing, respectively. This work was supported by the NIH (CA127622 to B.S.M. and GM097509 to P.C.D. and B.S.M.), a Feodor Lynen postdoctoral fellowship from the Alexander von Humboldt Foundation to L.K., and an NIH instrument grant (S10-OD010640).

Footnotes

Accession codes. The complete sequences of the epoxomicin (27908 bp) and the eponemycin (23794 bp) biosynthetic gene clusters were deposited at NCBI under the accession numbers KF647219 and KF647220, respectively.

Supporting Information. Supporting tables and graphics are provided. This material is available free of charge via the Internet at http://pubs.acs.org.

References

  • 1.Ciechanover A. The ubiquitin-proteasome proteolytic pathway. Cell. 1994;79:13–21. doi: 10.1016/0092-8674(94)90396-4. [DOI] [PubMed] [Google Scholar]
  • 2.Lopes UG, Erhardt P, Yao R, Cooper GM. p53-dependent induction of apoptosis by proteasome inhibitors. J Biol Chem. 1997;272:12893–12896. doi: 10.1074/jbc.272.20.12893. [DOI] [PubMed] [Google Scholar]
  • 3.Drexler HC. Activation of the cell death program by inhibition of proteasome function. Proc Natl Acad Sci U S A. 1997;94:855–860. doi: 10.1073/pnas.94.3.855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Grawert MA, Groll M. Exploiting nature’s rich source of proteasome inhibitors as starting points in drug development. Chem Commun (Camb) 2012;48:1364–1378. doi: 10.1039/c1cc15273d. [DOI] [PubMed] [Google Scholar]
  • 5.Hanada M, Sugawara K, Kaneta K, Toda S, Nishiyama Y, Tomita K, Yamamoto H, Konishi M, Oki T. Epoxomicin, a new antitumor agent of microbial origin. J Antibiot (Tokyo) 1992;45:1746–1752. doi: 10.7164/antibiotics.45.1746. [DOI] [PubMed] [Google Scholar]
  • 6.Sugawara K, Hatori M, Nishiyama Y, Tomita K, Kamei H, Konishi M, Oki T. Eponemycin, a new antibiotic active against B16 melanoma. I Production, isolation, structure and biological activity. J Antibiot (Tokyo) 1990;43:8–18. doi: 10.7164/antibiotics.43.8. [DOI] [PubMed] [Google Scholar]
  • 7.Pereira AR, Kale AJ, Fenley AT, Byrum T, Debonsi HM, Gilson MK, Valeriote FA, Moore BS, Gerwick WH. The carmaphycins: new proteasome inhibitors exhibiting an alpha, beta-epoxyketone warhead from a marine cyanobacterium. Chembiochem. 2012;13:810–817. doi: 10.1002/cbic.201200007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Koguchi Y, Kohno J, Suzuki S, Nishio M, Takahashi K, Ohnuki T, Komatsubara S. TMC-86A, B and TMC-96, new proteasome inhibitors from Streptomyces sp. TC 1084 and Saccharothrix sp. TC 1094. II Physico-chemical properties and structure determination. J Antibiot (Tokyo) 2000;53:63–65. doi: 10.7164/antibiotics.53.63. [DOI] [PubMed] [Google Scholar]
  • 9.Koguchi Y, Nishio M, Suzuki S, Takahashi K, Ohnuki T, Komatsubara S. TMC-89A and B, new proteasome inhibitors from Streptomyces sp. TC 1087. J Antibiot (Tokyo) 2000;53:967–972. doi: 10.7164/antibiotics.53.967. [DOI] [PubMed] [Google Scholar]
  • 10.Kuhn DJ, Chen Q, Voorhees PM, Strader JS, Shenk KD, Sun CM, Demo SD, Bennett MK, van Leeuwen FW, Chanan-Khan AA, Orlowski RZ. Potent activity of carfilzomib, a novel, irreversible inhibitor of the ubiquitin-proteasome pathway, against preclinical models of multiple myeloma. Blood. 2007;110:3281–3290. doi: 10.1182/blood-2007-01-065888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McCormack PL. Carfilzomib: in relapsed, or relapsed and refractory, multiple myeloma. Drugs. 2012;72:2023–2032. doi: 10.2165/11209010-000000000-00000. [DOI] [PubMed] [Google Scholar]
  • 12.Curran MP, McKeage K. Bortezomib: a review of its use in patients with multiple myeloma. Drugs. 2009;69:859–888. doi: 10.2165/00003495-200969070-00006. [DOI] [PubMed] [Google Scholar]
  • 13.Glenn RJ, Pemberton AJ, Royle HJ, Spackman RW, Smith E, Jennifer Rivett A, Steverding D. Trypanocidal effect of alpha’, beta’-epoxyketones indicates that trypanosomes are particularly sensitive to inhibitors of proteasome trypsin-like activity. Int J Antimicrob Agents. 2004;24:286–289. doi: 10.1016/j.ijantimicag.2004.02.023. [DOI] [PubMed] [Google Scholar]
  • 14.Czesny B, Goshu S, Cook JL, Williamson KC. The proteasome inhibitor epoxomicin has potent Plasmodium falciparum gametocytocidal activity. Antimicrob Agents Chemother. 2009;53:4080–4085. doi: 10.1128/AAC.00088-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Groll M, Kim KB, Kairies N, Huber R, Crews CM. Crystal Structure of Epoxomicin: 20S Proteasome Reveals a Molecular Basis for Selectivity of α′,β′-Epoxyketone Proteasome Inhibitors. J Am Chem Soc. 2000;122:1237–1238. [Google Scholar]
  • 16.Rothberg JM, Hinz W, Rearick TM, Schultz J, Mileski W, Davey M, Leamon JH, Johnson K, Milgrew MJ, Edwards M, Hoon J, Simons JF, Marran D, Myers JW, Davidson JF, Branting A, Nobile JR, Puc BP, Light D, Clark TA, Huber M, Branciforte JT, Stoner IB, Cawley SE, Lyons M, Fu Y, Homer N, Sedova M, Miao X, Reed B, Sabina J, Feierstein E, Schorn M, Alanjary M, Dimalanta E, Dressman D, Kasinskas R, Sokolsky T, Fidanza JA, Namsaraev E, McKernan KJ, Williams A, Roth GT, Bustillo J. An integrated semiconductor device enabling non-optical genome sequencing. Nature. 2011;475:348–352. doi: 10.1038/nature10242. [DOI] [PubMed] [Google Scholar]
  • 17.Ross AC, Xu Y, Lu L, Kersten RD, Shao Z, Al-Suwailem AM, Dorrestein PC, Qian PY, Moore BS. Biosynthetic multitasking facilitates thalassospiramide structural diversity in marine bacteria. J Am Chem Soc. 2013;135:1155–1162. doi: 10.1021/ja3119674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Henke W, Herdel K, Jung K, Schnorr D, Loening SA. Betaine improves the PCR amplification of GC-rich DNA sequences. Nucleic Acids Res. 1997;25:3957–3958. doi: 10.1093/nar/25.19.3957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Medema MH, Blin K, Cimermancic P, de Jager V, Zakrzewski P, Fischbach MA, Weber T, Takano E, Breitling R. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39:339–346. doi: 10.1093/nar/gkr466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kale AJ, McGlinchey RP, Lechner A, Moore BS. Bacterial self-resistance to the natural proteasome inhibitor salinosporamide A. ACS Chem Biol. 2011;6:1257–1264. doi: 10.1021/cb2002544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kaysser L, Bernhardt P, Nam SJ, Loesgen S, Ruby JG, Skewes-Cox P, Jensen PR, Fenical W, Moore BS. Merochlorins A-D, cyclic meroterpenoid antibiotics biosynthesized in divergent pathways with vanadium-dependent chloroperoxidases. J Am Chem Soc. 2012;134:11988–11991. doi: 10.1021/ja305665f. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bierman M, Logan R, O’Brien K, Seno ET, Rao RN, Schoner BE. Plasmid cloning vectors for the conjugal transfer of DNA from Escherichia coli to Streptomyces spp. Gene. 1992;116:43–49. doi: 10.1016/0378-1119(92)90627-2. [DOI] [PubMed] [Google Scholar]
  • 23.Labeda DP, Kroppenstedt RM. Goodfellowia gen. nov., a new genus of the Pseudonocardineae related to Actinoalloteichus, containing Goodfellowia coeruleoviolacea gen nov., comb nov. Int J Syst Evol Microbiol. 2006;56:1203–1207. doi: 10.1099/ijs.0.64170-0. [DOI] [PubMed] [Google Scholar]
  • 24.Flett F, Mersinias V, Smith CP. High efficiency intergeneric conjugal transfer of plasmid DNA from Escherichia coli to methyl DNA-restricting streptomycetes. FEMS Microbiol Lett. 1997;155:223–229. doi: 10.1111/j.1574-6968.1997.tb13882.x. [DOI] [PubMed] [Google Scholar]
  • 25.Lombo F, Velasco A, Castro A, de la Calle F, Brana AF, Sanchez-Puelles JM, Mendez C, Salas JA. Deciphering the biosynthesis pathway of the antitumor thiocoraline from a marine actinomycete and its expression in two streptomyces species. Chembiochem. 2006;7:366–376. doi: 10.1002/cbic.200500325. [DOI] [PubMed] [Google Scholar]
  • 26.Bandeira N, Tsur D, Frank A, Pevzner PA. Protein identification by spectral networks analysis. Proc Natl Acad Sci U S A. 2007;104:6140–6145. doi: 10.1073/pnas.0701130104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Guthals A, Watrous JD, Dorrestein PC, Bandeira N. The spectral networks paradigm in high throughput mass spectrometry. Mol Biosyst. 2012;8:2535–2544. doi: 10.1039/c2mb25085c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Watrous J, Roach P, Alexandrov T, Heath BS, Yang JY, Kersten RD, van der Voort M, Pogliano K, Gross H, Raaijmakers JM, Moore BS, Laskin J, Bandeira N, Dorrestein PC. Mass spectral molecular networking of living microbial colonies. Proc Natl Acad Sci U S A. 2012;109:1743–1752. doi: 10.1073/pnas.1203689109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Nguyen DD, Wu CH, Moree WJ, Lamsa A, Medema MH, Zhao X, Gavilan RG, Aparicio M, Atencio L, Jackson C, Ballesteros J, Sanchez J, Watrous JD, Phelan VV, van de Wiel C, Kersten RD, Mehnaz S, De Mot R, Shank EA, Charusanti P, Nagarajan H, Duggan BM, Moore BS, Bandeira N, Palsson BO, Pogliano K, Gutierrez M, Dorrestein PC. MS/MS networking guided analysis of molecule and gene cluster families. Proc Natl Acad Sci U S A. 2013;110:2611–2620. doi: 10.1073/pnas.1303471110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Challis GL, Ravel J, Townsend CA. Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. Chem Biol. 2000;7:211–224. doi: 10.1016/s1074-5521(00)00091-0. [DOI] [PubMed] [Google Scholar]
  • 31.Stachelhaus T, Mootz HD, Marahiel MA. The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. Chem Biol. 1999;6:493–505. doi: 10.1016/S1074-5521(99)80082-9. [DOI] [PubMed] [Google Scholar]
  • 32.Rausch C, Hoof I, Weber T, Wohlleben W, Huson DH. Phylogenetic analysis of condensation domains in NRPS sheds light on their functional evolution. BMC Evol Biol. 2007;7:78. doi: 10.1186/1471-2148-7-78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kraas FI, Giessen TW, Marahiel MA. Exploring the mechanism of lipid transfer during biosynthesis of the acidic lipopeptide antibiotic CDA. FEBS Lett. 2012;586:283–288. doi: 10.1016/j.febslet.2012.01.003. [DOI] [PubMed] [Google Scholar]
  • 34.Wilkinson S, Lowe LA. Structure of Polymyxin B1. Nature. 1964;202:1211. doi: 10.1038/2021211a0. [DOI] [PubMed] [Google Scholar]
  • 35.Powell A, Borg M, Amir-Heidari B, Neary JM, Thirlway J, Wilkinson B, Smith CP, Micklefield J. Engineered biosynthesis of nonribosomal lipopeptides with modified fatty acid side chains. J Am Chem Soc. 2007;129:15182–15191. doi: 10.1021/ja074331o. [DOI] [PubMed] [Google Scholar]
  • 36.Revill WP, Bibb MJ, Scheu AK, Kieser HJ, Hopwood DA. Beta-ketoacyl acyl carrier protein synthase III (FabH) is essential for fatty acid biosynthesis in Streptomyces coelicolor A3(2) J Bacteriol. 2001;183:3526–3530. doi: 10.1128/JB.183.11.3526-3530.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Moore BS, Hertweck C. Biosynthesis and attachment of novel bacterial polyketide synthase starter units. Nat Prod Rep. 2002;19:70–99. doi: 10.1039/b003939j. [DOI] [PubMed] [Google Scholar]
  • 38.Thibodeaux CJ, Chang WC, Liu HW. Enzymatic chemistry of cyclopropane, epoxide, and aziridine biosynthesis. Chem Rev. 2012;112:1681–1709. doi: 10.1021/cr200073d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Read JA, Walsh CT. The lyngbyatoxin biosynthetic assembly line: chain release by four-electron reduction of a dipeptidyl thioester to the corresponding alcohol. J Am Chem Soc. 2007;129:15762–15763. doi: 10.1021/ja077374d. [DOI] [PubMed] [Google Scholar]
  • 40.Hofvander P, Doan TT, Hamberg M. A prokaryotic acyl-CoA reductase performing reduction of fatty acyl-CoA to fatty alcohol. FEBS Lett. 2011;585:3538–3543. doi: 10.1016/j.febslet.2011.10.016. [DOI] [PubMed] [Google Scholar]
  • 41.Gulder TA, Freeman MF, Piel J. The Catalytic Diversity of Multimodular Polyketide Synthases: Natural Product Biosynthesis Beyond Textbook Assembly Rules. Top Curr Chem. 2011 doi: 10.1007/128_2010_113. [DOI] [PubMed] [Google Scholar]
  • 42.Stein ML, Beck P, Kaiser M, Dudler R, Becker CF, Groll M. One-shot NMR analysis of microbial secretions identifies highly potent proteasome inhibitor. Proc Natl Acad Sci U S A. 2012;109:18367–18371. doi: 10.1073/pnas.1211423109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Krahn D, Ottmann C, Kaiser M. The chemistry and biology of syringolins, glidobactins and cepafungins (syrbactins) Nat Prod Rep. 2011;28:1854–1867. doi: 10.1039/c1np00048a. [DOI] [PubMed] [Google Scholar]
  • 44.Sambrook J, Russell DW. Molecular cloning A laboratory manual. Cold Spring Harbor Laboratory Press; New York: 2001. [Google Scholar]
  • 45.Kieser T, Bibb M, Buttner M, Chater K, Hopwood D. Practical Streptomyces Genetics. The John Innes Foundation; Norwich, UK: 2000. [Google Scholar]
  • 46.Patiny L, Borel A. ChemCalc: a building block for tomorrow’s chemical infrastructure. J Chem Inf Model. 2013;53:1223–1228. doi: 10.1021/ci300563h. [DOI] [PubMed] [Google Scholar]
  • 47.Frank AM, Bandeira N, Shen Z, Tanner S, Briggs SP, Smith RD, Pevzner PA. Clustering millions of tandem mass spectra. J Proteome Res. 2008;7:113–122. doi: 10.1021/pr070361e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27:431–432. doi: 10.1093/bioinformatics/btq675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bibb MJ, Findlay PR, Johnson MW. The relationship between base composition and codon usage in bacterial genes and its use for the simple and reliable identification of protein-coding sequences. Gene. 1984;30:157–166. doi: 10.1016/0378-1119(84)90116-1. [DOI] [PubMed] [Google Scholar]
  • 50.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES