Abstract
Cytochrome P450 monooxygenases (CYPs/P450s) are well known for their role in organisms’ primary and secondary metabolism. Among 20 P450s of the tuberculosis-causing Mycobacterium tuberculosis H37Rv, CYP128A1 is particularly important owing to its involvement in synthesizing electron transport molecules such as menaquinone-9 (MK9). This study employs different in silico approaches to understand CYP128 P450 family’s distribution and structural aspects. Genome data-mining of 4250 mycobacterial species has revealed the presence of 2674 CYP128 P450s in 2646 mycobacterial species belonging to six different categories. Contrast features were observed in the CYP128 gene distribution, subfamily patterns, and characteristics of the secondary metabolite biosynthetic gene cluster (BGCs) between M. tuberculosis complex (MTBC) and other mycobacterial category species. In all MTBC species (except one) CYP128 P450s belong to subfamily A, whereas subfamily B is predominant in another four mycobacterial category species. Of CYP128 P450s, 78% was a part of BGCs with CYP124A1, or together with CYP124A1 and CYP121A1. The CYP128 family ranked fifth in the conservation ranking. Unique amino acid patterns are present at the EXXR and CXG motifs. Molecular dynamic simulation studies indicate that the CYP128A1 bind to MK9 with the highest affinity compared to the azole drugs analyzed. This study provides comprehensive comparative analysis and structural insights of CYP128A1 in M. tuberculosis.
Keywords: cytochrome P450 monooxygenenases, CYP128A1, Mycobacterium tuberculosis H37Rv, tuberculosis, molecular dynamic simulations, azole drugs, menaquinone
1. Introduction
Tuberculosis (TB), caused by Mycobacterium tuberculosis H37Rv, remains a serious public health problem despite the existence of international TB control programs [1]. Recent data from the World Health Organization (WHO) shows that about 10 million people fell ill with TB in 2018 [1]. TB’s global threat to human health has been exacerbated in recent years by the emergence of widespread multi- and extensively drug-resistant M. tuberculosis strains [1]. The developing countries of South-East Asia and Africa have shown high incidence rates of TB. The prevalence of the disease in these countries is mainly due to lack of basic sanitation (causing an increase in transmission of the disease), human immunodeficiency virus (HIV) infection and lack of drugs to treat the disease [1]. Recent statistics from South Africa revealed that TB is the major killer among infectious diseases, indicating that this disease is still a major challenge in the country [2].
In 1998, determination of the M. tuberculosis H37Rv genome sequence encouraged more investigation of new anti-tubercular drugs and the seeking of more knowledge on the complex biology of the M. tuberculosis bacterium [3]. This highlighted the importance of lipid metabolism in M. tuberculosis; novel biosynthetic pathways were found to be involved in the synthesis of compounds such as phenolphthiocerol, mycolic acids and mycocerosic acid for the complex cell wall structure of the bacterium [4]. Among the enzymes involved in lipid metabolism, cytochrome P450 monooxygenases (CYPs/P450s) in M. tuberculosis were found to play a key role in the metabolism of lipids [5,6]. P450s are heme-thiolate proteins found in all species across biological domains [7]. Recent studies revealed the presence of a large number of P450s in mycobacterial species and most of these P450s were found to be involved in lipid metabolism [6]. M. tuberculosis H37Rv has 20 P450s in its genome and some of these P450s are indeed involved in lipid metabolism [5]. Furthermore, one of the P450 genes, namely CYP125A1, was used as a key factor in determining the cholesterol degrading ability of mycobacterial species [8].
Among M. tuberculosis H37Rv P450s, CYP128A1 gained particular interest among researchers owing to its history indicating its essentiality and its physiological importance. Transposon site hybridization mutagenesis studies indicated that CYP128A1 is essential for in vitro survival of M. tuberculosis H37Rv [9]. Interestingly, another study, which used a similar approach, revealed that CYP128A1 is not essential for survival of M. tuberculosis CDC1551 [10]. However, this study had a backdrop of limited gene coverage in its mutant library. In Vitro M. tuberculosis H37Rv latency model studies including a carbon starvation model [11] and hypoxia model [12] showed up-regulation of CYP128A1, suggesting that this P450 has a potential role in M. tuberculosis latency. Until 2016, the nature of CYP128A1 with respect to its essentiality remained a mystery. Research revealed that CYP128A1 is non-essential for survival of M. tuberculosis H37Rv as the CYP128A1 gene knock-out strain survives [13]. However, the CYP128A1 gene knock-out strain has proven to be hyper-virulent [13], indicating this gene actually playing a role in the synthesis of a compound that acts as a negative regulator of virulence.
Heterologous expression of CYP128A1 posed a great challenge to researchers, as the expression of this particular P450 in Escherichia coli has been unsuccessful [14,15]. Genomic analysis revealed that CYP128A1 is part of an operon that consists of two other genes, stf3 and rv2269c [16]. In Vivo studies using M. smegmatis as model strain demonstrated that CYP128A1 is involved in hydroxylation of menaquinone-9 (MK9) and is essential in the synthesis of this compound, whereas Stf3 was found to introduce the sulfate group to menaquinone-9 and rv2269c was found to act as a promoter [13]. The sequence of reaction is that CYP128A1 introduces the hydroxyl group into MK9, followed by the addition of the sulfate group by the Stf3 that leads to the synthesis of sulfomenaquinone [13].
Lipoquinones are electron transport molecules that are involved in the respiratory function of bacteria and mainly consist of menaquinone and ubiquinone [17]. Menaquinones (2-methyl-3-polyprenyl-1,4-naphthoquinones) especially MK9 is ubiquitous and unique to mycobacteria [17], indicating that CYP128A1 should be present in all mycobacterial species. However, to date, the distribution of CYP128A1 in such a large number of mycobacterial species belonging to six different mycobacterial categories, i.e., Mycobacterium tuberculosis complex (MTBC), M. chelonae-abscessus complex (MCAC), M. avium complex (MAC), mycobacteria causing leprosy (MCL), non-tuberculosis mycobacteria (NTM) and saprophytes (SAP) is still unknown. Anti-fungal azole drugs were shown to be promising new tools to fight TB, particularly as they showed high antimycobacterial activity [18,19,20] and interestingly, to date, characterized M. tuberculosis P450s have been found to bind quite a number of azole drugs [21], leading to M. tuberculosis P450s becoming the main focus as novel drug targets against this pathogen [5]. The binding of azole drugs to CYP128A1 could have an undesired effect, as this could lead to possible disruption of enzyme function, a vital component in the virulence modulation of M. tuberculosis.
To date, genome-wide analysis of CYP128 P450s has only been carried out in 60 mycobacterial species [22] and because of the failure of CYP128A1 heterologous expression [14,15], analysis of binding patterns of azole drugs to CYP128A1 has not been performed. To address these research gaps, in this study, genome-wide data mining, annotation, and phylogenetic analysis of CYP128A1 were carried out in 4250 mycobacterial species. Furthermore, in silico analysis of binding of different azole drugs with the CYP128A1 model was assessed. The results for CYP128A1 were discussed in the context of gaining more knowledge on the role of this P450 in mycobacterial species.
2. Results and Discussion
2.1. Presence of CYP128 P450s in Five of the Six Mycobacterial Category Species
Comprehensive comparative analysis of CYP128 P450s in 4250 mycobacterial species belonging to six different categories (MTBC, MCAC, MAC, MCL, NTM and SAP) revealed that this P450 family is present in 2646 mycobacterial species (Figure 1 and Figure 2; Supplementary Dataset 1). Thirty-seven mycobacterial species were found to have short P450 sequences that have none or only one of the highly conserved P450 motifs, i.e., EXXR and CXG (Supplementary Dataset 1). Thus, these short sequences are not included in the study and the species are considered not to have CYP128 P450. Among mycobacterial species that were used in this study, most of the species (99%) belonging to the MTBC category have this P450 (Figure 2). Among 2350 MTBC species only 38 species do not have CYP128 (Figure 2). In contrast to MTBC species, most of the species belonging to another five mycobacterial categories do not have this P450 (Figure 2). CYP128 P450s were found in only 34% of NTM species, followed by 26% of MAC species, 12% of MCAC species and 18% of SAP species. The results observed in this study revealed that none of the MCL species has CYP128 P450s, which is consistent with previous observations that MCL species do not have this P450 [22]. Overall, based on the results of this study and comparison with the results reported earlier [22], we conclude that CYP128 P450s can be found in species belonging to the five mycobacterial categories except in species of the MCL category. Most of the CYP128 P450s (81%) are 489 amino acids in length (Supplementary Dataset 1). An anomaly was observed that two CYP128 P450 sequences, one from M. tuberculosis BTB10-120 and the other from M. tuberculosis SIT745/EAI1-MYS, has 877 amino acids. However, analysis of genome sequences revealed that CYP128 and the sulfotransferase (stf3) genes were present next to each other on different DNA strands, indicating an error in annotation leading to the fused protein. A single copy of the CYP128 gene was found in almost all species, except for 25 species where two copies (22 species), three copies (two species) and four copies (one species) of this gene were found (Supplementary Dataset 1). The species with more than one copy of this P450 gene were found across four different categories, where eight species were from MTBC, 12 species from NTM, three species from MAC and two species from MCAC (Supplementary Dataset 1). CYP128 P450 sequences identified in the study are presented in Supplementary Dataset 2.
2.2. Different Mycobacterial Category Species Have Different CYP128 Subfamily Preferences
CYP128 subfamily analysis revealed the presence of a new CYP128 subfamily in mycobacterial species. Thus, at present two CYP128 subfamilies can be found in mycobacterial species, i.e., A and B (Figure 3). Phylogenetic analysis revealed an interesting feature in CYP128 subfamilies with respect to their categories (Figure 1). In a previous study, it was observed that P450s belonging to different mycobacterial categories grouped together according to their category, indicating high conservation of P450 protein sequences after speciation into different categories [22]. In this study, the same phenomenon was observed for CYP128 subfamilies A and B, as these subfamily proteins were grouped together according to their mycobacterial categories, despite the subfamilies being clearly separated on the tree (Figure 1). Also, contrasting subfamily features were observed among different mycobacterial categories (Figure 3). Except for one species, M. tuberculosis XTB13-223, all species belonging to the MTBC category have CYP128 subfamily A (Figure 3). However, in contrast to MTBC species, in other mycobacterial categories subfamily B was dominantly present (Figure 3). This indicates that during evolution mycobacterial species had both subfamilies, but owing to their lifestyles or ecological niches only one subfamily was favored. This phenomenon of enriching a particular type of P450 family or subfamilies in microbial species is well known [6,22,23,24,25,26,27,28]. An interesting pattern of subfamilies was found in species having more than one copy of the CYP128 gene. Eight MTBC species have two copies of the CYP128 gene, both belonging to the same subfamily A. However, in other category species subfamilies A and B are present; especially in species belonging to the NTM category, subfamily B is populated (Supplementary Dataset 1).
2.3. CYP128 Family Ranked Fifth among P450 Families
It has been proposed that the present-day P450s are evolved from the ancient P450 CYP51 [29,30]. Evolution of different P450 families and their rate of evolution indicate that the higher the evolutionary rate, the more catalytically diverse they are [6,22,24]. Parvez and co-workers [22] proposed a ranking system for P450 families where different P450 families are given a rank based on the number of conserved amino acids and it was proposed that the higher the conservation, the less the catalytic diversity. Furthermore, a recent study revealed that better ranking of P450 families can be achieved using a larger sample size [31]. In a previous study, only 49 CYP128 P450s were used to predict the ranking [22]. Identification of quite a large number of CYP128 P450s in this study necessitates re-assessment of ranking, in order to calculate the accurate number of conserved amino acids in P450s, as it was mentioned that P450s with similar amino acid length should be used [22,25,31,32]. Thus, in this study, 2191 CYP128 P450s with amino acid lengths ranging from 479 to 489 amino acids (Supplementary Dataset 1) were used to assess the conservation ranking of the CYP128 P450 family. PROfile Multiple Alignment with Predicted Local Structures and 3D Constraints (PROMALS3D) analysis [33] revealed the presence of 217 amino acids that are invariantly conserved in CYP128 P450s (Table 1). Comparison with other P450 families from different biological kingdoms placed CYP128 family in the fifth position, compared to 23rd position previously (Table 1), indicating that this P450 family is one of the best conserved families. A complete table with updated P450 family ranking is presented in Table S1.
Table 1.
P450 Family | Number of Member P450s | Kingdom | PROMALS3D Conservation Index | Rank (Highest to Lowest Conservation) | ||||
---|---|---|---|---|---|---|---|---|
5 | 6 | 7 | 8 | 9 | ||||
CYP141 | 29 | Bacteria | 0 | 0 | 0 | 0 | 389 | 1 |
CYP51 | 50 | Bacteria | 11 | 102 | 0 | 0 | 264 | 2 |
CYP137 | 38 | Bacteria | 145 | 0 | 0 | 0 | 251 | 3 |
CYP121 | 34 | Bacteria | 0 | 0 | 0 | 0 | 233 | 4 |
CYP128 | 2191 | Bacteria | 118 | 25 | 0 | 0 | 217 | 5 (previously 23rd) |
CYP132 | 39 | Bacteria | 175 | 0 | 0 | 0 | 217 | 5 |
CYP5619 | 23 | Stramenopila | 118 | 38 | 170 | 0 | 199 | 6 |
CYP124 | 71 | Bacteria | 52 | 35 | 59 | 0 | 170 | 7 |
CYP139 | 894 | Bacteria | 0 | 127 | 0 | 0 | 165 | 8 |
CYP188 | 67 | Bacteria | 62 | 0 | 100 | 0 | 141 | 9 |
CYP123 | 74 | Bacteria | 62 | 0 | 82 | 0 | 137 | 10 |
The bold shows the position of CYP128 as it been revised from 23rd place to 5th because of this study results.
2.4. CYP128 Family Has Distinctive Amino Acid Patterns at EXXR and CXG Motif
A study of the analysis of P450 motifs EXXR and CXG revealed that all P450 families have a unique amino acid pattern that serves as a signature characteristic of that particular P450 family [34]. Subsequent studies of P450 families strongly supported this hypothesis [25,31,32]. Analysis of the EXXR and CXG P450 motifs in the CYP128 family has not been performed, and the large number of CYP128 P450s identified in this study provides an opportunity to analyze the amino acid patterns at the EXXR and CXG-motifs for this family. Analysis of the amino acids at the EXXR motif indicated the presence of E-T(84%)/Q(11%)/H(1%) -W(84%)/L(15%)-R amino acid patterns at this motif, with most of the CYP128 P450s (84%) having the E-T-W-R amino acid pattern (Figure 4). Analysis of amino acid patterns at the CXG motif revealed the presence of F-G-S(88%)/Y(12%) -G-I(88%)/V(6%)/A(5%)/P(1%) -H-L(89%)/M(11%) -C-P(87%)/I(7%)/L(6%) -G amino acid patterns, with the majority of the CYP128 P450s (86%) containing the F-G-S-G-I-H-L-C-P-G amino acid patterns (Figure 4). Amino acids patterns at the EXXR and CXG motifs of the CYP128 family were found to be unique compared to P450 families from different biological kingdoms [26,31,32,34], further supporting the hypothesis that amino acid patterns at these motifs are a signature of the P450 family [34].
2.5. Most CYP128 P450s Exist in Secondary Metabolite Biosynthetic Gene Clusters
P450s are well known to play a key role in the synthesis of different secondary metabolites [35,36] and a recent study undertaking comparative analysis of secondary metabolite biosynthetic gene clusters (BGCs) between 48 Streptomyces species and 60 mycobacterial species revealed that CYP128 P450s are part of a secondary metabolite BGC [6]. The CYP128 P450 BGCs were found to have one or two other P450s in the cluster and the CYP128 P450 is always found with CYP124A1 or together with CYP124A1 and CYP121A1 P450s [6]. In this study, secondary metabolite BGC analysis revealed that 78% of CYP128 P450s were found to be part of secondary metabolite BGCs; 2090 CYP128 P450s from 2674 CYP128 P450s were found to be part of secondary metabolite BGCs. Analysis of cluster types revealed 1994 CYP128 P450 clusters belonging to the tRNA-dependent cyclodipeptide synthases (CDPS) cluster type, followed by 94 having no cluster types, indicating a novel BGC cluster; one belongs to Type I Polyketide synthase (T1PKS) and the last one belongs to the non-ribosomal peptide synthetase (NRPS) cluster type (Supplementary Dataset 1). In 25 species with two or more CYP128 P450s, four species (M. tuberculosis KI_19771, M. tuberculosis 402267, M. canettii CIPT 140070010, and M. canettii CIPT 140010059) were found to have all their CYP128 P450s as part of a cluster type, three species (M. tuberculosis BTB10-253, M. tuberculosis M1415, and Mycobacterium sp. GA-1199) had only one and the rest of the species had no CYP128 P450s as part of any cluster types (Supplementary Dataset 1).
An interesting contrast pattern was observed when comparing CYP128 P450 BGCs among different mycobacterial category species (Table 2). Most of the CYP128 P450s were found to be part of BGCs in MTBC species, whereas only a handful of CYP128 P450s were part of BGCs in species belonging to the categories MCAC, NTM, and MAC (Table 2). Furthermore, most BGCs of MTBC species have three P450s, CYP128, CYP121, and CYP124, followed by BGCs with CYP128 P450 and BGCs with CYP128 and CYP124 P450s (Table 2). Interestingly, the remaining four mycobacterial category species BGCs have only CYP128 P450 (Table 2). None of the BGCs from all five different mycobacterial categories has the combination of P450s, CYP128, and CYP121 (Table 2).
Table 2.
Data type | Mycobacterial Category | ||||
---|---|---|---|---|---|
MTBC | MCAC | NTM | MAC | SAP | |
Total number of CYP128 P450s | 2328 | 169 | 140 | 35 | 2 |
Number of CYP128 P450s not part of BGCs | 282 | 147 | 120 | 34 | 1 |
Number of CYP128 P450s part of BGCs | 2046 | 22 | 20 | 1 | 1 |
Number of BGCs with CYP128, CYP121 and CYP124 P450s | 1908 | 0 | 0 | 0 | 0 |
Number of BGCs with CYP128 P450 | 136 | 22 | 21 | 1 | 1 |
Number of BGCs with CYP128 and CYP121 P450s | 0 | 0 | 0 | 0 | 0 |
Number of BGCs with CYP128 and CYP124 P450s | 2 | 0 | 0 | 0 | 0 |
Abbreviations: MTBC, Mycobacterium tuberculosis complex; NTM, non-tuberculosis mycobacteria; MCAC, Mycobacterium chelonae-abscessus complex; MAC, M. avium complex; and SAP, saprophytes; BGCs, biosynthetic gene clusters.
2.6. CYP128A1 Protein Model Has High Affinity to Menaquinone-9
Since the expression of CYP128A1 in heterologous hosts such as E. coli was found to be difficult [14,15] and at present there is no scope to generate the CYP128A1 crystal structure, in this study a 3D model of CYP128A1 was built to explore its binding affinity with MK9 and different azole drugs. The CYP128A1 model was built using the structure of Vitamin D3 hydroxylase (Vdh) from Pseudonocardia autotrophica as a template (PDB ID: 3VRM, 33.8% identity) [37]. The model was optimized using molecular-dynamics (MD) simulations (Figure 5) as described in the subsection on Materials and Methods. Notably, other models based on templates sharing similar coverage and identity compared to 3VRM were tested. Other structures include 1Z8P (32.4% identity) [38], 3A4G (33.5% identity) [39] and 5GNM (33.6% identity) [40]. Although some of these structures have a bit better resolution than 3VRM, they either correspond to apo structures (5GNM) or structures bound to small ligands (1Z8P and 3A4G), leaving the catalytic site “packed up” and thus inaccessible through docking simulations. This was confirmed by visual inspection together with our inability to generate binding poses in the catalytic site when docking on models based on 5GNM or 3A4G templates (results not shown). Conversely, 3VRM corresponds to the structure of Vdh mutant bound with Vitamin D3 where the catalytic site is open enough to enable the binding of ligands as big as MK9. As a result, docking simulations on this model were successful in finding binding poses for all the ligands tested.
Homology modeling was carried out using the Molecular Operating Environment (MOE) software as described in the Materials and Methods section. Importantly, sequence alignment did not reveal any major indel regions compared to our target sequence (see Figure S1). We reported a five-residue loop as the most significant insertion with a fairly large distance to the catalytic site (23.5 Å to the ferric ion of the HEME group) and a seven-residue gap as the most important deletion. During our protocol, 10 intermediate models were generated independently. For every model, the RMSD (root-mean-square deviation) to the mean conformation was calculated and a low standard deviation () to the mean structure was obtained ( for residues in the five-residue insertion, for residues on both sides of the seven-residue deletion, for the whole structure), suggesting good convergence and a limited number of possible variations in the target model. Our final CYP128A1 model (Figure 5), selected based on MOE’s built-in score, was used as a target in a first round of docking simulations. For each compound, the 10 top docked complexes were then equilibrated through MD (six complexes in the case of MK9, see Materials and Methods section). Next, MK9 and azole drugs were removed and re-docked independently onto each of their corresponding equilibrated receptors. In Table 3, we reported the results of our re-docking procedure with top scores (in kcal/mol) listed in column 2. The order of binding was as follows: MK9 > itraconozole > posaconazole > ketoconazole > econazole > miconzaole > voriconazole > clotrimazole > fluconazole (Table 2).
Table 3.
Compound | Top Score (kcal/mol) | Score of First Well-Oriented Pose (kcal/mol) | Rank of First Well-Oriented Pose |
---|---|---|---|
Menaquinone-9 (MK9) | −13.48 | −12.83 (1) | 7 |
Itraconazole | −12.57 | −10.72 (3) | 10 |
Posaconazole | −11.81 | −11.81 (2) | 1 |
Ketoconazole | −10.47 | −9.84 (4) | 9 |
Econazole | −8.24 | −7.56 (7) | 6 |
Miconazole | −8.10 | −8.10 (5) | 1 |
Voriconazole | −7.63 | −7.57 (6) | 2 |
Clotrimazole | −7.46 | −7.21 (9) | 5 |
Fluconazole | −7.38 | −7.38 (8) | 1 |
In P450 enzyme systems, substrates and azole drugs are known to coordinate with the ferric core of the heme group. In the case of MK9, coordination takes place at the end of its hydrophobic tail, consistent with omega hydroxylation of the molecule [13,16]. For azole drugs, coordination occurs via the imidazole ring. Importantly, docking of MK9 and azole drugs was performed in a non-covalent way in the present paper. Therefore, no coordination was predicted for the tested ligands, which would require more thorough investigation, especially considering changes in electronic density. However, since non-covalent binding is a critical step in the establishment of covalent interactions, we believe the present results still provide a good indication of how azoles drugs can interact with CYP128A1 and how they rank in terms of affinity.
Column 3 in Table 3 shows the score of the first well-oriented pose, i.e., where the end of the hydrocarbon tail (MK9) or the imidazole ring (azole drugs) faces the ferric core of the heme. Interestingly, considering well-oriented poses does not significantly affect the ranking of compounds. Column 4 shows the rank of the first well-oriented pose over all the poses generated for each ligand. While top-ranked poses are already well oriented in some cases (posaconazole, miconazole, voriconazole, fluconazole), well-oriented poses of other compounds such as itraconazole and ketoconazole exhibit lower ranking (10 and 9, respectively). Nonetheless, we observed that at least one pose among the top 10 is well oriented for each compound, leading to a score value similar to the top-ranked pose in each case. We depicted the first well-oriented poses of each compound in Figure 6.
In summary, in silico results indicated that the CYP128A1 indeed binds to the substrate (MK9) with highest affinity compared to the azole drugs analyzed in the study (Table 3). Among azole drugs itraconazole, posaconazole and ketoconazole have the highest binding affinity with CYP128A1. One possible reason for the high binding affinity of these compounds compared to other azoles is that these molecules are more extended depicting the longer side chain of MK9. A point to be noted that same phenomenon of better interactions of azoles due to their more extended structures was also observed for CYP51 of Sporothrix schenckii [41]. CYP128A1 binding to different azole drugs is also consistent with other M. tuberculosis H37Rv P450s that are experimentally shown to bind azole drugs albeit with different affinities [21].
3. Materials and Methods
3.1. Species and Databases
A total of 4250 mycobacterial species genomes (permanent draft) available at the Joint Genome Institute (JGI)/Integrated Microbial Genomes and Microbes (IMG/M) database (from February 2019 to March 2019) were used in this study [42]. Species were grouped in six different mycobacterial categories, as described elsewhere [22,31]. The six categories included MTBC (2350 species), MCAC (1391 species), MAC (130 species), MCL (7 species), NTM (361 species), and SAP (11 species). Mycobacterial species along with their genome IDs and their mycobacterial categories were presented in Supplementary Dataset 1.
3.2. Genome Data Mining and Annotation of CYP128 P450s
CYP128 P450s mining in different bacterial species was carried out following the method described elsewhere [22,31]. Briefly, BLAST analysis was performed with the M. tuberculosis H37Rv CYP128A1 (Rv2268c) P450 sequence with default settings against individual mycobacterial species genomes at the IMG/M database. Considering the International Cytochrome P450 Nomenclature criteria, i.e., P450s showing > 40% identity belong to the same family [43,44,45], all the hit proteins with more than 40% identity were selected and subjected to P450 characteristic motifs analysis as described elsewhere [34,46,47]. Proteins that showed all P450 characteristic motifs were selected for further analysis. Proteins that were short in amino acid length and had none or only one of the highly conserved P450 motifs, such as EXXR and CXG, were considered P450 fragments and not included in the study. The selected proteins were then subjected to BLAST analysis at the P450 webpage (http://www.p450.unizulu.ac.za/) to identify the named homology protein, in this case CYP128 P450. Hit proteins that showed homology to CYP128 were then selected and different subfamilies were assigned following the International Cytochrome P450 Nomenclature criteria, i.e., P450s showing >55% identity belong to the same subfamily [43,44,45].
3.3. Phylogenetic Analysis of CYP128 P450s
Phylogenetic analysis of CYP128 P450s was carried out following the method described elsewhere [31]. First, the protein sequences were aligned by MAFFT v6.864 embedded on the Trex web server [48]. Thereafter, the alignments were automatically subjected to tree inferring and optimization by the Trex web server [49]. Finally, the best-inferred tree was envisioned and colored using iTOL (http://itol.embl.de/about.cgi) [50].
3.4. Amino Acid Conservation Analysis
Amino acid conservation analysis of CYP128 P450s was carried out following the methods described elsewhere [22,31,32]. Briefly, 2191 CYP128 P450s with amino acid length ranging from 475–496 amino acids (Supplementary Dataset 1) were selected and subjected to PROMALS3D analysis [33]. In order to calculate the accurate number of conserved amino acids in P450s, it was mentioned that P450s with similar amino acid length should be used [22,25,31,32]. PROMALS3D analysis provided the amino acid conservation index at different protein sequence positions [33], using the numbers from 5 to 9, where 9 is the invariantly conserved amino acid. The conserved number of amino acids for each conservation index was counted and compared with other P450 families from different biological kingdoms [22,25,31,32] to determine the CYP128 P450 family conservation rank.
3.5. Generation of EXXR and CXG Sequence Logos
CYP128 P450 family EXXR and CXG sequence logos were constructed using the method described elsewhere [22,32,34]. Briefly, all CYP128 P450 sequences were aligned using ClustalW multiple alignments embedded in MEGA7 [51]. Then the EXXR and CXG region amino acids (4 and 10 amino acids, respectively), were copied and pasted in the WebLogo program (http://weblogo.berkeley.edu/logo.cgi) [52,53]. As a selection parameter, the image format was selected as PNG (bitmap) at 300 dpi resolution. The percentage predominance of amino acids at specific positions was calculated, taking into account the total number of amino acids as 100%. The generated EXXR and CXG logos were used for analysis and comparison to the different P450 family EXXR and CXG logos that have been published and are accessible to the public [22,31,32,34].
3.6. Identification of CYP128 P450 Secondary Metabolite BGCs
Secondary metabolite BGCs analysis of CYP128 P450s was carried out following the method described elsewhere [31]. Mycobacterial species BGCs listed at the IMG/M website were manually searched for the presence of CYP128 P450s using their gene ID [42]. The BGCs that contained CYP128 P450 were selected and the entire gene cluster sequence was downloaded. The listed BGCs at IMG/M are unspecific and to identify the particular BGC type, the downloaded gene cluster sequence was subjected to secondary metabolite BGC analysis using anti-SMASH [54]. The type of BGC, percentage similarity to a known cluster and the known cluster name were recorded from the anti-SMASH analysis. Standard BGC abbreviation terminology developed by anti-SMASH was used in the study.
3.7. CYP128 Homology Modeling
The Molecular Operating Environment (Chemical Computing Group) was used to build a 3D model of CYP128A1. The crystallographic structure of the P450 Vitamin D3 hydoxylase bound with vitamin D3 (VD3) (PDB ID: 3VRM) [37] was used as a template, showing 33.8% identity and 48% similarity with the targeted sequence after alignment. Homology modeling of CYP128A1 was performed by setting the number of generated models to 10 and by selecting the final model based on MOE’s Generalized Born/Volume Integral (GB/VI) scoring function. During the modeling, the heme group of the template—including the ferric ion—was kept as part of the environment and included in the refinement step. The final model was eventually protonated at neutral pH and minimized using a MOE’s built-in protocol.
3.8. Molecular Docking of the CYP128 Model
Non-covalent docking of menaquinone-9 (MK9), a known substrate of CYP128A1, and different azole compounds (Figure S2) was done with MOE’s dock utility. Prior to this, all compounds were “washed” using MOE, i.e., we generated the most dominant protonation state of each compound at neutral pH, computed its atomic partial charges, and minimized the generated 3D structure. Docking was performed into the catalytic site of our CYP128A1 model, setting the placement method to “Triangle Matcher” and the scoring and rescoring methods to “London dG” and “GBVI/WSA dG”, respectively. After docking, the ligand structures were further refined using the fixed receptor option. This refinement step entails energy minimization using the conventional Amber10:Extended Huckel Theory molecular mechanics force field to take electronic effects into account. For each compound, the top 10 complexes as identified from GBVI/WSA-dG scores were considered for further MD equilibration (see Section 3.10). In the case of MK9, since the molecule was predicted to undergo omega hydroxylation via the heme group, only poses with proper orientation (hydrophobic tail facing the heme group) were kept, resulting in six (out of 10) poses being selected.
3.9. Density Functional Theory
MD parameters—e.g., partial charges and force constants—for non-standard residues like the heme group are not provided by standard MD force fields. Hence, to run further equilibration of our docked complexes, it was necessary to generate those parameters via quantum calculations. Such calculations were performed using Gaussian 09 (g09) together with the Metal Center Parameter Builder (MCPB.py) available in the Amber16 package [55,56]. MCPB.py was applied to create the correct g09 input files by including the heme ferric cation and its nearby residues from our 3D structures. g09 was successively utilized for geometry optimization, force constant calculation and Merz-Kollman RESP charge calculation of the selected atoms. Every calculation was performed at the B3LYP/6-31G* level of theory. Finally, the MCPB.py program was re-applied to fit RESP charges and generate parameters compatible with Amber’s ff14SB force field. Note that a partial charge of 0.250e was calculated for the ferric cation in the heme group. Keeping in mind that the partial charge of a metal ion is less than 2e, no matter how big its formal charge (oxidation state) is, our result looks reasonable.
3.10. Molecular Dynamics
While MCPB.py and g09 were used to get the correct force field parameters for the ferric core of the heme and nearby atoms, Amber’s antechamber utility [55,56] was applied to generate MD parameters for MK9 and azole compounds. Amber’s tleap program was applied to solvate all our docked complexes in TIP3P water and to generate the correct topology file to conduct MD simulations using the ff14SB force field. For each complex, a rectangular box with at least 10 Å distance between the box edges and the protein was considered, resulting in about 16,380 water molecules in each case. Na+ and Cl− ions were added to approximate 0.15 M concentration as well as to neutralize the system. Minimization, NVT, NPT, and MD production runs were all performed with Amber’s pmemd utility. Minimization of each structure was carried out in two phases, both using the steepest descent and conjugate gradient methods successively. Briefly, minimization was done in 10,000 minimizations steps on hydrogens and solvent atoms only, i.e., by restraining the protein–ligand complexes. Next, a 20,000-step minimization was run without restraints. The structures were then equilibrated in the NVT ensemble during 20 ps and in the NPT ensemble during 40 ps, setting the temperature to 298 K and the pressure to 1 bar. Finally, MD production was run to relax each complex for at least 20 ns. The stability of each complex was assessed by checking if the RMSD of both the protein and the ligand reached a plateau by the end of the simulation (see Figure S3). In general, we found 20 ns sufficient to equilibrate the complexes, although in some cases, an extra 5–10 ns may have been required to equilibrate the structure fully.
3.11. Redocking of Compounds
MK9 and the azoles compounds were re-docked on their corresponding set of equilibrated structures using the MOE’s dock (six structures for MK9, 10 structures for each of the azole drugs). The same options as in the first docking step were used (see Section 3.8).
4. Conclusions
The availability of quite a large number of bacterial genome sequences and different bioinformatics tools gives us an opportunity to understand the role of different genes/proteins in bacterial communities at large rather than confining the results to a single bacterium. In this study, we utilized such information and tools to understand the CYP128 P450 family profiles in different mycobacterial species and understand its structure. The study revealed interesting aspects; for example, P450 was found to be highly conserved in the MTBC species causing lung disease in humans and other animals, indicating the important function of this enzyme during the latent phase of these organisms, as this P450 was found to be expressed during such stage [11,12]. Furthermore, only 12–34% of non-MTBC species that have this P450 strongly support its important role during the latent phase of MTBC species. Contrasting CYP128 subfamily profiles between MTBC and four other different mycobacterial categories revealed by this study suggests that CYP128 P450s may play different roles in different category species, as previously observed for the CYP53 family in fungi [25]. Molecular dynamic simulation of CYP128A1 interactions with different ligands revealed that this P450 has the highest binding affinity to its substrate compared to azole drugs. Binding of azole drugs to CYP128A1 protein models further shows the complex biology of mycobacterial species, as azole drugs have been found to be effective against Mycobacterium tuberculosis [18,19,20], indicating the expression of CYP128A1 in the latent phase [11,12], and possible binding of azole drugs during experimentation did not resulted in a hypervirulent bacterium. This may be due to the inhibition of essential P450s such as CYP121A1 and CYP125A1, leading to the death of M. tuberculosis H37Rv. One of the interesting observations of this study is that more than one copy of this gene was present in some species and one of these genes was found not to be part of the gene cluster. This poses a fascinating evolutionary question on the need for having more than one copy of this gene. Research on the CYP128 P450s that are not part of gene clusters would be interesting in discerning other roles of CYP128 P450s in mycobacterial species. It is also important that CYP128 gene clusters were found to have CYP124A1 or both CYP124A1 and CYP121A1, indicating the collective efforts of these P450s in generating complex lipids in mycobacterial species, especially in MTBC species, as none of the other four mycobacterial category species has this type of combination in its CYP128 gene clusters. Future studies should include cloning of the entire CYP128A1 gene cluster (containing CYP121A1 and CYP124A1) and analyzing the effect of the cluster molecule in M. tuberculosis pathogenesis.
Acknowledgments
The authors want to thank Barbara Bradley, Pretoria, South Africa for English language editing.
Supplementary Materials
The following are available online at https://www.mdpi.com/1422-0067/21/14/4816/s1, Figure S1: 3VRM/CYP128A1 alignement., Figure S2: 2D structures of substrate (menaquinone 9) and azole compounds used in the study, Figure S3: RMSD vs time during MD simulations of MK9-CYP128A1 complexes (colors are explained in the legend of each figure), Table S1: Comparative amino acid conservation acid analysis of CYP128 P450 family with top 10 ranked families, Supplementary Dataset 1: Comprehensive comparative analyses of CYP128 P450s and those associated with secondary metabolite biosynthetic gene clusters in mycobacterial species, Supplementary Dataset 2: CYP128 P450 sequences identified and annotated in mycobacterial species, Supplementary Dataset 3: A high-resolution phylogenetic tree of CYP128 P450s.
Author Contributions
Conceptualization, K.S.; data curation, N.S.N., Z.E.C., W.C., J.-H.Y., D.R.N., J.A.T., J.P. and K.S.; formal analysis, N.S.N., Z.E.C., W.C., J.-H.Y., D.R.N., J.A.T., J.P. and K.S.; funding acquisition, K.S.; investigation, N.S.N., Z.E.C., W.C., J.-H.Y., D.R.N., J.A.T., J.P. and K.S.; methodology, N.S.N., Z.E.C., W.C., J.-H.Y., D.R.N., J.A.T., J.P. and K.S.; project administration, K.S.; resources, K.S.; supervision, K.S.; validation, N.S.N., Z.E.C., W.C., J.-H.Y., D.R.N., J.A.T., J.P. and K.S.; visualization, N.S.N., W.C., J.P. and K.S.; writing—original draft, N.S.N., W.C., J.-H.Y., D.R.N, J.P. and K.S.; writing—review and editing, N.S.N., W.C., J.-H.Y., D.R.N., J.P. and K.S. All authors have read and agreed to the published version of the manuscript.
Funding
The work presented in this article is part of a National Research Foundation (NRF), South Africa grant awarded to Khajamohiddin Syed (Grant No. 114159), where all the international authors involved in the study are listed as international collaborators. Master’s study bursaries were provided for Nokwanda Samantha Ngcobo (year 2019) and Zinhle Edith Chiliza (year 2018) from the same grant. Zinhle Edith Chiliza also thanks the NRF, South Africa for a DST-NRF Innovation Master’s Scholarship for the year 2019 (Grant No. 117182). Khajamohiddin Syed expresses sincere gratitude to the University of Zululand Research Committee for funding (Grant No. C686) and for the laboratory facilities.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.World Health Organization (WHO) Global Tuberculosis Report 2019. [(accessed on 7 December 2019)];2019 ISBN 978-92-4-156571-4. Available online: https://www.who.int/tb/publications/global_report/en/
- 2.SSA Mortality and Causes of Death in South Africa, 2016: Findings From Death Notification; Statistics South Africa. [(accessed on 22 March 2019)];2018 Available online: http://www.statssa.gov.za/publications/P03093/P030932016.pdf.
- 3.Cole S., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S., Eiglmeier K., Gas S., Barry Iii C. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature. 1998;393:537. doi: 10.1038/31159. [DOI] [PubMed] [Google Scholar]
- 4.Ghazaei C. Mycobacterium tuberculosis and lipids: Insights into molecular mechanisms from persistence to virulence. J. Res. Med Sci. Off. J. Isfahan Univ. Med. Sci. 2018;23:63. doi: 10.4103/jrms.JRMS_904_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ortiz de Montellano P.R. Potential drug targets in the Mycobacterium tuberculosis cytochrome P450 system. J. Inorg. Biochem. 2018;180:235–245. doi: 10.1016/j.jinorgbio.2018.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Senate L.M., Tjatji M.P., Pillay K., Chen W., Zondo N.M., Syed P.R., Mnguni F.C., Chiliza Z.E., Bamal H.D., Karpoormath R. Similarities, variations, and evolution of cytochrome P450s in Streptomyces versus Mycobacterium. Sci. Rep. 2019;9:3962. doi: 10.1038/s41598-019-40646-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Nelson D.R. Cytochrome P450 diversity in the tree of life. Biochim. Biophys. Acta Proteins Proteom. 2018;1866:141–154. doi: 10.1016/j.bbapap.2017.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Van Wyk R., van Wyk M., Mashele S.S., Nelson D.R., Syed K. Comprehensive comparative analysis of cholesterol catabolic genes/proteins in mycobacterial species. Int. J. Mol. Sci. 2019;20:1032. doi: 10.3390/ijms20051032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sassetti C.M., Rubin E.J. Genetic requirements for mycobacterial survival during infection. Proc. Natl. Acad. Sci. USA. 2003;100:12989–12994. doi: 10.1073/pnas.2134250100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lamichhane G., Zignol M., Blades N.J., Geiman D.E., Dougherty A., Grosset J., Broman K.W., Bishai W.R. A postgenomic method for predicting essential genes at subsaturation levels of mutagenesis: Application to Mycobacterium tuberculosis. Proc. Natl. Acad. Sci. USA. 2003;100:7213–7218. doi: 10.1073/pnas.1231432100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Betts J.C., Lukey P.T., Robb L.C., McAdam R.A., Duncan K. Evaluation of a nutrient starvation model of Mycobacterium tuberculosis persistence by gene and protein expression profiling. Mol. Microbiol. 2002;43:717–731. doi: 10.1046/j.1365-2958.2002.02779.x. [DOI] [PubMed] [Google Scholar]
- 12.Rustad T.R., Harrell M.I., Liao R., Sherman D.R. The enduring hypoxic response of Mycobacterium tuberculosis. PLoS ONE. 2008;3:e1502. doi: 10.1371/journal.pone.0001502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sogi K.M., Holsclaw C.M., Fragiadakis G.K., Nomura D.K., Leary J.A., Bertozzi C.R. Biosynthesis and Regulation of Sulfomenaquinone, a Metabolite Associated with Virulence in Mycobacterium tuberculosis. ACS Infect. Dis. 2016;2:800–806. doi: 10.1021/acsinfecdis.6b00106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ouellet H., Johnston J.B., de Montellano P.R.O. The Mycobacterium tuberculosis cytochrome P450 system. Arch. Biochem. Biophys. 2010;493:82–95. doi: 10.1016/j.abb.2009.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Driscoll M. Ph.D. Thesis. University of Manchester; Manchester, UK: 2011. Investigating Orphan Cytochromes P450 from Mycobacterium tuberculosis: The Search for Potential Drug Targets. [Google Scholar]
- 16.Holsclaw C.M., Sogi K.M., Gilmore S.A., Schelle M.W., Leavell M.D., Bertozzi C.R., Leary J.A. Structural characterization of a novel sulfated menaquinone produced by stf3 from Mycobacterium tuberculosis. ACS Chem. Biol. 2008;3:619624. doi: 10.1021/cb800145r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Dhiman R.K., Mahapatra S., Slayden R.A., Boyne M.E., Lenaerts A., Hinshaw J.C., Angala S.K., Chatterjee D., Biswas K., Narayanasamy P. Menaquinone synthesis is critical for maintaining mycobacterial viability during exponential growth and recovery from non-replicating persistence. Mol. Microbiol. 2009;72:85–97. doi: 10.1111/j.1365-2958.2009.06625.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Sun Z., Zhang Y. Antituberculosis activity of certain antifungal and antihelmintic drugs. Tuber. Lung Dis. 1999;79:319–320. doi: 10.1054/tuld.1999.0212. [DOI] [PubMed] [Google Scholar]
- 19.Ahmad Z., Sharma S., Khuller G.K. The potential of azole antifungals against latent/persistent tuberculosis. FEMS Microbiol. Lett. 2006;258:200–203. doi: 10.1111/j.1574-6968.2006.00224.x. [DOI] [PubMed] [Google Scholar]
- 20.Ahmad Z., Sharma S., Khuller G. In vitro and ex vivo antimycobacterial potential of azole drugs against Mycobacterium tuberculosis H37Rv. FEMS Microbiol. Lett. 2005;251:19–22. doi: 10.1016/j.femsle.2005.07.022. [DOI] [PubMed] [Google Scholar]
- 21.Chenge J.T., Duyet L.V., Swami S., McLean K.J., Kavanagh M.E., Coyne A.G., Rigby S.E., Cheesman M.R., Girvan H.M., Levy C.W., et al. Structural Characterization and Ligand/Inhibitor Identification Provide Functional Insights into the Mycobacterium tuberculosis Cytochrome P450 CYP126A1. J. Biol. Chem. 2017;292:1310–1329. doi: 10.1074/jbc.M116.748822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Parvez M., Qhanya L.B., Mthakathi N.T., Kgosiemang I.K., Bamal H.D., Pagadala N.S., Xie T., Yang H., Chen H., Theron C.W., et al. Molecular evolutionary dynamics of cytochrome P450 monooxygenases across kingdoms: Special focus on mycobacterial P450s. Sci. Rep. 2016;6:33099. doi: 10.1038/srep33099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mthethwa B., Chen W., Ngwenya M., Kappo A., Syed P., Karpoormath R., Yu J.-H., Nelson D., Syed K. Comparative analyses of cytochrome P450s and those associated with secondary metabolism in Bacillus species. Int. J. Mol. Sci. 2018;19:3623. doi: 10.3390/ijms19113623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Syed K., Shale K., Pagadala N.S., Tuszynski J. Systematic identification and evolutionary analysis of catalytically versatile cytochrome P450 monooxygenase families enriched in model basidiomycete fungi. PLoS ONE. 2014;9:e86683. doi: 10.1371/journal.pone.0086683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jawallapersand P., Mashele S.S., Kovačič L., Stojan J., Komel R., Pakala S.B., Kraševec N., Syed K. Cytochrome P450 monooxygenase CYP53 family in fungi: Comparative structural and evolutionary analysis and its role as a common alternative anti-fungal drug target. PLoS ONE. 2014;9:e107209. doi: 10.1371/journal.pone.0107209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sello M.M., Jafta N., Nelson D.R., Chen W., Yu J.-H., Parvez M., Kgosiemang I.K.R., Monyaki R., Raselemane S.C., Qhanya L.B. Diversity and evolution of cytochrome P450 monooxygenases in Oomycetes. Sci. Rep. 2015;5:11572. doi: 10.1038/srep11572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Qhanya L.B., Matowane G., Chen W., Sun Y., Letsimo E.M., Parvez M., Yu J.-H., Mashele S.S., Syed K. Genome-wide annotation and comparative analysis of cytochrome P450 monooxygenases in Basidiomycete biotrophic plant pathogens. PLoS ONE. 2015;10:e0142100. doi: 10.1371/journal.pone.0142100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kgosiemang I.K.R., Syed K., Mashele S.S. Comparative genomics and evolutionary analysis of cytochrome P450 monooxygenases in fungal subphylum Saccharomycotina. J. Pure Appl. Microbiol. 2014;8:291–302. [Google Scholar]
- 29.Nelson D.R. Cytochrome P450 and the individuality of species. Arch. Biochem Biophys. 1999;369:1–10. doi: 10.1006/abbi.1999.1352. [DOI] [PubMed] [Google Scholar]
- 30.Yoshida Y., Aoyama Y., Noshiro M., Gotoh O. Sterol 14-demethylase P450 (CYP51) provides a breakthrough for the discussion on the evolution of cytochrome P450 gene superfamily. Biochem. Biophys. Res. Commun. 2000;273:799–804. doi: 10.1006/bbrc.2000.3030. [DOI] [PubMed] [Google Scholar]
- 31.Syed P.R., Chen W., Nelson D.R., Kappo A.P., Yu J.-H., Karpoormath R., Syed K. Cytochrome P450 Monooxygenase CYP139 Family Involved in the Synthesis of Secondary Metabolites in 824 Mycobacterial Species. Int. J. Mol. Sci. 2019;20:2690. doi: 10.3390/ijms20112690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bamal H.D., Chen W., Mashele S.S., Nelson D.R., Kappo A.P., Mosa R.A., Yu J.-H., Tuszynski J.A., Syed K. Comparative analyses and structural insights of the novel cytochrome P450 fusion protein family CYP5619 in Oomycetes. Sci. Rep. 2018;8:6597. doi: 10.1038/s41598-018-25044-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pei J., Kim B.-H., Grishin N.V. PROMALS3D: A tool for multiple protein sequence and structure alignments. Nucleic Acids Res. 2008;36:2295–2300. doi: 10.1093/nar/gkn072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Syed K., Mashele S.S. Comparative analysis of P450 signature motifs EXXR and CXG in the large and diverse kingdom of fungi: Identification of evolutionarily conserved amino acid patterns characteristic of P450 family. PLoS ONE. 2014;9:e95616. doi: 10.1371/journal.pone.0095616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Greule A., Stok J.E., De Voss J.J., Cryle M.J. Unrivalled diversity: The many roles and reactions of bacterial cytochromes P450 in secondary metabolism. Nat. Prod. Rep. 2018;35:757–791. doi: 10.1039/C7NP00063D. [DOI] [PubMed] [Google Scholar]
- 36.Podust L.M., Sherman D.H. Diversity of P450 enzymes in the biosynthesis of natural products. Nat. Prod. Rep. 2012;29:1251–1266. doi: 10.1039/c2np20020a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yasutake Y., Nishioka T., Imoto N., Tamura T. A single mutation at the ferredoxin binding site of P450 Vdh enables efficient biocatalytic production of 25-hydroxyvitamin D (3) Chembiochem. 2013;14:2284–2291. doi: 10.1002/cbic.201300386. [DOI] [PubMed] [Google Scholar]
- 38.Nagano S., Cupp-Vickery J.R., Poulos T.L. Crystal Structures of the Ferrous Dioxygen Complex of Wild-type Cytochrome P450eryF and Its Mutants, A245S and A245T INVESTIGATION OF THE PROTON TRANSFER SYSTEM IN P450eryF. J. Biol. Chem. 2005;280:22102–22107. doi: 10.1074/jbc.M501732200. [DOI] [PubMed] [Google Scholar]
- 39.Yasutake Y., Fujii Y., Nishioka T., Cheon W.K., Arisawa A., Tamura T. Structural evidence for enhancement of sequential vitamin D3 hydroxylation activities by directed evolution of cytochrome P450 vitamin D3 hydroxylase. J. Biol. Chem. 2010;285:31193–31201. doi: 10.1074/jbc.M110.147009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yasutake Y., Kameda T., Tamura T. Structural insights into the mechanism of the drastic changes in enzymatic activity of the cytochrome P450 vitamin D3 hydroxylase (CYP107BR1) caused by a mutation distant from the active site. Acta Crystallogr. Sect. F Struct. Biol. Commun. 2017;73:266–275. doi: 10.1107/S2053230X17004782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Matowane R.G., Wieteska L., Bamal H.D., Kgosiemang I.K.R., Van Wyk M., Manume N.A., Abdalla S.M.H., Mashele S.S., Gront D., Syed K. In silico analysis of cytochrome P450 monooxygenases in chronic granulomatous infectious fungus Sporothrix schenckii: Special focus on CYP51. Biochim. Biophys. Acta Proteins Proteom. 2018;1866:166–177. doi: 10.1016/j.bbapap.2017.10.003. [DOI] [PubMed] [Google Scholar]
- 42.Chen I.-M.A., Chu K., Palaniappan K., Pillay M., Ratner A., Huang J., Huntemann M., Varghese N., White J.R., Seshadri R. IMG/M v. 5.0: An integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res. 2018;47:D666–D677. doi: 10.1093/nar/gky901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nelson D.R. Cytochrome P450 Protocols. Springer; Berlin, Germany: 2006. Cytochrome P450 Nomenclature, 2004; pp. 1–10. [DOI] [PubMed] [Google Scholar]
- 44.Nelson D.R. Cytochrome P450 Protocols. Springer; Berlin, Germany: 1998. Cytochrome P450 nomenclature; pp. 15–24. [DOI] [PubMed] [Google Scholar]
- 45.Nelson D.R. The cytochrome p450 homepage. Hum. Genom. 2009;4:59. doi: 10.1186/1479-7364-4-1-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sirim D., Widmann M., Wagner F., Pleiss J. Prediction and analysis of the modular structure of cytochrome P450 monooxygenases. BMC Struct. Biol. 2010;10:34. doi: 10.1186/1472-6807-10-34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gotoh O. Substrate recognition sites in cytochrome P450 family 2 (CYP2) proteins inferred from comparative analyses of amino acid and coding nucleotide sequences. J. Biol. Chem. 1992;267:83–90. [PubMed] [Google Scholar]
- 48.Katoh K., Kuma K.-i., Toh H., Miyata T. MAFFT version 5: Improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005;33:511–518. doi: 10.1093/nar/gki198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Boc A., Diallo A.B., Makarenkov V. T-REX: A web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res. 2012;40:W573–W579. doi: 10.1093/nar/gks485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Letunic I., Bork P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kumar S., Stecher G., Tamura K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Schneider T.D., Stephens R.M. Sequence logos: A new way to display consensus sequences. Nucleic Acids Res. 1990;18:6097–6100. doi: 10.1093/nar/18.20.6097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Crooks G.E., Hon G., Chandonia J.-M., Brenner S.E. WebLogo: A sequence logo generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Blin K., Pascal Andreu V., de los Santos E.L.C., Del Carratore F., Lee S.Y., Medema M.H., Weber T. The antiSMASH database version 2: A comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. 2018;47:D625–D630. doi: 10.1093/nar/gky1060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Salomon-Ferrer R., Case D.A., Walker R.C. An overview of the Amber biomolecular simulation package. WIREs Comput. Mol. Sci. 2013;3:198–210. doi: 10.1002/wcms.1121. [DOI] [Google Scholar]
- 56.Case D., Berryman J., Betz R., Cerutti D., Cheatham T., III, Darden T., Duke R., Giese T., Gohlke H., Goetz A. AMBER 2015. University of California; San Francisco, CA, USA: 2015. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.