Skip to main content
ACS Omega logoLink to ACS Omega
. 2020 Sep 1;5(36):23385–23392. doi: 10.1021/acsomega.0c03338

Correlation of Conservation of Sequence and Structures of Mycobacterial Hemerythrin-like Proteins with Evolutionary Relationship and Host Pathogenicity

Zhongxin Ma , Maria Luiza Caldas Nogueira , Daniela Priscila Marchi-Salvador §, Victor L Davidson ‡,*
PMCID: PMC7496005  PMID: 32954191

Abstract

graphic file with name ao0c03338_0007.jpg

The Rv2633c gene of Mycobacterium tuberculosis, which plays a role in infection, encodes a hemerythrin-like protein (HLP). The crystal structure of an orthologue of Rv2633c, the HLP from Mycobacterium kansasii, revealed that it possessed structural features that were distinct from other hemerythrins and HLPs. These and other orthologous proteins comprise a distinct class of non-heme di-iron HLPs that are only found in mycobacteria. This study presents an analysis and comparison of protein sequences, putative structures, and evolutionary relationship of HLPs from 20 mycobacterial species that are known to cause tuberculosis or pulmonary disorders in humans. The results of this analysis allowed correlation of the physicochemical characteristics of amino acid residues that are substituted in these highly conserved sequences with their position in structures, possible effects on function, and evolutionary relationships. The sequences of the proteins from M. tuberculosis, Mycobacterium bovis, and other members of the M. tuberculosis complex, which cause tuberculosis, have substitutions not seen in the other non-tuberculous mycobacteria. Furthermore, groups of species that are closely related, based on phylogenetic analysis, possess substitutions of otherwise conserved residues not seen in other species that are less related. This information is correlated with the occurrence and clinical presentations of these groups of mycobacterial species. The results of this study provide a framework for structure–function studies to determine how subtle differences in the primary sequences of members of this family of proteins correlate with their structures and activities and how this may influence the infectious properties of the host species.

Introduction

A di-iron hemerythrin-like protein (HLP) encoded by the Rv2633c gene of Mycobacteria tuberculosis (Mtb) was recently characterized.1 Subsequently, the orthologous HLP was isolated from Mycobacteria kansasii (Mka), and its crystal structure was determined.2 Despite the two proteins exhibiting 78% similarity and 86% identity, the two proteins exhibited notable differences in solution properties and enzymatic activity. The Mtb HLP could be isolated only in very low yields and had limited solubility. It was a homodimer that was unstable as it precipitated from solution. This protein did exhibit catalase activity, which was an unprecedented activity for an HLP. In contrast, the Mka HLP was isolated in reasonable yields and was a stable monomer in solution. These properties allowed its crystallization and structure determination. The Mka HLP also exhibited catalase activity, albeit with lower activity than the Mtb HLP, and interacted with nitric oxide. A homology model of the Mtb HLP was generated using the Mka HLP structure as a template.2 The model suggested that the overall structures of the two proteins were very similar with five α-helices connected by relatively short loops. The amino acids that coordinate the two iron ions were completely conserved (Figure 1). An interesting distinction was that the Mtb HLP model contained two proline residues (Pro63 and Pro145 situated at α-helices 3 and 5, respectively) that were not present in the Mka HLP. These substitutions suggested that those helices might be disrupted and alter the true structure of the Mtb HLP in solution.

Figure 1.

Figure 1

Overall structure and the di-iron center of the Mka HLP. (A) Overall protein structure showing the five α-helices and loops. Helices are numbered and the sequence numbers of the residues in each helix are listed using the Mka HLP primary sequence as reference. (B) Oxo-bridged di-iron center of the protein. Amino acids residues that provide ligands for the irons are shown with carbons colored pink. Amino acid residues that provide second sphere ligands, those that interact with the primary ligands, are shown with carbons colored green. Nitrogens are blue, and oxygens including the one between the irons are red. Interactions with the irons and between first and second sphere ligands are shown as dotted lines. The irons are labeled. The structure shown uses the coordinates in PDB entry 6Q09.2

Phylogenetic analysis of sequence databases revealed that these two HLPs were members of a family of HLPs that is found only in mycobacteria. The mycobacterial HLPs are distinct from true hemerythrins and hemerythrin-like domains of other proteins.1,2 These distinctions were not solely based on overall sequence. True hemerythrins exhibit a HHE cation-binding domain with the residues that provide iron ligands in the order HHEHHHD, with variable lengths of separation between residues in the sequence motif.3 Some HLPs possess the same sequence motif as hemerythrin, while others have HHEHHE or HHEEHHE motifs.4 In contrast to these motifs, the sequence motif in the mycobacterial HLPs proteins is HHEYHHE.2 These are the only group of HLPs that utilize a Tyr as an iron ligand. Overall structural features and function also distinguish the mycobacterial HLPs. The di-iron site with a bridging oxygen reside in subunits comprised of four α-helices in true hemerythrins. The holoproteins usually possess six or eight of these identical subunits,3,4 although monomeric hemerythrins have been reported.5,6 Hemerythrins are seen primarily in marine invertebrates and function as oxygen carriers. HLPs are found in all phylogenetic kingdoms. HLPs also house an oxo-bridged di-iron site within a core structure of four α-helices. However, these proteins are not comprised of multiple identical subunits. Instead, the α-helical domain is fused to another protein domain that dictates the biological function of the HLP. These functions include signal transduction,7 oxygen-sensing,8 iron sensing,9 and repair of Fe–S centers.10 The mycobacterial HLPs are the only ones that have a structure with five α-helices, and they are not fused to other protein domains. In vitro studies have shown that the Mka HLP is a monomer2 and the Mtb HLP is a homodimer1 rather than a large multimer. While the exact physiological roles of these two proteins are not known, they do exhibit catalase activity, which is not seen with hemerythrin or other HLPs. This study extends the analysis and comparison of sequences, putative structures, and evolutionary relationship of HLPs from 20 mycobacterial species that are known to cause tuberculosis or pulmonary disease in humans.

A remarkable feature of this group of mycobacterial HLPs is the high level of conservation of sequence, which suggests conservation of structures. Each protein has approximately 162 amino acid residues with very few small gaps or insertions in sequence. As such, alignment and comparative analysis of the protein sequences are very straightforward, and the likely accuracy of homology models is enhanced. Of particular interest are the positions in sequence of residues that provide the first and second sphere ligands for the iron ions (Figure 1B). The first sphere ligands are residues that interact directly with the two irons in the active site. The second sphere ligands are residues that interact with the first sphere ligands. Determining the extent of conservation of these residues is important because the ligand set observed in the Mka HLP structure had not been observed in any other hemerythrins or HLPs.

Another point of interest is the identification of substitutions of specific amino acid residues in the HLP sequences in certain subsets of this group of mycobacteria. In the present study, conserved residues in this family of mycobacterial proteins are identified, which are not iron ligands but in the proximity of the di-iron site and which suggest specific roles in protein function. The sequences of the proteins from M. tuberculosis and Mycobacterium bovis, which are capable of causing tuberculosis in humans, are shown to have multiple unique features that are distinct from those of the other non-tuberculous mycobacteria (NTM). Among the sequences of the HLPs of the 20 species in this study, substitutions of otherwise conserved residues are correlated with their positions in the protein structure and presence within related species of mycobacteria. Overall, these results provide a much broader view of the subtle but important structural distinctions that are present in certain members of this highly conserved family of proteins and which are correlated with the evolutionary relationship between these mycobacterial species. These results are discussed in the context of reported clinical presentations associated with infection by these mycobacteria.

Results

Sequence and Phylogenetic Analysis

A BLAST search using the sequence of the Mtb HLP as the template was performed with the current database. The parameters were set for a maximum e-value of 1 × 10–40 and the search yielded 262 sequences. All of the sequences were of proteins from bacteria in the Mycobacteriaceae family. The sequences of proteins from 20 of these mycobacterial species were chosen for further analysis. A criterion for choosing these mycobacterial species is that each have been reported to cause tuberculosis or pulmonary disorders in humans. Furthermore, these sequences were highly conserved as they exhibited e values less than 5 × 10–63. An alignment of the sequences of these 20 mycobacterial HLPs is present in Figure 2. Key residues are highlighted to facilitate comparison.

Figure 2.

Figure 2

Alignment of the sequences of HLPs. The residues that provide ligands for the irons in the di-iron site are colored red and those that are second sphere ligands are colored green. Residues that are common to M. tuberculosis and M. bovis, while different in other species, are colored blue.

A phylogenetic tree of the 20 host species of the mycobacterial HLPs, which is based on the sequence alignment in Figure 2, is shown in Figure 3A. Another phylogenetic tree of these 20 species, which is based 16s rRNA gene sequences, is shown in Figure 3B.

Figure 3.

Figure 3

Phylogenetic trees based on the sequences of the mycobacterial species that possess the HLPs in this study. (A) Phylogenetic tree that is based on the sequences of the HLPs listed in Figure 2. (B) Phylogenetic tree that is based on 16s rRNA nucleotide sequences of these species.

Homology Modeling

In order to estimate the extent of the conservation of the structure of the mycobacterial orthologues of the Mka HLP, homology models of each of the proteins were constructed using Swiss-model homology modeling online software11 (Table 1). The QMEAN Z-value is a measure of the quality of the structural model. A value near zero reflects strong agreement between the structural model and known structures of similar size. A value less than −4.0 is considered to be of poor quality.12 The GMQE score is a value between 0 and 1 with a value of 1 reflecting the highest level of reliability of the model.13Mka is the first species listed in Table 1, as its structure was the template that was used to generate the models from the other sequences. The other HLP sources are listed in the decreasing order of GMQE. It is worth noting that the reliability of the model does not strictly correlate with the extent of sequence identity.

Table 1. Parameters for Homology Models of Mycobacterial HLPs Generated from Each Sequence Using the Mka HLP Structure as a Template.

organism source sequence accession number QMEAN Z-score GMQE sequence identity (%)
M. kansasii WP_023372785.1      
M. colombiense WP_007774632.1 –0.10 0.98 93.17
M. nebraskense WP_046184253.1 0.00 0.98 91.30
M. parascrofulaceum WP_007172067.1 –0.03 0.98 90.68
M. avium WP_104182873.1 0.14 0.97 90.68
M. malmoense WP_065482948.1 –0.02 0.97 90.06
M. triplex WP_036468396.1 –0.44 0.97 89.44
M. intracellulare WP_064939987.1 –0.11 0.96 86.96
M. szulgai WP_085670647.1 –0.69 0.94 83.23
M. gordonae WP_055576538.1 –0.43 0.92 80.12
M. asiaticum WP_036355112.1 –0.31 0.92 79.50
M. xenopi WP_003919947.1 –0.05 0.90 70.89
M. heckeshornense WP_048892962.1 –0.63 0.88 70.25
M. tuberculosis WP_003413625.1 –0.68 0.87 77.02
M. bovis KAN9011.1 –0.82 0.85 76.40
M. kyorinense WP_065015270.1 –0.10 0.83 70.00
M. abscessus WP_074334317.1 –0.48 0.82 63.29
M. fortuitum WP_064897428.1 –0.11 0.80 56.52
M. heraklionense WP_065040284.1 –0.51 0.80 60.49
M. arupense WP_046187572.1 –0.63 0.79 59.26

The root-mean-square deviation (rmsd) of atomic positions was calculated for comparison of each model and the Mka HLP structure using PyMol. Because the models were constructed using that structure as a template, the resulting rmsd values were small, as would be expected. For this reason, the parameters shown in Table 1 provide the best estimates of the goodness of the model, which will reflect the similarity in structure to the known Mka HLP structure. While the modeling cannot predict exactly what the structure will look like, the importance of the parameters in Table 1 is that they provide a measure of the extent to which the structures of the other HLPs deviate from that of the Mka HLP. These parameters indicate that the overall structures are in fact very similar. This suggests that differences in solution properties and function are related to minor changes in the structure or substitution of amino acid residues in key positions that alter the physicochemical properties of that residue.

Correlation of Conservation of Sequence and Structure with Evolutionary Relationship

Residues near the di-iron Site

Inspection of the structure of the di-iron site of the Mka HLP (Figure 1B) describes the first and second sphere ligands of the iron ions. The first sphere ligands, residues that provide ligands for the two irons in the active site, are completely conserved in each of the sequences (colored red in Figure 2). A key feature of the structure of the Mka HLP2 is the presence of a Tyr ligand for an iron in this position in the di-iron site where His provides the ligand in true hemerythrins. This Tyr54 is conserved in the sequences of each of these mycobacterial orthologues. The structure of the Mka HLP also shows residues that provide second sphere ligands for the di-iron site. These residues interact with the primary ligands. Each of the residues that provided these second sphere ligands are also conserved in every sequence (colored green). Thus, any observed differences in the physical properties or functions of the HLPs from the different species are not due to differences in residues that define the structure of the oxo-bridged di-iron site.

Phe46 is another amino acid residue in the vicinity of the di-iron site that is highly conserved in sequence and in structure in the homology models. Phe46 is present in all but four of the protein sequences. It resides approximately 4.0 Å from Fe2 (Figure 4). The position of the Phe side-chain in the structure could block access of hydrophilic species that could potentially react with iron. This idea is supported by the observation that there are two water molecules in the Mka HLP structure on the side of Phe46 opposite to that of the iron (Figure 4). The four sequences that do not have this Phe instead have a Met in this position. These sequences are from the HLPs from Mycobacterium arupensis, Mycobacterium heraklionenese, Mycobacterium fortuitum, and Mycobacterium abscessus. This Met residue could be performing a similar function or signify a difference in reactivity of the di-iron site in these proteins. As seen in Figure 3, these species are grouped together in a major branch at the bottom of the phylogenetic trees, separate from the other 16 species.

Figure 4.

Figure 4

Position of Phe46 near the di-iron site. Helices and residues are numbered, and the oxo-bridged di-iron center and relevant waters are shown with carbons colored gray, nitrogens colored blue, and oxygens including waters and the one bridging the irons are red. Phe 46 is highlighted as colored green. Interactions of residues with the irons, and between first and second sphere ligands, are shown as dashed lines. The distances between the waters and Phe46 are given and shown as dashed lines. Irons are labeled. The structure shown uses the coordinates in PDB entry 6Q09.

Residues Present Only in HLPs from Species that Cause Tuberculosis

Mtb and M. bovis are two closely related (see Figure 3) and highly pathogenic species of mycobacteria. The analysis in Figure 2 reveals several residues that are only present in the sequences from these two species (colored blue in Figure 2) and highly conserved as different amino acids in the other species. It was previously noted that two Pro residues in the sequence of the Mtb protein in positions that are in α-helices in the structure of the Mka HLP2. The same substitutions of two Pro residues at positions 63 and 145 are only found in the sequence from M. bovis. All other sequences have a Leu at position 63. The sequence from M. abscessus does have a Pro at position 145, while the other sequences show a variety of residues at this position, Ala, Thr, Ser, or Lys. The sequences of Mtb and M. bovis also substitute a Thr for Ala67, which is conserved in all of the other 18 sequences. This residue is positioned 3.5 Å from the Tyr54 side chain (Figures 1B and 5), which provides an iron ligand, on the side opposite that which faces Fe2. The larger size of the Thr side chain, and its potential ability to form a hydrogen bond, could be related to reactivity as these may influence the position of the Tyr ligand or interact with water. Another interesting substitution seen only in Mtb and M. bovis is the Val at position 108. In all of the other sequences, the 108 position is occupied by a Glu, and it is the first of three consecutive Glu residues, which are then followed by an Asp. These residues are located on the α-helix 4 and strongly influence the di-iron site. The side chain of Glu109 provides ligands to both Fe1 and Fe2, and Glu110 is a second sphere ligand that interacts with the His11 ligand of Fe1. If the substitution of Glu108 with Val has even a subtle influence on Glu109 and Glu110, this could influence properties of the di-iron center. Furthermore, Glu108 forms a hydrogen bond with one of two waters that approach the di-iron site (Figure 5). Thus, the replacement with Val may also affect access from the surface to the di-iron site.

Figure 5.

Figure 5

Positions of Val108 and Thr67 substitutions in the Mtb model. Portions of the Mka structure and Mtb model are superimposed. Carbons are colored gray for Mka residues and green for Mtb residues. Nitrogens are colored blue, and oxygens including waters and the one bridging the irons are colored red. Interactions described in the text are shown as dashed lines. The distances between the waters and Glu108 are given. The structure shown uses the coordinates in PDB entry 6Q09 for Mka HLP and the homology model of the Mtb HLP.

Mtb and M. bovis are members of the Mycobacterium tuberculosis complex (MTBC),14,15 a group of mycobacteria that are known to cause tuberculosis. The BLAST search that identified the proteins in Figure 2 also identified proteins from other members of the MTBC, which were listed as subspecies of Mtb. These mycobacteria with the accession numbers for the sequences, in parentheses, are Mycobacterium africanum (AMC64977.1), Mycobacterium canettii (CCK64818.1), Mycobacterium caprae (APU26589.1), Mycobacterium microti (AMC60342.1), and Mycobacterium pinnipedii (PRH90547.1). These HLP sequences are not included in Figure 2 because each of these protein sequences was identical to that of the Mtb HLP. In fact, the only non-identical sequence of the MTBC species is that from the M. bovis protein, which has only one difference, a Val instead of a Leu at residue 102. Thus, the multiple substitutions described above in Mtb and M. bovis are completely conserved in the sequences of proteins of other species that cause tuberculosis and do not appear in any of the NTM species.

Deletions and Insertions of Residues

While 15 of the 20 sequences have 161 residues, 5 have residues deleted or inserted. The most extreme case is seen in the sequences from Mycobacterium heckeshornense and Mycobacterium xenopi. Each contain a gap of three missing residues in the same position in the sequences and thus they have only 158 residues. These two closely related species comprise a branch of the phylogenetic trees that is distinct from the other species (see Figure 3). The position of these gaps in residues in the Mka HLP structure (residues 27–29) correspond to the last two residues in the loop that connects α-helices 1 and 2 and the first residue of α-helix 2 (see Figure 1A). This suggests a shorter loop in these proteins. One sequence from Mycobacterium kyorinense is missing the final residue and is only 160 residues long. This species is isolated in the phylogenetic trees from any other species (Figure 3). In contrast to the sequences with missing residues, the sequences from M. arupense and M. heraklionense each have an additional residue inserted after the initial Met. These species, which are closely related, comprise the bottom branch of the trees.

Discussion

Mycobacterial species continue to be identified, and over 190 species have been reported.16,17 The majority of tuberculosis cases in humans are caused by Mtb. Tuberculosis in humans can also be caused by M. bovis(18) and members of the MTBC.14,15 Many other mycobacterial species are categorized as NTM. While these may be considered potentially pathogenic, the NTM do not typically cause severe disease in healthy individuals. However, certain NTM, which are included in this study, are known to cause pulmonary disease in humans.19 Phylogenetic analyses of bacteria are typically based on comparison of polymeric molecules such as RNA, DNA, or particular proteins. In the present study, the amino acid sequences of the HLPs were used to determine the evolutionary relationship of the species that encode this protein. For comparison, a phylogenetic tree was also constructed using the 16s rRNA genes sequences. The results of this analysis allowed correlation of the physicochemical characteristics of amino acid residues that are substituted in these highly conserved sequences with their position in structure, possible effects on function, and evolutionary relationships.

The sequences of the HLPs from Mtb, M. bovis, and other MTBC species exhibit distinct differences from the other 18 NTM sequences. These are the only two sequences with Pro residues present at both positions 63 and 145. These positions correspond to residues within α-helices in the Mka HLP structure. This finding suggests that the actual structures may deviate from those observed in the homology models and could affect their solution properties. Alterations in structures such as this could explain why the Mtb HLP is poorly soluble and precipitates, whereas the Mka HLP is readily soluble and stable in solution.1,2 The proteins from Mtb and M. bovis are also the only two that substitute a Thr for the Ala67 near the di-iron site. This substitution could affect the properties of Tyr54, the iron ligand that is unique to mycobacterial HLPs. A substitution of Val for the conserved Glu108 may also influence the properties of the di-iron site and access for water to that site. Thus, the HLPs from mycobacteria that cause tuberculosis contain multiple substitutions in the protein sequence that are conserved within this subset of mycobacteria and replace different amino acids which are highly conserved in the NTM which are infectious but do not cause tuberculosis. This information provides a basis for further experimental studies to determine precisely what effects these sequence substitutions, which are unique to the MTBC, have on the properties of their HLPs in vitro and to gain insights into how these HLPs may contribute to the unique ability of the MTBC species to cause tuberculosis.

The sequences of the HLPs of the four species in the major branch at the bottom of the phylogentic trees, M. arupense, M. heraklionenese, M. fortuitum, and M. abscessus, are distinct from the other 16 in having a Met substituted for Phe46, which is close to the di-iron site and may restrict the access of potential hydrophilic reactants. M. abscessus accounts for a significant number of NTM infections.20 It typically causes chronic lung infection, skin and soft tissue infection, and tenosynovitis. This group is subdivided into two branches of the phylogenetic tree. The members of one branch M. arupense and M. heraklionense have a further distinction of each having an additional residue inserted after the initial Met.

Only two species exhibited a three-residue gap in the sequence. These species, M. xenopi and M. heckeshornense, comprise a distinct branch of each of the phylogenetic trees. M. xenopi exhibits high prevalence and increasing clinical importance and is a leading cause of NTM lung disease in Europe and Canada.21 It is seen in patients with and without underlying chronic lung disease. The closely related M. heckeshornense exhibits even greater pathogenicity.22

The M. kyorinense sequence is distinct among these mycobacterial HLPs as it is the only one missing the final residue. It occupies a similar position in each phylogenetic tree that is separated from any other species. M. kyorinense has been isolated from immunocompetent patients with a history of pulmonary disease, primarily in Japan,23 and causes significant respiratory disease. More recently, it was also described in Brazilian24 and Australian25 patients.

The remaining NTM that were analyzed and were most similar to M. kansasii include some members of the Mycobacterium avium complex. These are frequent causative agents of chronic pulmonary infections in adults, lymphadenitis in children, and extrapulmonary and disseminated infections in immunocompromised patients.26 The members of this complex that are included in this analysis are M. avium, Mycobacterium colombiense and Mycobacterium intracellulare. While not a formal member of this group, M. kansasii is one of the most prevalent causes of NTM disease and is the second most frequent NTM found in HIV-infected patients, after the M. avium complex.27

It is reasonable to infer from these results that these orthologous mycobacterial HLPs are relevant to infectivity and pathogenicity of the host species. The Rv2633c HLP from Mtb was initially studied because the Rv2633c gene was upregulated during infection following phagocytosis by macrophages,28,29 and a transposon mutant screen indicated that Mtb was significantly attenuated by Tn insertions that inactivated Rv2633c.30 It is also very interesting to see multiple substitutions that appear in the sequences of Mtb, M. bovis, and the MTBC species, which are different and completely conserved in the other 18 species. Thus, the two most pathogenic species have multiple unique features that are distinct from those of the other infectious NTM. Furthermore, groups of species that are determined to be related based on the sequence of the orthologous HLPs seem to also be related based on clinical presentations. Together, these data strongly suggest that these HLPs play relevant roles in the infectious properties and clinical outcomes of Mtb, M. bovis, and NTMs that cause pulmonary disease in humans.

While the phylogenetic trees in Figure 3 are not identical, they are quite similar. The groups of species that were identified to possess substitutions in sequence that were not seen in other species each occupy the same positions in both trees. Thus, the evolution of the sequences of the HLPs tracks with the overall evolution of the species. This finding suggests that these HLPs were not introduced into mycobacteria via gene transfer but evolved independently. The sum of these data supports the designation of these proteins as a family distinct from true hemerythrins and other HLPs. The results of this study provide a framework for structure function studies to correlate how the subtle differences in the protein sequences affect their structures and activities and how these influence the infectious properties of the host species.

Methods

Basic Local Alignment Search Tool (BLAST) searches were performed using the blastp suite provided by the NCBI.31 The amino acid sequence of the hemerythrin-like Rv2633c HLP of Mtb was used as the query sequence. NCBI COBALT32 was used to create the multiple sequence alignment of proteins.

A phylogenetic tree of the 20 host organisms of the proteins under analysis was constructed, which is based on the sequences of these orthologous proteins. Their evolutionary history was inferred using the maximum composite likelihood method that was based on the JTT matrix-based model33 and general time reversible model. The evolutionary analysis of the protein sequences was conducted using MEGAX.34 The tree with the highest log likelihood is drawn to scale, with branch lengths measured in the number of substitutions per site. Another phylogenetic tree of the 20 host organisms was constructed, which is based on their 16s rRNA sequences. This analysis involved 20 nucleotide sequences. There were a total of 1449 positions in the final data set. All of the sequences were obtained from GenBank. The methods for analysis were the same as described above for the protein sequence-based tree.

Homology models of the 19 proteins of the unknown structure were generated using the Swiss-model homology modeling online software (https://swissmodel.expasy.org),11 with the structure of Mka protein (PDB entry 6Q09)2 as the template. All structural figures presented here were prepared using Pymol (http://www.pymol.org).

Acknowledgments

The authors thank Yu Tang for technical assistance and Drs. Kyle Rohde and Jonathan Caranto for critically reading the manuscript and providing helpful suggestions. This research was supported by the National Institute of General Medical Sciences of the National Institutes of Health under award number R35GM130173 (V.L.D.).

Glossary

Abbreviations

HLP

hemerythrin-like protein

Mka

Mycobacteria kansasii

Mtb

Mycobacteria tuberculosis

MTBC

Mycobacteria tuberculosis complex

NTM

non-tuberculosis mycobacteria

Accession Codes

PDB: 6Q09. UniProtKB: WP_023372785.1, WP_007774632.1, WP_046184253.1, WP_007172067.1, WP_104182873.1, WP_065482948.1, WP_036468396.1, WP_064939987.1, WP_085670647.1, WP_055576538.1, WP_036355112.1, WP_003919947.1, WP_048892962.1, WP_003413625.1, KAN9011.1, WP_065015270.1, WP_074334317.1, WP_064897428.1, WP_065040284.1, WP_046187572.1, AMC64977.1, CCK64818.1, APU26589.1, AMC60342.1, PRH90547.1.

Author Present Address

(D.P.M.S.) Department of Molecular Biology, Center of Exact and Natural Sciences, Federal University of Paraiba, João Pessoa, PB, Brazil.

Author Contributions

All authors performed data analysis and modeling, and contributed conceptually to the study and the writing of the paper.

The authors declare no competing financial interest.

References

  1. Ma Z.; Strickland K. T.; Cherne M. D.; Sehanobish E.; Rohde K. H.; Self W. T.; Davidson V. L. The Rv2633c protein of Mycobacterium tuberculosis is a non-heme di-iron catalase with a possible role in defenses against oxidative stress. J. Biol. Chem. 2018, 293, 1590–1595. 10.1074/jbc.ra117.000421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ma Z.; Abendroth J.; Buchko G. W.; Rohde K. H.; Davidson V. L. Crystal structure of a hemerythrin-like protein from Mycobacterium kansasii and homology model of the orthologous Rv2633c protein of M. tuberculosis. Biochem. J. 2020, 477, 567–581. 10.1042/bcj20190827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Stenkamp R. E. Dioxygen and hemerythrin. Chem. Rev. 1994, 94, 715–726. 10.1021/cr00027a008. [DOI] [Google Scholar]
  4. Alvarez-Carreño C.; Alva V.; Becerra A.; Lazcano A. Structure, function and evolution of the hemerythrin-like domain superfamily. Protein Sci. 2018, 27, 848–860. 10.1002/pro.3374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chen K. H.-C.; Chuankhayan P.; Wu H.-H.; Chen C.-J.; Fukuda M.; Yu S. S.-F.; Chan S. I. The bacteriohemerythrin from Methylococcus capsulatus (Bath): Crystal structures reveal that Leu114 regulates a water tunnel. J. Inorg. Biochem. 2015, 150, 81–89. 10.1016/j.jinorgbio.2015.04.001. [DOI] [PubMed] [Google Scholar]
  6. Martins L. J.; Hill C. P.; Ellis W. R. Jr. Structures of wild-type chloromet and L103N hydroxomet Themiste zostericola myohemerythrins at 1.8 A resolution. Biochemistry 1997, 36, 7044–7049. 10.1021/bi9630422. [DOI] [PubMed] [Google Scholar]
  7. Alvarez-Carreño C.; Becerra A.; Lazcano A. Molecular evolution of the oxygen-binding hemerythrin domain. PLoS One 2016, 11, e0157904 10.1371/journal.pone.0157904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Isaza C. E.; Silaghi-Dumitrescu R.; Iyer R. B.; Kurtz D. M. Jr.; Chan M. K. Structural basis for O2 sensing by the hemerythrin-like domain of a bacterial chemotaxis protein: substrate tunnel and fluxional N terminus. Biochemistry 2006, 45, 9023–9031. 10.1021/bi0607812. [DOI] [PubMed] [Google Scholar]
  9. Ruiz J. C.; Bruick R. K. F-box and leucine-rich repeat protein 5 (FBXL5): sensing intracellular iron and oxygen. J. Inorg. Biochem. 2014, 133, 73–77. 10.1016/j.jinorgbio.2014.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Lo F.-C.; Hsieh C.-C.; Maestre-Reyna M.; Chen C.-Y.; Ko T.-P.; Horng Y.-C.; Lai Y.-C.; Chiang Y.-W.; Chou C.-M.; Chiang C.-H.; Huang W.-N.; Lin Y.-H.; Bohle D. S.; Liaw W.-F. Crystal structure analysis of the repair of iron centers protein YtfE and its interaction with NO. Chemistry 2016, 22, 9768–9776. 10.1002/chem.201600990. [DOI] [PubMed] [Google Scholar]
  11. Waterhouse A.; Bertoni M.; Bienert S.; Studer G.; Tauriello G.; Gumienny R.; Heer F. T.; de Beer T. A. P.; Rempfer C.; Bordoli L.; Lepore R.; Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018, 46, W296–W303. 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Benkert P.; Biasini M.; Schwede T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 2011, 27, 343–350. 10.1093/bioinformatics/btq662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Studer G.; Rempfer C.; Waterhouse A. M.; Gumienny R.; Haas J.; Schwede T. QMEANDisCo-distance constraints applied on model quality estimation. Bioinformatics 2020, 36, 1765–1771. 10.1093/bioinformatics/btz828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brosch R.; Gordon S. V.; Marmiesse M.; Brodin P.; Buchrieser C.; Eiglmeier K.; Garnier T.; Gutierrez C.; Hewinson G.; Kremer K.; Parsons L. M.; Pym A. S.; Samper S.; van Soolingen D.; Cole S. T. A new evolutionary scenario for the Mycobacterium tuberculosis complex. Proc. Natl. Acad. Sci. U.S.A. 2002, 99, 3684–3689. 10.1073/pnas.052548299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Forrellad M. A.; Klepp L. I.; Gioffré A.; Sabio y García J.; Morbidoni H. R.; de la Paz Santangelo M.; Cataldi A. A.; Bigi F. Virulence factors of the Mycobacterium tuberculosis complex. Virulence 2013, 4, 3–66. 10.4161/viru.22329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Forbes B. A. Mycobacterial taxonomy. J. Clin. Microbiol. 2017, 55, 380–383. 10.1128/jcm.01287-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. King H. C.; Khera-Butler T.; James P.; Oakley B. B.; Erenso G.; Aseffa A.; Knight R.; Wellington E. M.; Courtenay O. Environmental reservoirs of pathogenic mycobacteria across the Ethiopian biogeographical landscape. PLoS One 2017, 12, e0173811 10.1371/journal.pone.0173811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Torres-Gonzalez P.; Cervera-Hernandez M. E.; Martinez-Gamboa A.; Garcia-Garcia L.; Cruz-Hervert L. P.; Bobadilla-Del Valle M.; Ponce-de Leon A.; Sifuentes-Osornio J. Human tuberculosis caused by Mycobacterium bovis: a retrospective comparison with Mycobacterium tuberculosis in a Mexican tertiary care centre, 2000-2015. BMC Infect. Dis. 2016, 16, 657. 10.1186/s12879-016-2001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Stout J. E.; Koh W.-J.; Yew W. W. Update on pulmonary disease due to non-tuberculous mycobacteria. Int. J. Infect. Dis. 2016, 45, 123–134. 10.1016/j.ijid.2016.03.006. [DOI] [PubMed] [Google Scholar]
  20. Lee M.-R.; Sheng W. H.; Hung C. C.; Yu C. J.; Lee L. N.; Hsueh P. R. Mycobacterium abscessus complex infections in humans. Emerg. Infect. Dis. 2015, 21, 1638–1646. 10.3201/2109.141634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hirama T.; Marchand-Austin A.; Ma J.; Alexander D. C.; Brode S. K.; Marras T. K.; Jamieson F. B. Mycobacterium xenopi genotype associated with clinical phenotype in lung disease. Lung 2018, 196, 213–217. 10.1007/s00408-018-0087-9. [DOI] [PubMed] [Google Scholar]
  22. Jauréguy F.; Ioos V.; Marzouk P.; Hornstein M.; Picard B.; Gutierrez M. C.; Valeyre D. Mycobacterium heckeshornense: An emerging pathogen responsible for a recurrent lung infection. J. Infect. 2007, 54, e33–e35. 10.1016/j.jinf.2006.03.026. [DOI] [PubMed] [Google Scholar]
  23. Ohnishi H.; Yonetani S.; Matsushima S.; Wada H.; Takeshita K.; Kuramochi D.; de Souza Caldas P. C.; Campos C. E. D.; Porphirio da Costa B.; Ramos J. P.; Mikura S.; Narisawa E.; Fujita A.; Funayama Y.; Kobashi Y.; Sakakibara Y.; Ishiyama Y.; Takakura S.; Goto H.; Watanabe T. Mycobacterium kyorinense infection. Emerg. Infect. Dis. 2013, 19, 508–510. 10.3201/eid1903.120591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Campos C. E. D.; de Souza Caldas P. C.; Ohnishi H.; Watanabe T.; Ohtsuka K.; Matsushima S.; Ferreira N. V.; da Silva M. V. B.; Redner P.; de Carvalho L. D.; Medeiros R. F. d. M.; Abbud Filho J. A.; Fandinho Montes F. C. O.; Galvao T. C.; Ramos J. P. First isolation of Mycobacterium kyorinense from clinical specimens in Brazil. J. Clin. Microbiol. 2012, 50, 2477–2478. 10.1128/jcm.00023-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Muruganandan S.; Jayaram L.; Wong J. S. J.; Guy S. Pulmonary cavitary Mycobacterium kyorinense (M. kyorinense) infection in an Australian woman. IDCases 2015, 2, 37–39. 10.1016/j.idcr.2015.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. van Ingen J.; Turenne C. Y.; Tortoli E.; Wallace R. J. Jr.; Brown-Elliott B. A. A definition of the Mycobacterium avium complex for taxonomical and clinical purposes, a review. Int. J. Syst. Evol. Microbiol. 2018, 68, 3666–3677. 10.1099/ijsem.0.003026. [DOI] [PubMed] [Google Scholar]
  27. DeStefano M. S.; Shoen C. M.; Cynamon M. H. Therapy for Mycobacterium kansasii Infection: Beyond 2018. Front. Microbiol. 2018, 9, 2271. 10.3389/fmicb.2018.02271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Homolka S.; Niemann S.; Russell D. G.; Rohde K. H. Functional genetic diversity among Mycobacterium tuberculosis complex clinical isolates: delineation of conserved core and lineage-specific transcriptomes during intracellular survival. PLoS Pathog. 2010, 6, e1000988 10.1371/journal.ppat.1000988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Rohde K. H.; Abramovitch R. B.; Russell D. G. Mycobacterium tuberculosis invasion of macrophages: linking bacterial gene expression to environmental cues. Cell Host Microbe 2007, 2, 352–364. 10.1016/j.chom.2007.09.006. [DOI] [PubMed] [Google Scholar]
  30. Sassetti C. M.; Rubin E. J. Genetic requirements for mycobacterial survival during infection. Proc. Natl. Acad. Sci. U.S.A. 2003, 100, 12989–12994. 10.1073/pnas.2134250100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Boratyn G. M.; Camacho C.; Cooper P. S.; Coulouris G.; Fong A.; Ma N.; Madden T. L.; Matten W. T.; McGinnis S. D.; Merezhuk Y.; Raytselis Y.; Sayers E. W.; Tao T.; Ye J.; Zaretskaya I. BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 2013, 41, W29–W33. 10.1093/nar/gkt282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Papadopoulos J. S.; Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 2007, 23, 1073–1079. 10.1093/bioinformatics/btm076. [DOI] [PubMed] [Google Scholar]
  33. Jones D. T.; Taylor W. R.; Thornton J. M. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 1992, 8, 275–282. 10.1093/bioinformatics/8.3.275. [DOI] [PubMed] [Google Scholar]
  34. Kumar S.; Stecher G.; Li M.; Knyaz C.; Tamura K.; Mega X. Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from ACS Omega are provided here courtesy of American Chemical Society

RESOURCES