Abstract
Brucellosis, also known as “undulant fever” is a zoonotic disease caused by Brucella, which is a facultative intracellular bacterium. Despite efforts to eradicate this disease, infection in uncontrolled domestic animals persists in several countries and therefore transmission to humans is common. Brucella evasion of the innate immune system depends on its ability to evade the mechanisms of intracellular death in phagocytic cells. The BvrR-BvrS two-component system allows the bacterium to detect adverse conditions in the environment. The BvrS protein has been associated with genes of virulence factors, metabolism, and membrane transport. In this study, we predicted the DNA sequence recognized by BvrR with Gibbs Recursive Sampling and identified the three-dimensional structure of BvrR using I-TASSER suite, and the interaction mechanism between BvrR and DNA with Protein-DNA docking and molecular dynamics (MD) simulation. Based on the Gibbs recursive Sampling analysis, we found the motif AAHTGC (H represents A, C, and T nucleotides) as a possible sequence recognized by BvrR. The docking and EMD simulation results showed that C-terminal effector domain of BvrR protein is likely to interact with AAHTGC sequence. In conclusion, we predicted the structure, recognition motif, and interaction of BvrR with DNA.
Keywords: BvrR, protein structure prediction, Gibbs sampling, protein-DNA docking, MD simulation
1. Introduction
Brucellosis is one of the most common zoonoses in the world and is caused by microorganisms of the genus Brucella, which can be transmitted directly or indirectly to humans. The disease is transmitted from animals to humans through genital excretions and contaminated milk, the greatest source of human infection [1,2]. Infection can occur through the skin or mucous membrane lesions and by inhalation of contaminated dust or aerosols (an estimated dose of 10 to 100 microorganisms is enough to establish aerial infection) and is one of the most common laboratory-acquired infections [3,4]. Infections by B. abortus and B. suis usually affect occupational groups (veterinarians, farmers, and abattoir workers), whereas that caused by B. melitensis is more frequent in the community [5,6]. The disease presents polymorphous clinical manifestations and is often asymptomatic [7]. Acute brucellosis manifests as a febrile disease. The fever increases progressively until reaching a plateau that lasts for several days and then descends slowly; this first febrile cycle can precede other shorter successive febrile cycles [8]. Brucellosis without treatment progresses to a disabling chronic disease with severe complications, such as central nervous system (CNS) affectations, osteomyelitis, keratitis, and endocarditis. The susceptibility to brucellosis in humans depends on the immunological state of the person, the route of infection, the size of the inoculum, and the virulence of the species [2].
Brucella possesses mechanisms of immune response evasion that allow it to be an intracellular parasite in macrophages, monocytes, and epithelial cells [9]. For the internalization, the bacteria’s protein Hsp60 and lipopolysaccharide (LPS) are recognized by proteins located on the lipid rafts of the cell membrane [10]. Once internalized, the bacterium resides in Brucella-containing vacuoles (BCVs) [11] that interact with multivesicular bodies (MVB) and early compartments of the endocytic pathway [12,13]. Then, they interact with late endocytic organelles [11,13] and partially fuse with the lysosomes [14]. Finally, the BCVs intercept the endoplasmic reticulum exit sites, fuse with them, and form an organelle that is permissive to replication [11]. The interaction between BCVs and endosomes and lysosomes is controlled to allow acidification, which activates the BvrS/BvrR two-component system, composed of a transmembrane histidine kinase sensor (BvrS) and a cytosolic response regulator (BvrR). This system controls the expression of outer membrane proteins, as Omp22 and Omp25, and the structure of the LPS [15,16,17,18]; it also controls the expression of the virB operon coding for the type IV secretion system, which secretes bacterial factors that modulate the maturation of the BCV [19,20]. The BvrS/BvrR two-component system is essential to the detection of changes in the phagosomal environment and the modification of the extracellular lifestyle into an intracellular one [16,21]. The two-component system works through the transduction of environmental signals. While acidity is essential, other factors also contribute and are first detected by the histidine kinase sensor, which is autophosphorylated in a histidine residue. The phosphate group is transferred to an aspartate residue in the regulatory protein, which mediates the changes in gene expression [21]. This system also provides Brucella with resistance to polycationic detergents and increases permeability to surfactants, so it has been proposed that some molecular characteristics of the outer membrane are under the control of the BvrR/BvrS system [22,23]. Mutant strains in the BvrR/BvrS system are avirulent in mouse, show a lower invasive capacity in macrophages and HeLa cells, and are unable to replicate intracellularly [15].
In this study, the structure of BvrR, the motif it recognizes in DNA, and the interaction between BvrR and DNA were predicted (Figure 1). In order to carry out this study, the three-dimensional structure of BvrR was constructed. Then the motif that is probably recognized by BvrR was predicted with Gibbs Recursive Sampling. The sequence AAHTGC (H represents the A, C, and T nucleotides) was found as the most probable motif recognized by BvrR. Therefore, the three-dimensional structure of the DNA-motif was constructed. Subsequently, the initial interaction between BvrR/DNA-motif was predicted by protein/DNA docking. Finally, the BvrR/DNA-motif interaction was analyzed by MD simulation.
2. Results and Discussion
2.1. Structure Prediction of BvrR
In the absence of homologous structures of significant similarity deposited in the Protein Data Bank (PDB), the tridimensional structure of the BvrR was predicted by means of a composite threading method using the I-TASSER software. The modeling yielded 5 structural models, all them with C-score values between −5 and 2. The best model was the only one with a positive C-score = +0.15, RMSD = 5.4 ± 3.4 Å, and TM-score = 0.73 ± 0.11 (Supplemental Information S1). According to Yang and Zang, the C-score estimates the confidence of the model with higher values meaning greater quality [24]. Moreover, models with C-score > −1.5 and TM-score > 0.5 have usually a correct fold. The TM-score is a measure of the structural similarity which is independent of the sequence length. TM-score values that are higher than 0.5 generally correspond to highly similar structures in the same SCOP/CATH fold family as reported by Xu and Zang [25]. Therefore we select only the optimal model as the best guess of the protein model.
I-TASSER uses a Local Meta-Threading Server (LOMETS) to select templates with high significance in threading alignments to build the model; the significance is measured by the Z-score. An alignment with a Normalized Z-score > 1 indicates a good template for modeling. The best templates selected by LOMETS were the mutant of response regulator KdpE complexed to DNA (PDB code: 4kfcA), the response regulator KdpE complexed to its promoter (PDB code: 4knyA), and the response regulator MtrA (PDB code: 2gwrA) with normalized Z-scores of 4.21, 3.91, and 4.11, respectively (Supplemental information S1). We used the COACH server to predict possible ligand binding sites on BvrR. This site generates two binding site predictions using TM-SITE and S-SITE methods, which recognize ligand-binding templates from the BioLiP protein function database by binding-specific substructure and sequence profile comparisons. The server uses a confidence score (C-score) of predicted binding site ranking from 0 to 1, where a higher score indicates a more reliable prediction [26,27]. According to TM-SITE results, BvrR have binding sites for beryllium trifluoride ion, magnesium, and manganese with a C-score of 0.45; and accordingly to S-SITE, BvrR have binding sites for peptides, sulfate, and lanthanum ions with a C-score of 0.18 and two possible binding sites for nucleic acids with a C-score of 0.12 (Supplemental information S2, Figure S2.1 and S2.2). We also used COFACTOR server to predict possible BvrR biological functions. COFACTOR uses the three-dimensional (3D) structural model to thread through the BioLiP protein function database and identify Gene Ontology (GO), Enzyme Commission (EC), and functional insights. This server evaluates global and local similarity using the C-scoreGO, a confidence score of predicted GO terms, and values a range in between (0–1) where a higher value indicates a better prediction of the function [28,29]. COFACTOR defined the molecular function of BvrR in GO terms as a possible heterocyclic compound binding (GO:1901363, GO:0097159), a nucleic acid binding (GO:0003676), and a signal transducer activity (GO:0004871) with a CscoreGO of 0.94, 0.89, and, 0.73 respectively. The biological process defined in GO terms showed a biological regulation (GO:0065007), signal transduction (GO:0007165), and a metabolic process activity (GO:0008152) with a CscoreGO of 0.97, 0.89, and 0.71, respectively (Supplemental information S2). We also searched the evolutionary relationships of the BvrR domains in the CATH database. This resource identifies protein domain structures within 3D-protein structures from the PDB and assigns domains sharing evolutionary similarities to the same superfamily within the CATH hierarchical structure classification [30]. According to the sequence homology, the BvrR protein has two domains: an N-terminal receiver domain and a C-terminal effector domain. The C-terminal effector domain of BvrR has structural similarities with a domain of AraC-family transcriptional activator protein ToxT from Vibrio cholerae (PDB code: 4MLOA02) (Supplemental information S2, Figure 2a). This domain belongs to the Superfamily 3.40.50.12330 of the CATH classification, which is characterized for having protein histidine kinase activity (GO:0004673), phosphorelay response regulator activity (GO:0000156), pathogenesis (GO:0009405), and cellular response to osmotic stress (GO:0071470). The N-terminal receiver domain of BvrR shares a structural similarity with an N-terminal receiver domain of Response Regulator PmrA from Escherichia coli (PDB code: 3W9SB00) and an N-terminal receiver domain of a signal transduction histidine kinase from Aspergillus oryzae (PDB code: 3C97A01) (Supplemental information S2, Figure 2b). These domains belong to the Superfamily 3.40.50.2300 of the CATH classification, characterized by having protein binding activity (GO:0005515) and phosphorelay signal transduction system (GO:0000160).
The quality of the predicted structure was first assessed using PROCHECK software, which evaluates the stereo-chemical data of the amino acids on the structure and compares them with the data obtained from a refined, high-resolution structure [32,33]. A Ramachandran plot was used to evaluate the BvrR model showing that 75.5% of the residues were in the most favored regions, 19.4% in the additional allowed regions, 2.3% in the generously allowed regions, and 2.8% in the disallowed regions (Supplemental Information S3). Although the software indicates that a good quality model would be expected to have over 90% in the most favored region, it also determines that unusual highlighted regions are not necessarily errors as such, but can be unusual features for which there is a reasonable explanation [32]. VERIFY3D software was used to determine the compatibility of the atomic model (3D) with its own amino acid sequence (1D) by assigning a structural class based on its location and environment [34,35]. Results showed that 85.77% of the residues have an averaged 3D–1D score ≥ 0.2, suggesting that the model is highly accurate (Supplemental Information S4). ERRAT software was used to analyze the BvrR model; it reliably identifies regions of error by examining the statistics of six types of non-covalently bonded atom-atom interactions (CC, CN, CO, NN, NO, and OO) in protein structures and it was used to evaluate the BvrR model [36]. In this analysis, a good protein structure was expected to show a confidence level above 95%. Results indicated that the structure has a quality factor of 96.53% (Supplemental Information S5). PROVE (PROtein Volume Evaluation) software was used to validate the volume-based structure of the BvrR model by determining the deviations of the atomic volumes from the standard values [37]. The program found 49 buried outliner protein atoms and an absolute Z score = 5.6 on the BvrR structure (Supplemental Information S6). Absolute Z-scores are used to identify problems in specific regions within a protein model. Proteins having absolute Z-scores > 3 occur at or near regions in the structure with unusual stereochemistry. This result correlates with the analysis obtained using PROCHECK with which we found unusual regions on the BvrR structure. Finally, ProSA (Protein Structure Analysis) software was used to validate the BvrR structure. This software calculates the overall model quality with the Z-score for a specific input structure, and the score is compared with those typically found for native proteins of similar size. If this score is outside a range characteristic for native proteins, the structure probably contains errors. The BvrR structure obtained a Z-score of −6.57, which is within the range of Z-scores found for native proteins of similar size, indicating that the overall quality of our model is high (Supplemental Information S7).
2.2. Prediction of DNA-Motif Recognized by BvrR
Viadas et al. [18] performed a microarray analysis comparing the expression of all the RNA of a B. abortus wild-type and a mutant bvrR− strain. The analysis showed that 127 genes had an alteration in their expression in the bvrR− strain. Subsequently, real-time PCR analysis was performed to validate their data. They selected 48 genes related to carbon and nitrogen metabolism, control of external membrane proteins, transport, transcription factors, and virulence [18]. We selected and analyzed 32 genes of biological relevance from those analyzed in the real-time PCR and those mentioned in the discussion of the article to find the sequence of a potential DNA-motif recognized by BvrR. For the analysis, we used a 149 bp upstream and 21 bp downstream sequence from the initiation codon, making a final sequence of 170 bp (Supplemental information S8). Three motif analyses were performed with the Gibbs Motif Sampler software on the Gibbs Recursive Sampling mode. First, the software was configured to search for 10 base-pairs motifs, limited to one motif per sequence and two different motifs in all sequences. The first motif obtained was determined by the software as the most optimal result. The data of this motif, its location and the genes where this sequence was found are shown in Table S1 (Supplemental information S9) while the graphic representation of this sequence is shown in Figure 3a. The second motif obtained was determined as being formed by hits that occur with a probability greater than 50% in an optimal alignment in 500 iterations. Table S2 shows the data from this motif (Supplemental information S9) and the graphic representation of this sequence is shown in Figure 3b. On the third motif analysis, the software was configured to search for 10 bp motifs, limited to three motifs per sequence and three different motifs in all sequences. The data of this motif, its location, and the genes where this sequence was found are shown in Table S3 (Supplemental information S9). Finally, the graphic representation of this sequence is shown in Figure 3c.
The three-dimensional structure of the DNA-motif AAHTGC (H represents A, C, and T nucleotides in FASTA format) was built using the 3D-DART server. To fill the gap in the sequence and make the DNA sequence longer to evaluate the docking with BvrR, we choose the sequence showing the motif for the omp25a gene, which codes for an Omp25 protein (Supplemental Information S9). The final sequence model built as double-stranded DNA-B was GCGGCACGAAATGCCCCATT.
2.3. Docking of BvrR into the DNA-motif
The docking analysis was performed with HDOCK server, developed by Huang Laboratory at the Huazhong University of Science & Technology [38,39]. According to structural similarity, the docked pose 7 showed a probable interaction between the DNA-motif and the C-terminal effector domain of BvrR that shares structure similarity with a domain of the transcription regulator protein ToxT from Vibrio cholerae (PDB code: 4MLOA). This docked pose obtained a Docking score of −448.3 and ligand RMSD = 126.84 Å (Figure 4) (Supplemental information S10). HDOCK uses a distance-dependent knowledge-based scoring function (ITScore-PP) to predict interactions. The ITScore-PP improves the interatomic pair potentials using a statistical mechanics-based iterative method, in which the pairwise distance-dependent atomic interaction potentials were derived from experimentally determined complex structures [40,41,42].
2.4. MD Simulations
The docked pose 7 was selected to carry out the Equilibrium Molecular Dynamics (EMD) simulation (Supplemental Information S11). The RMSD (root mean square deviation) was plotted during the 20 ns simulation. The RMSD values were plotted and are shown in Figure 5a. Additionally, the number of hydrogen bonds between BvrR and the DNA-motif during the EMD simulation were plotted. The average number of hydrogen bonds was 3.13 (Figure 5b) (Supplemental Information S12).
We analyzed the Coulombic surface force of the C-terminal domain on frame 7507, to identify the charge in the surface of the C-terminal effector domain (Figure 6a,b). To analyze the energies involved in the C-terminal effector domain and the DNA, two frames (Frames 5523 and 7507) were extracted from the MD simulation and analyzed with the FireDock server. Averaging the results of both frames, the docked pose 7 obtained an attractive Van der Waals (VdW) force of −26.77 Kcal/mol, a repulsive VdW force of 7.56 Kcal/mol, an atomic contact energy (ACE) of 5.18 Kcal/mol, a global binding energy of −16.31 Kcal/mol and five possible hydrogen bonds between the BvrR protein and DNA (Figure 6c). To evaluate the energies involved in the DNA when it has a different motif, we analyzed frame 7507 interacting with the same DNA but changed the motif sequence AAHTGC for CCGGTA. This docked pose obtained an attractive VdW force of −15.78 Kcal/mol, a repulsive VdW force of 6.16 Kcal/mol, an ACE of 3.58 Kcal/mol, Global binding energy of −2.91 Kcal/mol, and one possible hydrogen bond (Supplemental Information S13).
3. Materials and Methods
3.1. Structure Prediction
The BvrR sequence was obtained from the Gene Products Data Bank (GenPept code AAC28777) [44]. The analysis and modeling of BvrR were performed using I-TASSER (Iterative Threading ASSEmbly Refinement) Protein Structure & Function Predictions developed by Zhang Lab University of Michigan [45,46,47] to predict the possible three-dimensional structure of the BvrR protein. Molecular graphics and analyses were performed with UCSF Chimera, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311 [31]. The DNA three-dimensional structure modeling was performed using the 3DNA-Driven DNA Analysis and Rebuilding Tool server (3D-DART server) [48]. The quality analysis of the predicted structure was conducted using SAVES (PROCHECK, VERIFY3D, ERRAT, PROVE) and ProSA servers. Finally, the modeled structure was visualized using Chimera v 1.13.
3.2. Gibbs Sampling
The analysis of the possible regulator zone on the DNA of BvrR was made with Gibbs Motif Sampler Homepage, Gibbs v 3.1 [49,50,51]. We used 32 genes from a previous work for the analysis [18] and selected 170 bp upstream of the starting codon of each gene, where the regulator zone is probably located (Supplemental information S8). The Recursive Sampling mode was specifically used to perform three motif analyses. Firstly, we searched for 10-bp motifs, one motif per sequence, and a maximum of 2 motifs on all the sequences. Afterward, we searched for 10-bp motifs, 3 motifs per sequence, and a maximum of 3 motifs on all the sequences. To graphically represent the multiple sequence alignment, we used WebLogo v 2.8.2 [52].
3.3. Protein-DNA Docking
Protein-DNA docking was performed to analyze the interactions between BvrR and the DNA sequence found with Gibbs sampling. The analysis was carried out with HDOCK server, developed by Huang Laboratory at the Huazhong University of Science & Technology [38,39,40,41]. HDOCK server can model protein–protein and protein–DNA/RNA docking based on a hybrid docking algorithm of template-based modeling and free docking, in which cases with misleading templates can be rescued by the free docking protocol [39].
3.4. Molecular Dynamics Simulations
The docking structures in PDB format were refined with MolProbity server developed by the Department of Biochemistry at Duke University [53]. The Equilibrium Molecular dynamics simulations of the complex were performed with Nanoscale Molecular Dynamics (NAMD2) version 2.12. a molecular dynamics code based on Charm++ parallel objects, designed for high-performance simulation of large biomolecular systems [54]. Additionally, Visual Molecular Dynamics (VMD) version 1.9 was used for simulation setup and trajectory analysis [43,55,56]. The PSF and PDB model representations were build using the CHARMM36 Force Field with the complex immersed into a rectangular periodic box of water of dimensions 60 Å x 92 Å x 90Å; the distance established from the molecule to the edge of the box was 10 Å. The system was neutralized with NaCl. The dynamics ran to a constant temperature of 310 K in explicit solvent. The simulations of the system were carried out using the Langevin dynamics [57], consisting of minimization and equilibration of the molecule in the water box, with a timestep of 2 fs/step considering as rigid all bonds. A thousand steps for minimization were employed and a total number of 107 of simulation steps for a total simulated time of 20 ns for the production dynamics [54].
3.5. Analysis of Dynamics Simulations
First, we used the output files from the minimization and equilibration of the molecule to calculate the RMSD values and analyze the extent of equilibration of the simulation; the RMSD values were calculated for the entire molecule (excluding the hydrogen) every 2 fs of the simulation. With this information, we constructed a plot of RMSD over time to know the stability of the protein. If the RMSD is stable at the end of the run, it means that the molecule is equilibrated. To analyze the binding energies we used the FireDock server developed by the Raymond and Beverly Sackler Faculty of Exact Sciences at the Tel Aviv University [58]. The FireDock server scores the interaction between the protein and the DNA according to Atomic Contact Energy [59], softened Van der Waals interactions, partial electrostatics, and additional estimations of the binding free energy.
4. Discussion
In the structure prediction, the evolutionary relationships of the BvrR domains showed that the C-terminal effector domain of BvrR has high homology with response regulators as proteins KdpE (4knyA), MtrA (2gwrA), and ToxT (4MLOA). This domain belongs to the Superfamily 3.40.50.12330 which have known histidine kinase and phosphorelay response regulator activities related to pathogenesis and response to osmotic stress. The N-terminal receiver domain has high homology with protein PmrA (3W9SB) and a signal transduction histidine kinase (3C97A). This domain belongs to the Superfamily 3.40.50.2300 and is the one that interacts with the sensor BvrS, which correlates with its protein binding activity and phosphorelay signal transduction activities. All these results, including the ones obtained with VERIFY3D, ERRAT and ProSA that prove the accuracy of the model, validate the BvrR structural model, even though some small regions were modeled in an unusual way (accordingly to PROCHECK and PROVE). This may be because the modeling template contained those variations, which were transferred to the model; it may also be due to the lack of better templates on which to base the prediction of the BvrR structure.
In the search of the DNA motif recognized by BvrR, it can be observed that the first two motifs found by the software have the same sequence, AAHTGC. This sequence was found before the coding sequence for 13 genes, as nitric oxide reductase NorC, Lipoprotein, methyltransferase Cfa, transcriptional regulator OmpR, GntR regulator, thermal shock protein HtpX, and glutaminase R, among others (Supplemental Information S9). Most importantly, the motif was found for the outer membrane protein Omp25, which is known to be regulated by BvrR [16]. This could indicate that the motif is probably the one recognized by this regulator on the DNA. The third motif was found only on 3 genes; for this reason, this motif was discarded for further analysis. Interestingly, the sequence AAHTGC has a great similarity to the sequence AAATGTG, one of the sequences recognized by the response regulator KdpE (PDB code: 4knyA). The latter was one of the threading templates used by I-TASSER to build the BvrR model, as mentioned above, and was the only template crystalized with its DNA motif. It should be noted that the motif AAHTGC was found after the start codon in the genes coding for a Heat Shock Protein HtpX (BAB1_1821) and in the Negative exopolysaccharide regulator ExoR (BAB1_0891). This suggested a possible negative regulation on those genes (Table S4, Supplemental Information S9).
In the docking analysis, we selected the docked pose 7 because it showed an interaction between BvrR and the DNA-motif related to the one predicted by COACH server with the method S-SITE (Figure S2.1, Supplemental Information S2) and because it also had a good docking score, showing a high probability that it is the right zone of interaction. This was related to the EMD simulation, where the RMSD values were kept constant almost from the beginning and the number of hydrogen bonds remained stable, indicating that the interaction between the C-terminal effector domain and the DNA-motif is in equilibrium. This also correlated with the analysis of the Coulombic surface force on the site of interaction, where the C-terminal effector domain was positively charged and interacted with the negatively charged DNA, making the union in that area highly probable. Finally, those results were confirmed with the analysis of the energies involved in the interaction, where the global binding energy between BvrR and the AAHTGC motif was higher compared to that observed when using an unrelated DNA-motif.
All results shown above were performed by computational models, and need to be validated experimentally by means of ChIP-Seq and the crystallization of BvrR interacting with DNA. Unfortunately, specific monoclonal antibodies against BvrR, necessary to purify the protein and carry out these studies, are not available. In the future, we would like to continue the research on this response regulator, as it is possibly a major virulence factor in the infection caused by Brucella. We believe that knowledge of the structure, binding site to DNA, and regulated genes could lead to a better understanding of the infection mechanism, and that knowledge in the future could aid in the selection of new targets for the development of new vaccines. Also, BvrR could be used as a new biomarker for diagnostic chronic infections of brucellosis, as Vitale et al. [60] used the cyclin-dependent kinase inhibitor p16INK4a, in high-risk HPV infections, to identify which low-grade intraepithelial lesion (LSIL) cases were inclined to the progression of the disease and the possible development of cervical cancer.
5. Conclusions
According to the results obtained, the three-dimensional structure of BvrR was consistently predicted. It was also found that the AAHTGC sequence is likely the motif recognized in the DNA by BvrR. The BvrR protein will probably interact with the DNA motif sequence found in Gibbs Recursive Sampler. The interaction between the DNA and the C-terminal effector domain showed a good equilibrium when analyzing the RMSD of all atoms as compared to the initial position of the molecule. The hydrogen bonds between BvrR and DNA remained stable during the simulation. According to the analysis of the surface electrical charge, the BvrR C-terminal effector domain is positively charged in the area where it interacts with the DNA, which possibly favors that interaction. When analyzing the energies involved, high attractive VdW energies and global binding energy were observed. Likewise, the analysis showed little interference of repulsive VdW energies as compared to the observed energies when the DNA-motif is exchanged for an unrelated one. The data obtained in this work were in silico and must be validated experimentally in the future.
Supplementary Materials
The following are available online. Supplemental Information S1: I-TASSER results for predicted BvrR structure; Supplemental Information S2: CATH results for BvrR homology; Supplemental Information S3: PROCHECK results for BvrR; Supplemental Information S4: Verify 3D results for BvrR; Supplemental Information S5: ERRAT results for BvrR; Supplemental Information S6: PROVE results for BvrR; Supplemental Information S7: ProSA results for BvrR; Supplemental Information S8: Sequences of the 32 genes used for the Gibbs Sampling analysis; Supplemental Information S9: Gibbs recursive sampling results obtained with the 32 sequences; Supplemental Information S10: HDOCK docking results for BvrR and DNA-motif; Supplemental Information S11: MD simulations results for docked pose 7; and Supplemental Information S12: Firedock analysis results for frames 5523 and 7507.
Author Contributions
E.A.R.-G. and R.L.-S. designed the experiments. E.A.R.-G. and A.M.-T. performed the computational analysis. E.A.R.-G., M.C.M.-L., M.E.C.-D., I.E.-G., A.M.-T., and R.L.-S. analyzed the data. R.L.-S. and M.C.M.-L. were in charge of funding acquisition, resources, project administration, and review and editing. E.A.R.-G. wrote the original draft.
Funding
This research was funded by Instituto Politécnico Nacional (SIP project 20181661, SIP project 20195796 and SIP project 20181218).
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Sample Availability: Not available.
References
- 1.Ariza J., Bosilkovski M., Cascio A., Colmenero J.D., Corbel M.J., Falagas M.E., Memish Z.A., Roushan M.R.H., Rubinstein E., Sipsas N.V., et al. Perspectives for the Treatment of Brucellosis in the 21st Century: The Ioannina Recommendations. PLoS Med. 2007;4:e317. doi: 10.1371/journal.pmed.0040317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Corbel M.J. Brucellosis in Humans and Animals. WHO Press; Geneva, Switzerland: 2006. pp. 1–102. [Google Scholar]
- 3.Xavier M.N., Paixao T.A., den Hartigh A.B., Tsolis R.M., Santos R.L. Pathogenesis of Brucella spp. Open Vet. Sci. J. 2010;4:109–118. doi: 10.2174/1874318801004010109. [DOI] [Google Scholar]
- 4.Bossi P., Tegnell A., Baka A., Van Loock F., Hendriks J., Werner A., Maidhof H., Gouvras G. Task Force on Biological and Chemical Agent Threats, Public Health Directorate, European Commission, Luxembourg Bichat guidelines for the clinical management of brucellosis and bioterrorism-related brucellosis. Euro Surveill. 2004;9:E15–E16. doi: 10.2807/esm.09.12.00506-en. [DOI] [PubMed] [Google Scholar]
- 5.Pappas G., Akritidis N., Bosilkovski M., Tsianos E. Brucellosis. N. Engl. J. Med. 2005;352:2325–2336. doi: 10.1056/NEJMra050570. [DOI] [PubMed] [Google Scholar]
- 6.Fugier E., Pappas G., Gorvel J.P. Virulence factors in brucellosis: Implications for aetiopathogenesis and treatment. Expert Rev. Mol. Med. 2007;9:1–10. doi: 10.1017/S1462399407000543. [DOI] [PubMed] [Google Scholar]
- 7.Acha P.N., Szyfres B. Zoonosis y enfermedades transmisibles comunes al hombre y a los animales. vol. 1—Bacteriosis y micosis. Rev. Inst. Med. Trop. Sao Paulo. 2001;43:338. doi: 10.1590/S0036-46652001000600015. [DOI] [Google Scholar]
- 8.Murray P.R., Rosenthal K.S., Pfaller M.A. Microbiologia Médica. Elsevier; Amsterdam, The Netherland: 2017. [Google Scholar]
- 9.Gorvel J.P., Moreno E. Brucella intracellular life: From invasion to intracellular replication. Vet. Microbiol. 2002;90:281–297. doi: 10.1016/S0378-1135(02)00214-6. [DOI] [PubMed] [Google Scholar]
- 10.Watarai M., Kim S., Erdenebaatar J., Makino S., Horiuchi M., Shirahata T., Sakaguchi S., Katamine S. Cellular prion protein promotes Brucella infection into macrophages. J. Exp. Med. 2003;198:5–17. doi: 10.1084/jem.20021980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Celli J., de Chastellier C., Franchini D.-M., Pizarro-Cerda J., Moreno E., Gorvel J.-P. Brucella Evades Macrophage killing via VirB-dependent sustained interactions with the endoplasmic reticulum. J. Exp. Med. 2003;198:545–556. doi: 10.1084/jem.20030088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mayorga L.S., Bertini F., Stahl P.D. Fusion of newly formed phagosomes with endosomes in intact cells and in a cell-free system. J. Biol. Chem. 1991;266:6511–6517. [PubMed] [Google Scholar]
- 13.Desjardins M., Celis J.E., van Meer G., Dieplinger H., Jahraus A., Griffiths G., Huber L.A. Molecular characterization of phagosomes. J. Biol. Chem. 1994;269:32194–32200. [PubMed] [Google Scholar]
- 14.Starr T., Ng T.W., Wehrly T.D., Knodler L.A., Celli J. Brucella intracellular replication requires trafficking through the late endosomal/lysosomal compartment. Traffic. 2008;9:678–694. doi: 10.1111/j.1600-0854.2008.00718.x. [DOI] [PubMed] [Google Scholar]
- 15.Sola-Landa A., Pizarro-Cerdá J., Grilló M.J., Moreno E., Moriyón I., Blasco J.M., Gorvel J.P., López-Goñi I. A two-component regulatory system playing a critical role in plant pathogens and endosymbionts is present in Brucella abortus and controls cell invasion and virulence. Mol. Microbiol. 1998;29:125–138. doi: 10.1046/j.1365-2958.1998.00913.x. [DOI] [PubMed] [Google Scholar]
- 16.Guzman-Verri C., Manterola L., Sola-Landa A., Parra A., Cloeckaert A., Garin J., Gorvel J.-P., Moriyon I., Moreno E., Lopez-Goni I. The two-component system BvrR/BvrS essential for Brucella abortus virulence regulates the expression of outer membrane proteins with counterparts in members of the Rhizobiaceae. Proc. Natl. Acad. Sci. USA. 2002;99:12375–12380. doi: 10.1073/pnas.192439399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lamontagne J., Butler H., Chaves-Olarte E., Hunter J., Schirm M., Paquet C., Tian M., Kearney P., Hamaidi L., Chelsky D., et al. Extensive cell envelope modulation is associated with virulence in Brucella abortus. J. Proteome Res. 2007;6:1519–1529. doi: 10.1021/pr060636a. [DOI] [PubMed] [Google Scholar]
- 18.Viadas C., Rodríguez M.C., Sangari F.J., Gorvel J.-P., García-Lobo J.M., López-Goñi I. Transcriptome analysis of the Brucella abortus BvrR/BvrS Two-Component Regulatory System. PLoS ONE. 2010;5:e10216. doi: 10.1371/journal.pone.0010216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Martinez-Nunez C., Altamirano-Silva P., Alvarado-Guillen F., Moreno E., Guzman-Verri C., Chaves-Olarte E. The Two-Component System BvrR/BvrS regulates the expression of the type IV Secretion System VirB in Brucella abortus. J. Bacteriol. 2010;192:5603–5608. doi: 10.1128/JB.00567-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Boschiroli M.L., Ouahrani-Bettache S., Foulongne V., Michaux-Charachon S., Bourg G., Allardet-Servent A., Cazevieille C., Liautard J.P., Ramuz M., O’Callaghan D. The Brucella suis virB operon is induced intracellularly in macrophages. Proc. Natl. Acad. Sci. USA. 2002;99:1544–1549. doi: 10.1073/pnas.032514299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Altamirano-Silva P., Meza-Torres J., Castillo-Zeledón A., Ruiz-Villalobos N., Zuñiga-Pereira A.M., Chacón-Díaz C., Moreno E., Guzmán-Verri C., Chaves-Olarte E. Brucella abortus senses the intracellular environment through the two-component system BvrR/BvrS allowing the adaptation to its replicative niche. Infect. Immun. 2018;86:e00713-17. doi: 10.1128/IAI.00713-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Martínez de Tejada G., Pizarro-Cerdá J., Moreno E., Moriyón I. The outer membranes of Brucella spp. are resistant to bactericidal cationic peptides. Infect. Immun. 1995;63:3054–3061. doi: 10.1128/iai.63.8.3054-3061.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Freer E., Moreno E., Moriyón I., Pizarro-Cerdá J., Weintraub A., Gorvel J.P. Brucella-Salmonella lipopolysaccharide chimeras are less permeable to hydrophobic probes and more sensitive to cationic peptides and EDTA than are their native Brucella sp. counterparts. J. Bacteriol. 1996;178:5867–5876. doi: 10.1128/jb.178.20.5867-5876.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Yang J., Zhang Y. Protein structure and function prediction using I-TASSER. Curr. Protoc. Bioinform. 2015;52:5–8. doi: 10.1002/0471250953.bi0508s52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Xu J., Zhang Y. How significant is a protein structure similarity with TM-score = 0.5? Bioinformatics. 2010;26:889–895. doi: 10.1093/bioinformatics/btq066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Yang J., Roy A., Zhang Y. Protein–ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment. Bioinformatics. 2013;29:2588–2595. doi: 10.1093/bioinformatics/btt447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yang J., Roy A., Zhang Y. BioLiP: A semi-manually curated database for biologically relevant ligand-protein interactions. Nucleic Acids Res. 2013;41:D1096–D1103. doi: 10.1093/nar/gks966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Roy A., Yang J., Zhang Y. COFACTOR: An accurate comparative algorithm for structure-based protein function annotation. Nucleic Acids Res. 2012;40:W471–W477. doi: 10.1093/nar/gks372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhang C., Freddolino P.L., Zhang Y. COFACTOR: Improved protein function prediction by combining structure, sequence and protein–protein interaction information. Nucleic Acids Res. 2017;45:W291–W299. doi: 10.1093/nar/gkx366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dawson N.L., Lewis T.E., Das S., Lees J.G., Lee D., Ashford P., Orengo C.A., Sillitoe I. CATH: An expanded resource to predict protein function through structure and sequence. Nucleic Acids Res. 2017;45:D289–D295. doi: 10.1093/nar/gkw1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E. UCSF Chimera-A visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 32.Laskowski R.A., MacArthur M.W., Moss D.S., Thornton J.M. IUCr PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 1993;26:283–291. doi: 10.1107/S0021889892009944. [DOI] [Google Scholar]
- 33.Laskowski R.A., Rullmannn J.A., MacArthur M.W., Kaptein R., Thornton J.M. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR. 1996;8:477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
- 34.Bowie J.U., Lüthy R., Eisenberg D. A method to identify protein sequences that fold into a known three-dimensional structure. Science. 1991;253:164–170. doi: 10.1126/science.1853201. [DOI] [PubMed] [Google Scholar]
- 35.Lüthy R., Bowie J.U., Eisenberg D. Assessment of protein models with three-dimensional profiles. Nature. 1992;356:83–85. doi: 10.1038/356083a0. [DOI] [PubMed] [Google Scholar]
- 36.Colovos C., Yeates T.O. Verification of protein structures: Patterns of nonbonded atomic interactions. Protein Sci. 1993;2:1511–1519. doi: 10.1002/pro.5560020916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Pontius J., Richelle J., Wodak S.J. Deviations from standard atomic volumes as a quality measure for protein crystal structures. J. Mol. Biol. 1996;264:121–136. doi: 10.1006/jmbi.1996.0628. [DOI] [PubMed] [Google Scholar]
- 38.Yan Y., Wen Z., Wang X., Huang S.-Y. Addressing recent docking challenges: A hybrid strategy to integrate template-based and free protein-protein docking. Proteins Struct. Funct. Bioinform. 2017;85:497–512. doi: 10.1002/prot.25234. [DOI] [PubMed] [Google Scholar]
- 39.Yan Y., Zhang D., Zhou P., Li B., Huang S.-Y. HDOCK: A web server for protein–protein and protein–DNA/RNA docking based on a hybrid strategy. Nucleic Acids Res. 2017;45:W365–W373. doi: 10.1093/nar/gkx407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Huang S.-Y., Zou X. A knowledge-based scoring function for protein-RNA interactions derived from a statistical mechanics-based iterative method. Nucleic Acids Res. 2014;42:e55. doi: 10.1093/nar/gku077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Huang S.-Y., Zou X. An iterative knowledge-based scoring function for protein-protein recognition. Proteins Struct. Funct. Bioinform. 2008;72:557–579. doi: 10.1002/prot.21949. [DOI] [PubMed] [Google Scholar]
- 42.Yan Y., Huang S. A new pairwise shape-based scoring function to consider long-range interactions for protein-protein docking. Biophys. J. 2017;112:470a. doi: 10.1016/j.bpj.2016.11.2521. [DOI] [Google Scholar]
- 43.Humphrey W., Dalke A., Schulten K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 44.National Center of Biotechnology Information (NCBI) [(accessed on 7 December 2018)]; Available online: https://www.ncbi.nlm.nih.gov/protein/AAC28777.1?report=genbank&log$=protalign&blast_rank=24&RID=WZ0WJA2M014.
- 45.Zhang Y. I-TASSER server for protein 3D structure prediction. BMC Bioinform. 2008;9:40. doi: 10.1186/1471-2105-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Roy A., Kucukural A., Zhang Y. I-TASSER: A unified platform for automated protein structure and function prediction. Nat. Protoc. 2010;5:725–738. doi: 10.1038/nprot.2010.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yang J., Yan R., Roy A., Xu D., Poisson J., Zhang Y. The I-TASSER Suite: Protein structure and function prediction. Nat. Methods. 2015;12:7–8. doi: 10.1038/nmeth.3213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.van Dijk M., Bonvin A.M.J.J. 3D-DART: A DNA structure modelling server. Nucleic Acids Res. 2009;37:W235–W239. doi: 10.1093/nar/gkp287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Thompson W., Rouchka E.C., Lawrence C.E. Gibbs Recursive Sampler: Finding transcription factor binding sites. Nucleic Acids Res. 2003;31:3580–3585. doi: 10.1093/nar/gkg608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Newberg L.A., Thompson W.A., Conlan S., Smith T.M., McCue L.A., Lawrence C.E. A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction. Bioinformatics. 2007;23:1718–1727. doi: 10.1093/bioinformatics/btm241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Thompson W.A., Newberg L.A., Conlan S., McCue L.A., Lawrence C.E. The Gibbs Centroid Sampler. Nucleic Acids Res. 2007;35:W232–W237. doi: 10.1093/nar/gkm265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Crooks G.E., Hon G., Chandonia J.-M., Brenner S.E. WebLogo: A Sequence Logo Generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chen V.B., Arendall W.B., Headd J.J., Keedy D.A., Immormino R.M., Kapral G.J., Murray L.W., Richardson J.S., Richardson D.C., Richardson D.C. MolProbity: All-atom structure validation for macromolecular crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Phillips J.C., Braun R., Wang W., Gumbart J., Tajkhorshid E., Villa E., Chipot C., Skeel R.D., Kalé L., Schulten K. Scalable Molecular Dynamics with NAMD. J. Comput. Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Stone J.E., Gullingsrud J., Schulten K. A system for interactive molecular dynamics simulation; Proceedings of the 2001 Symposium on Interactive 3D Graphics; Chapel Hill, NC, USA. 26–29 March 2001; New York, NY, USA: ACM; 2001. pp. 191–194. [Google Scholar]
- 56.Eargle J., Wright D., Luthey-Schulten Z. Multiple alignment of protein structures and sequences for VMD. Bioinformatics. 2006;22:504–506. doi: 10.1093/bioinformatics/bti825. [DOI] [PubMed] [Google Scholar]
- 57.Langevine P. On the theory of Brownian motion. Comptes Rendus Acad. Bulg. des Sci. 1908;146:530–533. [Google Scholar]
- 58.Mashiach E., Schneidman-Duhovny D., Andrusier N., Nussinov R., Wolfson H.J. FireDock: A web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 2008;36:W229–W232. doi: 10.1093/nar/gkn186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Zhang C., Vasmatzis G., Cornette J.L., DeLisi C. Determination of atomic desolvation energies from the structures of crystallized proteins 1 1Edited by B. Honig. J. Mol. Biol. 1997;267:707–726. doi: 10.1006/jmbi.1996.0859. [DOI] [PubMed] [Google Scholar]
- 60.Vitale S.G., Valenti G., Rapisarda A.M.C., Cali I., Marilli I., Zigarelli M., Sarpietro G., Cianci A. P16INK4a as a progression/regression tumour marker in LSIL cervix lesions: Our clinical experience. Eur. J. Gynaecol. Oncol. 2016;37:685–688. [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.