Skip to main content
. 2012 Apr 12;7(4):e34754. doi: 10.1371/journal.pone.0034754

Table 1. Workflow summarizing the steps followed in the identification of HGT genes in mycobacteria.

1. The proteomes of 15 Mycobacterium spp., D. discoideum and 34 amoeba-resistant bacteria (Table S1) were downloaded from the National Center for Biotechnology Information (NCBI).
2. Search for homologous genes of mycobacterial open reading frames (ORFs) in the genomes of D. discoideum and 34 amoeba-resistant bacteria using the BLASTp program from NCBI (E-value<1.10–4, similarity >30% and coverage >80%).
3. Search for the homologous sequences of the mycobacterial ORFs found in step 2 in the NR database using BLASTp (E-value<1.10–4, similarity >30% and coverage >80%).
4. Selection of ORFs from mycobacteria found in step 3 presenting significant homology with D. discoideum and 34 amoeba-resistant bacteria in the first 100 hits.
5. Infer phylogenetic relationships between the protein sequences found in steps 4 using MUSCLE for alignment and two construction methods (Maximum Likelihood within the PHYML program and M. Bayes).
6. Analysis of the trees generated in step 5, looking for possible HGT between Mycobacterium spp. and amoebae and/or amoeba-resistant bacteria.

More details can be found in the materials and methods.