The tree includes 69 amino acid sequences of PB transposases and domesticated PB transposases from ciliates and other species. To construct the tree, the alignment of all transposase core domains (
Supplementary file 1) was edited to remove specific insertions restricted to one particular PgmL family. All accession numbers are indicated, except for the PiggyBat transposase from
Myotis lucifugus (
Mitra et al., 2013). PB and domesticated PB proteins from
Tetrahymena thermophila are in blue. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model (
Jones et al., 1992). The tree with the highest log likelihood (−35952.17) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches (bootstrap = 100). Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (five categories (+G, parameter = 2.8098)). There were a total of 541 positions in the final dataset. The tree is drawn to scale, with branch lengths measured as the number of substitutions per site. Evolutionary analyses were conducted in MEGA7 (
Kumar et al., 2016).