Skip to main content
. 2020 Jan 15;577(7791):519–525. doi: 10.1038/s41586-019-1916-6

Extended Data Fig. 7. Maximum-likelihood tree of Asgard archaea l-threonine/l-serine dehydratase.

Extended Data Fig. 7

a, Tree calculated for target Asgard archaea l-threonine/l-serine dehydratase (TdcB) and homologues. TdcB homologues were obtained by BLASTp analysis of the Asgard archaea sequences against the UniProt reference proteome and SwissProt database (release 2019_06). Of homologues with sequence similarity ≥40%, overlap ≥70% and predicted prosite domain PS00165 (serine/threonine dehydratases pyridoxal-phosphate attachment site), representative sequences were selected using CD-HIT with a clustering cut-off of 70% similarity (otherwise default settings were used). Additional homologues with verified biochemical activity, sequence similarity ≥30% and overlap ≥70% were obtained by BLASTp analysis of the Asgard archaea sequences against the UniProt/SwissProt database (2019_05). Sequences were aligned using MAFFT v.7 with default settings. Positions with gaps in more than 10% of the sequences were excluded from the alignment using trimAl v.1.2 (-gt 0.9; and otherwise default settings were used). The maximum-likelihood tree was constructed using PhyML using a fixed empirical substitution matrix (LG), 4 discrete GAMMA categories, empirical amino acid frequencies from the alignment and 100 bootstrap replicates (-b 100 -d aa -m LG -v e). In total, 308 sites of the alignment were used for tree construction. b, Tree calculated for a subset of sequences contained in a section of the original tree (branches that are coloured blue). Sequences were realigned and trimmed as described for a. In total, 308 sites of the alignment were used for tree construction.