Skip to main content
. 2018 Sep 8;34(17):i671–i679. doi: 10.1093/bioinformatics/bty589

Fig. 3.

Fig. 3.

Simulation setup and comparison measures. (a) Given the number m of taxa and n of characters, we use the ms package (Hudson, 2002) to simulate a perfect phylogeny tree. Subsequently, we introduce at most k losses per character using a rate λ, yielding the simulated phylogenetic tree T* and matrix B*. (b) We then perturb the entries of B*=[bp,c*] given a false positive rate FPR(D)=α* and false negative rate FPNR(D)=β*, yielding the input matrix D=[dp,c]. Entry dp,c=0 is a true negative (TN) if bp,c*=0 and a false negative (FN) if bp,c*=1. Conversely, dp,c=1 is a false positive (FP) if bp,c*=0 and a true positive (TP) if bp,c*=1. (c) Given D, α* and β*, a phylogeny estimation method yields output matrix B=[bp,c]. (d) In addition, such a method outputs a phylogenetic tree T whose leaves form the rows of output matrix B. (e) To compare T and T*, we compute the recall in terms of pairs of character states that are ancestral (A), on distinct branches (incomparable, I), or on the same edge (clustered, C). A recall of 1 for all three measures implies that (the internal nodes of) T and T* are identical. To compare B and B*, we compute FPR(B) and FNR(B)—if both are 0 then B=B*