Skip to main content
. 2013 Mar 6;14:85. doi: 10.1186/1471-2105-14-85

Table 1.

Model comparison using the harmonic mean estimator (HME) for each of the three data sets

Data set Model Root order HME sHME log BF
Laurasiatheria
GTR
Independence (Second)
-173720.23
-173342.71
-
 
HG04
Second
-170093.65
-169819.78
3522.93
Pseudogenes
GTR
Independence (First)
-14026.11
-14001.45
-
 
HG04
First
-13772.80
-13738.83
262.62
Nuclear SSU rRNA
GTR
Independence (Zero)
-8441.69
-8409.53
-
 
HG04
Zero
-8412.27
-8374.33
35.20
 
GTR
Independence (First)
-8435.35
-8409.82
-
 
HG04
First
-8395.11
-8364.34
45.48
 
GTR
Independence (Second)
-8445.00
-8408.39
-
  HG04 Second -8399.92 -8353.75 54.64

Harmonic mean estimates (HME) and stabilized harmonic mean estimates (sHME) for both the site-independent general time-reversible model (GTR) and the context-dependent model of Hwang and Green (HG04) [23], combined with the one or more ancestral root distributions [24]. Great care needs to be taken to make sure that under the GTR model, the same sites are taken into account when calculating the likelihood, this due to the presence of gaps. We emphasize this by stating which ancestral root distribution is used for the GTR model, on top of eliminating those sites for which the dependency pattern contains one or more gaps. The proposed context-dependent model (HG04) clearly outperforms the site-independent GTR model for all the data sets, according to the HME and sHME. Further, for the nuclear SSU rRNA data set and according to the sHME, the difference in model fit increases in favor of the HG04 model with increasing dependencies at the ancestral root.