Skip to main content
. 2014 May 15;10(5):e1004342. doi: 10.1371/journal.pgen.1004342

Figure 3. Graphical models for Discretized Sequentially Markov Coalescent (DSMC) models.

Figure 3

(A) Full DSMC model for Inline graphic samples with local trees, Inline graphic, recombinations, Inline graphic, and alignment columns, Inline graphic. Together, Inline graphic and Inline graphic define an ancestral recombination graph, Inline graphic. Solid circles indicate observed variables and empty circles indicate latent variables. Arrows indicate direct dependencies between variables and correspond to conditional probability distributions described in the text. Notice that the Inline graphic variables can be integrated out of this model, leading to the conventional graph topology for a hidden Markov model. (B) The same model as in (A), but now partitioning the latent variables into components that describe the history of the first Inline graphic sequences (Inline graphic and Inline graphic) and components specific to the Inline graphicth sequence (Inline graphic and Inline graphic). The Inline graphic and Inline graphic variables are represented by solid circles because they are now “clamped” at specific values. A sample of Inline graphic represents a threading of the Inline graphicth sequence through the ARG. (C) Reduced model after elimination of Inline graphic by integration, enabling efficient sampling of coalescent threadings Inline graphic. This is the model used by the first step in our two-step sampling approach. In the second step, the Inline graphic variables are sampled conditional on Inline graphic, separately for each Inline graphic. In this model, the grouped nodes have complex joint dependencies, leading to a heterogeneous state space and normalization structure, but the linear conditional independence structure of an HMM is retained.