Skip to main content
. 2014 Feb 27;9(2):e89244. doi: 10.1371/journal.pone.0089244

Figure 1. Overview of the inference methodology.

Figure 1

(a) The occurrence and co-occurrence frequencies of the cancer gene mutations Inline graphic and Inline graphic are determined from available samples, where Inline graphic, and Inline graphic is the number of the cancer genes targeted in the study. An occurrence of a gene will be counted if it is mutated in one of the samples, and a co-occurrence of a pair of genes will be counted if both are mutated in one of the samples; therefore, Inline graphic and Inline graphic. (b) Based on the principle of maximum entropy, the initial values of the sequential co-occurrence frequencies are set as Inline graphic. (c) The carcinogenesis information conductivities, Inline graphic, are calculated from the vector of Inline graphic and the matrix of Inline graphic. It should be noted that Inline graphic might not be equal to Inline graphic, implying that the matrix of Inline graphic represents a directed network. (d) For each of the Inline graphic samples in question, the probabilities of every potential order of the mutant genes in sample Inline graphic are computed according to the CICs of each order (Methods). (e) The matrix of Inline graphic is redetermined by the matrix of Inline graphic and the ratio of the probability-weighted number of the orders indicated that i occurs before j to the number of co-occurrence frequency, it is important to note that Inline graphic is not equal to Inline graphic in general. If the matrix of Inline graphic has not reached the criterion of convergence, the inferred orders will not be regarded as stable and a new loop of the calculation of Inline graphic and Inline graphic will be performed. Otherwise (f), the orders with a probability higher than random chance and the corresponding probabilities Inline graphic and Inline graphic are regarded as the referred results. For example, of all 6 potential orders for a sample with three mutant cancer genes a, b and c, orders Inline graphic and Inline graphic are identified as the probable ones due to probabilities of 0.7 and 0.2 (higher than a random chance of 1/6).