Skip to main content
. 2018 Oct 28;20(11):826. doi: 10.3390/e20110826

Figure 1.

Figure 1

In probability mass diagrams, height represents the probability mass of each joint event from X×Y which must sum to 1. The leftmost of the diagrams depicts the joint distribution P(X,Y), while the central diagrams depict the joint distribution after the occurence of the event yY leads to exclusion of the probability mass associated with the complementary event y¯. By convention, vertical and diagonal hatching represent informative and misinformative exclusions, respectively. The rightmost diagrams represent the conditional distribution after the remaining probability mass has been normalised. Top row: A purely informative probability mass exclusion, p(x¯,y¯)>0 and p(x,y¯)=0, leading to p(x|y)>p(x) and hence i(x;y)>0. Middle row: A purely misinformative probability mass exclusion, p(x¯,y¯)=0 and p(x,y¯)>0, leading to p(x|y)<p(x) and hence i(x;y)<0. Bottom row: The general case p(x¯,y¯>0) and p(x,y¯)>0. Whether p(x|y) turns out to be greater or less than p(x) depends on the size of both the informative and misinformative exclusions.