Fig. 2. Overview of C. jejuni attribution study.
(A) Random Forest analyses were performed on training data from chicken, cattle, and wild bird sources to score patterns of unitigs according to their ability to predict isolate source. Dots within each box show how common each pattern is in the two intersecting hosts. Patterns within the small, dotted boxes are common in the host on the horizontal axis, but rare in the host labelled on the vertical axis. (B) Patterns of unitigs with highly discriminatory mutual information (MI) scores were used to select markers for different hosts. (C) Genes from which unitig markers were selected and assigned allele numbers for attribution. (D) Markers were tested on a subset of the data for accuracy (overall accuracy > 90%); and (E) used to predict the source of 8160 C. jejuni infection cases.
