Table 1. Illustration of the three different encoding schemes for SNP data.
SNPi | Add count | Rec | Gen | |||
---|---|---|---|---|---|---|
A | B | AA | AB | BB | ||
AA | 0 | 1 | 0 | 1 | 0 | 0 |
AB | 1 | 1 | 1 | 0 | 1 | 0 |
BB | 2 | 0 | 1 | 0 | 0 | 1 |
We investigated three different encoding schemes for SNP data, here with two alleles A (major) and B (minor). The additive encoding (Add) represents each genotype through the minor allele count. The recessive/dominant (Rec) encoding encodes the presence of at least one allele for each of the two. The genotypic (Gen) encoding consists of three features, one for each possible genotype.