Fig 1.
An example of how feature vectors are generated: if we believe that the mutation rate at a position depends on the 4-mer (i.e. length 4 motif) starting one position to its left, then the feature vector for position j is a one-hot encoding of the sequence that appears in position j − 1 through j + 2. More formally, each element in the feature vector at position j indicates whether or not a motif m appears from start position j − j′ +1 through end position j − j′ + len(m) (here m = 4 and j′ = 2). The start and end positions are derived by aligning position j of the sequence with position j′ of the motif.