Fig 1. Protein sequence splitting.
In order to prepare the training data, each protein sequence will be represented as three sequences (1, 2, 3) of 3-grams.
In order to prepare the training data, each protein sequence will be represented as three sequences (1, 2, 3) of 3-grams.