Figure 1. An iterative pairwise sequence similarity training scheme used for constructing a protein's feature vector.
Feature vector corresponding to a particular protein X is FX = fX1, fX2, …, fXi, where i is the total number of allergens in the training data set and fXi, is the Smith-Waterman alignment score of sequence X against the ith allergens in the training dataset.
