Skip to main content
. 2018 Nov 10;35(12):2009–2016. doi: 10.1093/bioinformatics/bty937

Fig. 3.

Fig. 3.

We represented each protein sequence with the overlapping trigram counts present in that sequence. This leads to a size 8000 sparse vector. The vector was reduced to a vector of size 200 using Singular Value Decomposition. We used the size 200 vector as the baseline representation