Skip to main content
. 2015 Nov 14;32(6):932–936. doi: 10.1093/bioinformatics/btv663

Fig. 2.

Fig. 2.

Schematic representation of KMAD’s scoring function; i and j are positions in the profile and in the sequence, respectively; profilei is a vector of length 20 that contains the frequencies of the 20 amino acid types m at position i in the alignment that resulted from the previous iteration (in the first iteration all profile values are 0 or 1, according to the first sequence). D is an IDP-specific 20 × 20 substitution matrix (Midic et al., 2009). The residue score is the convolution of the profile vector for i and the D column for j. ωft are weights for the three feature types ft (ωft weights optionally can be set by the user). Uf,i values for PTMs and SLiMs are determined from the conservation of that feature at i. Uf,i values for domains are the conservation of the domain at position i minus the conservation of all other domains at that position. The terms R relate to the perceived reliability (PR) of the feature; experimentally determined PTMs, for example, weigh three times higher than predicted ones. Rf,i,j is the product of the average PR of the feature f at position i in the profile and the PR of this feature at position j in the sequence. The metascore for j is the weighted sum of scores for all observed features