Skip to main content
. 2016 Sep 7;45(1):e2. doi: 10.1093/nar/gkw798

Figure 1.

Figure 1.

COME workflow: a coding potential calculator based on multiple features. COME integrates multiple features with a supervised model to classify protein coding transcripts (mRNAs) and non-coding transcripts (lncRNAs). Multiple features (GC content, sequence conservation score, etc.) are processed by a decompose–compose procedure: feature values are initially calculated and indexed at the bin level (B). They are first indexed at the whole genome level, then mapped to each transcript (A). (C) The feature vectors of each transcript are composed at the transcript level by the maximum, mean and variance scores of the overlapping bins. (D) The probability of being mRNA predicted by the supervised model is the coding potential score for a given transcript.