Table 1.
Parameter | Species | Relationship with expression |
Type of evidence (0) |
Proposed mechanism | Refs (computation) (1) |
Refs (function) |
---|---|---|---|---|---|---|
Codon adaptation | ||||||
CAI or FOP or tAI | All | Complex (see text) | c | Translation elongation rate/accuracy | CAI35, fop28, tAI128 | 2, 37, 55, 81, 94, 96 |
Rare codon stretches | Bacteria | − | c | Translation elongation rate/accuracy | 90 | |
Rare codons between protein domains |
Bacteria | Complex | c | Translation elongation/Protein folding | 91, 132 | |
Frequency of starvation-resistant codons |
Bacteria | + | c | Translation elongation rate/accuracy | 79, 81 | 81 |
Frequency of abundant codons | All | Complex | c | Unclear (2) | 133 | 134 |
mRNA folding | ||||||
Weak mRNA folding at start codon | Bacteria | + | d | Translation initiation rate | mfold135 | 53, 55, 97 |
mRNA stem-loop 15 nt downstream of start codon |
Bacteria, mammals | + | c | Translation initiation rate | 130, 131 | 130, 131 |
mRNA stem-loops further downstream | All | Complex | c | Translation elongation | 135 | 46, 84 |
Regulatory motifs | ||||||
Transcription terminators | Bacteria | − | b | Transcription | RNAMotif136 | |
RNAse E sites | Bacteria | − | b | mRNA stability | 137 | |
miRNA target sites | Eukaryotes | − | b | Translation, mRNA stability | TargetScan138 | |
Various mRNA regulatory elements | All | Complex | b | Translation, mRNA stability | TransTerm139 | |
Nucleotide bias | ||||||
High GC3 content, low A content | Mammals | + | c | Transcription, mRNA processing, mRNA export |
140, 141 | 82, 83, 101 |
High CpG content | Mammals | + | c | Transcription | 102 | |
Others sources of codon bias | ||||||
Codon pair bias | Bacteria, mammals | + | c | Translation elongation rate | 85, 104 | 85, 104 |
Codon ramp | All | Complex (3) | b | Translation initiation rate | 57 | |
Codon correlation | Eukaryotes | + | c | Translation elongation rate | 62 | 62 |
Unknown | All | Complex | c | Protein folding efficiency | 107 | |
Unknown | Eukaryotes | Complex | b | Splicing regulation | 4, 142 | |
Unknown | All | Complex | c | Protein posttranslational modification | 44 | 44 |
Replication strand nucleotide bias | Bacteria, Mitochondria |
unknown | a | Unknown | ||
CTAG avoidance | Bacteria | unknown | a | Restriction avoidance | 143 | 143 |
NOTE: This table lists coding-sequence derived parameters that can be changed by synonymous mutations. Other parameters may be important for expression (eg the identity of the N-terminal amino acid or the length of the sequence) but they require nonsynonymous changes.
Type of evidence supporting the influence of the parameter on expression. a: none; b: theoretical; c: anecdotal; d: systematic.
Generic tools to calculate some of these parameters: codonW140 (http://codonw.sourceforge.net/), INCA141, GeneDesigner144.
The frequency of abundant codons is highly correlated with GC content in mammals.
The codon ramp is predicted to decrease the cost of translation, which may indirectly influence expression levels.