TABLE 3.
The distribution of poly(A) sites on gene transcipts
| Category | Subcategory | No. of transcripts | % |
|---|---|---|---|
| Total poly(A) sites | — | 16,952 | 100 |
| Located in the transcript (full-length cDNA) | In CDS | 45 (12)c | 0.3 |
| In introns | 588 (39)c | 3.5 | |
| In 5′-UTR | 86 (12)c | 0.5 | |
| In 3′-UTRb | 11,011 | 65.0 | |
| Subtotal | 11,730 | 69.2 | |
| Located in the intergenic regiona | — | 5,222 | 30.8 |
The intergenic regions are the areas 500 or 1000 nt downstream of the 3′-UTR (as defined above).
To avoid genome annotation error, the 3′-UTR defined here has been extended by 500 nt past the poly(A) site. For those genes that do not have an annotated 3′-UTR in the current version of the genome, the range was extended to 1000 nt after the annotated stop codons.
The numbers in parentheses are the cases with high confidence when using more stringent conditions and manual confirmation as described in the main text.