Table 1.
Exon no. | cDNA positiona | Exon first baseb | Intron–exon boundary | Exon size | Exon–intron boundary | Intron sizec |
---|---|---|---|---|---|---|
1 | 1 | 4708 | GAGGCG | 30 | CGTCAGgtcctggcct | 610 |
2 | 31 | 5348 | ttacttgcagGCCTGA | 103 | ATACAGgtggggtttg | 21847d |
3 | 134 | 27298 | tcattgccagCTTGCA | 115 | GAAAAGgtaagggcct | 6377e |
4 | 249 | 33790 | aattctgcagATGATA | 135 | AACCAGgtaatcttgt | 12596 |
5 | 384 | 46521 | tattttctagATGTGA | 220 | AAAAAGgtaacaataa | 5062f |
6 | 604 | 51803 | tcgtctgcagTTCCCG | 101 | GATCAGgtacggtccc | 457 |
7 | 705 | 52361 | gtgcgcccagGCGAGG | 157 | GACGGGgtgagttctt | 1286 |
8 | 862 | 53804 | ttcatttcagGGATGT | 111 | GCTGAGgtgagggctg | 507 |
9 | 973 | 54422 | tctcctttagCCAAAT | 172 | ATGCACgtgagtgtca | 1346 |
10 | 1145 | 55940 | tcctttctagCTTTTG | 174 | CATAAGgtgtgtgtgc | 1258 |
11 | 1319 | 57372 | tatcttacagGGATCA | 189 | ACGCTGgtgagtgttc | 631 |
12 | 1508 | 58192 | ccttctttagGCCCCA | 152 | CACTGTgtatgtatcg | 2490 |
13 | 1660 | 60834 | tctccccaagGCCTTT | 158 | GCCATGgtacgtctgc | 85 |
14 | 1818 | 61077 | ctgtttgaagGCTCCA | 114 | AGAACGgtacgtagag | 2448 |
15 | 1932 | 63639 | tattttttagGGCAAG | 252 | TGCAAGgtgagtgcaa | 1947 |
16 | 2184 | 65838 | tcccttgtagGGAAGA | 194 | GCCCAGgtactgaata | 3515 |
17 | 2378 | 69547 | ttcctcttagAGCTTT | 201 | CTTCAGgtattcatga | 743 |
18 | 2579 | 70491 | ctcctttcagTTGCAT | 229 | GCGCAGgtgggcctgg | 92 |
19 | 2808 | 70812 | tctatttcagTTTCAG | 125 | ATCCAGgtatggcttt | 1353 |
20 | 2933 | 72290 | tgaattacagGATATT | 179 | TCTTAGgtaaatcgta | 5603 |
21 | 3112 | 78072 | tcctgtatagAAACAT | 185 | TTTCTAgtaagttgct | 1653 |
22 | 3297 | 79910 | cttcttcaagGTCCAG | 156 | TTACTGgtaccttttg | 675 |
23 | 3453 | 80741 | ttaaatgtagGTGTTC | 186 | TAATGGgtacggcgtc | 7108g |
24 | 3639 | 88035 | gtatatacagAGTCAT | 171 | TTCTTGgtaagattac | 384 |
25 | 3810 | 88590 | tttgtttcagCTCAGT | 104 | TTGGAGgtgaggctgt | 1000 |
26 | 3914 | 89694 | atttttccagCCTGAC | 151 | GTGCCAgtaagaaaat | 2676 |
27 | 4065 | 92521 | cattttccagAATGGC | 215 | GTGAAGgtgagctagg | 273 |
28 | 4280 | 93009 | tctgttgcagGACTTT | 133 | ATTTAGgtaaggagct | 102 |
29 | 4413 | 93244 | ctttttacagGTCATG | 128 | ATTAAGgtgatagatt | 92 |
30 | 4541 | 93464 | aactttgcagACTCAT | 196 | AGAGAGgtaagaatgt | 2645 |
31 | 4737 | 96305 | taaaacacagTTCCTA | 134 | CCCAAGgtgcagtatt | 519 |
32 | 4871 | 96958 | cctttgttagGACAAA | 174 | AAACAGgtaacatttg | 77 |
33 | 5045 | 97209 | tattttgcagTTGGAG | 137 | TATAGGgtaaaacgtt | 113 |
34 | 5182 | 97459 | tatttgaaagGGAACC | 152 | AGCTTGgtgagtcaat | 785 |
35 | 5334 | 98396 | tgctcattagGTATCC | 192 | TGATTGgtaggtctgc | 6010hi |
36 | 5526 | 104598 | gctccatcagGCCCCA | 188 | GATCAGgtactcagag | 1383 |
37 | 5714 | 106169 | gctttctcagGATGGG | 193 | ACTCTGgtgggtgact | 1780 |
38 | 5907 | 108142 | ttttttttagAAGCCG | 183 | AGACATgtatggaatg | 2686 |
39 | 6090 | 111011 | cctcctgcagCTTCTC | 182 | AGGCAGgtaatgtgct | 818 |
40 | 6272 | 112011 | cttttatcagATCTTA | 147 | TCAGAGgtgggtggcc | 383 |
41 | 6420 | 112541 | tcctcctcagAGTCCA | 197 | GAAGGGgtgggtttgt | 103 |
42 | 6617 | 112841 | tctgcactagGCCCAG | 231 | AAACCAgtaggtgaac | 1158 |
43 | 6848 | 114230 | tcctctccagCTCCCT | 139 | TTGCAGgtcagtacat | 1299 |
44 | 6987 | 115668 | ccttccgcagGACAAG | 144 | ACACAGgtgtcttttt | 4619 |
45 | 7131 | 120431 | ttaaattcagATGATG | 143 | CTTGAGgtacagccat | 3652 |
46 | 7274 | 124226 | gtctgcccagGCTGCT | 268 | TGCCTGgtacttcgtt | 97 |
47 | 7542 | 124591 | cctctcttagGTGTGG | 137 | TCCATGgtcagtgcct | 558 |
48 | 7679 | 125286 | tttcttgcagTCTACT | 99 | ATTCAGgtgagtaatt | 2686 |
49 | 7778 | 128071 | ttaattctagGTGGGA | 169 | TTATAGgtgagcacat | 97 |
50 | 7947 | 128337 | acatctgtagGCTATC | 126 | TGAAAGgtaatattat | 1808 |
51 | 8073 | 130271 | tctgtcctagCTTTCA | 109 | GGTTACgtgagttatt | 106 |
52 | 8182 | 130486 | ttacaaatagGTGTGA | 140 | AACCAGgtatggcaga | >2015 |
53 | 8322 | 348 | cacgaaatagGTCAGT | 191 | GGAAAGgtagcatcta | >800 |
54 | 8513 | ttttctaaagCACTGG | 106 | TGTCAGgtaattcagt | 82 | |
55 | 8619 | tttttcctagGTGGAA | 92 | ACAGAGgtaagtagct | >1200 | |
56 | 8711 | ttgtgtctagTATCAC | 176 | CGGAAGgtgagaacca | >1200 | |
57 | 8887 | gtttggttagCCTCAT | 112 | TCCAAGgtacggcctg | >1150 | |
58 | 8999 | tctgttgtagATAAAG | 82 | TTGCAGgtatgattat | 110 | |
59 | 9081 | ttccttgcagTGACTG | 144 | ACTCAGgtacaagcca | >1100 | |
60 | 9225 | aatgttgcagGTGGCC | 91 | CAGAATgtaagggtac | 291 | |
61 | 9316 | cttgttgtagGAACTG | 178 | AAAATGgtgattatac | 181 | |
62 | 9494 | aattgagtagGTGAAA | 82 | ATGAAGgtgagtggct | 86 | |
63 | 9576 | ttaaacctagGTTTGG | 172 | GACATGgtacgtaaac | >1000 | |
64 | 9748 | cttctctcagGGGAAA | 145 | GGGCAGgtaaggctgc | 891 | |
65 | 9893 | tcgtaaacagGTGTAT | 226 | ACTTAGgtaacacaga | >1280 | |
66 | 10119 | cctcttatagGCGTGC | 172 | TGCCAGgtaggcttct | 893 | |
67 | 10291 | gtcctgacagAGATGC | 184 | AGAGAGgtaaaagcga | 578 | |
68 | 10475 | cttggactagATTGTT | 141 | AAAGAGgtgaagaggc | >1260 | |
69 | 10616 | tttgacttagGATGTT | 192 | CCACAGgtgagtctca | >1070 | |
70 | 10808 | tcctgaccagGTGGCA | 154 | TACCAGgtacaggggc | >6385 | |
71 | 10962 | tgtccttcagGTGCAG | 108 | GGTCAGgtaaggacag | 1433 | |
72 | 11070 | cctcccgcagGCCGAG | 132 | CTGCTGgtaaggaagg | 436 | |
73 | 11202 | tcattcacagGCCCTA | 159 | CCCTAGgtaaatgcca | 85 | |
74 | 11361 | ctcatttcagCTGCCA | 119 | GCTTTGgtatggagca | 904 | |
75 | 11480 | gtctccttagAGTTCT | 126 | TTTAAGgtaatgtttc | 434 | |
76 | 11606 | gtgcctgcagGTACTG | 156 | GATGAGgtattgcatg | 392 | |
77 | 11762 | ttttgtttagGTGGCT | 116 | GAACAGgttattatat | 101 | |
78 | 11878 | tgtattgtagGCGACC | 199 | GGGAAGgtaagagctc | >12000 | |
79 | 12077 | cttttcttagCTGTAT | 215 | CAGAAGgtatgatgtg | >4000 | |
80 | 12292 | ttttttgcagTCCGTG | 178 | AAGCTGgtgaggaggc | 391 | |
81 | 12470 | ttctctgaagGTGGAG | 162 | ATGAAGgtatttccca | 1507 | |
82 | 12632 | ccccctctagATTGAT | 92 | TACCTGgtacgcaaga | 200 | |
83 | 12724 | ttgtgtgcagGGGCAA | 140 | AGGATGgtaggtaggg | 4971 | |
84 | 12864 | cttcattcagGTGAGG | 188 | GCACAGgtgagtccgg | 770 | |
85 | 13052 | tgtgttctagGTCCCC | 198 | GGAAAGgtattcaagt | 2606 | |
86 | 13250 | tgtgttttagGAGGCG | 84 | ATCCAGgtagcacatg | >5000 | |
87 | 13334 | tcctctccagGTCAAA | 142 | TTGTGGgtgagaactt | >200 | |
88 | 13476 | tcccctgcagGTGAAT | 195 | TCCTGGgtgagctact | >2700 | |
89 | 13671 | gaatgatcagGTGTGT | 113 | AGTGAGgtaactccct | 627 | |
90 | 13784 | tccttaaaagGTTGAT | 191 | CTATAGgttggtggtc | 935 | |
91 | 13975 | tttcttccagACTCCA | 106 | ACGATGgtatgccgac | 290 | |
92 | 14081 | cctcccgcagGTGTGT | 213 | ATCCAGgtaggctcct | 1035 | |
93 | 14294 | tgctcaacagGTGTTG | 997 | TGACATaaaagtgtagj |
GenBank accession no. AF071172 (Ji et al. 1999).
Exons 1–52, GenBank accession no. AC004583 (PAC 778A2; exon 53, GenBank accession no. AQ388928 (BAC R-142A11 end).
(>) Minimum intron size based on contiguous flanking sequence.
(TA)22 begins at position 25031b.
(CA)15 begins at position 28775b.
(TG)18 begins at position 47153b.
(PuT)66 begins at position 81120b. Pu = purine, with (GT)20 the longest pure repeat.
(TCTG)5 begins at position 102604b.
(TG)20 begins at position 102638b.
Polyadenylation takes place within or just 5′ of the four underlined adenine residues.