Table 1. List of the ENS-1 like sequences and homology with the 3807 peptide.
Sequence | Position of the gene | Protein Size | AA | Promoter | |||
Name | Chrom. | Start | End | Amino Acids (AA) | kDa (1) | Identity (2) | activity (3) |
Seq.12 | chr1 | 153080201 | 153086086 | 490 | 54 | 16/16 | Y |
Seq.7 | chr1 | 105099609 | 105104886 | 490 | 54 | 16/16 | Y |
Seq.32 | chr2 | 95636316 | 95642444 | 488 | 54 | 16/16 | Y |
Seq.62 | chrUn_ran. | 23794802 | 23799438 | 487 | 54 | 16/16 | Y |
Seq.10 | chr1 | 146896807 | 146900813 | 490 | 54 | 16/16 | No |
Seq.31 | chr2 | 95607871 | 95610941 | 427 | 47 | 16/16 | Y |
Seq.38 | chr4 | 29062470 | 29069620 | 392 | 43 | 16/16 | Y |
Seq.41 | chr5 | 56151532 | 56156408 | 475 | 52 | 16/16 | Y |
Seq.46 | chr9 | 3145510 | 3148390 | 627 | 69 | 16/16 | Y |
Seq.52 | chrUn_ran. | 14717478 | 14720393 | 307 | 34 | 16/16 | Y |
Seq.30 (ENS-3) | chr2 | 72991670 | 73001955 | 698 | 77 | 16/16 | Y |
Seq.18 | chr1 | 166123942 | 166128607 | 490 | 54 | 15/16 | Y |
Seq.2 | chr1 | 49392030 | 49395290 | 490 | 54 | 15/16 | No |
Seq.69 | chrUn_ran. | 48116551 | 48120222 | 265 | 29 | 15/16 | Y |
Seq.54 | chrUn_ran. | 18990010 | 18992568 | 160 | 18 | 15/16 | Y |
Seq.15 | chr1 | 164856462 | 164859682 | 213 | 23 | N/A | Y |
Seq.34 | chr2 | 138621760 | 138623256 | 210 | 23 | N/A | Y |
Seq.26 | chr2 | 36219087 | 36223257 | 378 | 42 | N/A | Y |
Seq.24 | chr2 | 11703874 | 11708540 | 293 | 32 | N/A | Y |
Seq.40 | chr5 | 3991240 | 3997526 | 182 | 20 | N/A | Y |
Seq.8 | chr1 | 134745214 | 134749487 | 182 | 20 | N/A | Y |
Seq.22 | chr12 | 18117921 | 18121939 | 182 | 20 | N/A | Y |
Seq.35 | chr3 | 92605110 | 92611133 | 182 | 20 | N/A | Y |
Seq.20 | chr1 | 167194063 | 167197535 | 182 | 20 | N/A | No |
Seq.3 | chr1 | 49395084 | 49396779 | <10 | N/A | N/A | Y |
Among 78 copies detected in the chicken genome, 25 were very conserved compared to the reference sequence and are listed below. ORF Finder was used to determine the presence of ORF and to retrieve the subsequent protein sequence. Identity with the 3807 peptide (16 amino acids) defined three categories: total identity (16/16 AA), partial identity (15/16 AA) or no identity (N/A).
(1) The molecular weight of the protein was calculated with the formula: AA number X 110/1000.
In bold are indicated the protein sizes detected in CES cell lysates with the 3807 antibody in western-blotting.
(2) Identity between the ENS-1 like protein sequences and the 3807 peptide (DRIRVLQNEARTRAGK)
(3) Potential activity based on the presence (Y) or the absence (No) of the Nanog, Gata and Ets transcription factors binding sites controling promoter activity.