Table I. Summary of the sequence analysis of the 10q25.3 BAC contig.
Sequence motifs | CENP-A binding region | Non-binding region | p value | Genome average | (>64%) AT-rich average | |
---|---|---|---|---|---|---|
Base composition | AT, % | 65.4 | 58.3 | <0.001 | 58.0a | n/a |
Repeats | tandem repeatsb, % | 5.5 | 2.3 | <0.05 | n/a | n/a |
satellitesc, % | 0 | 0 | 1 | n/a | n/a | |
LINEs, % | 43.7 | 14.7 | <0.05 | 18.5a | 26.2a | |
L1, % | 40.6 | 11.0 | <0.05 | 15.0a | 23.2a | |
SINEs, % | 6.3 | 12.9 | <0.001 | 14.1a | 6.3a | |
Alus, % | 5.1 | 10.0 | <0.001 | 11.8a | 4.8a | |
LTRs, % | 9.8 | 6.1 | >0.05 | 7.8a | 7.2a | |
DNA elements, % | 2.9 | 2.9 | >0.05 | n/a | n/a | |
Protein binding motifs | HMG1d,e | 25.6 | 16.2 | <0.001 | 17.7f | n/a |
topo IIe,g | 9.3 | 6.2 | >0.05 | n/a | n/a | |
CENP-Be,h | 0 | 1 | n/a | n/a | n/a | |
Genes | predicted gene densityg,i | 0.30 | 1.1 | n/a | 2.6j | 1.0j |
CpG island densityg,i | 0 | 0.53 | n/a | 1.4j | 0.37j |
To facilitate statistical analysis of differences between the two regions, the 330 kb CENP-A binding region was split into four equal segments of 82.5 kb, which were analysed separately. Similarly, the non-binding regions were split into 32 segments of 100 kb and one of 70 kb. Results are averages for each region unless otherwise stated and p values from t-tests comparing both regions are also given. Where known, values for average genomic sequences and genomic AT-rich sequences are quoted in the final two columns.
aFrom Smit (1999).
bTandem repeats were classed as being two or more copies of any repeat of unit length ≥2 bp.
cSatellite repeats (with % homology to consensus): α-satellite (80%); β-satellite (100%); γ-satellite (100%); 48 bp repeat (100%); classical satellites Ia and Ib (100%); classical satellite III (100%, two or more copies in tandem).
dPer kb.
e100% homology to consensus.
fFrom Barry et al. (1999).
gPer 100 kb.
hNumber of matches.
iFigures were calculated for whole binding and non-binding regions only.
jPredicted gene density for the whole genome and for AT-rich G bands (Craig, 1995).
n/a, not applicable/attempted.