Skip to main content
letter
. 2004 Dec;14(12):2412–2423. doi: 10.1101/gr.2800104

Table 2.

Comparison of modern sequences to predicted ancestor

Deletions
Insertions
Substitutions
Species Size of region (kb) (a) Nonrepetitive %GC-content (b) % of ancestor lost (c) % of extant species' bases acquired (nonrepetitive only) (d) % of extant species' bases changed (expected # substitutions per site) (e)
Reconstructed Boreoeutherian ancestor 1124 37.0 N/A N/A N/A
Human 1274 37.1 11.3 21.7 (2.0) 8.6 (11.1)
Chimpanzee 1278 37.1 11.5 21.8 (1.8) 8.7 (11.1)
Gorilla 1247 37.1 12.9 21.6 (1.9) 8.7 (11.1)
Baboon 1260 37.3 12.6 21.2 (2.1) 9.1 (10.7)
Orangutan 1268 37.1 11.7 21.2 (1.8) 8.6 (11.2)
Vervet 1229 37.2 13.5 20.7 (2.0) 9.1 (11.8)
Macaque 1255 36.4 12.2 21.0 (2.0) 9.1 (11.7)
Lemur 1071 37.7 19.1 11.6 (2.8) 9.0 (10.9)
Mouse-lemur 1085 37.5 18.0 14.5 (3.8) 9.3 (11.6)
Mouse 1110 39.2 39.1 38.3 (12.0) 17.5 (34.3)
Rat 1239 39.5 38.8 44.4 (10.1) 15.9 (35.1)
Rabbit 1348 42.7 29.4 37.9 (28.9) 10.5 (21.3)
Cat 1206 37.2 24.5 29.6 (6.9) 11.3 (16.5)
Dog 1122 39.4 26.4 22.5 (6.4) 13.5 (19.2)
Cow 1324 37.1 30.9 41.5 (7.7) 11.1 (20.9)
Pig 1158 36.8 33.7 29.6 (7.5) 10.9 (19.7)
Horse 1102 38.5 20.2 17.5 (8.0) 12.1 (13.3)
Hedgehog 1379 39.7 50.0 48.9 (38.6) 8.9 (28.5)
Armadillo 1339 39.4 28.9 34.2 (18.1) 9.9 (20.2)

Listed are some properties of sequences of the extant species in the greater-CFTR locus and the predicted changes they incurred during evolution from the Boreoeutherian ancestral sequence. (a) Length of sequence. (b) Fraction of nonrepetitive bases that are G or C. (c) Deletions: percentage of the ancestral sequence lost in each species. (d) Insertions: percentage of extant species' sequence that was inserted since the reconstructed ancestor (in parentheses, percentage of extant species' sequence that resulted from insertions of nonrepetitive sequences, using RepeatMasker to identify repetitive sequences.) The high fraction of nonrepetitive inserted bases in rabbit and hedgehog is most likely due to lack of complete RepeatMasker libraries for the transposons specific to these species. (e) Substitutions: percentage of extant species' bases that were derived from an ancestral base but differ from that base (this is different from the standard percentage identity measure, where only aligned bases are considered). In parentheses, the expected number of substitutions per site under a Kimura 2-parameter model (Kimura 1980) is given, here using only the aligned bases.