Skip to main content
. 2002 Feb;12(2):349–354. doi: 10.1101/gr.206602

Table 1.

Evaluation of SMARTest Accuracy

Sequence Experimentally defined S/MARs Position (kb) SMARTest predictions Position (kb) MAR-Finder predictions Position (kb)




Description Length (kb)


Oryza sativa putative ADP-glucose pyrophosphorylase 30.034 0.0–1.2
 subunit SH2 and putative NADPH-dependent 5.4–7.4 6.5–7.0
 reductase A1 genes (U70541; Avramova et al. 15.2–15.7 15.7–15.9
 1998) 16.2–16.6
17.3–18.5 17.6–18.3 17.5–18.4
20.0–23.1 19.6–20.1 19.8–20.4
20.7–21.3 21.3–21.5
23.6–23.9 23.9–24.2
25.0–25.4 24.7–25.1
27.5–27.9
Sorghum bicolor ADP-glucose pyrophosphorylase 42.446 0.0–1.5
 subunit SH2, NADPH-dependent reductase A1-a 7.1–9.7
 and NADPH-dependent reductase A1-b genes. 21.3–21.9
 (AF010283; Avramova et al. 1998) 22.4–24.7 22.9–24.0 23.2–24.2
27.3–27.6 26.9–27.5
32.5–33.7
41.6–42.3
Sorghum bicolor BAC clone 110K5 (AF124045; 78.195 ∼0.9
Tikhonov et al. 2000) ∼5.8
∼6.3
∼9.3
∼15.0 15.1–15.8
∼18.5
∼21.9 21.7–22.0
∼23.3
∼25.6
∼29.1
∼34.6
∼44.1 44.1–44.5
∼48.5 47.9–49.5 47.9–49.4
∼57.9
∼62.9 63.1–63.7
∼67.1
∼69.3
∼73.7 74.3–74.7
Human alpha-1-antitrypsin and corticosteroid 30.461 2.6–6.3 5.5–6.0 3.0–3.2
 binding globulin intergenic region (AF156545; 5.1–6.0
Rollini et al. 1999) 22.0–30.4 25.7–26.2 24.9–25.3
27.5–27.8 25.5–25.8
26.2–26.4
27.5–28.2
Human protamine locus (U15422 & AC00247; 53.060 8.8–9.7 8.0–8.9*
Kramer et al. 1998) 32.6–33.6 33.9–34.8*
37.2–39.4 37.7–38.6*
51.8–53.0 *
Human beta-globin locus (22754 & U01317; Jarman 75.955 1.5–3.0 — 
 and Higgs 1988) 15.6–19.0 18.0–18.4 15.5–16.0
18.0–18.4
34.4–34.9
44.7–52.7 50.6–50.8
56.6–57.1 56.5–57.2
60.0–70.0 59.8–60.3 58.1–58.5
65.6–66.0 63.0–63.6
67.6–67.9 68.7–69.3
68.8–69.1
Sum (kb) 310.151 >56.1 14.5 13.8





Total numbers: 37 28 25  
True positives [number of experimentally defined S/MARs found] 19 [14] 20 [12]
False positives  9  5  
False negatives 23 25  
Specificity 19/28 = >68% 20/25 = >80%
Sensitivity 14/37 = >38% 12/37 = >32%

Six different genomic sequences, three plant and three human sequences, for which experimentally defined S/MARs are known were analyzed with SMARTest to evaluate the accuracy of in silico predictions. For comparison, the same sequences were analyzed with the MAR-Finder program. True positive matches are printed in bold, minus (−) indicates false negative matches. Some of the longer (up to 10 kb) experimentally defined S/MARs contained more than one in silico prediction, each of them was counted as true positive match. Therefore, the number of true in silico predictions is higher than the number of experimentally defined S/MARs found. Specificity is defined as the ratio of true positive predictions, whereas sensitivity is defined as the ratio of experimentally defined S/MARs found. 

*

A T-rich rule excluded using MAR-Finder