Table 8. COSMIC variant extraction coverage result.
The table shows the number of variants in the reference set (Total), the number of matched variants by the mutation extraction tool (Matched), the proportion of matched variants (Recall), the number of variants matched when the gene is not considered (M NG) and the proportion of matched variants when the gene is not considered (Rec NG). The data sets considered are MEDLINE abstracts (medline), Open Access PMC articles (pmc.ft), PDF articles when no Open Access PMC articles are available (pdf), PDF representation for all the articles (pdf.all), tables available from the Open Access PMC Articles’ XML (table), supplementary material (sup) and the combination from all the sources (all). The tools are Extractor of Mutations (EMU), OpenMutationMiner (OMM), MutationFinder (MF), tmVar and SNP Extraction Tool for Human Variations (SETH). The row with tool value as All indicates the result when the variants extracted by all the tools are merged.
Data set | Tool | Total | Matched | Recall | M NG | Rec NG |
---|---|---|---|---|---|---|
medline
medline medline medline medline |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
146
140 126 25 139 |
0.0043
0.0041 0.0037 0.0007 0.0041 |
157
147 137 26 145 |
0.0046
0.0043 0.0041 0.0008 0.0043 |
medline | All | 33814 | 156 | 0.0046 | 169 | 0.0050 |
pmc.ft
pmc.ft pmc.ft pmc.ft pmc.ft |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
726
697 632 141 655 |
0.0215
0.0206 0.0187 0.0042 0.0194 |
758
726 658 148 682 |
0.0224
0.0215 0.0195 0.0044 0.0202 |
pmc.ft | All | 33814 | 814 | 0.0241 | 853 | 0.0252 |
pdf
|
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
34
1 5 6 4 |
0.0010
0.0000 0.0001 0.0002 0.0001 |
47
1 6 6 5 |
0.0014
0.0000 0.0002 0.0002 0.0001 |
All | 33814 | 34 | 0.0010 | 47 | 0.0014 | |
pdf.all
pdf.all pdf.all pdf.all pdf.all |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
1094
1132 989 246 1049 |
0.0324
0.0335 0.0292 0.0073 0.0310 |
1114
1137 996 247 1060 |
0.0329
0.0336 0.0295 0.0073 0.0313 |
pdf.all | All | 33814 | 1304 | 0.0386 | 1327 | 0.0392 |
table
table table table table |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
580
597 462 179 176 |
0.0172
0.0177 0.0137 0.0053 0.0052 |
681
699 564 207 233 |
0.0201
0.0207 0.0167 0.0061 0.0069 |
table | All | 33814 | 694 | 0.0205 | 831 | 0.0246 |
sup
sup sup sup sup |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
19177
20054 1286 21052 7763 |
0.5671
0.5931 0.0380 0.6226 0.2296 |
19217
20116 1308 21089 7782 |
0.5683
0.5949 0.0387 0.6237 0.2301 |
sup | All | 33814 | 22756 | 0.6730 | 22829 | 0.6751 |
all
all all all all |
EMU
OMM MF SETH tmVar |
33814
33814 33814 33814 33814 |
20203
20960 2087 21335 8724 |
0.5975
0.6199 0.0617 0.6310 0.2580 |
20284
21040 2133 21379 8762 |
0.5999
0.6222 0.0631 0.6323 0.2591 |
all | All | 33814 | 23859 | 0.7056 | 23969 | 0.7088 |