Table 3. Matrix overview of data quality issues identified per validation task and epidemiological Wikidata property.
Rows represent validation tasks as defined in Table 2, columns the corresponding epidemiological Wikidata properties, and the value in a given cell represents the number of deficient statements identified by the row’s specific task for the column’s epidemiological Wikidata property on a given date (August 8, 2020).
| c | d | r | t | h | Overall | |
|---|---|---|---|---|---|---|
| V1 | 18 | 9 | 10 | 2 | 1 | 40 |
| V2 | 2 | 91 | 6 | 0 | 0 | 99 |
| V3 | 660 | 92 | 6 | 5 | 763 | |
| V4 | 2,081 | 2,247 | 149 | 1 | 4,478 | |
| V5 | 0 | 0 | 0 | 0 | 0 | 0 |
| V6 | 8 | 8 | 8 | |||
| V7 | 1 | 1 | 1 | |||
| V8 | 9 | 9 | 9 | |||
| V9 | 17 | 17 | 17 | |||
| V10 | 60 | 19 | 1 | 0 | 1 | 81 |
| Overall | 2,856 | 2,467 | 189 | 9 | 10 | 5,496 |