Table 3. The impact of automated filtering on occurrence records for 18 Neotropical taxa downloaded from http://www.gbif.org.
From column six onwards the numbers show the percentage of records flagged by the respective test. Only tests that flagged at least 0.1% of the records in any group are shown. Individual records can be flagged by multiple tests, therefore the sum of percentages from all tests can supersede the total percentage.
Summary | Errors | Unfit | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Taxon | Total records | Fraction flagged [%] | Fraction error [%] | Fraction unfit [%] | Biodiversity Institutions [%] | Sea/land area [%] | Zeros [%] | Capitals [%] | Duplicates [%] | Political centroids [%] | Urban areas [%] | Basis of record [%] | Collection year [%] | Coordinate precision [%] | Id-level [%] | Individual count [%] |
Diogenidae | 13,840 | 68.7 | 44.3 | 38.2 | 0.0 | 44.3 | 0.0 | 0.7 | 33.8 | 0.2 | 1.3 | 1.7 | 2.5 | 0.0 | 0.0 | 0.0 |
Entomobryidae | 2,767 | 90.3 | 0.1 | 90.3 | 0.1 | 0.0 | 0.0 | 0.1 | 85.5 | 0.0 | 70.1 | 72.9 | 2.0 | 0.0 | 72.1 | 0.0 |
Neanuridae | 689 | 66.9 | 0.0 | 66.9 | 0.0 | 0.0 | 0.0 | 0.0 | 62.4 | 0.0 | 2.0 | 2.9 | 1.3 | 0.0 | 0.0 | 0.0 |
Tityus | 1,018 | 55.2 | 0.5 | 54.9 | 0.5 | 0.0 | 0.0 | 1.2 | 43.5 | 0.1 | 6.9 | 7.0 | 0.4 | 1.8 | 1.6 | 0.0 |
Arhynchobatidae | 14,633 | 38.5 | 3.8 | 37.4 | 0.0 | 3.8 | 0.0 | 0.0 | 35.4 | 0.0 | 1.9 | 1.7 | 1.3 | 0.0 | 0.9 | 0.0 |
Dipsadidae | 64,249 | 57.7 | 0.3 | 57.6 | 0.3 | 0.0 | 0.0 | 1.8 | 46.3 | 0.4 | 8.5 | 5.6 | 11.3 | 0.8 | 0.0 | 0.1 |
Harengula | 36,697 | 31.0 | 5.5 | 27.8 | 0.0 | 5.5 | 0.0 | 0.2 | 27.0 | 0.1 | 0.2 | 1.0 | 0.4 | 0.0 | 0.3 | 0.0 |
Thozetella | 51 | 35.3 | 23.5 | 29.4 | 0.0 | 23.5 | 0.0 | 0.0 | 27.5 | 0.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Conchocarpus | 1,551 | 43.2 | 0.5 | 42.9 | 0.1 | 0.4 | 0.0 | 0.0 | 39.6 | 0.9 | 2.3 | 0.5 | 1.9 | 0.1 | 0.0 | 0.0 |
Gaylussacia | 3,998 | 47.2 | 0.1 | 47.1 | 0.1 | 0.1 | 0.0 | 3.2 | 41.8 | 1.1 | 5.2 | 0.7 | 4.4 | 0.6 | 0.0 | 0.0 |
Harpalyce | 870 | 33.1 | 0.0 | 33.1 | 0.0 | 0.0 | 0.0 | 1.0 | 26.0 | 1.3 | 3.8 | 0.5 | 5.5 | 0.7 | 0.0 | 0.9 |
Iridaceae | 23,127 | 33.6 | 0.5 | 33.5 | 0.4 | 0.1 | 0.0 | 1.0 | 17.1 | 0.4 | 12.3 | 0.9 | 4.7 | 0.1 | 0.0 | 1.3 |
Lepismium | 825 | 29.7 | 0.0 | 29.7 | 0.0 | 0.0 | 0.0 | 0.1 | 21.9 | 0.1 | 7.8 | 0.0 | 2.1 | 0.0 | 0.0 | 0.0 |
Oocephalus | 883 | 49.3 | 0.0 | 49.3 | 0.0 | 0.0 | 0.0 | 6.1 | 41.9 | 0.8 | 13.3 | 0.0 | 0.7 | 0.3 | 0.0 | 0.1 |
Pilosocereus | 1,940 | 25.9 | 0.2 | 25.9 | 0.2 | 0.0 | 0.0 | 0.5 | 16.8 | 0.5 | 2.1 | 1.8 | 7.0 | 0.0 | 0.0 | 0.9 |
Prosthechea | 6,617 | 31.5 | 0.1 | 31.5 | 0.0 | 0.0 | 0.1 | 0.4 | 19.6 | 1.7 | 0.9 | 5.0 | 8.3 | 0.1 | 0.0 | 0.2 |
Tillandsia | 42,222 | 35.3 | 0.4 | 35.2 | 0.3 | 0.0 | 0.0 | 0.7 | 19.8 | 0.7 | 9.2 | 4.9 | 5.1 | 0.1 | 0.0 | 1.0 |
Tocoyena | 2,922 | 37.6 | 0.3 | 37.4 | 0.0 | 0.2 | 0.0 | 0.8 | 32.3 | 0.8 | 5.0 | 0.1 | 1.9 | 0.2 | 0.0 | 0.5 |
Total | 218,899 | 44.3 | 4.2 | 41.7 | 0.2 | 4.0 | 0.0 | 1.0 | 32.3 | 0.4 | 7.1 | 4.2 | 5.6 | 0.3 | 1.0 | 0.4 |