Table 1. Example of a sentence from the dataset, annotated by 5 independent annotators (sentence 10835394_70).
Annotations | 3 fragments (A, B, C) | 5 annotators (A1–A5) |
Number of sentence fragments | A1: 1 | |
A2: 2 | ||
A3: 1 | ||
A4: 2 | ||
A5: 2 | ||
Evidence | A |A1:E3|A2:E3|A3:E3|A4:E3|A5:E3 | A1: A|E3 B|E3 C|E3 |
B |A1:E3|A2:E1|A3:E3|A4:E3|A5:E3 | A2: A|E3 B|E1 C|E1 | |
C |A1:E3|A2:E1|A3:E0|A4:E0|A5:E3 | A3: A|E3 B|E3 C|E0 | |
A4: A|E3 B|E3 C|E0 | ||
A5: A|E3 B|E3 C|E3 | ||
Focus | A |A1:S|A2:S|A3:S|A4:S|A5:G | A1: A|S B|S C|S |
B |A1:S|A2:S|A3:S|A4:S|A5:G | A2: A|S B|S C|S | |
C |A1:S|A2:S|A3:S|A4:S|A5:G | A3: A|S B|S C|S | |
A4: A|S B|S C|S | ||
A5: A|G B|G C|G | ||
Polarity-Certainty | A |A1:P3|A2:P3|A3:P2|A4:P2|A5:P3 | A1: A|P3 B|P3 C|P3 |
B |A1:P3|A2:P3|A3:P2|A4:P2|A5:P3 | A2: A|P3 B|P3 C|P3 | |
C |A1:P3|A2:P3|A3:P2|A4:P3|A5:P3 | A3: A|P2 B|P2 C|P2 | |
A4: A|P2 B|P2 C|P3 | ||
A5: A|P3 B|P3 C|P3 |
Annotations in the context of the real sentence are as follows:
The phenotypes of mxp19 ( Fig 1B ) |A2:**1SP3E3| and mxp170 (data not shown) homozygotes and hemizygotes (data not shown) are identical, |A3:**1SP3E3| |A4:**1SP3E3| |A5:**1GP3E3| suggesting that mxp19 and mxp170 are null alleles. |A1:**1SP3E3| |A2:**2SP3E1| |A3:**1SP2E0| |A4:**2SP2E0| |A5:**2GP2E3|
The minimum number of sentence fragments required to represent these annotations is three:
A = “The phenotypes of mxp19 (Fig 1B)”
B = “and mxp170 (data not shown) homozygotes and hemizygotes (data not shown) are identical,”
C = “suggesting that mxp19 and mxp170 are null alleles.”
Annotators' identities are concealed with codes A1, A2, A3, A4, and A5.