. Author manuscript; available in PMC: 2018 Oct 8.

Published in final edited form as: Proc Conf Assoc Comput Linguist Meet. 2018 Jul;2018:197–207.

Table 3:

Precision, recall and F-1 for aggregated AMT spans evaluated against the union of expert span labels, for all three P, I, and O elements.

Participants	Precision	Recall	F-l
Majority Vote	0.903	0.507	0.604
Dawid Skene	0.840	0.641	0.686
HMMCrowd	0.719	0.761	0.698
Interventions	Precision	Recall	F-l
Majority Vote	0.843	0.432	0.519
Dawid Skene	0.755	0.623	0.650
HMMCrowd	0.644	0.800	0.683
Outcomes	Precision	Recall	F-l
Majority Vote	0.711	0.577	0.623
Dawid Skene	0.652	0.648	0.629
HMMCrowd	0.498	0.807	0.593