. 2018 Aug 29;27(1):184–192. doi: 10.1055/s-0038-1667079

Table 2. List of shared tasks with data source, data size, sub-tasks descriptions, and best-performance score (metrics differ per challenge). The table also contains information about data availability after the challenge, whether the data have been de-identified, and whether they require a DUA to be signed.

Category	Year	Challenge name	Task description	Data type	Data source	Data size	De-identification / anonymization	DUA	Currently Available?	Best Performance	Measure
	2015	TREC Clinical Decision Support (CDS) 9	Patient-centered information retrieval	Medical case narratives	Synthetic, PubMed	30 topics, 730K articles	no	no	yes	38.21%	infNDCG
		TREC Precision Medicine 11
Synthetic	2017	> Track 1 > Track 2	Patient-centered literature article retrieval Patient-centered clinical trials retrieval	Semi-structured cases	Synthetic, PubMed, ClinicalTrials.gov	30 topics, 27M abstracts, 241K trials	no	no	yes	63.10% 44.29%	P@10 P@10
	2016	CLEF cHcalth 12	Information extraction	Nursing handover notes	NICTA synthetic nursing handover notes	300 notes	no	no	yes	38.20%	Fl (macro avg.)
		Text Analysis Conference (TAC) Adverse Drug Reaction Extraction from Drug Labels (ADR) 18
Prescription drug labels	2017	> Track 1 > Track 2 > Track 3 > Track 4	ADR mentions and modifiers extraction Relation extraction Positive ADR filtering Positive ADR normalization	Drug labels	Drugs-Library.com	2309 labels	no	no	yes	82.48% 49.00% 82.19% 85.33%	Fl Fl Fl (macro avg.) Fl (macro avg.)
	2015	CLPsych: Depression and PTSD on Twitter 22	Binaty classification of depression and PTSD users	Social media	Twitter	7.8M tweets	yes	yes	yes	80.00%	Avg. Precision
		Social Media Mining (SMM) 24
	2016	> Track 1 > Track 2 > Track 3	ADR classification Information extraction Concept normalization	Social media	Twiner	10,882 tweets	no	no	yes	41.95% 61,10% -	Fl Fl
		Social Media Mining for Health Applications (SMM4HA) 29
Online social data	2017	> Track 1 > Track 2 > Track 3	ADR classification Classification of medication intake Concept normalization	Social media	Twitter	15,777 tweets	no	no	yes	43.50% 69.30% 88.50%	Fl Fl (micro avg.) Accuracy
	2016	CLPsych: Triaging content in online peer-support forums 33	Classification of mental health severity in 4 levels	Forum	ReachOut	65,024 (1,227 annotated)	yes	yes	yes, on request	42.00%	Fl (macro avg.)
	2017	CLPsych: Triaging content in online peer-support forums 35	Classification of mental health severity in 4 levels	Forum	ReachOut	157,963 posts (1,588 annotated)	yes	yes	yes, on request	46.70%	Fl (macro avg.)
	2017	NTCIR-13 MedWeb 36	8-class classification of diseases and symptoms	Multilingual Social media	Twitter	2560 tweets	yes	yes	yes, on request	-
		Analysis of Clinical Text (ACT) 39
	2015	> Track 1 > Track 2a > Track 2b	Disorder NER and normalization Template slot filling (given gold spans) Disorder recognition and template slot filling (end-to-end)	Clinical notes	ShARc corpus (MIMIC)	531 summaries	yes	yes	yes	75.70% 88.60% 80.80%	Fl (strict) Fl * weighted acc. Fl * weighted acc.
	2016	TREC Clinical Decision Support (CDS) 43	Patient-centered IR	Nursing admission notes	MIMIC, PubMed	30 notes, 1.25M abstracts				40.33%	P@10
		Medication and Adverse Drug Events (MADE1.0)
	2017	> Track 1 > Track 2	Medication, ADE, sign and symptom identification Relation extraction	Clinical notes	UMass Memorial Medical Center	1092 records	yes	yes	no	-
		Clinical TempEval 45
	2015	> Track 1 > Track 2 > Track 3	Time expression extraction Event extraction Relation extraction (wrt DCT) Relation extraction (wrt narrative containers)	Pathology reports	Mayo Clinic	600 notes	yes	yes	yes, on request	72.50% 87.50% 70.20% 12.30%	Fl Fl Fl Fl
Clinical data		Clinical TempEval 46
	2016	> Track 1 > Track 2 > Track 3	Time expression extraction Event extraction Relation extraction (wrt DCT) Relation extraction (wrt narrative containers)	Pathology reports	Mayo Clinic	600 notes	yes	yes	yes, on request	79.50% 90.30% 75.60% 47.90%	Fl Fl Fl Fl
		Clinical TempEval 48
	2017	> Track 1 > Track 2 > Track 3	Time expression extraction (cross-domain) Event extraction (cross-domain) Relation extraction (wrt DCT) Relation extraction (wrt narrative containers)	Pathology reports, Clinical notes	Mayo Clinic	1216 notes	yes	yes	yes, on request	57.00% 72.00% 59.00% 32.50%	Fl Fl Fl Fl
		Centers for Excellence in Genomics N-GRID (CEGS-NGRID) 51		Pathology reports, Clinical notes
	2016	> Track la > Track lb > Track 2	De-identification (cross-domain) Dc-identification Psychiatric Symptom Severity Prediction	Psychiatric evaluation records	Partners Healthcare and Harvard Medical School	1000 records	yes	yes	yes, on request	79.85% 91.43% 86.30%	Fl Fl INMAE^M