[Table 1].
Overview of the shared task datasets and comparison to UMLS Metathesaurus
UMLS | TAC 2017 ADR track | FDA shared task | SMM4H 2019 | |
---|---|---|---|---|
Source of input data | - | SPL | SPL | Tweet |
Number of records | - | 200 | 100 | 684 |
Total number of AE descriptions | 428,673 | 13,436 | 16,427 | 1,122 |
Total number of unique AE descriptions | 241,096 | 4,475 | 5,156 | 746 |
Total number of unique MedDRA PTs | 22,774 | 1,941 | 1,946 | 248 |
Average # of tokens in unique AE descriptions | 4.07 | 3.79 | 3.64 | 3.39 |