Table 2.
Task description | Sample | Time taken |
---|---|---|
0.1% Sample extraction | Full corpus | 00:58 |
0.5% Sample extraction | Full corpus | 01:03 |
1% Sample extraction | Full corpus | 01:05 |
Keyword extraction | 0.1% | 05:57a |
Keyword extraction | 1% | 48:32a |
All tasks were run on an Intel Core i7 -7400 3.0 GHz CPU (4 cores) on Ubuntu Linux 20.04 Server 64-bit. We have only included those tasks that took a significant amount of time. All times are given in hh:mm format.
aDuring the keyword extraction process, we also filtered candidate keywords and extracted other relevant text items (entities, mentions, hashtags, and emojis), thus adding considerable processing time.