Table 1:
Study | Data source | Research aim | Size of training set | Number of entities / classes | Dataset domain |
---|---|---|---|---|---|
Rios et al., 2018 [16] | MIMIC II [29] MIMIC III [30] | Multi-label Text Classification | Frequent group (all labels that occur >5 times), the few-shot group (labels that occur between 1 and 5 times), and the zero-shot group (labels that never occur in the training dataset), reconstructed | Not mentioned 1 | Medical, discharge summaries annotated with a set of ICD-9 diagnosis and procedure labels |
Rios et al., 2018 [31] | MIMIC II [29] MIMIC III [30] | Multi-label Text Classification | Original dataset, with no reconstruction | Not mentioned 1 | Medical |
Hofer et al., 2018 [10] | i2b2 2009 32 i2b2 2010 33 i2b2 2012 34 CoNLL-2003 [35] BioNLP-2016 [36] MIMIC-III [30] UK CRIS [37, 38] | NER | 10-shot | Not mentioned 1 | Medical and nonmedical (e.g, news) |
Pham et al., 2018 [39] | The Europarl datasets [40] IWSLT17[41] The UFAL Medical Corpus HIML2017 dataset 3 | Neural Machine Translation (NMT) | One-Shot | N/A 2 | German→English: medical; English→Spanish, the proceedings of the European Parliament and data from TED |
Yan et al., 2018 [42] | Multigames dataset [43] HCR dataset [44] SS-Tweet dataset [45] SemEval-2013 Dataset (SemEval b) [46] | Text Classification | Few-shot, but reconstructed | Multi games: 3 HCR: 5 SS-Tweet: 3 SemEval-2013 Dataset: 3 | Tweets about sentiment and games, and health Care Reform (HCR) data |
Manousogiannis et al., 2019 [47] | Tweets (provided by SMM4H 2019) [48] | NER | Original dataset, with no reconstruction | 1, ADR with 319 MEDDRA codes | Medical (ADR) |
Gao et al., 2019 [49] | FewRel dataset [50] | Relation Classification | 5-Way 1-Shot / 5-Way 5-Shot / 10-Way 1-Shot / 10-Way 5-Shot | 25 | Wikipedia corpus 10 and Wikidata knowledge bases |
Lara-Clares et al., 2019 [51] | MEDDOCAN shared task dataset [52] | NER | 500 clinical cases, with no reconstruction | 29 | Clinical |
Ferré et al., 2019 [53] | BB-norm dataset [54] | Entity Nor mal izat io n | Original dataset with no reconstruction and zero-shot | Not mentioned 1 | Biological |
Hou et al., 2020 [55] | Snips dataset [56] | Slot Tagging (NER) | 1-shot and 5-shot | 7 | Six of Weather, Music, PlayList, Book (including biomedical), Search Screen (including biomedical), Restaurant and Creative Work. 11 |
Sharaf et al., 2020 [57] | Ten different datasets collected from the Open Parallel Corpus (OPUS) [58] | Neural Machine Translation (NMT) | Sizes ranging from 4k to 64k training words (200 to 3200 sentences), but reconstructed | N/A 2 | Bible, European Central Bank, KDE, Quran, WMT news test sets, Books, European Medicines Agency (EMEA), Global Voices, Medical (ufal-Med), TED talks |
Lu et al., 2020 [59] | MIMIC II [29] MIMIC III [30] EU legislation dataset [60] | Multi-label Text Classification | 5-shot for MIMIC II and III, 50-shot for EU legislation | MIMIC II: 9 MIMIC III: 15 EU legislation: 5 | Medical |
Jia et al., 2020 [61] | BioNLP13PC BioNLP-13CG [62] CoNLL-2003 dataset [35] Broad Twitter dataset [63] Twitter dataset [64] CBS SciTech News dataset [65] | NER | Four few-shot (reconstructed) and zero-shot | CoNLL: 4 Broad Twitter: 3 Twitter: 4 BioNLP13PC: >=3 BioNLP13CG: >=3 CBS News: 4 | For the BioNLP dataset, BioNLP13PC as the source domain dataset; In the Broad Twitter dataset, the CoNLL-2003 as the source domain dataset; In the Twitter dataset,the CoNLL- 2003 as the source domain dataset |
Chalkidis et al., 2020 [66] | EURLEX5TK [60] MIMIC III [30] AMAZON13K [67] | Multi-label Text Classification | The labels are divided into frequent (≥ 50), few-shot (≤ 50), and zero-shot | Not mentioned 1 | English legislative documents, English discharge summaries from US hospitals, English product descriptions from Amazon |
Lwowski et al., 2020 [68] | Tweets about COVID-19 [69] | Text Classification | 100 tweets, with no reconstructed | 4 | Tweets about COVID-19 |
Hou et al., 2020 [9] | Dialogue utterances from the AIUI open dialogue platform of iFlytek4 | Dialogue Language Understanding: includes two sub-tasks: Intent Detection (classification) and Slot Tagging (sequence labeling) | 1-shot, 3-shot, 5-shot and 10-shot | Train Domains: 45 Dev Domains: 5 Test Domains: 9 | General dialogue (including health domain) |
Chen et al., 2020 [70] | WIKIBIO dataset [71] | Natural Language Generation (NLG) | Dataset sizes: 50, 100, 200 and 500, with no reconstruction | N/A 2 | Books, Songs and Human domain (including biomedical) |
Vaci et al., 2020 [72] | UK-CRIS system that provides a means of searching and analysing deidentified clinical case records from 12 National Health Service Mental Health Trusts [37, 38] | NER | Original dataset, with no reconstruction | 7 | Clinical |
Huang et al., 2020 [73] | 10 public datasets | NER | 5-shot, 10%, 100% | CoNLL: 4 Onto: 18 WikiGold: 4 WNUT: 6 Movie: 12 Restaurant: 8 SNIPS: 53 AT IS: 79 Multiwoz: 14 I2B2: 23 | 10 public datasets, different domains |
Chen et al., 2020 [74] | MRI image dataset MRI text reports5 | Text Classifie ait ion | Original dataset, with no reconstruction | Not mentioned 1 | MRI data |
Yin et al., 2020 [75] | MLEE [76] BioNLP’13-GE [62] | Sequence Tagging (NER) | 5-way-10-shot, 5-way-15-shot, and 5-way-20-shot | 5 | Biological event |
Goodwin et al., 2020 [77] | Tensor Flow DataSets catalogue6 | Abstractive Summarization | Zero-shot and 10-shot | N/A 2 | 3 general domain & 1 consumer health |
Yang et al., 2020 [78] | OntoNotes 5.0 [79] CoNLL-2003 [35] I2B2 2014 [80] WNUT 2017 [81]11 | NER | 1-shot and 5-shot | Onto: 18 CoNLL: 4 I2B2–14: 23 WNUT: 6 | Three of general, news, medical and social media |
Hartmann et al., 2021 [82] | The IULA dataset [83] The NUBES dataset [84] The FRENCH dataset [85] Negation Scope Resolution datasets | NER | Zero-shot, with no reconstruction | 1, Negation | No training data for the clinical datasets |
Fi vez et al., 2021 [86] | SNOMED-CT17 ICD-10 | Name Normalization | Zero-shot, with no reconstructed | N/A 2 | Biomedical |
Lu et al., 2021 [87] | Constructed a dataset 8 based on Wei bo for the research of few-shot rumor detection, and use PHEME dataset [88] | Rumor Detection (NER) | For the Weibo dataset: 2-way 3-event 5-shot 9-query; for PHEME dataset: 2-way 2-event 5-shot 9-query | Weibo: 14 PHEME: 5 | Source posts and comments from Sina Weibo related to COVID-19 |
Ma et al., 2021 [89] | CCLE CERES-corrected CRISPR gene disruption scores G DSC1000 dataset PDTC dataset PDX dataset9 | Drug-response Predictions | 1-shot, 2-shot, 5-shot, and 10-shot | N/A 2 | Biomedical |
Kormilitzin et al., 2021 [90] | MIMIC-III [30] UK-CRIS datasets [37, 38] | NER | 25%, 50%, 75% and 100% of the training set, with no reconstruction | 7 | Electronic health record |
Guo et al., 2021 [91] | BioNLP Shared Task 2011 and 2019 [54] structured biological datasets | NER | 100%, 75%, 50%, 25%, 0% of training set, with no reconstructed | Not mentioned 1 | Biomedical entities |
Lee et al., 2021 [92] | COVID19-Scientific [93] COVID19-Social [94] FEVER [95] | Fact- Checki ng (close to Text Classification) | 2-shot, 10-shot, and 50-shot | Not mentioned 1 | Facts about COVID-19 |
Fi vez et al., 2021 [96] | ICD-10 SNOMED-CT7 | Name Nor mal izat io n | 15-shot | N/A 2 | Biomedical |
Xiao et al., 2021 [97] | FewRel dataset | Relation Classification | 5-Way-1-Shot 5-Way-5-Shot 10-Way-l-Shot 10-Way-5-Shot | Not mentioned 1 | Wikipedia and Wikidata |
Ziletti et al., 2021 [98] | MedDRA ontology | Medical Coding / classification | Zero-shot Few-shot | 26,000 distinct classes | Synonyms and biomedical text |
Ye et al., 2021 [99] | Huggingface Datasets | Cross-task Generalization | Few-shot More-shot | N/A 2 | 160 datasets |
Aly et al., 2021 [100] | OntoNotes-ZS MedMentions-ZS | NER and classification | zero-shot | Train: 19 classes Dev: 12 classes Test: 12 classes | General, Biomedical |
Wright et al., 2021 [101] | Curated a dataset of paired sentences from abstracts and associated press releases, labeled by experts for exaggeration based on their claim strength, and ScienceDaily | Information Extraction | 100-shot | N/A 2 | A science reporting website which aggregates and re-releases press releases from a variety of sources |
Lee et al., 2021 [102] | C0NLLO3 Ontonotes 5.0 BC5CDR | NER | 2 5-shot and 50-shot | Not mentioned 1 | General 12 |
Wang et al., 2022 [103] | i2b2 2010 dataset i2b2 2012 dataset MIMIC-III dataset BioScope NegEx Chia | Classification | Whole datasets but few-shot classes | Not mentioned 1 | Annotates a corpus 13 of assertions |
Yan et al., 2022[104] | 677 full-text articles were obtained as neuroimaging corpora | NER | Whole datasets | 10 categories of neuroimaging entities and 55 categories of neuroimaging interactions | Neuroimaging entities and their interactions |
Lin et al., 2022 [105] | Neuroimaging event mention set | Information Extraction | Whole datasets but few-shot classes | 788 “Activate” event mentions, 128 “Deactivate” event mentions, 1169 “Effect” event mentions, 665 “Perform Experiment” event mentions, 266 “Acquisition” event mentions, and 315 “Perform Analysis” event mentions. | Ne uro imaging event |
Riveland et al., 2022 [106] | Psychophysical tasks | Classfication | Zero-shot | 4 categories | Psychophysical tasks |
Navarro et al., 2022 [107] | 27 recorded conversations between general practitioners and patients at Primary Care facilities | Abstractive summarization | Zero-shot 10-shot 20-shot 50-shot | N/A 2 | Medical dialogues from various online chats |
Das et al., 2022 [108] | OntoNotes C0NLLO3 WNUTir GUM Few-NERD | NER | 1–2 shot-5-way 5–10 shot-5-way 1–2 shot-10-way 5–10 shot-10-way | Not Mentioned 1 | General (OntoNotes 5.0), Medical (I2B2), News (C0NLLO3), Social (WNUT17) 14 |
Ma et al., 2022 [109] | C0NLLO3 WNUT1T JNLPBA NCBI-disease 12B2–14 datasets | NER | 1-shot, 5-shot, 20-shot, 50-shot | Not mentioned 1 | General, Social, Biomedical |
Par mar et al., 2022 [110] | 32 datasets | Multi-Task Learning | 32-shot, 100-shot, 1000-shot, 2000-shot | Not mentioned 1 | Biomedical and health data |
Boulanger et al., 2022 [111] | I2B2 CoNLL | NER | 50-shot, 100-shot, 250-shot, 500-shot, 1000-shot | Not mentioned 1 | General and Biomedical data |
Yeh et al., 2022 [112] | ChemProt dataset | Relation Extraction | Zero-shot | Not mentioned 1 | Scientific paper abstracts annotated with 6 relation types between the chemicals and genes in sentences |
Pan et al., 2022 [113] | MoviesQA NewsQA BioQA CovidQA | Question Answering | Zero-shot | Not mentioned 1 | Movies, News, Biomedical, and COVID-19 domains |
Wadden et al., 2022 [114] | Scientific claim verification datasets | Scientific claim verification | Zero-shot few-shot | N/A 2 | SCIFACT, Health Ver, COVIDFact, FEVER, EVIDENCE- INFERENCE, PUBMEDQA |
Li et al., 2022[115] | FewRel 1.0 FewRel 2.0 | Relation classification | 5-way-1-shot, 5-way-5-shot, 10-way-1-shot, 10-way-5-shot | 100 relations split into training, validation and test sets with respectively 64, 16 and 20 relations without overlapping | Relations from PubMed articles |
Zhang et al., 2022 [116] | 7 STS tasks STS 2012–2016 STS Benchmark S ICK-Relatedness | Natural Language Inference (NLI) | 16 labeled instances per class | Not mentioned 1 | News, Biomedical, Search snippets and Social media data |
The research aim of this paper is text classification or NER, but the size of training set is Not mentioned in the paper.
The research aim of this paper is neither text classification nor NER.
UFAL Medical Corpus v.1.0 and HIML2017 dataset: http://aiui.xfyun.cn/index-aiui. Last accessed November 22, 2021.
iFlytek: http://aiui.xfyun.cn/index-aiui. Last accessed November 22, 2021.
Those datasets are not released.
TensorFlow DataSets: https://www.tensorflow.org/datasets. Last accessed November 22, 2021.
SNOMED-CT1: https://www.snomed.org. Last accessed November 22, 2021.
A novel dataset proposed by this paper: https://github.com/jncsnlp/Sina-Weibo-Rumors-for-few-shot-learning-research. Last accessed November 22, 2021.
Links are provided in the original paper.
Test data is biomedical literature with UMLS, a large-scale biomedical knowledge base.
The remaining one class is used at test time.
PubMed articles and chemical-disease texts are used as additional test data.
Patient eligibility data and 3 assertion types are used for test data: Present, Absent & Possible
GUM, Few-NERD used as test data.