. Author manuscript; available in PMC: 2024 Aug 1.

Published in final edited form as: J Biomed Inform. 2023 Jul 23;144:104458. doi: 10.1016/j.jbi.2023.104458

Table 1:

Articles published for few-shot learning on medical data, their publication years, data sources, search engine, which downstream tasks the literature focus, size of the training set (number of the shot), number of the entities (for few-shot NER tasks), number of the classes (for few-shot classification tasks), type of data in training and test domain.

Study	Data source	Research aim	Size of training set	Number of entities / classes	Dataset domain
Rios et al., 2018 [16]	MIMIC II [29] MIMIC III [30]	Multi-label Text Classification	Frequent group (all labels that occur >5 times), the few-shot group (labels that occur between 1 and 5 times), and the zero-shot group (labels that never occur in the training dataset), reconstructed	Not mentioned ¹	Medical, discharge summaries annotated with a set of ICD-9 diagnosis and procedure labels
Rios et al., 2018 [31]	MIMIC II [29] MIMIC III [30]	Multi-label Text Classification	Original dataset, with no reconstruction	Not mentioned ¹	Medical
Hofer et al., 2018 [10]	i2b2 2009 32 i2b2 2010 33 i2b2 2012 34 CoNLL-2003 [35] BioNLP-2016 [36] MIMIC-III [30] UK CRIS [37, 38]	NER	10-shot	Not mentioned ¹	Medical and nonmedical (e.g, news)
Pham et al., 2018 [39]	The Europarl datasets [40] IWSLT17[41] The UFAL Medical Corpus HIML2017 dataset ³	Neural Machine Translation (NMT)	One-Shot	N/A ²	German→English: medical; English→Spanish, the proceedings of the European Parliament and data from TED
Yan et al., 2018 [42]	Multigames dataset [43] HCR dataset [44] SS-Tweet dataset [45] SemEval-2013 Dataset (SemEval b) [46]	Text Classification	Few-shot, but reconstructed	Multi games: 3 HCR: 5 SS-Tweet: 3 SemEval-2013 Dataset: 3	Tweets about sentiment and games, and health Care Reform (HCR) data
Manousogiannis et al., 2019 [47]	Tweets (provided by SMM4H 2019) [48]	NER	Original dataset, with no reconstruction	1, ADR with 319 MEDDRA codes	Medical (ADR)
Gao et al., 2019 [49]	FewRel dataset [50]	Relation Classification	5-Way 1-Shot / 5-Way 5-Shot / 10-Way 1-Shot / 10-Way 5-Shot	25	Wikipedia corpus ¹⁰ and Wikidata knowledge bases
Lara-Clares et al., 2019 [51]	MEDDOCAN shared task dataset [52]	NER	500 clinical cases, with no reconstruction	29	Clinical
Ferré et al., 2019 [53]	BB-norm dataset [54]	Entity Nor mal izat io n	Original dataset with no reconstruction and zero-shot	Not mentioned ¹	Biological
Hou et al., 2020 [55]	Snips dataset [56]	Slot Tagging (NER)	1-shot and 5-shot	7	Six of Weather, Music, PlayList, Book (including biomedical), Search Screen (including biomedical), Restaurant and Creative Work. ¹¹
Sharaf et al., 2020 [57]	Ten different datasets collected from the Open Parallel Corpus (OPUS) [58]	Neural Machine Translation (NMT)	Sizes ranging from 4k to 64k training words (200 to 3200 sentences), but reconstructed	N/A ²	Bible, European Central Bank, KDE, Quran, WMT news test sets, Books, European Medicines Agency (EMEA), Global Voices, Medical (ufal-Med), TED talks
Lu et al., 2020 [59]	MIMIC II [29] MIMIC III [30] EU legislation dataset [60]	Multi-label Text Classification	5-shot for MIMIC II and III, 50-shot for EU legislation	MIMIC II: 9 MIMIC III: 15 EU legislation: 5	Medical
Jia et al., 2020 [61]	BioNLP13PC BioNLP-13CG [62] CoNLL-2003 dataset [35] Broad Twitter dataset [63] Twitter dataset [64] CBS SciTech News dataset [65]	NER	Four few-shot (reconstructed) and zero-shot	CoNLL: 4 Broad Twitter: 3 Twitter: 4 BioNLP13PC: >=3 BioNLP13CG: >=3 CBS News: 4	For the BioNLP dataset, BioNLP13PC as the source domain dataset; In the Broad Twitter dataset, the CoNLL-2003 as the source domain dataset; In the Twitter dataset,the CoNLL- 2003 as the source domain dataset
Chalkidis et al., 2020 [66]	EURLEX5TK [60] MIMIC III [30] AMAZON13K [67]	Multi-label Text Classification	The labels are divided into frequent (≥ 50), few-shot (≤ 50), and zero-shot	Not mentioned ¹	English legislative documents, English discharge summaries from US hospitals, English product descriptions from Amazon
Lwowski et al., 2020 [68]	Tweets about COVID-19 [69]	Text Classification	100 tweets, with no reconstructed	4	Tweets about COVID-19
Hou et al., 2020 [9]	Dialogue utterances from the AIUI open dialogue platform of iFlytek⁴	Dialogue Language Understanding: includes two sub-tasks: Intent Detection (classification) and Slot Tagging (sequence labeling)	1-shot, 3-shot, 5-shot and 10-shot	Train Domains: 45 Dev Domains: 5 Test Domains: 9	General dialogue (including health domain)
Chen et al., 2020 [70]	WIKIBIO dataset [71]	Natural Language Generation (NLG)	Dataset sizes: 50, 100, 200 and 500, with no reconstruction	N/A ²	Books, Songs and Human domain (including biomedical)
Vaci et al., 2020 [72]	UK-CRIS system that provides a means of searching and analysing deidentified clinical case records from 12 National Health Service Mental Health Trusts [37, 38]	NER	Original dataset, with no reconstruction	7	Clinical
Huang et al., 2020 [73]	10 public datasets	NER	5-shot, 10%, 100%	CoNLL: 4 Onto: 18 WikiGold: 4 WNUT: 6 Movie: 12 Restaurant: 8 SNIPS: 53 AT IS: 79 Multiwoz: 14 I2B2: 23	10 public datasets, different domains
Chen et al., 2020 [74]	MRI image dataset MRI text reports⁵	Text Classifie ait ion	Original dataset, with no reconstruction	Not mentioned ¹	MRI data
Yin et al., 2020 [75]	MLEE [76] BioNLP’13-GE [62]	Sequence Tagging (NER)	5-way-10-shot, 5-way-15-shot, and 5-way-20-shot	5	Biological event
Goodwin et al., 2020 [77]	Tensor Flow DataSets catalogue⁶	Abstractive Summarization	Zero-shot and 10-shot	N/A ²	3 general domain & 1 consumer health
Yang et al., 2020 [78]	OntoNotes 5.0 [79] CoNLL-2003 [35] I2B2 2014 [80] WNUT 2017 [81]¹¹	NER	1-shot and 5-shot	Onto: 18 CoNLL: 4 I2B2–14: 23 WNUT: 6	Three of general, news, medical and social media
Hartmann et al., 2021 [82]	The IULA dataset [83] The NUBES dataset [84] The FRENCH dataset [85] Negation Scope Resolution datasets	NER	Zero-shot, with no reconstruction	1, Negation	No training data for the clinical datasets
Fi vez et al., 2021 [86]	SNOMED-CT1⁷ ICD-10	Name Normalization	Zero-shot, with no reconstructed	N/A ²	Biomedical
Lu et al., 2021 [87]	Constructed a dataset ⁸ based on Wei bo for the research of few-shot rumor detection, and use PHEME dataset [88]	Rumor Detection (NER)	For the Weibo dataset: 2-way 3-event 5-shot 9-query; for PHEME dataset: 2-way 2-event 5-shot 9-query	Weibo: 14 PHEME: 5	Source posts and comments from Sina Weibo related to COVID-19
Ma et al., 2021 [89]	CCLE CERES-corrected CRISPR gene disruption scores G DSC1000 dataset PDTC dataset PDX dataset⁹	Drug-response Predictions	1-shot, 2-shot, 5-shot, and 10-shot	N/A ²	Biomedical
Kormilitzin et al., 2021 [90]	MIMIC-III [30] UK-CRIS datasets [37, 38]	NER	25%, 50%, 75% and 100% of the training set, with no reconstruction	7	Electronic health record
Guo et al., 2021 [91]	BioNLP Shared Task 2011 and 2019 [54] structured biological datasets	NER	100%, 75%, 50%, 25%, 0% of training set, with no reconstructed	Not mentioned ¹	Biomedical entities
Lee et al., 2021 [92]	COVID19-Scientific [93] COVID19-Social [94] FEVER [95]	Fact- Checki ng (close to Text Classification)	2-shot, 10-shot, and 50-shot	Not mentioned ¹	Facts about COVID-19
Fi vez et al., 2021 [96]	ICD-10 SNOMED-CT⁷	Name Nor mal izat io n	15-shot	N/A ²	Biomedical
Xiao et al., 2021 [97]	FewRel dataset	Relation Classification	5-Way-1-Shot 5-Way-5-Shot 10-Way-l-Shot 10-Way-5-Shot	Not mentioned ¹	Wikipedia and Wikidata
Ziletti et al., 2021 [98]	MedDRA ontology	Medical Coding / classification	Zero-shot Few-shot	26,000 distinct classes	Synonyms and biomedical text
Ye et al., 2021 [99]	Huggingface Datasets	Cross-task Generalization	Few-shot More-shot	N/A ²	160 datasets
Aly et al., 2021 [100]	OntoNotes-ZS MedMentions-ZS	NER and classification	zero-shot	Train: 19 classes Dev: 12 classes Test: 12 classes	General, Biomedical
Wright et al., 2021 [101]	Curated a dataset of paired sentences from abstracts and associated press releases, labeled by experts for exaggeration based on their claim strength, and ScienceDaily	Information Extraction	100-shot	N/A ²	A science reporting website which aggregates and re-releases press releases from a variety of sources
Lee et al., 2021 [102]	C0NLLO3 Ontonotes 5.0 BC5CDR	NER	2 5-shot and 50-shot	Not mentioned ¹	General ¹²
Wang et al., 2022 [103]	i2b2 2010 dataset i2b2 2012 dataset MIMIC-III dataset BioScope NegEx Chia	Classification	Whole datasets but few-shot classes	Not mentioned ¹	Annotates a corpus ¹³ of assertions
Yan et al., 2022[104]	677 full-text articles were obtained as neuroimaging corpora	NER	Whole datasets	10 categories of neuroimaging entities and 55 categories of neuroimaging interactions	Neuroimaging entities and their interactions
Lin et al., 2022 [105]	Neuroimaging event mention set	Information Extraction	Whole datasets but few-shot classes	788 “Activate” event mentions, 128 “Deactivate” event mentions, 1169 “Effect” event mentions, 665 “Perform Experiment” event mentions, 266 “Acquisition” event mentions, and 315 “Perform Analysis” event mentions.	Ne uro imaging event
Riveland et al., 2022 [106]	Psychophysical tasks	Classfication	Zero-shot	4 categories	Psychophysical tasks
Navarro et al., 2022 [107]	27 recorded conversations between general practitioners and patients at Primary Care facilities	Abstractive summarization	Zero-shot 10-shot 20-shot 50-shot	N/A ²	Medical dialogues from various online chats
Das et al., 2022 [108]	OntoNotes C0NLLO3 WNUTir GUM Few-NERD	NER	1–2 shot-5-way 5–10 shot-5-way 1–2 shot-10-way 5–10 shot-10-way	Not Mentioned ¹	General (OntoNotes 5.0), Medical (I2B2), News (C0NLLO3), Social (WNUT17) ¹⁴
Ma et al., 2022 [109]	C0NLLO3 WNUT1T JNLPBA NCBI-disease 12B2–14 datasets	NER	1-shot, 5-shot, 20-shot, 50-shot	Not mentioned ¹	General, Social, Biomedical
Par mar et al., 2022 [110]	32 datasets	Multi-Task Learning	32-shot, 100-shot, 1000-shot, 2000-shot	Not mentioned ¹	Biomedical and health data
Boulanger et al., 2022 [111]	I2B2 CoNLL	NER	50-shot, 100-shot, 250-shot, 500-shot, 1000-shot	Not mentioned ¹	General and Biomedical data
Yeh et al., 2022 [112]	ChemProt dataset	Relation Extraction	Zero-shot	Not mentioned ¹	Scientific paper abstracts annotated with 6 relation types between the chemicals and genes in sentences
Pan et al., 2022 [113]	MoviesQA NewsQA BioQA CovidQA	Question Answering	Zero-shot	Not mentioned ¹	Movies, News, Biomedical, and COVID-19 domains
Wadden et al., 2022 [114]	Scientific claim verification datasets	Scientific claim verification	Zero-shot few-shot	N/A ²	SCIFACT, Health Ver, COVIDFact, FEVER, EVIDENCE- INFERENCE, PUBMEDQA
Li et al., 2022[115]	FewRel 1.0 FewRel 2.0	Relation classification	5-way-1-shot, 5-way-5-shot, 10-way-1-shot, 10-way-5-shot	100 relations split into training, validation and test sets with respectively 64, 16 and 20 relations without overlapping	Relations from PubMed articles
Zhang et al., 2022 [116]	7 STS tasks STS 2012–2016 STS Benchmark S ICK-Relatedness	Natural Language Inference (NLI)	16 labeled instances per class	Not mentioned ¹	News, Biomedical, Search snippets and Social media data

The research aim of this paper is text classification or NER, but the size of training set is Not mentioned in the paper.

The research aim of this paper is neither text classification nor NER.

UFAL Medical Corpus v.1.0 and HIML2017 dataset: http://aiui.xfyun.cn/index-aiui. Last accessed November 22, 2021.

⁴

iFlytek: http://aiui.xfyun.cn/index-aiui. Last accessed November 22, 2021.

⁵

Those datasets are not released.

⁶

TensorFlow DataSets: https://www.tensorflow.org/datasets. Last accessed November 22, 2021.

⁷

SNOMED-CT1: https://www.snomed.org. Last accessed November 22, 2021.

⁸

A novel dataset proposed by this paper: https://github.com/jncsnlp/Sina-Weibo-Rumors-for-few-shot-learning-research. Last accessed November 22, 2021.

⁹

Links are provided in the original paper.

¹⁰

Test data is biomedical literature with UMLS, a large-scale biomedical knowledge base.

¹¹

The remaining one class is used at test time.

¹²

PubMed articles and chemical-disease texts are used as additional test data.

¹³

Patient eligibility data and 3 assertion types are used for test data: Present, Absent & Possible

¹⁴

GUM, Few-NERD used as test data.