Skip to main content
. 2022 Nov 29;60(2):571–591. doi: 10.1007/s10844-022-00768-8

Table 2.

ScispaCy entity recognition systems used corpus

Training corpus Entity types
CRAFT GGP, SO, TAXON, CHEBI, GO, CL
JNLPBA DNA, CELL_TYPE, CELL_LINE, RNA, PROTEIN
BC5CDR DNA, CELL_TYPE, CELL_LINE, RNA, PROTEIN
BIONLP13CG AMINO_ACID, ANATOMICAL_SYSTEM,
CANCER, CELL, CELLULAR_COMPONENT,
DEVELOPING_ANATOMICAL_STRUCTURE,
GENE_OR_GENE_PRODUCT, IMMATERIAL_ANATOMICAL_ENTITY,
MULTI-TISSUE_STRUCTURE, ORGAN, ORGANISM,
ORGANISM_SUBDIVISION, ORGANISM_SUBSTANCE,
PATHOLOGICAL_FORMATION, SIMPLE_CHEMICAL, TISSUE