Skip to main content
. 2022 Jun 11;5(2):ooac043. doi: 10.1093/jamiaopen/ooac043

Table 2.

NLP libraries for MIDD

Library name Features
Programming language Pretrained neural network models Word embeddings Multi-language support Tokenization Part-of-speech tagging Stemming/lemmatization Named entity recognition Entity resolution Sentiment analysis Relation extraction Assertion status detection Topic modeling
Spacy43 Python x x x x x x x x x
Gensim44 Python x x x x x x x
NLTK45 Python x x x x x x x x
CoreNLP46 Java x x x x x x x x x
Scispacy47 Python x x x x x x x x x
SparkNLP48 Python, Java, Scala, R x x x x x x x x x x x x
SparkNLP for healthcare49 Python, Java, Scala, R x x x x x x x x x x
Torchtext50 Python x x x x
KoRpus51 R x x x
Tensorflow52 Python x x x x x x x x x x
Scikit learn53 Python x x x
Textblob54 Python x x x x
Pattern55 Python, R x x x x x
Hugging face56 Python x x x x x
Allen NLP57 Python x x x x x x x x x
Fasttext21 Python x x x x x x
Stanza58 Python x x x x x x x
Flair59 Python x x x x x x
Fastai60 Python x x x x
Spacyr61 R x x x x x x