Skip to main content
. 2024 Jan 11;6:1292466. doi: 10.3389/frai.2023.1292466

Table 3.

Long COVID diagnosis: recently applied data mining and NLP techniques.

Study Input data AI method Task Output (%)
Miao et al. (2022) Tweets NLP Analysis of reported Long COVID symptoms in terms of demographics, geographical and temporal parameters Accuracy demographic categories 89 symptom categories 95
Zhu et al. (2022) Clinical notes Pretrained BERT Identification of Long COVID and potential computational phenotypes Sensitivity score 88.1
Scarpino et al. (2022) Blogs LDA and BERT Extract discussion topics in the Italian narration of COVID-19 pandemic Accuracy of BERT 91.97
Matharaarachchi et al. (2022) Tweets Association rule mining Relationships between symptoms Confidence 77 for lung/breathing problems and loss of taste vs. loss of smell
Wang et al. (2022) Clinical notes PASCLex (NLP) model Identification of symptoms Precision 94 recall 84
Banda et al. (2021) Tweets NLP and SVM Identification of symptoms Accuracy 75 on a 20% random held-out test set
Déguilhem et al. (2022) Tweets Biterm Topic Modeling Identification and co-occurrence of symptoms Three major symptom co-occurrences: asthenia-dyspnea 35.3, asthenia-anxiety 22.5, asthenia-headaches 17.3

The division is between BERT (the first 3 rows) and other approaches (all reported measures have the same number of decimal digits as the original paper).