Skip to main content
[Preprint]. 2024 Dec 31:2024.12.31.24319792. [Version 1] doi: 10.1101/2024.12.31.24319792

Table 2:

Evaluation results of trained BioBERT-base models trained with three different text processing and sampling methods (evidence-only, rule-based filtered, and raw ClinVar data) and pre-trained BioBERT-base on orthogonal generated DMS Data

Model Accuracy Precision Recall F1 Score Pair-wise AUC Avg AUC-ROC

P/LP vs B/LB P/LP vs VUS B/LB vs VUS

BioBERT-base + ClinVar (evidence-only) 0.4753 0.4930 0.4753 0.4219 0.9272 0.8043 0.5470 0.7595
BioBERT-base + ClinVar (rule-based) 0.4891 0.5098 0.4891 0.4399 0.9096 0.7938 0.5377 0.7470
BioBERT-base + ClinVar (raw-data) 0.4840 0.5306 0.4840 0.4192 0.9037 0.7882 0.5826 0.7582
BioBERT-base [9] 0.2713 0.0736 0.2713 0.1158 0.3953 0.5428 0.3953 0.4503