Skip to main content
. 2021 Dec 13;17(12):e1009650. doi: 10.1371/journal.pcbi.1009650

Table 1. Summary of Training and Validation for Image Preprocessing.

1[53], 2[30].

Steps Training data Validation Data Our Performance Benchmark
Compound Figure Detection ImageClef Medical 2016 Compound Figure Task ImageClef Medical 2016 Compound Figure Task Accuracy: 92% Accuracy: 92% (Top one team in ImageClef Medical 2016)
Subfigure Separation ImageClef Medical 2016 Compound Figure Task ImageClef Medical 2016 Subfigure Separation Task Score: 83% Score: 84% (Top one team of ImageClef Medical 2016)
Chart Classification Our generated charts Revision data (not fully available) Accuracy: 100% Accuracy: 80%1
Text Localization Localization from ArXiv papers through pdffigures Our generated charts F1: 76% F1: 88%2
Text Recognition No training data (Used an open-source fine-tuned model) Our generated charts F1: Exact 82% / Edit 90% F1: Exact 95%/ Edit 98%2
Text Role Classification No training data (Used an open-source fine-tuned model) Our generated charts F1: 80% F1: 100%2