Skip to main content
. Author manuscript; available in PMC: 2023 Mar 17.
Published in final edited form as: Nat Biomed Eng. 2022 Mar 17;6(3):267–275. doi: 10.1038/s41551-022-00860-y

Extended Data Fig. 8 ∣. Correlation of F-score and r2 of the biomarker prediction models with the relative feature importance of each spectroscopic variable.

Extended Data Fig. 8 ∣

For the binary classification models (top rows), samples were divided into two groups–abnormal vs. normal levels of serum biomarkers–based on the clinical references (CA125: 50 U/mL, HE4: 150 pM, YKL40: 1650 pM) and assessed the prediction accuracy of abnormal levels of each biomarker. Feature importance of the prediction models shows which spectral parameters most impacted the model performance using an ablation study. Biomarker dependent variables that were identified in Extended Data Fig. 4 are highlighted in bold. Vertical dashed lines indicate F-score when all four spectroscopic variables (dint, dint*, dwl, and dwl*) of the OCC-DNA were included as feature vectors in the model development.