Skip to main content
[Preprint]. 2023 Aug 24:arXiv:2308.13035v1. [Version 1]

Table 5.

Comparative performance of three different CNNs in the identification of normal versus abnormal frames in VCE footage. The Residual Network (ResNet152) model incorporates only the frame in question into its prediction. The m0110 multi-frame model relies on the frame in question plus the frame immediately preceding it. The m1110 multi-frame model incorporates the frame in question and the two preceding frames.

Model Accuracy Precision Sensitivity F1 score Video Prediction Time (minutes)
ResNet152 96.6% 90.3% 91.1% 90.7% 2.0
m0110 97.1% 91.0% 93.5% 92.3% 1.7
m1110 97.5% 91.5% 94.8% 93.1% 2.2