[Preprint]. 2023 Aug 24:arXiv:2308.13035v1. [Version 1]

Table 5.

Comparative performance of three different CNNs in the identification of normal versus abnormal frames in VCE footage. The Residual Network (ResNet152) model incorporates only the frame in question into its prediction. The m0110 multi-frame model relies on the frame in question plus the frame immediately preceding it. The m1110 multi-frame model incorporates the frame in question and the two preceding frames.

Model	Accuracy	Precision	Sensitivity	F1 score	Video Prediction Time (minutes)
ResNet152	96.6%	90.3%	91.1%	90.7%	2.0
m0110	97.1%	91.0%	93.5%	92.3%	1.7
m1110	97.5%	91.5%	94.8%	93.1%	2.2