Skip to main content
. 2024 Feb 9;69:102466. doi: 10.1016/j.eclinm.2024.102466

Table 2.

The performance of Deep-VCUG in the internal and external testing sets for unilateral and bilateral VUR grading.

Accuracy (95% CI) Precision (95% CI) Sensitivity (95% CI) F1 Score (95% CI) AUROC (95% CI) Specificity (95% CI)
Unilateral VUR grading (N = 374)
Internal testing set (N = 141)
 Deep-VCUG 0.805 (0.738–0.866) 0.761 (0.688–0.841) 0.805 (0.738–0.866) 0.782 (0.710–0.851) 0.962 (0.943–0.978) 0.960 (0.941–0.975)
 MobileNetv2 0.779 (0.711–0.846) 0.749 (0.683–0.828) 0.779 (0.711–0.846) 0.762 (0.695–0.832) 0.948 (0.920–0.972) 0.957 (0.941–0.972)
 GoogLeNet 0.597 (0.523–0.671) 0.518 (0.417–0.633) 0.597 (0.523–0.671) 0.529 (0.438–0.618) 0.881 (0.845–0.914) 0.878 (0.856–0.900)
 ResNet 101 0.738 (0.671–0.805) 0.729 (0.652–0.809) 0.738 (0.671–0.805) 0.731 (0.658–0.800) 0.931 (0.899–0.962) 0.951 (0.935–0.968)
 DenseNet161 0.758 (0.691–0.826) 0.783 (0.724–0.850) 0.758 (0.691–0.826) 0.762 (0.693–0.826) 0.942 (0.916–0.967) 0.955 (0.938–0.970)
 EfficientNet-B0 0.570 (0.483–0.644) 0.539 (0.438–0.656) 0.570 (0.483–0.644) 0.533 (0.444–0.616) 0.900 (0.868–0.928) 0.893 (0.871–0.915)
External testing set (N = 233)
 Deep-VCUG 0.807 (0.755–0.858) 0.827 (0.788–0.873) 0.807 (0.755–0.858) 0.807 (0.756–0.858) 0.944 (0.921–0.964) 0.958 (0.946–0.970)
 MobileNetv2 0.785 (0.734–0.837) 0.792 (0.747–0.845) 0.785 (0.734–0.837) 0.786 (0.734–0.838) 0.952 (0.933–0.968) 0.954 (0.942–0.965)
 GoogLeNet 0.790 (0.734–0.841) 0.804 (0.760–0.852) 0.790 (0.734–0.841) 0.790 (0.736–0.839) 0.954 (0.935–0.971) 0.952 (0.938–0.964)
 ResNet 101 0.764 (0.704–0.820) 0.782 (0.731–0.834) 0.764 (0.704–0.820) 0.765 (0.706–0.820) 0.933 (0.907–0.953) 0.950 (0.936–0.962)
 DenseNet161 0.768 (0.717–0.820) 0.792 (0.751–0.839) 0.768 (0.717–0.820) 0.772 (0.718–0.823) 0.943 (0.924–0.963) 0.948 (0.934–0.961)
 EfficientNet-B0 0.326 (0.266–0.391) 0.295 (0.234–0.368) 0.326 (0.266–0.391) 0.290 (0.231–0.356) 0.725 (0.684–0.768) 0.837 (0.806–0.863)
Bilateral VUR grading (N = 74)
Internal testing set (N = 27)
 Deep-VCUG 0.796 (0.685–0.889) 0.833 (0.671–0.917) 0.796 (0.685–0.889) 0.775 (0.640–0.879) 0.960 (0.922–0.983) 0.936 (0.898–0.969)
 MobileNetv2 0.741 (0.630–0.852) 0.782 (0.604–0.882) 0.741 (0.630–0.852) 0.720 (0.585–0.847) 0.936 (0.888–0.975) 0.911 (0.861–0.954)
 GoogLeNet 0.741 (0.630–0.852) 0.766 (0.590–0.869) 0.741 (0.630–0.852) 0.716 (0.581–0.840) 0.932 (0.876–0.971) 0.912 (0.863–0.951)
 ResNet 101 0.741 (0.611–0.852) 0.804 (0.647–0.886) 0.741 (0.611–0.852) 0.721 (0.583–0.847) 0.960 (0.925–0.984) 0.904 (0.851–0.950)
 DenseNet161 0.778 (0.667–0.889) 0.795 (0.657–0.913) 0.778 (0.667–0.889) 0.762 (0.632–0.880) 0.943 (0.899–0.980) 0.946 (0.910–0.976)
 EfficientNet-B0 0.667 (0.537–0.778) 0.679 (0.560–0.817) 0.667 (0.537–0.778) 0.656 (0.522–0.777) 0.919 (0.863–0.959) 0.893 (0.836–0.935)
External testing set (N = 47)
 Deep-VCUG 0.745 (0.660–0.830) 0.766 (0.671–0.852) 0.745 (0.660–0.830) 0.720 (0.617–0.819) 0.924 (0.887–0.957) 0.808 (0.738–0.874)
 MobileNetv2 0.723 (0.628–0.809) 0.740 (0.640–0.827) 0.723 (0.628–0.809) 0.699 (0.587–0.788) 0.930 (0.888–0.963) 0.799 (0.728–0.859)
 GoogLeNet 0.628 (0.521–0.734) 0.646 (0.543–0.760) 0.628 (0.521–0.734) 0.616 (0.510–0.724) 0.878 (0.834–0.918) 0.787 (0.713–0.850)
 ResNet 101 0.691 (0.606–0.777) 0.691 (0.592–0.791) 0.691 (0.606–0.777) 0.676 (0.577–0.773) 0.900 (0.858–0.937) 0.803 (0.728–0.864)
 DenseNet161 0.670 (0.574–0.766) 0.688 (0.582–0.787) 0.670 (0.574–0.766) 0.639 (0.527–0.742) 0.916 (0.881–0.949) 0.769 (0.697–0.835)
 EfficientNet-B0 0.489 (0.383–0.596) 0.511 (0.407–0.634) 0.489 (0.383–0.596) 0.493 (0.391–0.604) 0.843 (0.795–0.889) 0.730 (0.644–0.805)