Experimental design for evaluating repeatability of CNNs for (a) slice-level classification of clinically significant prostate cancer (csPCa: Gleason grade group (GGG) > 1) and non-csPCa regions and (b) lesion-level detection and segmentation of csPCa regions. N = 112 patients scheduled for prostatectomy underwent two prostate MR examinations (SA and SB) performed on the same day approximately 15 min apart. The scans, SA and SB, were divided into training set (Atrain and Btrain), N = 78, and test set (Atest and Btest) N = 34. Two different instances NA and NB were trained on scans Atrain and Btrain, respectively, and evaluated on a combined test set, Stest (Atest + Btest). The outputs P1 and P2 by NA and NB, respectively, on Stest are used to calculate repeatability of the CNNs