Results of PKRCC DD Identification on lymph_lung and large_upper data sets
x-axis: types of sample pairs based on the similarities of their class and patient. y-axis: PKRCC (Pairwise Kendall Rank Correlation Coefficient) values of each sample pair. Dots labeled in gray are not PKRCC DDs (data doppelgängers), whereas dots labeled in purple are PKRCC DDs. PKRCC DDs are sample pairs in “Same Class Different Patient” with a PKRCC value greater than the cut-off. The cut-off is the maximum PKRCC of any sample pair in “Different Class Different Patient.” The cut-off PKRCC is higher in large_upper (B) than in lymph_lung (A). In sum, 1,719 PKRCC DDs were identified within lymph_lung (A), whereas 17 PKRCC DDs were identified within large_upper (B).