Skip to main content
. 2024 Oct 9;7:1487335. doi: 10.3389/frai.2024.1487335

Figure 2.

Figure 2

Statistics of the A2H dataset. (a) The spectral biclustering (Krow = 8 and Kcol = 59) plot of the A2H dataset with |δ|<0.5σ . The vertical and horizontal red dashed lines separate column and row clusters, respectively. The features on the x-axis with colons in their names represent categorical features after one-hot encoding, and the string after the colon corresponds to the original category when the encoded feature is 1. The features without colons in their names represent numerical features, and the Min-Max scaling is performed on each numerical feature independently. (b) The distribution of the survival rate from the preclinical trial. (c) The distribution of the recovery rate from the clinical trial. (d) The distribution of delta δ=rrrs , the difference between the clinical trial recovery rate and the preclinical trial survival rate. After fitting the normal distribution δ~Nμ,σ to the delta, we label the preclinical/clinical trial pairs translation success (label 1) if δ lies between ±0.5σ around, and translation failure (label 0) otherwise. (e) Top 10 features with the highest absolute Spearman correlation coefficients for thresholds δ<0.5σ , where δ is the different between clinical recovery and preclinical survival rates, and σ is the standard deviation of δ . All the features have adjusted p-value <0.001.