Skip to main content
. 2021 May 17;22:252. doi: 10.1186/s12859-021-04163-y

Fig. 5.

Fig. 5

Learning curves generated by using dGBDT for multiple data splits of each of the datasets in Table 1. a The entire set of learning curve scores, LCraw, where each data point is the mean absolute error of predictions computed on test set E as a function of the training set size mk. A subset of scores in which the sample size is above mkmin (dashed black line) was considered for curve fitting. b Three curves were generated to represent the fit: q0.1 (blue curve) and q0.9 (green curve) representing the variability of the fit, and y~ (black curve) representing the learning curve fit