Skip to main content
. 2018 Jun;227:18–29. doi: 10.1016/j.jfoodeng.2018.01.009

Table 5.

Coffee bean classification models for species identification (Arabica-Robusta) based on Linear Discriminant Analysis (LDA) and Support Vector Machine (SVM) for green and roasted whole coffee beans. Sample size = 510 for green; 340 for roasted coffee.

Classification method Function Pre-treatment Green
Roasted
Correct (%) Incorrect (%) Correct (%) Incorrect (%)
LDA
Linear Log (1/R) 98.39 1.61 98.53 1.47
1st derivative 99.29 0.71 100.00 0.00
SNV 98.57 1.43 98.53 1.47
MSC 98.75 1.25 98.53 1.47
2nd derivative 99.61 0.39 100.00 0.00
Quadratic
Log (1/R) 100.00 0.00 100.00 0.00
1st derivative 99.80 0.20 99.12 0.88
SNV 99.80 0.20 99.71 0.29
MSC 100.00 0.00 99.41 0.59
2nd derivative
99.80
0.20
98.82
1.18
Classification method
Function
Pre-treatment
Training accuracy (%)
Validation accuracy (%)
Training accuracy (%)
Validation accuracy (%)
SVM Polynomial Log (1/R) 74.90 74.12 87.06 84.42
1st derivative 72.55 72.55 64.71 64.71
SNV 88.43 87.06 94.12 91.76
MSC 82.75 82.16 95.00 88.24
2nd derivative 72.55 72.55 64.71 64.71
Radial basis function Log (1/R) 84.90 83.92 90.29 88.24
1st derivative 91.18 91.18 88.24 88.24
SNV 97.45 97.25 98.23 97.06
MSC 94.51 94.12 95.00 94.12
2nd derivative 89.02 88.82 84.71 84.42