Skip to main content
. 2014 May;135(5):2885–2901. doi: 10.1121/1.4870484

TABLE I.

Performance of the F0 estimation algorithms (synthetic speech signals). The evaluation of the F0 estimation algorithms uses all 117 synthetic speech signals, where for each signal we use 90 F0 estimates (thus N=117×90=10530). The results are in the form mean ± standard deviation. The last four rows are the approaches to combine the outputs of the F0 estimation algorithms using the median from all algorithms, OLS, IRLS, and adaptive KF. The best individual F0 estimation algorithm and the best combination approach are highlighted in bold. The median error (ME) in the second column is used to illustrate the bias of each algorithm.

Algorithm ME (Hz) MAE (Hz) MRE (%) RMSE (Hz)
dypsa 0.02 3.79 ± 5.57 3.30 ± 5.41 7.20 ± 13.44
praat1 0.00 10.73 ± 22.09 7.42 ± 14.64 12.46 ± 22.33
praat2 0.02 6.56 ± 15.46 4.68 ± 10.26 8.81 ± 17.43
rapt −3.98 9.20 ± 8.91 6.64 ± 6.17 19.95 ± 14.85
shrp −0.23 3.67 ± 7.06 2.83 ± 5.08 7.17 ± 10.34
swipe 0.18 2.88 ± 7.10 2.37 ± 5.57 3.59 ± 7.59
yin −10.71 17.41 ± 16.87 11.90 ± 10.76 29.90 ± 22.95
ndf 0.00 2.38±6.71 1.90±4.92 3.16±7.74
tempo 0.00 2.53 ± 6.64 2.01 ± 4.87 3.34 ± 7.53
xsx 0.01 3.00 ± 7.10 2.38 ± 5.55 3.73 ± 7.58
Median −0.39 3.00 ± 7.28 2.31 ± 5.23 4.27 ± 8.91
OLS 0.02 3.49 ± 5.63 2.72 ± 4.14 4.60 ± 6.49
IRLS 0.00 2.34 ± 7.06 1.89 ± 5.21 3.34 ± 9.43
KF 0.02 2.19±6.54 1.73±4.70 2.72±6.84