Skip to main content
. Author manuscript; available in PMC: 2019 Jun 20.
Published in final edited form as: IEEE/ACM Trans Audio Speech Lang Process. 2018 May 30;26(10):1702–1726. doi: 10.1109/TASLP.2018.2842159

TABLE II.

STOI Improvements (in %) for a List of Features Averaged on a set of Test Noises (From [34])

Feature Matched noise Unmatched noise Cochannel Average
Anechoic Sim. RIRs Rec. RIRs Anechoic Sim. RIRs Rec. RIRs Anechoic Sim. RIRs Rec. RIRs
MRCG 7.12 14.25 12.15 7.00 7.28 8.99 21.25 (13.00) 22.93 (13.19) 21.29 (12.81) 12.92
GF 6.19 13.10 11.37 6.71 7.87 8.24 22.56 (11.87) 23.95 (12.31) 22.35 (12.87) 12.71
GFCC 5.33 12.56 10.99 6.32 6.92 7.01 23.53 (14.34) 23.95 (14.01) 22.76 (13.90) 12.50
LOG-MEL 5.14 12.07 10.28 6.00 6.98 7.52 21.18 (13.88) 22.75 (13.54) 21.71 (13.18) 12.08
LOG-MAG 4.86 12.13 9.69 5.75 6.64 7.19 20.82 (13.84) 22.57 (13.40) 21.82 (13.55) 11.91
GFB 4.99 12.47 11.51 6.22 7.01 7.86 19.61 (13.34) 20.86 (11.97) 19.97 (11.60) 11.75
PNCC 1.74 8.88 10.76 2.18 8.68 10.52 19.97 (10.73) 19.47 (10.03) 19.35 (9.56) 10.78
MFCC 4.49 11.03 9.69 5.36 5.96 6.26 19.82 (11.98) 20.32 (11.47) 19.66 (11.54) 10.72
RAS-MFCC 2.61 10.47 9.56 3.08 6.74 7.37 18.12 (11.38) 19.07 (11.19) 17.87 (10.30) 10.44
AC-MFCC 2.89 9.63 8.89 3.31 5.61 5.91 18.66 (12.50) 18.64 (11.59) 17.73 (11.27) 9.87
PLP 3.71 10.36 9.10 4.39 5.03 5.81 16.84 (11.29) 16.73 (10.92) 15.46 (9.50) 9.46
SSF-II 3.41 8.57 8.68 4.18 5.45 6.00 16.76 (10.07) 17.72 (9.18) 18.07 (8.93) 9.09
SSF-I 3.31 8.35 8.53 4.09 5.17 5.77 16.25 (10.44) 17.70 (9.40) 18.04 (9.35) 8.97
RASTA-PLP 1.79 7.27 8.56 1.97 6.62 7.92 11.03 (6.76) 10.96 (6.06) 10.27 (6.28) 7.46
PITCH 2.35 4.62 4.79 3.36 3.36 4.61 19.71 (9.37) 17.82 (8.45) 16.87 (6.72) 7.03
GFMC −0.68 7.05 5.00 −0.54 4.44 4.16 5.04 (0.07) 6.01 (0.33) 4.97 (0.28) 4.40
WAV 0.94 2.32 2.68 0.02 0.99 1.63 11.62 (4.81) 11.92 (6.25) 10.54 (1.05) 3.89
AMS 0.31 0.30 −1.38 0.19 −2.99 −3.40 11.73 (5.96) 10.97 (6.76) 10.20 (4.90) 1.71
PAC-MFCC 0.00 −0.33 −0.82 0.18 −0.92 −0.67 0.95 (0.15) 1.25 (0.26) 1.17 (0.09) −0.17

“Sim.” and “Rec.” Indicate Simulated and Recorded Room Impulse Responses. Boldface Indicates the Best Scores in Each Condition. In Cochannel (Two-Talker) Cases, the Performance is Shown Separately for a Female Interferer and Male Interferer (in Parentheses) with a Male Target Talker