Skip to main content
. Author manuscript; available in PMC: 2016 Mar 9.
Published in final edited form as: IEEE/ACM Trans Audio Speech Lang Process. 2015 Jan 14;23(1):92–101. doi: 10.1109/TASLP.2014.2372314

Fig. 2.

Fig. 2

Example of masking: (a) Log-mel spectrogram of a clean utterance. (b) Log-mel spectrogram of the utterance with reverberation. (c) Log-mel spectrogram of the utterance with noise and reverberation. The SNR, with respect to reverberant speech, is −3 dB. (d) The ideal ratio mask. (e) The IRM estimated by the independently trained mask estimator. (f) The IRM estimated by the joint model. (g) The noisy log-mel spectrogram enhanced using the estimated IRM. (h) The noisy log-mel spectrogram enhanced using the IRM estimated by the joint model.