Skip to main content
. Author manuscript; available in PMC: 2021 Mar 18.
Published in final edited form as: IEEE/ACM Trans Audio Speech Lang Process. 2019 Aug 12;27(11):1839–1848. doi: 10.1109/taslp.2019.2934319

TABLE I.

Calculation of the estimated target spectrogram in different two-stage networks. G(1)() and G(2)() denote the first and second stage DNN. μO1 and σO1 denote normalization parameters for the output of the first stage, and μO1 and σO2 the parameters for the second stage.

Combination DNN formula
Mapping+Mapping |S^1|=exp(G(2)([G(1)(F¯(m))μo1σo1,F¯(m)])×σo2+μo2)
Mapping+Masking |S^1|=G(2)([G(1)(F¯(m))μo1σo1,F¯(m)])×|Y|
Masking+Mapping |S^1|=exp(G(2)([log(G(1)(F¯(m))×|Y|)μo1σo1,F¯(m)])×σo2+μo2)
Masking+Masking |S^1|=G(2)([log(G(1)(F¯(m))×|Y|)μo1σo1,F¯(m)])×|Y|