Skip to main content
. 2021 Mar 22;8(21):16035–16046. doi: 10.1109/JIOT.2021.3067605

Fig. 7.

Fig. 7.

Scheme of the end-to-end (e2e) learning approach. DL, in essence, is a series of nonlinear transformations of the input. In the paradigm of e2e learning, higher representations can be extracted directly from the raw audio signals. The architecture of the DL models are usually deep CNN and/or RNN models.