| HCI | Human–Computer Interaction |
| SER | Speech Emotion Recognition |
| ZCR | Zero-Crossing Rate |
| MFCC | Mel Frequency Cepstral Coefficient |
| CNN | Convolutional Neural Network |
| STFT | Short-Term Fourier Transform |
| IEMOCAP | Interactive Emotional Dyadic Motion Capture |
| FCN | Fully Convolutional Neural Network |
| SVM | Support Vector Machine |
| RML | Ryerson Multimedia Laboratory database |
| LIF | Local Invariant Features |
| SAVEE | Surrey Audio-Visual Expressed Emotion database |
| HMM | Hidden Markov Model |