| VR | Virtual reality |
| AR | Augmented reality |
| ITD | Interaural time difference |
| ILD | Interaural level difference |
| DSP | Digital signal processing |
| HRIRs | Head-related impulse responses |
| BRIRs | Binaural room impulse responses |
| HRTFs | Head-related transfer functions |
| CP | Common portion |
| DP | Differential portion |
| POSA | POS-ORI self-attention module |
| GCFM | Gated-Conv fusion module |
| DS | Downsampling operation |
| US | Upsampling operation |
| SA | Self-attention mechanism |
| CA | Cross-attention mechanism |
| Pos | Position |
| Ori | Orientation |
| MSE | Mean squared error |
| STFT | Short-time Fourier transform operation |