| CNNs | Convolutional Neural Networks |
| CAD | Computer-aided diagnosis |
| TM | Tympanic membrane |
| CSOM | Chronic Suppurative Otitis Media |
| OME | Otitis Media with Effusion |
| GM | Granular Myringitis |
| W-MSA | Window-based Multi-head Self-Attention |
| FC | Fully Connected Laye |
| LN | Layer Normalization |
| SW-MSA | Shifted Window-based Multi-head Self-Attention |
| MLP | Multi-Layer Perceptron |
| CI | Confidence Interval |
| SD | Standard Deviation |
| LDAM (DRW) | Label-Distribution-Aware Margin Loss with Deferred Re-Weighting training strategy |
| CB-Sampling | Class-Balanced Sampling |
| CB-Loss (Focal) | Class-Balanced Loss (based on Focal Loss) |
| Grad-CAM | Gradient-weighted Class Activation Mapping |
| STMC | Stable Tympanic Membrane Condition |