| ML | Machine Learning |
| SVM | Support Vector Machine |
| RF | Random Forest |
| NN | Neural Network |
| LR | Linear Regression |
| AUC-ROC | Area Under the Receiver Operating Characteristic Curve |
| CNN | Convolutional Neural Network |
| LSTM | Long Short-Term Memory |
| LLM | Large Language Model |
| PPI | Protein–Protein Interaction |
| XGBOOST | eXtreme Gradient Boosting |
| PCA | Principal Component Analysis |
| PSSM | Position Specific Scoring Matrix |
| MNC | Mononucleotide Composition |
| DNC | Dinucleotide Composition |
| TNC | Trinucleotide Composition |
| AAC | Amino Acid Composition |
| DPC | Dipeptide Composition |
| TPC | Tripeptide Composition |
| CTD | Composition, Transition, and Distribution |
| CT | Conjoint Triad |
| AC | Auto-Covariance |
| CC | Cross-Covariance |
| DAC | Dinucleotide Auto-Covariance |
| TAC | Trinucleotide Auto-Covariance |
| DCC | Dinucleotide Cross-Covariance |
| TCC | Trinucleotide Cross-Covariance |
| DACC | Dinucleotide Auto- and Cross-Covariance |
| Pse-PSSM | Pseudo Position Specific Scoring Matrix |
| SPSSC | Split Protein Secondary Structure Composition |
| TS | Triplet Structure |
| MLM | Masked Language Modeling |
| PLM | Protein Language Model |
| GO | Gene Ontology |
| TAPE | Tasks Assessing Protein Embeddings |
| SHAP | SHapley Additive exPlanations |
| LIME | Local Interpretable Model-agnostic Explanations |