The article by Plekhanova, Nuzhdin, Utkin, and Samsonova (2018) contains an error in the formula of weights of each sample from source data in the transductive transfer learning method. The correct text should read as:
Let be source data with N columns and M rows; be target data with N columns and L rows, xij – an element from the ith row and jth column of data ; ms, mT – vectors, whose elements are the medians of the columns of and , respectively. Then the weight of each sample from source data is
where is source data shifted towards the target data median Shifting towards the median accounts only for rotation of the source over the target data and not for the linear shift between them.
Additionally, in Table 3, the classification quality of mouse data for classifiers trained on HumVar without weights was presented incorrectly. The correct values are listed in Table 3 here.
Table 3.
Comparative analysis of classification quality (accuracy on dog and mouse data) for classifiers, trained on source data with weights (+TL) and without weights (—). The maximal accuracy values achieved are shown in bold
Dog | Mouse | |||||||
---|---|---|---|---|---|---|---|---|
Trained on HumDiv | Trained on HumVar | Trained on HumDiv | Trained on HumVar | |||||
Classifier | +TL | — | +TL | — | +TL | — | +TL | — |
Random Forest | 0.855 | 0.638 | 0.884 | 0.879 | 0.844 | 0.679 | 0.841 | 0.828 |
Polynomial SVM | 0.874 | 0.657 | 0.715 | 0.454 | 0.748 | 0.528 | 0.812 | 0.706 |
Gaussian SVM | 0.686 | 0.662 | 0.754 | 0.618 | 0.613 | 0.560 | 0.833 | 0.674 |
Logistic Regression | 0.908 | 0.667 | 0.855 | 0.700 | 0.772 | 0.565 | 0.875 | 0.851 |
Linear SVM | 0.672 | 0.672 | 0.715 | 0.715 | 0.578 | 0.578 | 0.851 | 0.851 |
REFERENCE
- Plekhanova, E. , Nuzhdin, S. V. , Utkin, L. V. , & Samsonova, M. G. (2018). Prediction of deleterious mutations in coding regions of mammals with transfer learning. Evolutionary Applications, 12(1), 18–28. 10.1111/eva.12607 [DOI] [PMC free article] [PubMed] [Google Scholar]