Overview of our matrix decomposition model for predicting effective drug-virus associations
Totals of 850 associations for different BSAs and distinct viruses were collected from the Andersen et al.17 database. The observed associations were arranged into an matrix Y by setting . Unobserved associations were encoded with zeros. Our algorithm decomposes the matrix Y into the product of two matrices, P (of size ) and Q (of size ). By multiplying the matrices P and Q, we obtain , which models Y, where all the entries are replaced with real numbers—these correspond to our predicted scores. Rows of P are the BSA feature vectors (or BSA signature); columns of Q are the virus feature vectors (virus signature). The lower illustration depicts how our model discovers a low-dimensional signature vector for the antiviral drug zanamivir, and a low-dimensional signature vector for SARS-CoV-2. The dot product of these two signatures is the predicted efficacy of zanamivir against SARS-CoV-2.