Skip to main content
. 2021 Nov 9;3(1):100396. doi: 10.1016/j.patter.2021.100396

Figure 2.

Figure 2

Overview of our matrix decomposition model for predicting effective drug-virus associations

Totals of 850 associations for n=126 different BSAs and m=80 distinct viruses were collected from the Andersen et al.17 database. The observed associations were arranged into an n×m matrix Y by setting yij=1. Unobserved associations were encoded with zeros. Our algorithm decomposes the matrix Y into the product of two matrices, P (of size n×k) and Q (of size k×m). By multiplying the matrices P and Q, we obtain Yˆ, which models Y, where all the entries are replaced with real numbers—these correspond to our predicted scores. Rows of P are the BSA feature vectors (or BSA signature); columns of Q are the virus feature vectors (virus signature). The lower illustration depicts how our model discovers a low-dimensional signature vector for the antiviral drug zanamivir, and a low-dimensional signature vector for SARS-CoV-2. The dot product of these two signatures is the predicted efficacy of zanamivir against SARS-CoV-2.