Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2023 Apr 12;18(4):e0282042. doi: 10.1371/journal.pone.0282042

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2023 Lee et al

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PMC Copyright notice

Fig 3 — (A) Training dataset construction. Transcriptome profiles were obtained from the L1000 array data and then aggregated to generate a representative target vector. A mol2vec method was used to generate representative vectors for compounds. DTIs with modes of action were collected from the TTD. The original dataset was constructed by selecting activatory and inhibitory DTI pairs that include a compound for which ECFPs can be calculated and an original target (i.e., a target for which genetically perturbed transcriptome data are available). The additional dataset was constructed by selecting activatory and inhibitory DTI pairs that include a compound for which ECFPs can be calculated and an additional target (i.e., a target for which inferred transcriptome data are available). (B) Independent dataset construction. Two independent datasets, Drugbank and LIT-PCBA datasets, were constructed to evaluate the reliability of predictions for unseen DTI in training datasets.