Skip to main content
. 2017 Jan 27;114(7):1708–1713. doi: 10.1073/pnas.1620645114

Table S1.

Screening summary

Library ID Library size, in millions Valid sequence reads* Minimum count/enrichment Total hits Resynthesized compounds
DL5c_260 96 1,011,059 ≥5/475-fold 39 4
DL7a_623 85 982,396 ≥5/433-fold 175 8
DL11a_629 9 866,825 ≥25/260-fold 180 4
*

Each full-length sequence read was matched against a sequence database representing all possible library encoding combinations to translate each DNA molecule into its corresponding set of chemical identifiers. Only those reads for which all encoding elements could be conclusively identified were considered valid (excluding reads that represented PCR-derived duplicates).

“Minimum Count/Enrichment” is an arbitrary cutoff used to increase the ratio of signal to noise in the postselection dataset by filtering out compounds that were observed infrequently and generally shared no structural similarities. “Count” represents the number of valid sequence reads observed for a given compound. “Enrichment” represents the calculated change in frequency from baseline for each compound, where baseline was assumed to equal 1/“library size.”