Skip to main content
. Author manuscript; available in PMC: 2023 Jun 15.
Published in final edited form as: J Mol Biol. 2022 Feb 25;434(11):167514. doi: 10.1016/j.jmb.2022.167514

Table 1.

Record counts in PubChem data collections as of November 23, 2021. Current statistics can be found at the PubChem Statistics page (https://pubchemdocs.ncbi.nlm.nih.gov/statistics).

Data Collection Live Count Description
Substance 277,195,271 Descriptions about chemical entities provided by PubChem contributors
Compound 111,050,895 Unique chemical structures extracted from PubChem Substance records
BioAssay 1,391,562 Biological assay descriptions and test results, provided by PubChem contributors
Proteins 97,652 Protein targets tested in PubChem BioAssays and those involved in PubChem Pathways
Genes 89,270 Gene targets tested in PubChem BioAssays and those involved in PubChem Pathways
Pathway 238,597 A series of actions among molecules (chemicals, genes, and proteins) in a cell that leads to a certain product or a change in a cell.
Taxonomy 8,841 Organisms of targets tested in PubChem BioAssays and those involved in PubChem Pathways