Skip to main content

PMC Open Access Subset

The PMC Open Access Subset includes millions of journal articles and preprints that are made available under license terms that allow reuse. Not all articles in PMC are available for text mining or other reuse; many are under copyright. Articles in the PMC Open Access Subset are made available under Creative Commons or similar licenses that allow more liberal redistribution and reuse than a traditionally copyrighted work. The PMC Open Access Subset is one part of the PMC Article Datasets.

Tip icon
  • Not all articles in PMC are available for text mining and other reuse.
  • The PMC Cloud Service, PMC OAI-PMH Service, PMC FTP Service, E-Utilities and BioC API are the only services that may be used for automated retrieval of PMC content. Systematic retrieval (or bulk retrieval) of articles through any other automated process is prohibited.
  • License terms vary. Please refer to the license statement in each article for specific terms of use.
  • Users of this dataset are directly and solely responsible for compliance with copyright restrictions and are expected to adhere to the terms and conditions defined by the copyright holder (see the PMC Copyright Notice).

File Packaging

Files for the PMC Open Access Subset are available for automated retrieval in several types of packages:

Details about the files and directory structure are available on the FTP Service page and the Cloud Service page.

Find all Open Access Subset articles in:

Learn about additional search filters that restrict results to certain license types.

Retrieval Methods

The PMC Open Access Subset articles and related metadata are available for retrieval via

Terms of Use

Within the PMC Open Access Subset, there are three groupings by terms of use:

  • Commercial Use Allowed - CC0, CC BY, CC BY-SA, CC BY-ND licenses;
  • Non-Commercial Use Only - CC BY-NC, CC BY-NC-SA, CC BY-NC-ND licenses; and
  • Other - no machine-readable Creative Commons license, no license, or a custom license. NOTE: Distribution of articles in this group is limited on the PMC Cloud Service. See the section on PMC COVID-19 Collection for more information.

CC0, CC BY, CC BY-NC, etc. are common abbreviations used to indicate a type of Creative Commons license.

To retrieve the complete PMC Open Access Subset, you must retrieve packages from all of these groupings.

PMC COVID-19 Collection Articles

Some articles in the PMC COVID-19 Collection were made available through the PMC Open Access Subset under license terms that expired at the end of the public health emergency declaration and are no longer available through the FTP Service and Cloud Service. To download a list of PMCIDs that are no longer available under license terms allowing for re-use, see the FAQ item “Where can I find a list of articles removed from PMC or the PMC Open Access subset at the end of the Public Health Emergency?".

How to Cite

  • PMC Open Access Subset [Internet]. Bethesda (MD): National Library of Medicine. 2003 - [cited YEAR MONTH DAY]. Available from https://pmc.ncbi.nlm.nih.gov/tools/openftlist/.