Skip to main content

PMC Author Manuscript Dataset

The PMC Author Manuscript Dataset (“Dataset”) consists of all author manuscripts that have been made available in PMC in compliance with the NIH Public Access Policy or similar policies of other funders since July 2008. The text of manuscripts in the Dataset may be retrieved in XML and plain text formats using the retrieval methods described below.

Tip icon
  • Not all articles in PMC are available for text mining and other reuse.
  • The PMC Cloud Service, PMC OAI-PMH Service, PMC FTP Service, E-Utilities and BioC API are the only services that may be used for automated retrieval of PMC content. Systematic retrieval (or bulk retrieval) of articles through any other automated process is prohibited.
  • License terms vary. Please refer to the license statement in each article for specific terms of use.
  • Users of this dataset are directly and solely responsible for compliance with copyright restrictions and are expected to adhere to the terms and conditions defined by the copyright holder (see the PMC Copyright Notice).

Retrieval Methods

February 12, 2026: Changes to PMC Article Datasets Distribution Services Coming in 2026

PMC will make major changes to our Article Dataset Distribution Services in 2026. In August 2026, you will need to access full text article data files through the PMC Cloud Service instead of the PMC FTP Service. This change will provide you with more reliable performance, faster retrieval times, and greater flexibility in retrieving only the types and number of files you wish to work with.

Since this may impact operational workflows, we are providing a transition period from February to August. During this time, the FTP Service, OA Web Service API, and the current PMC Cloud Service files will remain available concurrently with the updated PMC Cloud Service on AWS.

For complete details about this transition, please see the NCBI Insights blog post and our documentation on Accessing PMC Article Datasets Using Amazon Web Services

The Author Manuscript Dataset is available via:

Terms of Use

Author manuscripts with specific licenses may be used according to the terms of their licenses. All other author manuscript files are available for text mining. They may also be used consistent with the principles of applicable copyright law.

How to Cite

  • NIH NLM NCBI PubMed Central (PMC) Article Datasets - Full-Text Biomedical and Life Sciences Journal Articles on AWS was accessed on DATE from https://registry.opendata.aws/ncbi-pmc.