Do text mining / retrieving full text
The majority of articles in PMC are subject to traditional copyright restrictions, and are not available for downloading in bulk. However, we do have several large datasets of journal articles and other scientific publications made available for automated retrieval under license terms that generally allow for more liberal redistribution and reuse than a traditional copyrighted work. We provide multiple ways of programmatically retrieving the full text as described on the PMC Article Datasets page.
NLM provides cloud service access to the PMC Open Access Subset and the PMC Author Manuscript Dataset for faster retrieval. As part of this service, content from these datasets is accessible to users on Amazon Web Services (AWS), without charge, through either an HTTPS or S3 URL, and without any log-in requirement for retrieval. Cloud Service documentation is available on the PMC Cloud Service and Accessing PMC Article Datasets Using AWS pages.