Skip to main content
. 2021 Jun 30;26(11):2593–2607. doi: 10.1016/j.drudis.2021.06.009

Table 3.

Publicly available FDA documents for promoting AI-powered LMs in regulatory applications.

Data sets Descriptions Potential use in LMs URL of data files
Drug labeling Drug labeling comprises a summary of information for safe and effective use of the drug, which is proposed by manufacturer and approved by FDA Drug labeling could be a useful resource (>120 000 product labelings) to develop biomedical named-entity recognition/normalization, and relation extraction between drug and AEs, drug–drug interaction, etc. https://dailymed.nlm.nih.gov/dailymed/spl-resources-all-drug-labels.cfm
FAERS FAERS is a database that contains information on AE and medication error reports submitted to FDA FAERS is designed to support the post-marketing safety surveillance program for drug and therapeutic biologic products of the FDA. There are more than 19 million case reports in FAERS; AI-powered LMs could be applied to carry out AE detection, causal relationship extraction, etc. https://fis.fda.gov/extensions/FPD-QDE-FAERS/FPD-QDE-FAERS.html
Orange Book Orange Book identifies drug products approved on basis of safety and effectiveness by FDA under the Federal Food, Drug, and Cosmetic Act and related patent and exclusivity information Orange book provides crucial regulatory information, such as biological equivalence, reference listed drug (RLD), Reference Standard (RS), and patent status. This information could be included in AI-powered LMs to compare drug product information with RLD and RS to facilitate abbreviated new drug application (ANDA) submissions www.fda.gov/drugs/drug-approvals-and-databases/orange-book-data-files
Drugs@FDA Drugs@FDA includes most drug products approved since 1939. Most patient information, labels, approval letters, reviews, and other information are available for drug products approved since 1998 Drugs@FDA provides rich information on drug approval history, which could be used as AI-powered LMs to explore underlying reasons for labeling changes and increase business success www.fda.gov/drugs/drug-approvals-and-databases/drugsfda-data-files
FDA Guidance Documents Guidance documents describe FDA’s interpretation of policy on a regulatory issue (21 CFR 10.115(b)). These documents usually discuss more specific products or issues that relate to design, production, labeling, promotion, manufacturing, and testing of regulated products FDA Guidance Documents could be useful to implement AI-powered LMs for standardizing and monitoring crucial steps in drug discovery and development in terms of their consistency and alignment with regulatory requirements www.fda.gov/regulatory-information/search-fda-guidance-documents
FDA Acronyms and Abbreviations FDA Acronyms and Abbreviations database provides a quick reference to acronyms and abbreviations related to FDA activities Emphasis of FDA Acronyms and Abbreviations is on scientific, regulatory, government agency, and computer application terms. The database includes some FDA organizational and program acronyms. It is a useful resource to define vocabularies in AI-powered LMs and increase model generalization www.accessdata.fda.gov/scripts/cder/acronyms/index.cfm