Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 3.
Published in final edited form as: FASEB J. 2019 Dec 10;34(1):41–65. doi: 10.1096/fj.201902143R

Improving Natural Product Research Translation: from Source to Clinical Trial

Barbara C Sorkin 1,*, Adam J Kuszak 2, Gregory Bloss 3, Naomi K Fukagawa 4, Freddie Ann Hoffman 5, Mahtab Jafari 6, Bruce Barrett 7, Paula N Brown 8, Frederic D Bushman 9, Steven Casper 10, Floyd H Chilton 11, Christopher S Coffey 12, Mario G Ferruzzi 13, D Craig Hopp 14, Mairead Kiely 15, Daniel Lakens 16, John B MacMillan 17, David O Meltzer 18, Marco Pahor 19, Jeffrey Paul 20, Kathleen Pritchett-Corning 21, Sara K Quinney 22, Barbara Rehermann 23, Kenneth DR Setchell 24, Nisha S Sipes 25, Jacqueline M Stephens 26, D Lansing Taylor 27, Hervé Tiriac 28, Michael A Walters 29, Dan Xi 30, Giovanna Zappalá 31, Guido F Pauli 32
PMCID: PMC7470648  NIHMSID: NIHMS1610706  PMID: 31914647

Abstract

While great interest in health effects of natural product (NP) including dietary supplements and foods persists, promising preclinical NP research is not consistently translating into actionable clinical trial (CT) outcomes. Generally considered the gold standard for assessing safety and efficacy, CTs, especially phase III CTs, are costly and require rigorous planning to optimize the value of the information obtained. More effective bridging from NP research to CT was the goal of a September, 2018 transdisciplinary workshop. Participants emphasized that replicability and likelihood of successful translation depend on rigor in experimental design, interpretation, and reporting across the continuum of NP research. Discussions spanned good practices for: NP characterization and quality control; use and interpretation of models (computational through in vivo) with strong clinical predictive validity; controls for experimental artefacts, especially for in vitro interrogation of bioactivity and mechanisms of action; rigorous assessment and interpretation of prior research; transparency in all reporting; and prioritization of research questions. NPCTs prioritized based on rigorous, convergent supporting data and current public health needs are most likely to be informative and ultimately affect public health. Thoughtful, coordinated implementation of these practices should enhance the knowledge gained from future NP research.

Keywords: rigor and replicability, model systems, clinical predictive validity, value of information, dietary supplements

INTRODUCTION

Background

Research on health effects of natural products (NPs) has often stemmed from traditional use, or from evidence that higher consumption of plant foods is associated with better health outcomes (see (1) and refs. therein). This research has resulted in the successful development of many active ingredient-based pharmaceuticals from NPs. In contrast to a typical drug development pathway, studies based on traditional uses or epidemiological data rarely begin with a precise understanding of the active molecules or their molecular mechanisms of action. A growing number of recent efforts to elucidate the mechanisms of action of traditionally used NPs have reported finding a complex mixture of bioactive constituents that act at multiple targets (25). Perhaps partly as a result of this still incompletely understood complexity, to date many NIH-supported clinical trials (CTs)1 of complex NPs (NPCTs), in particular the majority of large, randomized, controlled clinical trials (RCTs), have failed to reject the “null hypothesis,” i.e., they detected no significant difference between the intervention and negative controls (68). While CTs that fail to reject their null hypothesis can still provide invaluable guidance, the results of such trials are often met with concerns that different design choices – whether of product, dose, timing, participant eligibility criteria, outcomes, etc. – might have yielded a substantially different result. Each interventional trial represents a substantial investment, with well-powered Phase III CTs typically consuming more than $10 million and many person-years (9, 10). Additionally, as time and funds are limited, and participant pools may also be constrained, a decision to invest in a CT often precludes pursuing other research questions (11), which may provide more useful public health-relevant knowledge than an RCT with an insufficient evidence base.

Definitions

Natural products

NPs were defined as including materials such as animals, marine organisms, plants/botanicals/herbals, macroscopic fungi, bacteria, probiotics, and minerals, and materials derived from them, including isolates, extracts, vitamins, and amino acids. NPs may vary in chemical complexity, ranging from crude to extensively purified materials, and were considered to include dietary supplements, certain foods, traditional medicines, and other products derived from multiple natural sources.

Complex NPs

NP preparations which retain a large number of chemical constituents whose relative concentrations and chemistry (metabolome composition) reflect the inherent variability of the sources, as opposed to products consisting of a very limited number of highly purified NP-derived constituents.

Challenges to be addressed

For investigation of traditionally used NP, research design may be complicated by the chemical complexity and inherent variability of the NPs (1214). The often incomplete and/or conflicting information on mechanisms of action, appropriate uses, and expected health effects poses challenges for formulating hypotheses and designing CT. The ready availability of a variety of preparations of similar NPs and their occurrence in complex matrices such as foods or dietary supplements further increases the challenges for the design, conduct, and interpretation of NPCTs (1517). The variety of NPs in the marketplace poses challenges for product selection; in addition, participants must be monitored for inadvertent or otherwise unreported exposure to the intervention or components thereof (drop-ins). Here, consistent with the NIH definition, we use the term CT not only for efficacy trials, but also for any study in humans which assesses NP safety, bioavailability, PK, PD, dose-response, mechanisms of action, and biological markers thereof.

Focus and objectives

To discuss good practices for addressing these challenges, a two-day, public workshop on September 13 and 14, 2018, brought together speakers and attendees across disciplines and from academia, government, industry, and private practice at the NIH campus in Bethesda, Maryland. Past efforts to enhance rigor in NP research include the NCCIH Policy on Natural Product Integrity (18) as well as numerous guidance documents addressing identification, reproducibility, study design, and execution, as well as conflict of interest issues (6, 1927). As was done for a recent workshop focused on fatty acids (28), we assemble here relevant prior publications citing critical or state-of-the-art approaches drawn from a breadth of NP research and related disciplines, and address in greater depth NP- related topics that were specific foci of the workshop.

Reproducibility, replicability, and other critical characteristics of a solid evidential foundation, have been variously defined in different sub-disciplines (29, 30). Consistent with the 2019 NASEM Report on Reproducibility and Replicability in Science (31), this article uses the term reproducibility for “obtaining consistent results using the same input data, computational steps, methods, and code, and conditions of analysis,” and the term replicability for “obtaining consistent results across studies aimed at answering the same scientific question, each of which has obtained its own data.”

The workshop was predicated on the premise that adherence to a well-integrated set of good practices along the continuum from ethnobotany and epidemiology through basic biomedical research to CTs should enhance the effectiveness of translation from pre-clinical research and early phase CTs to information usable by consumers, manufacturers, practitioners, research funders, and other stakeholders. The good practices should also help reduce premature initiation of Phase III NPCTs, thus reducing instances producing strong grounds for post-trial concerns about the evidence base for a given trial. This article summarizes and updates the workshop discussions of such practices, which include those that apply to individual sub-disciplines, as well as productive combination of approaches across disciplines. The integration of all of these practices, along with the application of sound inductive reasoning, rigorous, appropriate, and transparent statistical analyses (Table 1) (3236), as well as targeting an effect size that is significant to clinicians and consumers (37) should increase the replicability and translational relevance of basic and pre-clinical NP research. Ultimately, this will serve to limit resources expended on flawed premises producing irreplicable data (3840), and to ensure optimal knowledge gained from every NPCT, regardless of whether the results are consistent with benefit, harm, or absence of an effect on the assessed outcome(s).

TABLE 1.

Highlights of broadly applicable good practices for enhanced replicability, transparency, and translational relevance of NP research

1. Models, by definition, differ from the actual subject of interest, however, when carefully chosen to address a clearly defined experimental question they can provide critical insights (236). Sound inductive reasoning, including careful consideration of differences between the model and the actual subject of interest, must be used in interpreting model system outcomes, for example, response in an experimental model may demonstrate a biologically relevant activity, but does not imply clinical efficacy.
2. Quality control. On-going, discipline-appropriate quality control is critical for all experimental assays and research components, from NP, through cell lines, microphysiological systems (58), in silico and whole-organism models, to CT procedures.
3. Replication is critical: within and across distinct model systems, with methods based on orthogonal principles, and in multiple labs. Results that replicate across models, methods, labs, are more likely to replicate in a CT.
4. Rigor. The same rigorous practices used in CT to avoid bias and enhance replicability should be applied in hypothesis-testing experiments that may be used in designing a CT. This includes randomization of wells/dishes/animals, masking/blinding of study personnel, and sample size calculations. Exploratory studies merit equally complete and transparent reporting but may accommodate less stringent procedures.
5. Reporting. Use approaches and venues that support more complete, transparent, and cumulative reporting.
 a. Registration: use registered reports and pre-publication registries for preclinical in vivo studies as well as CT, e.g., clinicaltrials.gov and preclinicaltrials.eu. Information needed to replicate research on complex NPs may go substantially beyond the intervention characterization required by these sites, see, e.g., nccih.nih.gov/research/policies/naturalproduct.htm
 b. Outlet: publish in journals that do not limit supplemental information.
 c. Publish exploratory studies (without forcing them into hypothesis-testing frameworks) and null results of hypothesis-testing studies.
 d. Openness: curate key information for scientific research in publicly accessible databases, e.g., curatescience.org.

This review summarizes areas of both consensus and contention that emerged from the workshop discussions. While the workshop focused on approaches to improve the knowledge gained from future randomized, controlled, NPCTs, some attendees questioned the appropriateness of and need for such trials, especially for traditionally used botanical preparations, arguing that beneficial effects of such practices cannot be detected in the context of a conventional CT.

Moving from a narrower to a broader and more applied focus, this review begins with good practices for the application of assays used to elucidate NP bioactive constituents, and continues through the specification and characterization of NP interventions, endogenous and exogenous factors that may influence the translational relevance of preclinical in vivo models, and the strengths and weaknesses of a broad range of state-of-the-art models. Subsequently, the review will address additional factors that influence the likelihood of successful translation to humans, including assessment of the supporting research, and concludes with approaches for evidence-based prioritization of this research. The review is organized into four main sections:

  • Good practices for preclinical NP studies

  • NP characteristics critical for replicability

  • Translational relevance of preclinical NP research

  • Good practices for moving to NPCTs

Figure 1 provides a graphical overview of the relationships among the covered topics, and Table 1 highlights broadly applicable good practices. The subsequent tables highlight good practices from each of the four main sections:

Figure 1. Critical information for the design, implementation and interpretation of natural product clinical trials.

Figure 1.

Elucidating details (light bulbs) of molecular mechanisms of action prior to initiating a NPCT can help “de-risk” the planned trial by both providing evidence for the hypothesized outcome and its underlying causes, and supporting the development of methods to test that hypothesis, e.g., by measuring in vivo target engagement. It is good practice for researchers to establish standard procedures for product preparation (A), that are optimized for bioactivities and stability, and methods, preferably validated, for thorough characterization of product chemistry and stability. Elucidation of the bioactive NP constituents or metabolites (B) is critical for quality control, for assessing bioavailability, PK and PD, and determining whether minimal effective concentrations of bioactives have been achieved in plasma or at the targets (C). These data can support the development of methods to test mechanistic hypotheses, e.g., by measuring in vivo engagement of the putative targets, proximate bioactivities, or PD, and/or assessing the relationships between these and (D) the primary and secondary NPCT outcomes, and any adverse events.

  • Table 2 highlights good practices for the use of in vitro and in silico models,

  • Table 3 highlights good practices for NP selection, formulation and characterization,

  • Table 4 highlights good practices for the use of preclinical in vivo models,

  • Table 5 highlights critical considerations in NPCT design,

  • Table 6 highlights issues to consider in NPCT decision-making.

GOOD PRACTICES FOR PRECLINICAL NP STUDIES

Optimizing the use of in vitro, invertebrate, and in silico models and ethnobotanical approaches

Good practices in model systems (in vitro, invertebrate, in silico) and in translating from traditional medical systems2 to clinical research can enhance the replicability and translational relevance of preclinical NP studies. To date, the application of invertebrate model systems and some newer in vitro models has been very limited in NP research.

The measurement of activity in in vitro assays is a typical starting point for research on potential clinical application(s) of a NP. Good practices in proceeding from other starting points are discussed below. In vitro assays are also commonly used for monitoring the stability and potency of NPs, as well as for identifying fractions or components responsible for the assayed bioactivities. The replicability of these assays depends on many inputs, beginning with the test article, which often is a chemically complex NP. Good practices to enhance the replicability of vertebrate cell culture-based assays have been described elsewhere (38, 39, 41), as have recommendations for the use of antibodies (42). The following, therefore, focuses on issues more specific to the use of NPs in cell-based assays, and on approaches to enhance the translational relevance of those assays.

Data generated by high-throughput screens (HTS) and in vitro bioactivity assays pose many challenges for interpretation, importantly including their high potential for false discovery. Small molecule NP constituents, their spontaneous breakdown products and/or metabolites (43, 44) have significant potential to generate in vitro assay artefacts via aggregation (45), fluorescence, light absorption, metal chelation, plasma or intracellular membrane disruption (46, 47), protein denaturation, and/or specific and non-specific reactivity with biological nucleophiles (Table 2). For example, HTS and other in vitro studies of extracts and other preparations derived from Curcuma longa (turmeric; often improperly designated as “curcumin”) are susceptible to confounding due to the low stability (44) and high reactivity of the curcuminoids, as well as the frequent lack of clarity as to the actual composition of a given preparation (48, 49). Workshop speakers emphasized the need to design and conduct in vitro experiments in ways that mitigate and/or reveal such artefacts (41, 50, 51), e.g., by time course and chemical analyses, as well as by comprehensive and transparent reporting of experimental parameters.

TABLE 2.

Highlights of good practices for the translational relevance of in vitro models assessing NP bioactivities and mechanisms of action

1. Artefacts. When using in vitro assays of NP activity, test for common types of NP assay interference, e.g., fluorescence or absorption, membrane disruption, protein denaturation, and specific and non-specific reactivity with nucleophiles (48).
2. Bioavailability. If in vitro activity is observed only at concentrations substantially higher than those achievable in vivo in the tissue of interest, then the probability of seeing a robust clinical effect of consuming the NP is likely insufficient to form the basis for a NPCT.
3. Non-specific molecular mechanism(s) of action. Include controls for non-classical mechanisms of action such as metal chelation (87), modulation of DNA modifying enzymes, or intercalation into plasma or subcellular membranes (46, 87).
4. Biomimetic in vitro models. Consider pros and cons of more morphologically detailed “3D” models using human-derived cells. Organoids, “organs-on-a-chip” (OoC), and multi-organ biomimetic models (e.g., human-derived intestine, liver, and kidney OoC coupled by appropriate microfluidics) can yield more translationally relevant high-throughput information than monolayer cell cultures, and are a powerful complement to preclinical in vivo models.

In vitro models

In vitro approaches leverage simplified models of more complex biological systems in the interests of higher throughput and increased ability to decipher causal mechanisms (Table 1). In vitro models that better reflect the biology of an intact organism have been developed, taking advantage of advances including the use of primary, patient-derived cells and stem cells, direct differentiation of induced pluripotent stem cells, more precise methods of gene replacement, and identification and quantification of heterogeneity in vivo and in vitro. In vitro models are also increasingly leveraging advances in tissue engineering, microfluidics, and real-time biosensors of physiological processes. These in vitro models range in the extent to which they mimic in vivo conditions from relatively simple spheroids (52), through patient tissue-derived organoids, self-organizing stem-cell-based organoids (52, 53), single “organs on chips” (OoC) under flow, and integrated, multi-OoC microphysiology systems (MPS)(5460). While still relatively high-throughput, these models from both healthy and diseased tissues have greater similarity to the chemistry, three-dimensional structure and component localization, environment, and possibly genetic stability of an intact organism than classical cell cultures grown on relatively flat substrates. Moreover, through the use of human-derived cells, these models may partially address the challenge of species differences in responses (Table 2) (6164). MPS also allow targeted assessment of cell-type- or organ-specific dose responses (Table 4). Some major challenges in the use of MPS and other complex in vitro systems have recently been addressed, including the development and implementation of metrics for replicability and approaches to prevent the artefactual binding of hydrophobic molecules to the devices (59, 65). Models incorporating elements of the human immune system are beginning to be developed (66). However, understanding how to translate results from in vitro systems such as MPS to CT, e.g., into calculated dose ranges remains challenging.

Only those effects observed in vitro at concentrations achievable in vivo following ingestion, or in systems that incorporate upstream components modeling the functions of the gastrointestinal tract (67), are likely to be clinically relevant to oral dosing. Activities observed only at higher concentrations than can be obtained following ingestion should not be over-interpreted in planning CTs of ingested products (Table 2). Notably, many NP constituents show very limited oral bioavailability. Ex vivo models may be used for extensive pharmacodynamic characterization of NP, but they do not fully recapitulate human physiology. For instance, a promising NP candidate in an in vitro cell screening assay may be metabolized by the liver into an ineffective or even toxic byproduct (68). In vivo validation of in vitro results remains a key component of pre-clinical studies.

Speakers described several approaches to manage the sometimes limited or conflicting information on NP biological activities and their underlying mechanisms, including determining which NP components and cellular and molecular targets are required for activity in animals. The application to chemically complex NPs of agnostic (or non-targeted) in vitro assays (69) combining cutting-edge computational approaches, optimized mass spectrometric (MS) data collection and analysis to elucidate chemical composition, and cytological profiling (70, 71) now enables the generation of strong and specific hypotheses regarding bioactive NP components and the mechanisms through which they affect gene expression or cell phenotype (7276). Such systems have the ability to interrogate multiple chemical components and their combinations simultaneously, without purification of individual constituents. This ability is critical, given accumulating evidence (consistent with tradition) that multiple chemical constituents in even a single botanical extract may contribute to bioactivity via distinct or similar molecular mechanisms (77, 78). These may include effects by constituents on each others’ solubility (or other aspects of bioavailability) or toxicity (5, 79). These novel, non-targeted platforms also have the advantage of generating strong, testable mechanism of action hypotheses without requiring assumptions or prior knowledge regarding those mechanisms.

Invertebrate in vivo models

Invertebrate models such as Caenorhabditis elegans (80, 81) and Drosophila melanogaster are popular in a variety of research areas for their ability to provide relatively high throughput, but in vivo platforms. The conservation of many human genes and signaling networks, combined with the small size and relatively short lifespans of these models facilitate the cost-effective evaluation of a variety of phenotypes (e.g., reproduction, locomotion, cognition) in a large number of individuals (Table 4) (80). This provides the ability to study phenotypes that are challenging to assess in most vertebrate animal models, such as healthspan (the part of an organism’s lifespan during which it is healthy) (82, 83). Invertebrate models can also be used in more focused interventional and mechanistic studies that explore specific aspects of an established phenotype (8486). For example, tea polyphenols were reported to modulate both stress responses and fertility in D. melanogaster strains (87) via a non-target-ligand mechanism of action (Table 2), by modulating iron homeostasis. Recent advances in omics technology and the use of outbred populations further expand opportunities for using D. melanogaster as a model system for NP pre-clinical studies.

In silico models

When information on NP kinetics, dynamics, bioavailability, clearance, and pharmacologic activities is limited, one useful approach is to apply computational or in silico methods to leverage information from products with known chemistry, mechanisms of action, and activities. Data from more highly characterized products can be integrated into dynamic models, and in vitro to in vivo extrapolation (IVIVE) can be used to predict biological responses (8892). These approaches, as outlined by the Tox21 Federal collaboration and the U.S. Environmental Protection Agency’s ToxCast program (9396), can be used to focus high-throughput in vitro and in vivo testing on compounds and concentrations that are more likely to be biologically active and appropriate for each model (97, 98). Such integrated approaches could be further developed to probabilistically model the pharmacological and toxicological kinetics and dynamics of complex NPs (99).

Speakers described several critical, often broadly applicable considerations for utilizing in silico models. First, the purpose of the model must be clearly defined, and the model appropriate and sufficient to fulfill the purpose. Second, as for any research, if experiments that provide the input data are not appropriately designed in the beginning, no amount of computational modeling will yield translatable information. Third, variability and uncertainty in the data that inform the model, as well as correlations between parameters must be considered. Uncertainty and variability in model outcomes are acceptable but must be quantified, and in optimized models they should be similar to real world variability. Preliminary models may be continuously refined, using a “learn and confirm” strategy, or as understanding, e.g., of NP pharmacology increases. Fourth, end users must understand the computational (or any other) model’s limitations, the extent to which it is similar or dissimilar to the biological system of interest, and how the outputs compare with population-derived data, other real-world data, or with data from other models. Finally, computational models and the data used in their development must be made findable, freely accessible, interoperable and reusable (FAIR) (100), just as information for in vitro and preclinical in vivo models must be findable and sufficiently transparent to support replication.

Ethnobotanical approaches

Another typical starting point for translational research is traditional NP use, particularly the use of plants in traditional medical systems. When basing NP research on traditional use, it is essential to consider and thoroughly document how the NP is sourced, identified, prepared, and used in the traditional contexts (Table 3). Intra-species variation and growing conditions strongly influence the metabolic profile of a plant and differences in harvest and preparation will also alter critical product characteristics (101, 102). Many different uses may be described for a given species, and traditional healers have frequently been reported to use only specific subtypes of a given species. Awareness of such practices has the potential to be highly informative for the identification of bioactive components (103). Analogous considerations apply to fungal and animal sources of NP. Good practices for reporting botanical NP sourcing, identification, characterization, vouchering and archiving have been previously described elsewhere (12, 18, 19, 104106). Comprehensiveness and transparency in reports of biomedical research on traditionally used plants have increased in recent years (107). There are cases, however, where, the available records predate the availability of geolocation data on growing sites, or where the need to protect ecologically or culturally sensitive information may limit what can be published (108, 109). Where a similar use or outcome is reported for the same or closely related species across cultures and geographical regions, those similarities may provide priority clues for translational research.

Translation from traditional to modern practice may be limited by important differences (110) (Table 3). Traditional practices commonly combine multiple NP sources (e.g., multi-botanical) into one intervention and are often associated with specific concurrent behaviors (e.g., rituals, behavior changes, possibly including dietary changes). Moreover, if documented traditional practices and the NP outcomes to be studied differ substantially, it may be inappropriate to claim the traditional or typical use as supporting data; the differences between modern medical diagnoses and practices and traditional principles of diagnosis and practice may be too divergent to base claims for modern application on traditional use. Moreover, where a planned NPCT is expected to use a highly processed (single) botanical, information from traditional use may also lack relevance, as the bioactivity and/or the ADME properties of the highly processed product may differ substantially from those of a (multi-component) traditional preparation. Where the NP and the study population are close to historical use and users, traditional practice may provide initial parameters for dose range to be used in research. In instances where no clear typical usage exists, for example with green tea extract capsules, speakers recommended that preclinical studies prioritize the determination of a therapeutic window and administer the lowest dose to achieve concentrations required for the biological and/or health effect of interest (9).

TABLE 3.

Highlights of good practices for NP integrity including standardization, formulation, characterization, and quality control

1. Quality control. Identity and purity of all materials used, regardless of the supplier, should be verified (independent of supplier) prior to use. Appropriate storage and handling conditions should be maintained and monitored by stability testing at least throughout the duration of the study (consult, e.g., the NCCIH Natural Product Integrity Policy (18) and the NSF Stability Testing Guidelines for Dietary Supplements). To the extent feasible, comprehensive chemical analysis of product should be performed; minor product components or impurities can be responsible for or affect measured activities.
2. Product specifications:
 a. Chemistry. Identification of product components required for the bioactivities studied provides critical information for studies of bioavailability, pharmacokinetics (PK), and pharmacodynamics (PD) and should enhance the replicability and clinical relevance.
 b. Formulation. Consider the effects of NP formulation and matrix (e.g., capsule, liquid, food) on bio-accessibility, bioavailability, PK, PD, and activity. These may differ significantly between an isolated NP constituent and the same compound in the context of a complex food matrix.
 c. Sensory characteristics. Sensory acceptability must be considered for all in vivo research, and effectiveness of masking of placebos or active comparators must be considered in CT.
3. Translation from ethnomedicine. If basing an intervention on ethnobotanical observations, consider replicating traditional practice as closely as is feasible. E.g., highly processed extracts will likely have different activities than a multi-botanical decoction (see 2a and 2b). Contextual differences (environment, behavior, genetics) may also affect the outcomes.

NP CHARACTERISTICS CRITICAL FOR REPLICABILITY

NP specifications and integrity

The ultimate biological effect of an ingested, complex NP results from a dynamic interplay of factors: NP chemical composition, physical characteristics (111, 112), bioaccessibility, bioavailability, metabolism, and target population characteristics including sex, age, genetics, background diet, and other environmental exposures (113). This subsection begins by summarizing issues inherent to the integrity of the NP intervention and subsequently addresses target-related sources of variability.

Replicability of outcomes depends on correct and clearly reported identification and archiving of source materials (1214, 17) and consistent preparation and activity of the NP (17, 22, 26, 114). Some attendees felt that since the effects of complex NP preparations may not be due to only the identifiable components, transparent reporting of methods, including consistent product preparation and quality control via an in vitro assay relevant to bioactivity, is sufficient for replicable NP research. Others argued that in many research context it is also critical to understand which NP components are responsible for the bioactivities of interest, as this information can inform NP quality control, as well as being utilized in assessments of bioavailability, pharmacokinetics and to inform comparisons of different studies (115) (Table 4).

TABLE 4.

Highlights of good practices for preclinical in vivo research on NP

1. Product translational relevance and replicability:
 a. Equivalent product formulation. NPs used in preclinical studies should use similar or analogous formulation(s) to the intended human studies.
 b. Biotransformation. Track the actual bioactives, which may be generated from precursors in the NP by the host or the host’s gut microbiota. Either or both may transform NP constituents to active or inactive compounds. These transformations may differ between models and humans, and amongst humans, and affect replicability of results.
 c. Dose-ranging. Doses used in models should be comparable to anticipated human dosage. Initial dose ranges may be based on traditional use or MPS dose-response studies, derived using PBPK modeling, or from preclinical in vivo studies using appropriate scaling approaches. A clinical study is still required to establish dose-response, assess safety, and determine PK and PD in the target population.
2. Model translational relevance:
 a. Model selection. Consider the translational relevance for the specific research question of different species and strains, humanized, outbred, wildling, and/or genetically modified models. Heterogeneity in the model organisms may contribute to the ability to predict responders vs. non-responders.
 b. Documentation. Document as much as possible. Endogenous (e.g., age, sex, physiological/health status, genetic), and external factors (e.g., environment, including background diet, exposure history, enclosure size and enrichment, circadian timing, season, duration and timing of exposure relative to outcome assessment, handlers and handling methods) may affect outcomes. Make these records findable, accessible, interoperable, reusable (FAIR) (100).
3. Validated methods for de-risking the NPCT (see also Fig. 1): Develop and use validated assays to assess achievement of go/no-go criteria based on as many components of the mechanistic model as possible (see Fig. 1), e.g, achievement of effective concentrations of the bioactive component(s) at in vivo targets, target occupancy, and modulation of relevant biological networks.
4. Specimen collection, analysis, archiving. Collect and appropriately document and store specimens needed for assays described in point 3, and as applicable, fecal or other relevant samples to facilitate assessment of any association of the gut microbiota with differential responses. In analyses, include approaches to control for known artefacts, such as reagent contamination that can strongly affect outcomes in low biomass microbiome specimens (195).

Minor product components, impurities, instability, and chemical changes due to instability, resulting in the occurrence of degradation products can be responsible for some or even all measured activity. Instability has been demonstrated for a variety of prominent bioactive botanical marker compounds. These include ligustilide (a constituent of dong quai), which is very unstable and prone to redox reactions and photo-induced dimerization (43, 116, 117); desmethylxanthohumol, a chalcone found in Humulus lupulus, or hops, that forms the phytoestrogen, 8-prenylnaringenin, via Michael addition (118); and curcumin, found in turmeric, which is known to form a myriad of degradation products and metabolites (44). These chemical changes can modify bioactivity through a variety of mechanisms (97, 119, 120) and represent factors that appear to be minor (Residual Complexity, see go.uic.edu/residualcomplexity), but may significantly alter biological activities and therefore, require monitoring. A number of organizations have published general guidelines for stability testing, which are relevant for dietary supplements and other NPs, including the NCCIH Policy on Natural Product Integrity (18) and guidelines published by the International Council for Harmonisation (ICH; ich.org).

Variations in growing conditions generate variations in relative levels of NP constituents; differences in constituent levels are also seen between closely related species interchangeably used in similar contexts NP (121). For example, the licorice source plants, Glycyrrhiza glabra L. and G. uralensis Fisch. ex DC differ significantly in the ratio of liquiritigenin to isoliquiritigenin, as well as in congeneric constituents, and are therefore expected to have different bioactivities (122, 123). Characterization sufficient for the detection of such minor components, known or unknown, is therefore critical for quality control and for monitoring batch-to-batch variation and stability (15, 124). Identity and purity of all materials used, regardless of the supplier, should be verified prior to use (14, 124). Descriptions of approaches to NP selection and characterization have been published recently (17, 22, 114, 125).

Chemical and metabolic factors

Absorption, distribution, metabolism, and excretion (ADME) is a multistep, dynamic biological process that may be modified by product characteristics as well as by host characteristics (115, 126129) including physiology and behavior. NP processing methods and final formulation can alter the chemical profile, stability, bioaccessibility, bioavailability, and metabolism of a NP (111, 130). For example, soy germ isoflavone glucosides incorporated into pasta can be hydrolyzed to aglycones by wheat β-glucosidase (131).

In contrast to most preclinical models, humans are generally diverse in genotypes, diets, other behaviors, and environmental exposures. Moreover, host genetics and cultural milieu profoundly affect flavor perception (132). If the NP will be delivered in or as part of a food, liquid, capsule or tablet, acceptability of flavor, texture, and or tablet size are essential. For both CT and generally available commercial products, consumer burden is another critical factor; this may include the number of capsules required, ease of swallowing capsules or tablets, cost, and portability. Thus, sensory testing and burden, as well as stability of the NP preparation are critical for successful translation of an intervention (Table 3).

Human microbiota are generally much more varied than those of lab animal models, reflective of both genetic diversity (133) and differences in past exposures (134). Run-in periods prior to pharmacokinetic and metabolic assessments can increase the relevance of preclinical animal models to long-term dietary patterns and provide important information for comparison with post-intervention metabolite profiles. A critical consideration is that, even where preclinical in vivo studies are designed to better model clinical trials, the models usually differ from humans in their greater uniformity of background diet, control of prior exposures to the relevant NP, dietary requirements, and metabolic capacities (Table 4). For example, taurine is an essential nutrient for cats (135), but synthesized by humans and rodents, while ascorbic acid is essential for humans, but synthesized by most other animals (136).

Human genetic differences and variations in gut microbiota cannot be considered to be independent, as both direct and indirect effects of host genetics on gastrointestinal ecology have been reported (133, 134, 137, 138). Both background diet and bioactive NPs affect host physiology directly, as well as through modulation of the microbiota, and the microbiota in turn contribute to modulation of NP processing (139, 140). Non-absorbed polymeric polyphenols may, for example, modulate absorption of polyphenol monomers in humans, such that history of exposure significantly influences pharmacokinetics (141). Effects on the host may in turn modulate microbial ecology and metabolism, while changes in microbial populations or gene expression may modulate host physiology, including ADME, collectively resulting in dynamic host-microbiota interactions that modulate or even generate the observed responses to ingested interventions (141146).

Gastrointestinal microbiota have been shown in many cases to be required for the generation of bioactive metabolites of NPs. For example, gut microbiota can convert grape constituents to phenolic acids such as 3-hydroxy benzoic acid and dihydrocaffeic acid, which have been reported to inhibit β-amyloid and α-synuclein oligomerization, respectively, and epigenetically modulate gene expression (3, 147). Speakers stressed the importance of a strong, though not necessarily complete, understanding of the pharmacokinetics and metabolism of a NP intervention for optimizing the design of large-scale CT, although opinions differed on whether such knowledge was necessary before pursuing early-phase trials.

Presenters highlighted cases where the biological activities of enantiomeric NPs differ significantly. For example, a cancer preventive effect of soy in rodents could be replicated using the R-(+) enantiomer of the isoflavone metabolite equol, but not the S-(−)isomer produced in vivo by the gut microbiota (148152). The microorganisms responsible for this conversion are widely present in rodents, but only found in only 20–30% of adult humans, a difference that may contribute to discrepancies between observations in rodents and humans. This finding emphasizes the importance of considering, characterizing, and fully reporting the chirality of NPs and their metabolites, as well as of understanding the role of the gut microbiota. It was pointed out that translational relevance of in vitro HTS assays might be substantially improved through the integrated use of replicable model systems able to mimic the generation of translationally relevant metabolites by the target animal or its gut microbiota.

TRANSLATIONAL RELEVANCE OF PRECLINICAL NP RESEARCH

General considerations

Preclinical in vivo models are widely used in biomedical research, and are often considered a prerequisite for CT. These models provide critical information on the biological response to a product that in silico and in vitro models cannot currently provide in a reliable fashion, including on bioavailability, metabolic activation and deactivation, PD and toxicology. In vivo testing also provides a basis for choosing dose ranges to be tested in CT.

In selecting any in vivo preclinical model system, investigators must keep in mind ways in which the differences between the model and the clinical application of interest are likely to affect inference from the study (20). The model selected should be appropriate for the experimental question based on the organism characteristics considered most critical to the target biology. The likelihood of translational relevance of a preclinical model to a clinical or other experimental question can be considered in terms of different types of model validity, including face validity (similar phenotype, lower probability of successful translation), construct validity (similar mechanism and phenotype, better probability of successful translation), and clinical validity (demonstrated clinical predictive success) (153156) (Table 4). For example, short-lived invertebrate models may be particularly advantageous to understand NP effects on healthspan, as these species conserve the target-of-rapamycin, insulin/IGF-1-like signaling, and sirtuin pathways, which have been shown to modulate healthspan in organisms ranging from yeast through C. elegans and D. melanogaster to mammals (82).

Many factors that differ between preclinical in vivo models and people may influence the biological effects of a NP. Background diet and dietary history can influence NP absorption and metabolism. For example, the absorption and PK of flavan-3-ols are altered by exposure history, as well as by host factors such as obesity and diabetes (112, 141, 157, 158). This demonstrates the critical importance in research on humans of considering, assessing, and reporting background diet and history, as well as standard biometric parameters. Assessing correlations among human genomic, transcriptomic, proteomic, or metabolomic data, and between these and outcomes following a standardized exposure may elucidate mechanisms that underlie differences in the absorption or generation of, or exposure to, bioactive constituents or bioactive metabolites (128), and allow prediction of responders versus non-responders. These approaches can enhance translation between epidemiological, pre-clinical, and clinical studies.

Selection of a mouse model

Mice are frequently the model species of choice for in vivo preclinical research, due to their relatively small size and short life spans, and the large number of strains and large body of literature available. Nevertheless, mouse and other in vivo preclinical models differ from humans in many known and unknown details, both intrinsic (e.g., genetic, immune (159164), surface area: volume ratio, lifespan (165)) and extrinsic (e.g., captivity, diet, history), which, depending on the research question, may substantially affect translational relevance (166).

It is critical to remember that each distinct inbred strain may respond differently to an intervention (126, 167), that the same strain supplied by a different vendor (or with different gut microbiota) may respond differently (168), and that sex differences in response may appear only in specific species or strains (169172).

A wide variety of approaches are used to increase the predictive validity of mouse models. Transgenic mice are commonly employed to enhance construct validity; guidelines for rigor and transparency in research using mouse mutants have been published (173). Transgenic constructs allow experimentation on specific human pathways and targets of interest. Where the genetic mechanism of interest is unknown, for example in toxicity testing (166, 174) or research on neuropathic pain (175), the genetic heterogeneity of outbred mice(174) may provide important information that is unavailable from inbred strains without otherwise affecting outcome variability (176). Many different approaches to “humanizing” mice have been utilized to enhance predictive validity for specific research disciplines or questions. Employing the severe combined immune deficiency mouse (SCID) is one such approach, whereby human peripheral blood cells, and cells of other human fetal immunological tissues such as bone marrow, thymus, or liver are introduced (usually an NOD-SCID-γ mouse) (177). Another approach involves developing transgenic and chimeric mice in which specific human genes are expressed. Various approaches have also been utilized to generate mice with “humanized” gut microbiota, which have been found to have significant clinical predictive validity in developing anti-infective, autoimmune disease, cancer, and transplant therapeutics (178180). Inbred mice reconstituted with microbiota from wild mice have also been reported to have increased translational value (181, 182). In their natural environment mice, like humans, are exposed to microbes, including pathogens, and thus should be regarded as metaorganisms, with their own specific microbiota (including bacteria, fungi, and viruses). Returning natural microbiota to standard laboratory mice was found to significantly improve the model’s ability to predict human immune responses in two preclinical studies (183).

Model- and context-related considerations

Optimization of preclinical research for relevance to future NPCTs as well as general human use requires not only careful consideration of the choice of models, but also of intervention dose (184), composition, and delivery (Table 4). As discussed above, when research and intervention are founded on a traditional use, available ethnobotanical and ethnopharmacological information may be used for guidance (Table 3). Clinical and preclinical interventions should be as similar or, where more appropriate, as equivalent as is feasible, bearing in mind species differences (185), as well as any age-, sex-, size-, or health status-related differences which may affect behavior, absorption, metabolism, and/or toxicity. Allometric or mechanistic modeling approaches can be used to scale to human dosing from preclinical in vivo data, and vice versa (151, 184, 186).

Background diet, as well as other environmental exposures, including medications, alcohol, smoking, and air quality may also have important effects on NP bioavailability, metabolism, and biological effects in both humans and animal models. In the context of NP-diet interactions, it is critical to consider species-specific dietary requirements and optimal diets (e.g., differences in ascorbic acid requirements (187), optimal amino acid or lipid compositions) (185), as well as species differences in biosynthetic abilities (188) and digestive anatomies (189). When comparing results across studies, it should be noted that differences in local regulation of vertebrate animal research (e.g., the European Union prohibition on feeding animal-derived ingredients to herbivores) result in substantial differences in commonly used basal diets or chows. Researchers must carefully consider, for a given research question, whether background diets should be the same for different species, or at similar levels of optimization for each, or whether both possibilities need to be included as controls. As discussed below, physiologically based pharmacokinetic models may be helpful in making dose adjustments for different subpopulations.

Diet and circadian rhythm interact in their effects on health, gut microbiota, and metabolism (190, 191). These effects are modulated by sex in organisms ranging from fish and mice through humans (192194). Through their effects on gut microbiota, NPs may have important effects on their own metabolism and on the generation of active metabolites which, in turn, may significantly affect both PK and PD. In addition, changes in the gut microbiota may exert separate direct or indirect effects on animal biology and health (168). Good practices for characterization of the gut microbiota have been discussed elsewhere (195).

Preclinical in vivo models are often used in environments that are optimized for the investigators and different from the species’ natural habitat. This context by itself is likely to affect experimental outcomes and may also modulate the effects of an experimental intervention. In either case, the experimental environment is likely to affect translational relevance. Investigators need to acknowledge these often unavoidable factors and consider them when inferring the translational relevance of studies (156). In general, given the range of interrelated factors that may modulate the effects of a dietary intervention, it is critical that supporting data include experiments rigorously designed to falsify the hypotheses tested, e.g., to test whether the proposed bioactive constituents or metabolite and/or proximate targets are indispensable and inducing for the effect studied (Figure 1) (35, 196).

The robustness of a result is optimally assessed by attempting to replicate the study not only in the same model, by those originally reporting the result and others, but also across models based on a range of species, genotypes, and phenotypes. Recording as much information as possible about the model and the environmental conditions under which it was used may help others to replicate the studies or facilitate exploring why a finding failed to replicate, or to translate to another model or species.

Genetic diversity and gene-NP interactions

One critical difference between most preclinical in vivo models and humans is that the former are usually inbred, except as discussed above. Where the bioactive compounds are known, and population differences in health response have been reported, interrogating those differences may lead to the elucidation of underlying gene-environment interactions. For example, common genetic variants in two fatty acid desaturase genes (FADS1 and FADS2) appear to modulate the response to dietary polyunsaturated fatty acids (PUFA), resulting in differences in resulting levels of pro- and anti-inflammatory lipid mediators and differences in risk for inflammation-associated health outcomes such as asthma, rheumatoid arthritis, and cancer (197). Tracking the effects of prevalence of these variants might shed light on inter-trial differences in NPCT outcomes with PUFA interventions (197, 198). Similarly, a variety of factors including biometrics and genetic polymorphisms appear to be responsible for observed differences between black and white American women in associations between serum 25-hydroxy vitamin D levels and bone fractures (197199).

In silico modeling approaches

Quantitative or computational modeling and simulation are beginning to show their utility in optimizing NPCT design. Quantitative systems pharmacology (QSP) encompasses computational methods including statistics, pharmacometrics, PK and PD modeling, mechanistic or physiologically based pharmacokinetic models (PBPK), in vitro-in vivo extrapolation, and machine learning approaches. QSP is often employed in drug development to integrate data, optimize experimental dosing regimens, identify sources of inter- and intra-individual variability, predict efficacy or toxicity, or estimate appropriate doses for different sub-populations (200202). PBPK models incorporate physicochemical data (e.g., molecular weight, pKa, lipophilicity), in vitro or preclinical data (e.g., on metabolism and transport), and population-specific data (e.g., organ volumes, tissue composition, blood flow) to model ADME. PBPK can be used to predict equivalent exposures for humans and other species, for both translational or reverse-translational studies (Table 4). Sensitivity analyses can allow comparison of the magnitudes of the effects of different parameters on the outcomes. These models may also provide insight into the interplay between metabolizing enzymes and transporters, inter-individual variability due to intrinsic (e.g., pharmacogenomics, age, sex) and extrinsic differences (e.g., drug, food, or other environmental exposures). Perhaps most importantly, these computational models can highlight the most significant gaps in the data (203205), as well as support evidence-based selection of initial dose ranges for CT.

The application of quantitative pharmacology to complex NPs is in its infancy. In addition to the inherent chemical complexity of NPs, understanding of the pharmacology and metabolism of the individual chemical components is mostly very limited. Nevertheless, application to NPs of modeling and simulation approaches that incorporate uncertainty and variability can provide estimates of differences among populations, inform dose selection and go/no-go decisions, and provide early estimates of the probability of rejection of the null hypothesis.

GOOD PRACTICES FOR MOVING TO A NPCT

As outlined above in the section “NP Characteristics Critical for Replicability”, speakers emphasized the importance of having rigorous and replicable data on the chemical components of the NPs required for the biological activities of interest, as well as evidence for the activity of interest itself from at least two distinct in vivo preclinical models. This information is critical for both NPCT planning and for post-trial interpretation of results (Figs. 1, 2).

Figure 2. Good practices for actionable results in future translation of Natural Products research to Clinical Trials (NP2CT).

Figure 2.

Translation is most informative and cost-effective when CT Phase (I vs. II) and design (e.g. RCT, adaptive) decisions are built on a solid and rigorous evidence base (pyramid and tabletop). The choice to proceed to an NPCT and the determination of the most appropriate trial type (examples on sphere) depends on the extent and strength of supporting data available (table legs and segments). If multiple supporting data elements are missing or weak, an earlier phase CT may be more informative and cost-effective than a Phase III CT. If a substantial number of CT design choices or parameters lack supporting evidence, a more useful next step may be research addressing those gaps via either preclinical research or a mechanistic or early phase NPCT.

NPCT planning

The foundational information should rigorously support a clear and specific definition of the question to be addressed by the NPCT, framed in quantitative terms, and matched to the study population, the evaluation period, and the primary and secondary outcomes (206, 207). When strong evidence supports the requirement for specific bioactive components and specific in vivo targets, this information can be used to develop approaches for de-risking a project, i.e., assessing whether the accumulating data are consistent with the hypotheses and support continuation. This information enriches the ability to design a NPCT that will allow evidence-based interpretation and planning of next steps regardless of the completed outcome, as shown schematically in Figure 1, by providing the ability to:

  • define minimal effective concentrations in plasma or at the targets;

  • develop methods to assess in vivo engagement of putative targets, especially methods which may be translationally relevant, such as non-invasive methods;

  • use those methods to assess the relationship between target engagement and outcomes;

  • elucidate causal cellular or molecular mechanisms mediating the generation of the biological or behavioral responses of interest (i.e., PD biomarkers).

This mechanistic information will be critical for clinical dose-ranging and dose optimization studies, as well as for detecting potential “drop-ins”: given the broad availability of many NPs, placebo group participants sometimes present with anomalously high levels of apparent off-protocol exposure to the intervention. Off-protocol exposures, whether accidental or intentional, may hamper the ability to interpret the results. Knowing which compounds are indispensable and inducing for the bioactivities of interest also allows investigators to ensure that sufficient time is allowed for washout between inventions in cross-over designs (26). Equally important, knowledge, e.g., of minimal effective concentrations and targets, or of metabolites indispensable for activity or genotypes required to generate those metabolites, can be used to generate minimal go/no-go criteria, which help identify CTs that are less likely to be informative, and ensure that only treatments with strong supporting evidence move forward to larger, confirmatory CT (Figure 2). Elucidation of such causal, molecular mechanisms of action may also be useful to predict which participants are likely to respond well or poorly to a specific intervention. Similarly, understanding the molecular mechanisms between an initial target and the final, assessed outcome should enable not only enhanced post-trial assessment of the validity of the pre-trial mechanistic hypotheses, but also contribute to the ability to better predict individual responses to the intervention.

Safety and dose selection

Safety issues that arise in preclinical testing must be carefully considered prior to CT, and safety outcomes must be included in all NPCTs (208). Safety in the context of a NPCT cannot be inferred from traditional nor contemporary use if the product, dose, route, schedule, and context of use differ (Table 3). Dosing in larger NPCT should be based on dose-response data from Phase I/II studies with safety data. Dose ranging in Phase I NPCT may use allometric or mechanistic models to build from in vivo preclinical studies (Table 4), as described above, or, where appropriate, may be based on current clinical practice or traditional use.

As described above and summarized schematically in Figure 2, a variety of rigorous foundational information provides critical input for translation to later-phase efficacy CT, with much of this information requiring input from other types of NIH-defined CT. This foundational information includes:

  • safety of the product to be used in the NPCT, in the population to be studied;

  • product specifications (for interventions and any controls), including chemistry, stability, consumer acceptability (including sensory qualities), packaging;

  • bioavailability, PK, PD of the product to be used in the population to be studied;

  • dose-response properties.

Post-trial interpretation of results

In the absence of the foundational information described above, and without meeting the resulting go/no-go criteria, it will be more challenging to interpret a failure to reject the null hypothesis, as lack of effect could be due to many factors (Figure 1). These include, but are not limited to an incorrect hypothesis regarding the relationship between target engagement and outcome, or less interestingly, failure to achieve sufficient levels of bioactive compounds or target engagement. As the following variables could have an important influence on the results, obtaining information on these for each participant can also be critical to the interpretation of NPCT outcomes:

  • Background diet – habitual and during the NPCT (the latter is potentially controllable).

  • Age, sex, race/ethnicity, body size and composition, health status (especially with respect to outcomes of interest), co-medications, smoking, geography.

  • Genetic and epigenetic data, including characterization of microbiota structure and gene expression, especially as they relate to ADME of the NP (134, 168, 195, 209). When known, relevant genetic variants exist, e.g., in metabolism, specimens for genetic analyses should at least be collected and saved.

  • Sensory characteristics (texture, flavor, odor; pill, capsule, etc., number and characteristics) of products to be used in a NPCT must be acceptable to trial participants, who should not be able to distinguish amongst intervention(s) and controls. Information on participants’ pre-trial beliefs about product efficacy, and post-trial conclusions with respect to the intervention they received should be collected as this may critically inform interpretation (Tables 3 and 5).

Addressing expectancy effects

Human responses in NPCTs are also influenced by expectations, often resulting in either placebo or nocebo effects. These effects may vary substantially for different populations and interventions (e.g., color, size, number of capsules, estimated cost)(210). Through conscious and subconscious processes, people (researchers as well as participants) can be biased when assessing and reporting symptoms, functional abilities, and other patient-oriented outcomes. Whenever feasible, blinding/masking and other approaches should be employed to minimize such biases, including in preclinical studies (Tables 1 and 5).

Conscious and subconscious mind-body processes can also lead to real positive health outcomes, including in apparently objective, quantitative outcomes such as blood pressure (211). Because blinded “placebo” comparison groups are likely to achieve real health benefits through these processes, the magnitude of benefit estimated by comparing conventionally blinded placebo and treatment (intervention) groups is likely to underestimate the amount of benefit that would be observed when comparing non-blinded treatment to no treatment (212). Therefore, when feasible, no treatment and open-label treatment comparison groups should be included in RCT design, so as to arrive at best estimates of real-world effect sizes, as well as effect size from conventional blinded comparison.

Sample and effect size

Sample size calculations must reflect simultaneous consideration of trial phase and design, estimated and targeted magnitude of treatment effects, and statistical power (207). The targeted effect size should be biologically plausible, and based on the degree of benefit that patients, consumers, or practitioners would consider sufficiently beneficial to be worthwhile, given other considerations such as cost, inconvenience, side effects, and potential for serious adverse effects ((213); Table 5). Effect sizes should not be based on estimates from the literature as a whole, which almost always leads to biased estimates (214), inasmuch as the published literature contains a biased selection of all empirical research that is performed. Meta-analytic evaluations of the literature have consistently found that studies yielding null or mixed results are substantially less likely to appear in the published scientific record (215, 216) than those studies with strong statistical support for the tested hypothesis. This occurs even where the amount of collected data renders these “negative” studies highly informative. As a result, statistically significant published results often provide inflated effect size estimates (217). Increased use of registered reports, e.g., as provided by preclinicalresearch.eu, may begin to address this issue and decrease the bias in the scientific literature.

TABLE 5.

Highlights of major considerations for clinical trial design

1. Statistical power. Base sample sizes on the smallest effect size of interest, in the context of existing options and consumer burden. Use best current meta-analytic approaches to assess bias in the literature, and to correct for bias in reported effect sizes (214).
2. Transparency and statistical analysis plans. Clearly describe and document statistical analysis plans before beginning the trial.
3. Comparison groups and expectancy/placebo. Base controls and interventions on the supporting data and the resulting hypotheses. The inclusion of no-treatment, and open-label treatment groups, as well as conventional blinded comparison groups, in RCTs provides best estimates of real-world effect sizes. Development of inert but effectively masked placebos for NPs with strong color or odor (e.g., with similar or masking odor or color) may be challenging. Assessing participant perceptions of product efficacy pre-trial, and of intervention allocation post-trial may provide critical information for interpretation of trial results.
4. Optimal clinical trial paradigm. Adaptive design (AD) NPCTs may be a more efficient approach to optimizing, and testing the efficacy of an intervention, if the evidence base for some CT parameters is lacking. AD CTs require statistical rigor to avoid bias in the interpretation of the results. ADs may be a good choice for Phase II trials.

Analysis of the literature

Beyond publication bias, two additional factors that challenge the translational relevance of the published literature have been widely agreed upon: analytic flexibility during the data analysis (218), and low statistical power (40, 218, 219). All of these factors need to be taken into account when building on published research. Nevertheless, it should be kept in mind that a substantial portion of the scientific literature does consist of reliable and replicable findings. An important challenge for researchers navigating this intrinsically biased body of literature is to determine which research is reliable enough to build on, and which research is more likely to not be replicable (220). Because it is practically impossible to determine whether a finding will replicate based on a single study (221), meta-analytic techniques have been developed to aggregate evidence across studies. However, analytic flexibility and selection for significant effects in small studies lead to upwardly biased effect size estimates. Novel bias detection techniques can identify different types of biases: P-curve analysis (222) can identify lines of research that consist of false positives, and under some circumstances PET-PEESE meta-regression (222, 223) can calculate bias-corrected effect size estimates.

Clinical trial design

In the design of a later Phase NPCT, it is important to recognize that estimated treatment effects from pilot studies are necessarily imprecise. Pilot studies may under- or overestimate the true effect size (but, as noted above, publication bias generally selects for overestimates). Effect size estimates and sample size calculations should, therefore, be conservative and based on prospective meta-analyses (https://methods.cochrane.org/pma/what-pma-0) or other rigorous meta-analyses, i.e., those which include all eligible studies designed and performed, or those which pass rigorous bias detection tests, such as the p-curve analysis described above, with PET-PEESE meta-regression used in estimating effect sizes. Results of pre-registered, rigorously performed and analyzed pilot studies may also be used to estimate effect sizes when convergent, corroborating, or similar results are obtained in independent research from separate groups, using approaches based on different assumptions (triangulation, or orthogonal approaches), such that any artefacts or sources of bias are likely to differ for the different approaches (224, 225). Adaptive designs (see below) and sequential analyses (https://methods.cochrane.org/pma/what-pma-0) can be used to design informative studies when uncertainty about the effect size estimate remains. An argument can be made that later Phase NPCT intended to test whether an intervention is beneficial should be undertaken only where the estimated effect size from rigorous methods such as those described above is at least as large as the estimated smallest important difference or smallest worthwhile effect (213) for the intervention and outcome to be tested. This would not apply if the hypothesis to be tested is that the intervention has no effect, for example, on an adverse outcome.

Where the goal is to determine the maximum potential of an intervention, Phase II studies should enroll those subpopulations predicted to have strong responses, a practice called study “enrichment.” While enrichment, and the inclusion of multiple endpoints are appropriate for Phase II NPCT, it is important to recognize that these designs tend to bias the results of Phase II studies towards the conclusion that moving forward is worthwhile, although positive findings could be due to chance.

Because NPCT are costly, time consuming (226), and dependent on voluntary human participation (11), approaches that can support making trial design and prioritization decisions which optimize acquisition of replicable, actionable results (Tables 5 and 6) are in high demand. Multilevel modeling can be used in conjunction with existing data to identify parameters most strongly associated with the outcomes of interest (227). Simulation modeling can also be used to identify trials that are unlikely to reject the null hypothesis (228).

TABLE 6.

Highlights of major issues to consider in clinical trial decision-making

1. Trial focus. Is the proposed trial well-justified? Are the evidence for and magnitude of the effect, the size of the relevant population, and the need for the knowledge to be generated commensurate with the anticipated cost of a CT?
2. Likelihood of actionable information. What CT paradigm or other next step is most likely to provide actionable information?
 a. Next step: Is the most informative next step likely to be a pilot trial, a larger CT, clinical research other than an efficacy study, or research other than a CT?
 b. NPCT protocol: Have trial/protocol design decisions been sufficiently de-risked? Is there sufficient understanding of the path(s) from initial target(s) engagement to outcome, and availability of non-invasive assessments of these, to support evidence-based decisions regarding the next test of activity whether or not the null hypothesis is rejected upon trial completion?
 c. Bioavailability: Are assays of bioavailability at the immediate in vivo target available and translationally relevant?
 d. Open ends: Can any remaining questions be sufficiently addressed via use of a rigorous, adaptive trial design?

Adaptive designs (AD) can provide an efficient yet rigorous approach to honing the parameters of a NPCT while obtaining preliminary data on clinical effect of the intervention(s). An AD uses accumulating information to implement changes to the design during the course of the trial (229)(230). Changes in the trial must be based on pre-specified decision rules. Changes made on an ad hoc basis may not preserve the type I error rate (false positives), and may lead to bias in parameter estimates or other problems with statistical properties. The many variations of AD include early-stage AD, which may focus on dosing, exposure, heterogeneous responses, or biomarker responses. As it is assumed that AD are exploratory studies, there is less concern about the control of type I error. AD methods are more accepted in the learning (exploratory) stages of CT. Some key variations of AD include:

  • Adaptive enrichment designs, which target therapies to participant subgroups most likely to benefit. In such designs, a trial initially enrolls a broad population. The initial study period (discovery phase) reveals participant groups most likely to benefit from treatment. In the second phase of the trial, only those subgroups most likely to benefit from treatment are randomized, providing increased power to detect an effect.

  • Sample size re-estimation designs refer to AD that allow for adjustment of sample size based on an interim review of the data. Sample size re-estimation based on revised estimates of treatment effects has been controversial for the reasons cited above. However, internal pilot designs that re-estimate only “nuisance” parameters, such as the variance of control group event rate, are more generally accepted.

  • Platform trials are useful for studying multiple targeted therapies in the context of a single outcome, with therapies allowed to enter or leave the platform on the basis of pre-specified stopping boundaries (229). This approach could be used to choose between several different products in a NPCT, e.g., different preparations of NP material from the same source, such as whole dried plant, different extracts, and/or material enriched for one or more putative bioactives (231). Such studies could be done under a master protocol (232), defined as a protocol encompassing multiple substudies, which may have different objectives. A master protocol supports coordinated efforts to evaluate one or more investigational interventions in one or more subpopulations within the overall trial structure.

Adaptive seamless designs combine objectives traditionally addressed in separate trials into a single CT. First-stage results inform the second stage, but final analyses use data from both stages. For example, the first stage may establish preferred dosage, while the second stage can assess the efficacy of that dose. To avoid bias in the interpretation of results, such designs require clear, comprehensive, pre-specified criteria and raise tricky statistical issues, as data from both stages must be combined in a way that preserves key statistical operating characteristics.

In general, most AD require slightly larger sample sizes than a corresponding, conventional Phase II CT, but provide increased opportunity to detect an effect, and can also lead to timelier identification of ineffective (or clearly more efficacious) interventions, improving efficiency by allowing early discontinuation of trials for less effective or ineffective products.

Value of information

Whether and how to pursue a NPCT depends on the value of the information that the CT can be expected to generate. Given the high costs of conducting a CT, it is of critical importance to prioritize the questions to be addressed (11). Value of Information (VOI) analysis provides a framework for generating quantitative estimates of the value of further research in a particular area. VOI estimates can serve as a guide for prioritizing research resources (233).

The basic framework for estimating the VOI is the product of several factors: (1) the value of the information to be gained for individuals who would be affected by the findings, including both changes in health status and changes in costs; (2) the number of affected individuals in the population; (3) the extent to which the information generated would be adopted or implemented; and (4) the degree to which the value of the knowledge gained will depreciate over time as additional knowledge becomes available. This estimated quantity is typically summed over the relevant time horizon, adjusted by an appropriate discount factor reflecting the preference for current over future improvements in health.

Generating estimated values for the elements of the VOI calculation can itself be a costly and complex process. Depending on the NPs under consideration and the effects to be tested, different levels of VOI analysis may be undertaken. An initial level of conceptual VOI can assess whether any of the factors in the product are close enough to zero that the overall VOI would be very small, indicating that substantial research investments would not be justified. Depending on other characteristics of the product under investigation, the health outcomes involved, and the specific data collection challenges to be addressed, progressively more detailed approaches to VOI exist, termed minimal modeling, full modelling, and maximal modeling, that provide more thorough estimates of the value of the information to be gained from pursuing additional research (233, 234).

The use of VOI analysis to support decisions regarding which NPCT to pursue (Table 6) depends significantly on the available research foundation. Assessing the VOI to be gained to a representative affected individual requires definition of the outcome(s), and understanding of the effect sizes to be achieved. Defining and estimating the size of the relevant affected population requires understanding of the dietary, environmental, and genetic context of the effects. Use of VOI in priority setting is still relatively new and will benefit from further exploration and input from researchers and decision-makers. In particular, VOI should be assessed to understand the extent to which it changes decision-making relative to conventional approaches, and the health impact and monetary value that result from such changes.

CONCLUSIONS

The multi-disciplinary approach

Rigorous, evidence-based decision-making with respect to the design and prioritization NPCTs ultimately requires bringing together myriad research sub-disciplines. These disciplines span from ethnobotany through NP chemistry, food science, a range of modeling approaches, biomedical specialties including CT design and statistics, to value of information research. Notably, researchers who specialize in one sub-discipline often do not have ready access to collaborators from other fields, or are unaware of relevant state-of-the-art approaches from these areas. However, the likelihood that an NPCT will be optimally informative depends not only on the rigor of the information contributed by each sub-discipline, but also the rigor with which the different threads are brought together, achieving collaborative synergy. Deploying and synergizing the good practices discussed by attendees at the September 2018 workshop, across the full range of this disciplinary continuum and from the earliest phases of developing data that may inform a future NPCT, should help focus translational NP research on objectives that are most likely to contribute to the design of, and increase the knowledge gained from future NPCTs (Figure 2).

Controversies

Two major areas of controversy at the workshop related to translation from traditional health practices. The first of these areas of disagreement was the appropriateness of the RCT approach for the study of traditional practices, with several attendees arguing that RCT methodology is not well suited to studying them. Most attendees, however, supported the need for NPCTs to assess the safety and efficacy of tradition-based approaches in a 21st century context, with the many associated differences, including in the interventions, and in individuals’ histories, environments, and behaviors. A second area of disagreement was the extent to which it is necessary to elucidate causal, molecular mechanisms of action, including determination of bioactive substituents and their in-animal targets, in order to support the design of an optimally informative NPCT. A case was made that, in some contexts, adherence to current good manufacturing practices, including rigorous adherence to standardized growing and processing procedures, and in conjunction with an appropriate assay of systemic exposure, and, in some cases, bioactivity, is sufficient to ensure NP replicability. However, many attendees felt that product specifications, PK assessments, eligibility criteria, and outcome assessments should, to the extent feasible, be based on strong mechanism of action evidence, preferably derived from conceptually independent (orthogonal, or triangulated) scientific approaches. Attendees considered such evidence to be critical both for optimal trial design and for the ability to interpret results after trial completion. Information on the mechanism of action, including compound(s) indispensable for inducing effects, as well as biological target(s) and response(s) indispensable for outcomes, may critically inform the design and/or interpretation of an NPCT, from product quality control through understanding the bases for individual differences in response, and may support the ability to develop biomarkers of proximate effects, or biomarkers useful to distinguish likely responders and non-responders.

NCPT good practices

In preclinical research aimed at informing the understanding of purported, causal, molecular mechanisms of action, good practices include the use of biologically relevant concentrations of biologically relevant compounds and/or their in vivo metabolites, in one or more relevant models. Participants also emphasized the importance of including appropriate controls for common artefacts encountered in in vitro assays for the activities of phytochemicals. Further approaches that were highlighted as good practices for strengthening the foundational data for NPCTs included: (a) high-throughput, non-targeted approaches that can generate strong, specific hypotheses regarding molecular mechanisms of action of chemically complex NP; (b) pre-registration of experiments at the hypothesis-testing stage, rigorous adherence to experimental protocols, randomization, and masking, which collectively minimize the risk that unconscious biases influence the results; (c) thoughtful consideration, when selecting model systems, of translational relevance and clinical validity for a specific research question; and (d) comprehensive and transparent reporting, combined with meaningful and FAIR data-sharing. Participants discussed these issues for models ranging from in silico through the burgeoning variety of cell-based models, from organoids through microphysiological systems, through in vivo models from invertebrates to transgenic, outbred, wildling, and humanized mouse strains.

Good practices for NPCT decision-making based on the available foundational data were extensively discussed and gave rise to the third major area of controversy at the workshop. Some attendees felt that n-of-1 trials (235) based on traditional use and/or current practitioner experience provide sufficient evidence for safety and efficacy, and are better suited to tests of these approaches than RCTs. Others felt that RCTs are critical, but that NPCTs should be undertaken only when there is a strong likelihood that the outcome will influence practice. The foundation for such an assessment would come from public health needs and a rigorous, evidence-based effect size estimate, that predicts an effect magnitude considered worthwhile by practitioners and consumers, collectively supported by multiple, distinct lines of evidence (225). The evidence base for such assessments is strongest when it includes experiments designed to falsify the hypotheses tested, such as chemical, genetic, or other knockdown studies (11, 35).

Outlook

While many of the practices discussed at the workshop are already employed by numerous NP researchers, few, if any reported NPCTs have combined the full breadth of state-of-the-art approaches when investigating the effect of a NP on a specific health outcome. Integrating these practices in the development, implementation and interpretation of NPCTs will require research teams to collaborate synergistically across disciplines. The anticipated benefit of combining the rigorous and trans-disciplinary approaches referenced herein increases the likelihood that, in the many instances where the outcome of a NPCT does not signal a change in practice, the trial will nevertheless have provided actionable new information, for example, about NP metabolism or the target biology.

ACKNOWLEDGEMENTS

The authors thank Paul M. Coates, Joseph M. Betz and F. Ellis McKenzie for inspiration and support, Wen Chen for help with the initial concept, Wendy P. O’Meara for insightful editing, Sharon Ross and Harold Seifried of the NIH National Cancer Institute for invaluable input into the workshop, Miriam Sander for workshop transcription and notes, Cara Lynch, James Hubley, Katy Fuller, and Joyce Merkel for outstanding technical support. This project was supported by staff and funding from the NIH’s Office of Dietary Supplements, National Cancer Institute, National Center for Complementary and Integrative Health, National Institute on Aging, National Institute on Alcohol Abuse and Alcoholism, and National Institute of Environmental Health Sciences. Support was also provided by staff from the Food and Drug Administration and the US Department of Agriculture.

MGF is a member of the Sensient Technologies Board of Directors and holds equity in Sensient Technologies.

KDRS holds equity in the following companies: Asklepion Pharmaceuticals, Baltimore MD, Ausio LLC, Cincinnati, Ohio, and Aliveris s.r.l, Italy; KDRS is a consultant to Retrophin, San Diego, and Sanitarium Health and Wellness Company, Berkleyvale, NSW, Australia and is an inventor on intellectual property related to equol.

ABBREVIATIONS

AD

adaptive (clinical trial) design

ADME

absorption, distribution, metabolism, excretion

CT

clinical trial

FADS

fatty acid desaturase

FAIR

findable, accessible, interoperable, reusable (desirable characteristics for archived data)

HTS

high-throughput screening

MPS

microphysiological system

NOD

non-obese, diabetic

NP

natural product

NPCT

natural product clinical trial

PBPK

physiologically based pharmacokinetic (a type of in silico model)

PD

pharmacodynamics

PK

pharmacokinetics

PUFA

poly-unsaturated fatty acid

QSP

quantitative systems pharmacology

RCT

randomized clinical trial

SCID

severe combined immune-deficiency

VOI

value of information

Footnotes

1

Per the 2014 NIH guidance, a clinical trial is defined here as a research study in which one or more human subjects are prospectively assigned to one or more interventions (which may include placebo or other control) to evaluate the effects of those interventions on health-related biomedical or behavioral outcomes [https://grants.nih.gov/policy/clinical-trials/definition.htm].

2

Traditional medical systems are defined by the WHO, in the WHO Global Report on Traditional and Complementary Medicine 2019 (https://apps.who.int/iris/bitstream/handle/10665/312342/9789241515436-eng.pdf?ua=1) as “the sum total of the knowledge, skill and practices based on the theories, beliefs and experiences indigenous to different cultures, whether explicable or not, used in the maintenance of health as well as in the prevention, diagnosis, improvement or treatment of physical and mental illness.”

Contributor Information

Barbara C. Sorkin, Office of Dietary Supplements, National Institutes of Health (NIH), Bethesda, MD, US.

Adam J. Kuszak, Office of Dietary Supplements, National Institutes of Health (NIH), Bethesda, MD, US

Gregory Bloss, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, US.

Naomi K. Fukagawa, USDA ARS, Beltsville, MD, US

Freddie Ann Hoffman, HeteroGeneity LLC, Washington, DC, US.

Mahtab Jafari, University of California, Irvine, Irvine, CA, US.

Bruce Barrett, University of Wisconsin, Madison, WI, US.

Paula N. Brown, British Columbia Institute of Technology, Burnaby, British Columbia, Canada

Frederic D. Bushman, University of Pennsylvania, Philadelphia, PA, US

Steven Casper, Office of Dietary Supplement Programs, Center for Food Safety and Applied Nutrition, Food and Drug Administration (FDA), Hyattsville, MD, US.

Floyd H. Chilton, Department of Nutritional Sciences and the BIO5 Institute, University of Arizona, Tucson, AZ, US

Christopher S. Coffey, University of Iowa, Iowa City, IA, US

Mario G. Ferruzzi, Plants for Human Health Institute, North Carolina State University, Kannapolis, NC, US

D. Craig Hopp, National Center for Complementary and Integrative Health, NIH, Bethesda, MD, US.

Mairead Kiely, Cork Centre for Vitamin D and Nutrition Research, School of Food and Nutritional Sciences, University College Cork, Ireland.

Daniel Lakens, Eindhoven University of Technology, Eindhoven, Netherlands.

John B. MacMillan, University of California, Santa Cruz, Santa Cruz, CA, US

David O. Meltzer, University of Chicago Medicine, Chicago, IL, US

Marco Pahor, University of Florida, Gainesville FL, US.

Jeffrey Paul, Drexel Graduate College of Biomedical Sciences, College of Medicine, Evanston, IL, US.

Kathleen Pritchett-Corning, Harvard University, Cambridge, MA, US.

Sara K. Quinney, Indiana University, Indianapolis, IN, US

Barbara Rehermann, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD, US.

Kenneth D.R. Setchell, Cincinnati Children’s Hospital, Cincinnati, OH, US

Nisha S. Sipes, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC, US

Jacqueline M. Stephens, Louisiana State University, Baton Rouge, LA, US

D. Lansing Taylor, University of Pittsburgh, Pittsburgh, PA, US.

Hervé Tiriac, University of California, San Diego, La Jolla, CA, US].

Michael A. Walters, Institute for Therapeutics Discovery and Development, University of Minnesota, Minneapolis, MN, US

Dan Xi, Office of Cancer Complementary and Alternative Medicine, National Cancer Institute, NIH, Shady Grove, MD, US.

Giovanna Zappalá, National Institute on Aging, NIH, Bethesda, MD, US.

Guido F. Pauli, CENAPT and PCRPS, University of Illinois at Chicago College of Pharmacy, Chicago, IL, US

REFERENCES

  • 1.Baden MY, Liu G, Satija A, Li Y, Sun Q, Fung TT, Rimm EB, Willett WC, Hu FB, and Bhupathiraju SN (2019) Changes in Plant-Based Diet Quality and Total and Cause-Specific Mortality. Circulation 140, 979–991 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Britton ER, Kellogg JJ, Kvalheim OM, and Cech NB (2018) Biochemometrics to Identify Synergists and Additives from Botanical Medicines: A Case Study with Hydrastis canadensis (Goldenseal). J. Nat. Prod. 81, 484–493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Wang J, Hodes GE, Zhang H, Zhang S, Zhao W, Golden SA, Bi W, Menard C, Kana V, Leboeuf M, Xie M, Bregman D, Pfau ML, Flanigan ME, Esteban-Fernández A, Yemul S, Sharma A, Ho L, Dixon R, Merad M, Han M-H, Russo SJ, and Pasinetti GM (2018) Epigenetic modulation of inflammation and synaptic plasticity promotes resilience against stress in mice. Nat. Commun. 9, 477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Boudreau A, Poulev A, Ribnicky DM, Raskin I, Rathinasabapathy T, Richard AJ, and Stephens JM (2019) Distinct Fractions of an Artemisia scoparia Extract Contain Compounds With Novel Adipogenic Bioactivity. Frontiers in Nutrition, in press (DOI: 10.3389/fnut.2019.00018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Munyangi J, Cornet-Vernet L, Idumbo M, Lu C, Lutgen P, Perronne C, Ngombe N, Bianga J, Mupenda B, Lalukala P, Mergeai G, Mumba D, Towler M, and Weathers P (2019) Artemisia annua and Artemisia afra tea infusions vs. artesunate-amodiaquine (ASAQ) in treating Plasmodium falciparum malaria in a large scale, double blind, randomized clinical trial. Phytomedicine 57, 49–56 [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 6.Hopp C Past and Future Research at National Center for Complementary and Integrative Health with Respect to Botanicals. HerbalGram 107, 44–51 [Google Scholar]
  • 7.Manson JE, Cook NR, Lee I-M, Christen W, Bassuk SS, Mora S, Gibson H, Gordon D, Copeland T, D’Agostino D, Friedenberg G, Ridge C, Bubes V, Giovannucci EL, Willett WC, Buring JE, and VITAL Research Group. (2019) Vitamin D Supplements and Prevention of Cancer and Cardiovascular Disease. N. Engl. J. Med. 380, 33–4430415629 [Google Scholar]
  • 8.Manson JE, Cook NR, Lee I-M, Christen W, Bassuk SS, Mora S, Gibson H, Albert CM, Gordon D, Copeland T, D’Agostino D, Friedenberg G, Ridge C, Bubes V, Giovannucci EL, Willett WC, Buring JE, and VITAL Research Group. (2019) Marine n-3 Fatty Acids and Prevention of Cardiovascular Disease and Cancer. N. Engl. J. Med. 380, 23–32 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Berman J (2011) Clinical development of dietary supplements: The perils of starting at phase III. Fitoterapia 82, 80–84 [DOI] [PubMed] [Google Scholar]
  • 10.Sertkaya A, Wong H-H, Jessup A, and Beleche T (2016) Key cost drivers of pharmaceutical clinical trials in the United States. Clin. Trials 13, 117–126 [DOI] [PubMed] [Google Scholar]
  • 11.Zarin DA, Goodman SN, and Kimmelman J (2019) Harms From Uninformative Clinical Trials. JAMA, published online July 25, 2019322(9):813–814. doi: 10.1001/jama.2019.9892 [DOI] [PubMed] [Google Scholar]
  • 12.Applequist WL and Miller JS (2013) Selection and authentication of botanical materials for the development of analytical methods. Anal. Bioanal. Chem. 405, 4419–4428 [DOI] [PubMed] [Google Scholar]
  • 13.Sorkin BC, Kuszak AJ, Williamson JS, Hopp DC, and Betz JM (2016) The Challenge of Reproducibility and Accuracy in Nutrition Research: Resources and Pitfalls . Adv. Nutr. 7, 383–389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Simmler C, Graham JG, Chen S-N, and Pauli GF (2018) Integrated analytical assets aid botanical authenticity and adulteration management. Fitoterapia 129, 401–414 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Fenclova M, Novakova A, Viktorova J, Jonatova P, Dzuman Z, Ruml T, Kren V, Hajslova J, Vitek L, and Stranska-Zachariasova M (2019) Poor chemical and microbiological quality of the commercial milk thistle-based dietary supplements may account for their reported unsatisfactory and non-reproducible clinical outcomes. Sci. Rep. 9, 11118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rider CV, Walker NJ, and Waidyanatha S (2018) Getting to the Root of the Matter: Challenges and Recommendations for Assessing the Safety of Botanical Dietary Supplements. Clin. Pharmacol. Ther. 104, 429–431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kellogg JJ, Paine MF, McCune JS, Oberlies NH, and Cech NB (2019) Selection and characterization of botanical natural products for research studies: a NaPDI center recommended approach. Nat. Prod. Rep. 36, 1196–1221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.National Center for Complementary and Integrative Health (NCCIH). NCCIH Policy: Natural Product Integrity (https://nccih.nih.gov/research/policies/naturalproduct.htm). NCCIH/NIH [Google Scholar]
  • 19.Gagnier JJ, Boon H, Rochon P, Moher D, Barnes J, and Bombardier C (2006) Recommendations for reporting randomized controlled trials of herbal interventions: explanation and elaboration. Journal of Clinical Epidemiology 59, 1134–1149 [DOI] [PubMed] [Google Scholar]
  • 20.Kilkenny C, Browne WJ, Cuthill IC, Emerson M, and Altman DG (2010) Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. J. Pharmacol. Pharmacother. 1, 94–99 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Moher D, Hopewell S, Schulz KF, Montori V, Gøtzsche PC, Devereaux PJ, Elbourne D, Egger M, and Altman DG (2010) CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. BMJ 340, c869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Center for Drug Evaluation and Research. (2019) Botanical Drug Development: Guidance for Industry. U.S. Food and Drug Administration [Google Scholar]
  • 23.Kroeger CM, Garza C, Lynch CJ, Myers E, Rowe S, Schneeman BO, Sharma AM, and Allison DB (2018) Scientific rigor and credibility in the nutrition research landscape. Am. J. Clin. Nutr. 107, 484–494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Smith AJ, Clutton RE, Lilley E, Hansen KEA, and Brattelid T (2018) Improving animal research: PREPARE before you ARRIVE. BMJ 360, k760. [DOI] [PubMed] [Google Scholar]
  • 25.Mebane CA, Sumpter JP, Fairbrother A, Augspurger TP, Canfield TJ, Goodfellow WL, Guiney PD, LeHuray A, Maltby L, Mayfield DB, McLaughlin MJ, Ortego LS, Schlekat T, Scroggins RP, and Verslycke TA (2019) Scientific integrity issues in Environmental Toxicology and Chemistry: Improving research reproducibility, credibility, and transparency. Integr. Environ. Assess. Manag. 15, 320–344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.U.S. Food and Drug Administration. Considerations in Demonstrating Interchangeability With a Reference Product (PDF). U.S. FDA [Google Scholar]
  • 27.National Institutes of Health (NIH). Enhancing Reproducibility through Rigor and Transparency (https://grants.nih.gov/policy/reproducibility/index.htm). NIH [Google Scholar]
  • 28.Brenna JT, Plourde M, Stark KD, Jones PJ, and Lin Y-H (2018) Best practices for the design, laboratory analysis, and reporting of trials involving fatty acids. Am. J. Clin. Nutr. 108, 211–227 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Goodman SN, Fanelli D, and Ioannidis JPA (2016) What does research reproducibility mean? Sci. Transl. Med. 8, 341ps12. [DOI] [PubMed] [Google Scholar]
  • 30.Plesser HE (2017) Reproducibility vs. Replicability: a Brief History of a Confused Terminology. Front. Neuroinform. 11, 76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.National Academies of Sciences, Engineering, and Medicine. (2019) Reproducibility and Replicability in Science. Washington, DC: The National Academies Press; 10.17226/25303. [DOI] [PubMed] [Google Scholar]
  • 32.Morgan P, Van Der Graaf PH, Arrowsmith J, Feltner DE, Drummond KS, Wegner CD, and Street SDA (2012) Can the flow of medicines be improved? Fundamental pharmacokinetic and pharmacological principles toward improving Phase II survival. Drug Discov. Today 17, 419–424 [DOI] [PubMed] [Google Scholar]
  • 33.Karran E and De Strooper B (2016) The amyloid cascade hypothesis: are we poised for success or failure? Journal of Neurochemistry 139, 237–252 [DOI] [PubMed] [Google Scholar]
  • 34.Karran E and Hardy J (2014) A critique of the drug discovery and phase 3 clinical programs targeting the amyloid hypothesis for Alzheimer disease. Ann. Neurol. 76, 185–205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wood AC, Wren JD, and Allison DB (2019) The Need for Greater Rigor in Childhood Nutrition and Obesity Research. JAMA Pediatr. 173, 311–312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Harrington D, D’Agostino RB, Gatsonis C, Hogan JW, Hunter DJ, Normand S-LT, Drazen JM, and Hamel MB (2019) New Guidelines for Statistical Reporting in the Journal. New England Journal of Medicine 381, 285–286 [DOI] [PubMed] [Google Scholar]
  • 37.Ferreira ML, Herbert RD, Ferreira PH, Latimer J, Ostelo RW, Grotle M, and Barrett B (2013) The smallest worthwhile effect of nonsteroidal anti-inflammatory drugs and physiotherapy for chronic low back pain: a benefit-harm trade-off study. J. Clin. Epidemiol. 66, 1397–1404 [DOI] [PubMed] [Google Scholar]
  • 38.Freedman LP, Gibson MC, Wisman R, Ethier SP, Soule HR, Reid YA, and Neve RM (2015) The culture of cell culture practices and authentication--Results from a 2015 Survey. Biotechniques 59, 189–190, 192 [DOI] [PubMed] [Google Scholar]
  • 39.Freedman LP, Venugopalan G, and Wisman R (2017) Reproducibility2020: Progress and priorities. F1000Res. 6, 604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Button KS, Ioannidis JPA, Mokrysz C, Nosek BA, Flint J, Robinson ESJ, and Munafò MR (2013) Empirical evidence for low reproducibility indicates low pre-study odds. Nat. Rev. Neurosci. 14, 877. [DOI] [PubMed] [Google Scholar]
  • 41.Sitta Sittampalam G, Grossman A, Brimacombe K, Arkin M, Auld D, Austin C, Baell J, Bejcek B, Caaveiro JMM, Chung TDY, Coussens NP, Dahlin JL, Devanaryan V, Foley TL, Glicksman M, Hall MD, Haas JV, Hoare SRJ, Inglese J, Iversen PW, Kahl SD, Kales SC, Kirshner S, Lal-Nag M, Li Z, McGee J, McManus O, Riss T, Joseph Trask O., Jr, Weidner JR, Wildey MJ, Xia M, and Xu X. (2004) Assay Guidance Manual. https://www.ncbi.nlm.nih.gov/books/NBK53196/ [Google Scholar]
  • 42.Bradbury A and Plückthun A (2015) Reproducibility: Standardize antibodies used in research. Nature 518, 27–29 [DOI] [PubMed] [Google Scholar]
  • 43.Schinkovitz A, Pro SM, Main M, Chen S-N, Jaki BU, Lankin DC, and Pauli GF (2008) Dynamic nature of the ligustilide complex. J. Nat. Prod. 71, 1604–1611 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Schneider C, Gordon ON, Edwards RL, and Luis PB (2015) Degradation of Curcumin: from Mechanism to Biological Implications. J. Agric. Food Chem. 63, 7606–7614 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Duan D, Doak AK, Nedyalkova L, and Shoichet BK (2015) Colloidal aggregation and the in vitro activity of traditional Chinese medicines. ACS Chem. Biol. 10, 978–988 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ingólfsson HI, Thakur P, Herold KF, Hobart EA, Ramsey NB, Periole X, de Jong DH, Zwama M, Yilmaz D, Hall K, Maretzky T, Hemmings H. C., Jr, Blobel C, Marrink SJ, Koçer A, Sack JT, and Andersen OS. (2014) Phytochemicals perturb membranes and promiscuously alter protein function. ACS Chem. Biol. 9, 1788–1798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Baell JB (2016) Feeling Nature’s PAINS: Natural Products, Natural Product Drugs, and Pan Assay Interference Compounds (PAINS). Journal of Natural Products 79, 616–628 [DOI] [PubMed] [Google Scholar]
  • 48.Nelson KM, Dahlin JL, Bisson J, Graham J, Pauli GF, and Walters MA (2017) The Essential Medicinal Chemistry of Curcumin. J. Med. Chem. 60, 1620–1637 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Friesen JB, Liu Y, Chen S-N, McAlpine JB, and Pauli GF (2019) Selective Depletion and Enrichment of Constituents in “Curcumin” and Other Curcuma longa Preparations. J. Nat. Prod. 82, 621–630 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Dahlin JL and Walters MA (2014) The essential roles of chemistry in high-throughput screening triage. Future Med. Chem. 6, 1265–1290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Dahlin JL, Baell J, and Walters MA (2015) Assay Interference by Chemical Reactivity. In Assay Guidance Manual [Internet] Eli Lilly & Company and the National Center for Advancing Translational Sciences [PubMed] [Google Scholar]
  • 52.Ramaiahgari SC, Waidyanatha S, Dixon D, DeVito MJ, Paules RS, and Ferguson SS (2017) Three-Dimensional (3D) HepaRG Spheroid Model With Physiologically Relevant Xenobiotic Metabolism Competence and Hepatocyte Functionality for Liver Toxicity Screening. Toxicol. Sci. 160, 189–190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Tuveson D and Clevers H (2019) Cancer modeling meets human organoid technology. Science 364, 952–955 [DOI] [PubMed] [Google Scholar]
  • 54.Esch EW, Bahinski A, and Huh D (2015) Organs-on-chips at the frontiers of drug discovery. Nat. Rev. Drug Discov. 14, 248–260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Dutta D, Heo I, and Clevers H (2017) Disease Modeling in Stem Cell-Derived 3D Organoid Systems. Trends Mol. Med. 23, 393–410 [DOI] [PubMed] [Google Scholar]
  • 56.Skardal A, Shupe T, and Atala A (2016) Organoid-on-a-chip and body-on-a-chip systems for drug screening and disease modeling. Drug Discov. Today 21, 1399–1411 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Ewart L, Fabre K, Chakilam A, Dragan Y, Duignan DB, Eswaraka J, Gan J, Guzzie-Peck P, Otieno M, Jeong CG, Keller DA, de Morais SM, Phillips JA, Proctor W, Sura R, Van Vleet T, Watson D, Will Y, Tagle D, and Berridge B (2017) Navigating tissue chips from development to dissemination: A pharmaceutical industry perspective. Exp. Biol. Med. 242, 1579–1585 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Low LA and Tagle DA (2017) Tissue chips - innovative tools for drug development and disease modeling. Lab Chip 17, 3026–3036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Vernetti L, Gough A, Baetz N, Blutt S, Broughman JR, Brown JA, Foulke-Abel J, Hasan N, In J, Kelly E, Kovbasnjuk O, Repper J, Senutovitch N, Stabb J, Yeung C, Zachos NC, Donowitz M, Estes M, Himmelfarb J, Truskey G, Wikswo JP, and Taylor DL (2017) Functional Coupling of Human Microphysiology Systems: Intestine, Liver, Kidney Proximal Tubule, Blood-Brain Barrier and Skeletal Muscle. Sci. Rep. 7, 42296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Vernetti LA, Vogt A, Gough A, and Taylor DL (2017) Evolution of Experimental Models of the Liver to Predict Human Drug Hepatotoxicity and Efficacy. Clin. Liver Dis. 21, 197–214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Hartung T (2009) Toxicology for the twenty-first century. Nature 460, 208–212 [DOI] [PubMed] [Google Scholar]
  • 62.Seok J, Warren HS, Cuenca AG, Mindrinos MN, Baker HV, Xu W, Richards DR, McDonald-Smith GP, Gao H, Hennessy L, Finnerty CC, López CM, Honari S, Moore EE, Minei JP, Cuschieri J, Bankey PE, Johnson JL, Sperry J, Nathens AB, Billiar TR, West MA, Jeschke MG, Klein MB, Gamelli RL, Gibran NS, Brownstein BH, Miller-Graziano C, Calvano SE, Mason PH, Cobb JP, Rahme LG, Lowry SF, Maier RV, Moldawer LL, Herndon DN, Davis RW, Xiao W, Tompkins RG, and Inflammation and Host Response to Injury, Large Scale Collaborative Research Program. (2013) Genomic responses in mouse models poorly mimic human inflammatory diseases. Proc. Natl. Acad. Sci. U. S. A. 110, 3507–3512 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Baker LA, Tiriac H, Clevers H, and Tuveson DA (2016) Modeling pancreatic cancer with organoids. Trends Cancer Res. 2, 176–190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Aberle MR, Burkhart RA, Tiriac H, Olde Damink SWM, Dejong CHC, Tuveson DA, and van Dam RM (2018) Patient-derived organoid models help define personalized management of gastrointestinal cancer. Br. J. Surg. 105, e48–e60 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Li X, George SM, Vernetti L, Gough AH, and Taylor DL (2018) A glass-based, continuously zonated and vascularized human liver acinus microphysiological system (vLAMPS) designed for experimental modeling of diseases and ADME/TOX. Lab Chip 18, 2614–2631 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Neal JT, Li X, Zhu J, Giangarra V, Grzeskowiak CL, Ju J, Liu IH, Chiou S-H, Salahudeen AA, Smith AR, Deutsch BC, Liao L, Zemek AJ, Zhao F, Karlsson K, Schultz LM, Metzner TJ, Nadauld LD, Tseng Y-Y, Alkhairy S, Oh C, Keskula P, Mendoza-Villanueva D, De La Vega FM, Kunz PL, Liao JC, Leppert JT, Sunwoo JB, Sabatti C, Boehm JS, Hahn WC, Zheng GXY, Davis MM, and Kuo CJ (2018) Organoid Modeling of the Tumor Immune Microenvironment. Cell 175, 1972–1988.e16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Minekus M, Marteau P, Havenaar R, and Huis in ‘t Veld JHH. (1995) Multicompartmental dynamic computer-controlled model simulating the stomach and small intestine. Altern. Lab. Anim 23 [Google Scholar]
  • 68.Peng Y-S, Liu B, Wang R-F, Zhao Q-T, Xu W, and Yang X-W (2015) Hepatic metabolism: a key component of herbal drugs research. J. Asian Nat. Prod. Res. 17, 89–106 [DOI] [PubMed] [Google Scholar]
  • 69.Subramanian A, Narayan R, Corsello SM, Peck DD, Natoli TE, Lu X, Gould J, Davis JF, Tubelli AA, Asiedu JK, Lahr DL, Hirschman JE, Liu Z, Donahue M, Julian B, Khan M, Wadden D, Smith IC, Lam D, Liberzon A, Toder C, Bagul M, Orzechowski M, Enache OM, Piccioni F, Johnson SA, Lyons NJ, Berger AH, Shamji AF, Brooks AN, Vrcic A, Flynn C, Rosains J, Takeda DY, Hu R, Davison D, Lamb J, Ardlie K, Hogstrom L, Greenside P, Gray NS, Clemons PA, Silver S, Wu X, Zhao W-N, Read-Button W, Wu X, Haggarty SJ, Ronco LV, Boehm JS, Schreiber SL, Doench JG, Bittker JA, Root DE, Wong B, and Golub TR (2017) A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 171, 1437–1452.e17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Ochoa JL, Bray WM, Lokey RS, and Linington RG (2015) Phenotype-Guided Natural Products Discovery Using Cytological Profiling. J. Nat. Prod. 78, 2242–2248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Caicedo JC, Cooper S, Heigwer F, Warchal S, Qiu P, Molnar C, Vasilevich AS, Barry JD, Bansal HS, Kraus O, Wawer M, Paavolainen L, Herrmann MD, Rohban M, Hung J, Hennig H, Concannon J, Smith I, Clemons PA, Singh S, Rees P, Horvath P, Linington RG, and Carpenter AE (2017) Data-analysis strategies for image-based cell profiling. Nat. Methods 14, 849–863 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Potts MB, Kim HS, Fisher KW, Hu Y, Carrasco YP, Bulut GB, Ou Y-H, Herrera-Herrera ML, Cubillos F, Mendiratta S, Xiao G, Hofree M, Ideker T, Xie Y, Huang LJ-S, Lewis RE, MacMillan JB, and White MA (2013) Using functional signature ontology (FUSION) to identify mechanisms of action for natural products. Sci. Signal. 6, ra90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Vaden RM, Oswald NW, Potts MB, MacMillan JB, and White MA (2017) FUSION-Guided Hypothesis Development Leads to the Identification of N6,N6-Dimethyladenosine, a Marine-Derived AKT Pathway Inhibitor. Mar. Drugs 15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Das B, Neilsen BK, Fisher KW, Gehring D, Hu Y, Volle DJ, Kim HS, McCall JL, Kelly DL, MacMillan JB, White MA, and Lewis RE (2018) A Functional Signature Ontology (FUSION) screen detects an AMPK inhibitor with selective toxicity toward human colon tumor cells. Sci. Rep. 8, 3770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Litichevskiy L, Peckner R, Abelin JG, Asiedu JK, Creech AL, Davis JF, Davison D, Dunning CM, Egertson JD, Egri S, Gould J, Ko T, Johnson SA, Lahr DL, Lam D, Liu Z, Lyons NJ, Lu X, MacLean BX, Mungenast AE, Officer A, Natoli TE, Papanastasiou M, Patel J, Sharma V, Toder C, Tubelli AA, Young JZ, Carr SA, Golub TR, Subramanian A, MacCoss MJ, Tsai L-H, and Jaffe JD (2018) A Library of Phosphoproteomic and Chromatin Signatures for Characterizing Cellular Responses to Drug Perturbations. Cell Syst. 6, 424–443.e7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Wang Z, Lachmann A, Keenan AB, and Ma’ayan A (2018) L1000FWD: fireworks visualization of drug-induced transcriptomic signatures. Bioinformatics 34, 2150–2152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Cech NB, Junio HA, Ackermann LW, Kavanaugh JS, and Horswill AR (2012) Quorum quenching and antimicrobial activity of goldenseal (Hydrastis canadensis) against methicillin-resistant Staphylococcus aureus (MRSA). Planta Med. 78, 1556–1561 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Elfawal MA, Towler MJ, Reich NG, Golenbock D, Weathers PJ, and Rich SM (2012) Dried whole plant Artemisia annua as an antimalarial therapy. PLoS One 7, e52746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Caesar LK and Cech NB (2019) Synergy and antagonism in natural product extracts: when 1 + 1 does not equal 2. Nat. Prod. Rep. 36, 869–888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Lu T, Aron L, Zullo J, Pan Y, Kim H, Chen Y, Yang T-H, Kim H-M, Drake D, Liu XS, Bennett DA, Colaiácovo MP, and Yankner BA (2014) REST and stress resistance in ageing and Alzheimer’s disease. Nature 507, 448–454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Lithgow GJ, Driscoll M, and Phillips P (2017) A long journey to reproducible results. Nature 548, 387–388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Jafari M (2010) Drosophila melanogaster as a model system for the evaluation of anti-aging compounds. Fly 4, 253–257 [DOI] [PubMed] [Google Scholar]
  • 83.Jafari M (2015) Healthspan Pharmacology. Rejuvenation Res. 18, 573–580 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Schriner SE, Abrahamyan A, Avanessian A, Bussel I, Maler S, Gazarian M, Holmbeck MA, and Jafari M (2009) Decreased mitochondrial superoxide levels and enhanced protection against paraquat in Drosophila melanogaster supplemented with Rhodiola rosea. Free Radic. Res. 43, 836–843 [DOI] [PubMed] [Google Scholar]
  • 85.Schriner SE, Lee K, Truong S, Salvadora KT, Maler S, Nam A, Lee T, and Jafari M (2013) Extension of Drosophila lifespan by Rhodiola rosea through a mechanism independent from dietary restriction. PLoS One 8, e63886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Labachyan KE, Kiani D, Sevrioukov EA, Schriner SE, and Jafari M (2018) The impact of on the gut microbial community of Drosophila melanogaster. Gut Pathog. 10, 12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Lopez T, Schriner SE, Okoro M, Lu D, Chiang BT, Huey J, and Jafari M (2014) Green tea polyphenols extend the lifespan of male Drosophila melanogaster while impairing reproductive fitness. J. Med. Food 17, 1314–1321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM, and Thomas RS (2015) Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-Throughput Screening Assays for the Estrogen Receptor. Toxicol. Sci. 148, 137–154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Sipes NS, Wambaugh JF, Pearce R, Auerbach SS, Wetmore BA, Hsieh J-H, Shapiro AJ, Svoboda D, DeVito MJ, and Ferguson SS (2017) An Intuitive Approach for Predicting Potential Human Health Risk with the Tox21 10k Library. Environ. Sci. Technol. 51, 10786–10796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Browne P, Judson RS, Casey WM, Kleinstreuer NC, and Thomas RS (2015) Screening Chemicals for Estrogen Receptor Bioactivity Using a Computational Model. Environ. Sci. Technol. 49, 8804–8814 [DOI] [PubMed] [Google Scholar]
  • 91.Bell SM, Chang X, Wambaugh JF, Allen DG, Bartels M, Brouwer KLR, Casey WM, Choksi N, Ferguson SS, Fraczkiewicz G, Jarabek AM, Ke A, Lumen A, Lynn SG, Paini A, Price PS, Ring C, Simon TW, Sipes NS, Sprankle CS, Strickland J, Troutman J, Wetmore BA, and Kleinstreuer NC (2018) In vitro to in vivo extrapolation for high throughput prioritization and decision making. Toxicol. In Vitro 47, 213–227 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Wambaugh JF, Hughes MF, Ring CL, MacMillan DK, Ford J, Fennell TR, Black SR, Snyder RW, Sipes NS, Wetmore BA, Westerhout J, Setzer RW, Pearce RG, Simmons JE, and Thomas RS (2018) Evaluating In Vitro-In Vivo Extrapolation of Toxicokinetics. Toxicol. Sci. 163, 152–169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Kavlock RJ, Austin CP, and Tice RR (2009) Toxicity testing in the 21st century: implications for human health risk assessment. Risk Anal. 29, 485–487; discussion 492–497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Kavlock R, Chandler K, Houck K, Hunter S, Judson R, Kleinstreuer N, Knudsen T, Martin M, Padilla S, Reif D, Richard A, Rotroff D, Sipes N, and Dix D (2012) Update on EPA’s ToxCast program: providing high throughput decision support tools for chemical risk management. Chem. Res. Toxicol. 25, 1287–1302 [DOI] [PubMed] [Google Scholar]
  • 95.Thomas RS, Paules RS, Simeonov A, Fitzpatrick SC, Crofton KM, Casey WM, and Mendrick DL (2018) The US Federal Tox21 Program: A strategic and operational plan for continued leadership. ALTEX 35, 163–168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Thomas RS, Bahadori T, Buckley TJ, Cowden J, Deisenroth C, Dionisio KL, Frithsen JB, Grulke CM, Gwinn MR, Harrill JA, Higuchi M, Houck KA, Hughes MF, Hunter ES, Isaacs KK, Judson RS, Knudsen TB, Lambert JC, Linnenbrink M, Martin TM, Newton SR, Padilla S, Patlewicz G, Paul-Friedman K, Phillips KA, Richard AM, Sams R, Shafer TJ, Setzer RW, Shah I, Simmons JE, Simmons SO, Singh A, Sobus JR, Strynar M, Swank A, Tornero-Valez R, Ulrich EM, Villeneuve DL, Wambaugh JF, Wetmore BA, and Williams AJ (2019) The Next Generation Blueprint of Computational Toxicology at the U.S. Environmental Protection Agency. Toxicol. Sci. 169, 317–332 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Hubbard TD, Hsieh J-H, Rider CV, Sipes NS, Sedykh A, Collins BJ, Auerbach SS, Xia M, Huang R, Walker NJ, and DeVito MJ (2019) Using Tox21 High-Throughput Screening Assays for the Evaluation of Botanical and Dietary Supplements. Appl. In Vitro Toxicol. 5, 10–25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Ramaiahgari SC, Auerbach SS, Saddler TO, Rice JR, Dunlap PE, Sipes NS, DeVito MJ, Shah RR, Bushel PR, Merrick BA, Paules RS, and Ferguson SS (2019) The Power of Resolution: Contextualized Understanding of Biological Responses to Liver Injury Chemicals Using High-throughput Transcriptomics and Benchmark Concentration Modeling. Toxicol. Sci. 169, 553–566 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Fitzgerald JB, Schoeberl B, Nielsen UB, and Sorger PK (2006) Systems biology and combination therapy in the quest for clinical efficacy. Nat. Chem. Biol. 2, 458–466 [DOI] [PubMed] [Google Scholar]
  • 100.Wilkinson MD, Dumontier M, Jan Aalbersberg I, Appleton G, Axton M, Baak A, Blomberg N, Boiten J-W, da Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJG, Groth P, Goble C, Grethe JS, Heringa J, Hoen P. A. C.‘t, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone S-A, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, van der Lei J, van Mulligen E, Velterop Jan, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, and Mons B. (2019) Addendum: The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 6, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Božin B, Kladar N, Grujić N, Anačkov G, Samojlik I, Gavarić N, and Conić BS (2013) Impact of origin and biological source on chemical composition, anticholinesterase and antioxidant properties of some St. John’s wort species (Hypericum spp., Hypericaceae) from the Central Balkans. Molecules 18, 11733–11750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Ortiz D, Rocheford T, and Ferruzzi MG (2016) Influence of Temperature and Humidity on the Stability of Carotenoids in Biofortified Maize (Zea mays L.) Genotypes during Controlled Postharvest Storage. J. Agric. Food Chem. 64, 2727–2736 [DOI] [PubMed] [Google Scholar]
  • 103.Milanowski DJ, Winter REK, Elvin-Lewis MPF, and Lewis WH (2002) Geographic distribution of three alkaloid chemotypes of Croton lechleri. J. Nat. Prod. 65, 814–819 [DOI] [PubMed] [Google Scholar]
  • 104.Hildreth J, Hrabeta-Robinson E, Applequist W, Betz J, and Miller J (2007) Standard operating procedure for the collection and preparation of voucher plant specimens for use in the nutraceutical industry. Anal. Bioanal. Chem. 389, 13–17 [DOI] [PubMed] [Google Scholar]
  • 105.Klein MA, Nahin RL, Messina MJ, Rader JI, Thompson LU, Badger TM, Dwyer JT, Kim YS, Pontzer CH, Starke-Reed PE, and Weaver CM (2010) Guidance from an NIH workshop on designing, implementing, and reporting clinical studies of soy interventions. J. Nutr. 140, 1192S–1204S [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Eisenman SW, Tucker AO, and Struwe L (2012) Voucher specimens are essential for documenting source material used in medicinal plant investigations. Journal of Medicinally Active Plants 1, 30–43 [Google Scholar]
  • 107.Gagnier JJ, Moher D, Boon H, Beyene J, and Bombardier C (2011) Randomized controlled trials of herbal interventions underreport important details of the intervention. J. Clin. Epidemiol. 64, 760–769 [DOI] [PubMed] [Google Scholar]
  • 108.McManis CR (1998) The interface between international intellectual property and environmental protection: biodiversity and biotechnology. Wash. ULQ 76, 255 [Google Scholar]
  • 109.Battiste M and Henderson JY (2018) Compulsory Schooling and Cognitive Imperialism: A Case for Cognitive Justice and Reconciliation with Indigenous Peoples. The Palgrave Handbook of Education Law for Schools 567–583 [Google Scholar]
  • 110.Cheng C-W, Wu T-X, Shang H-C, Li Y-P, Altman DG, Moher D, Bian Z-X, and CONSORT-CHM Formulas 2017 Group. (2017) CONSORT Extension for Chinese Herbal Medicine Formulas 2017: Recommendations, Explanation, and Elaboration (Traditional Chinese Version). Ann. Intern. Med. 167, W7–W20 [DOI] [PubMed] [Google Scholar]
  • 111.Gusev PA, Savarala S, Atkinson R, Dang PT, Oh L, McNeal M, Pehrsson P, Dwyer J, Betz JM, Kuszak AJ, Costello R, Douglass LW, Saldanha LW. (2018) Disintegration and Rupture Testing of Prescription Prenatal Multivitamin/Mineral Dosage Forms: A Pilot Study for the Dietary Supplement Ingredient Database (DSID) presented at the Current Developments in Nutrition, Boston, MA [Google Scholar]
  • 112.Neilson AP and Ferruzzi MG (2011) Influence of formulation and processing on absorption and metabolism of flavan-3-ols from tea and cocoa. Annu. Rev. Food Sci. Technol. 2, 125–151 [DOI] [PubMed] [Google Scholar]
  • 113.Chen S-H, Forrester W, and Lahav G (2016) Schedule-dependent interaction between anticancer treatments. Science 351, 1204–1208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Catlin NR, Collins BJ, Auerbach SS, Ferguson SS, Harnly JM, Gennings C, Waidyanatha S, Rice GE, Smith-Roe SL, Witt KL, and Rider CV (2018) How similar is similar enough? A sufficient similarity case study with Ginkgo biloba extract. Food Chem. Toxicol. 118, 328–339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Waidyanatha S, Ryan K, Roe AL, Jia W, Paine MF, Ferguson S, Gurley BJ, Welch K, Chow MSS, Devito M, and Rider C (2018) Follow that botanical: Challenges and recommendations for assessing absorption, distribution, metabolism and excretion of botanical dietary supplements. Food Chem. Toxicol. 121, 194–202 [DOI] [PubMed] [Google Scholar]
  • 116.Sheng B, Vo Y, Lan P, Gardiner MG, Banwell MG, and Sun P (2019) Direct and Metal-Catalyzed Photochemical Dimerization of the Phthalide (Z)-Ligustilide Leading to Both [2 + 2] and [4 + 2] Cycloadducts: Application to Total Syntheses of Tokinolides A-C and Riligustilide. Org. Lett. 21, 6295–6299 [DOI] [PubMed] [Google Scholar]
  • 117.Duric K, Liu Y, Chen S-N, Lankin D, Nikolic D, McAlpine J, Friesen JB, and Pauli G (2019) Studying Mass Balance and the Stability of (Z)-Ligustilide from Angelica sinensis Helps to Bridge a Botanical Instability-Bioactivity Chasm. J. Nat. Prod. 82, 2400–2408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Chen S-N, Lankin DC, Chadwick LR, Jaki BU, and Pauli GF (2009) Dynamic residual complexity of natural products by qHNMR: solution stability of desmethylxanthohumol. Planta Med. 75, 757–762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Jia M, Chen L, Xin H-L, Zheng C-J, Rahman K, Han T, and Qin L-P (2016) A Friendly Relationship between Endophytic Fungi and Medicinal Plants: A Systematic Review. Front. Microbiol. 7, 906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Li W, Chan C-K, Liu Y, Yao J, Mitić B, Kostić EN, Milosavljević B, Davinić I, Orem WH, Tatu CA, Dedon PC, Pavlović NM, and Chan W (2018) Aristolochic Acids as Persistent Soil Pollutants: Determination of Risk for Human Exposure and Nephropathy from Plant Uptake. J. Agric. Food Chem. 66, 11468–11476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Perry NB, Anderson RE, Brennan NJ, Douglas MH, Heaney AJ, McGimpsey JA, and Smallfield BM (1999) Essential oils from dalmatian sage (Salvia officinalis l.): variations among individuals, plant parts, seasons, and sites. J. Agric. Food Chem. 47, 2048–2054 [DOI] [PubMed] [Google Scholar]
  • 122.Simmler C, Jones T, Anderson JR, Nikolić DC, van Breemen RB, Soejarto DD, Chen S-N, and Pauli GF (2014) Species-specific Standardisation of Licorice by Metabolomic Profiling of Flavanones and Chalcones. Phytochem. Anal. 25, 378–388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Hajirahimkhan A, Mbachu O, Simmler C, Ellis SG, Dong H, Nikolic D, Lankin DC, van Breemen RB, Chen S-N, Pauli GF, Dietz BM, and Bolton JL (2018) Estrogen Receptor (ER) Subtype Selectivity Identifies 8-Prenylapigenin as an ERβ Agonist from Glycyrrhiza inflata and Highlights the Importance of Chemical and Biological Authentication. J. Nat. Prod. 81, 966–975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Narayanapillai SC, Leitzman P, Gerard O’Sullivan M, and Xing C (2014) Flavokawains A and B in Kava, Not Dihydromethysticin, Potentiate Acetaminophen-Induced Hepatotoxicity in C57BL/6 Mice. Chem. Res. Toxicol. 27, 1871–1876 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Catlin NR, Collins BJ, Auerbach SS, Ferguson SS, Harnly JM, Gennings C, Waidyanatha S, Rice GE, Smith-Roe SL, Witt KL, and Rider CV (2018) How similar is similar enough? A sufficient similarity case study with Ginkgo biloba extract. Food Chem. Toxicol. 118, 328–339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Barrington WT, Wulfridge P, Wells AE, Rojas CM, Howe SYF, Perry A, Hua K, Pellizzon MA, Hansen KD, Voy BH, Bennett BJ, Pomp D, Feinberg AP, and Threadgill DW (2018) Improving Metabolic Health Through Precision Dietetics in Mice. Genetics 208, 399–417 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Werz O, Gerstmeier J, Libreros S, De la Rosa X, Werner M, Norris PC, Chiang N, and Serhan CN (2018) Human macrophages differentially produce specific resolvin or leukotriene signals that depend on bacterial pathogenicity. Nat. Commun. 9, 59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Zeisel SH (2019) A Conceptual Framework for Studying and Investing in Precision Nutrition. Front. Genet. 10, 200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Fischbach MA (2018) Microbiome: Focus on Causation and Mechanism. Cell 174, 785–790 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Gurley BJ, Yates CR, and Markowitz JS (2018) “…Not Intended to Diagnose, Treat, Cure or Prevent Any Disease.” 25 Years of Botanical Dietary Supplement Research and the Lessons Learned. Clin. Pharmacol. Ther. 104, 470–483 [DOI] [PubMed] [Google Scholar]
  • 131.Clerici C, Setchell KDR, Battezzati PM, Pirro M, Giuliano V, Asciutti S, Castellani D, Nardi E, Sabatino G, Orlandi S, Baldoni M, Morelli O, Mannarino E, and Morelli A (2007) Pasta naturally enriched with isoflavone aglycons from soy germ reduces serum lipids and improves markers of cardiovascular risk. J. Nutr. 137, 2270–2278 [DOI] [PubMed] [Google Scholar]
  • 132.Dotson CD, Babich J, and Steinle NI (2012) Genetic Predisposition and Taste Preference: Impact on Food Intake and Risk of Chronic Disease. Current Nutrition Reports 1, 175–183 [Google Scholar]
  • 133.Korach-Rechtman H, Freilich S, Gerassy-Vainberg S, Buhnik-Rosenblau K, Danin-Poleg Y, Bar H, and Kashi Y (2019) Murine Genetic Background has a Stronger Impact on the Composition of the Gut Microbiota than Maternal Inoculation or Exposure to Unlike Exogenous Microbiota. Appl. Environ. Microbiol, in press (doi: 10.1128/AEM.00826-19) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.McBurney MI, Davis C, Fraser CM, Schneeman BO, Huttenhower C, Verbeke K, Walter J, and Latulippe ME (2019) Establishing What Constitutes a Healthy Human Gut Microbiome: State of the Science, Regulatory Considerations, and Future Directions. The Journal of Nutrition, in press (doi: 10.1093/jn/nxz154) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Knopf K, Sturman JA, Armstrong M, and Hayes KC (1978) Taurine: an essential nutrient for the cat. J. Nutr. 108, 773–778 [DOI] [PubMed] [Google Scholar]
  • 136.Shen L (2011) More on vitamin C deficiency in mice and humans. Hepatology 53, 2139; author reply 2139–2140 [DOI] [PubMed] [Google Scholar]
  • 137.Spor A, Koren O, and Ley R (2011) Unravelling the effects of the environment and host genotype on the gut microbiome. Nat. Rev. Microbiol. 9, 279–290 [DOI] [PubMed] [Google Scholar]
  • 138.Goodrich JK, Waters JL, Poole AC, Sutter JL, Koren O, Blekhman R, Beaumont M, Van Treuren W, Knight R, Bell JT, Spector TD, Clark AG, and Ley RE (2014) Human genetics shape the gut microbiome. Cell 159, 789–799 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Klurfeld DM, Davis CD, Karp RW, Allen-Vercoe E, Chang EB, Chassaing B, Fahey GC, Jr. Hamaker BR, Holscher HD, Lampe JW, Marette A, Martens E, O’Keefe SJ, Rose DJ, Saarela M, Schneeman BO, Slavin JL, Sonnenburg JL, Swanson KS, Wu GD, and Lynch CJ (2018) Considerations for best practices in studies of fiber or other dietary components and the intestinal microbiome. Am. J. Physiol. Endocrinol. Metab. 315, E1087–E1097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140. How the Microbiome Influences Drug Action. The Scientist Magazine®.
  • 141.Novotny JA, Chen T-Y, Terekhov AI, Gebauer SK, Baer DJ, Ho L, Pasinetti GM, and Ferruzzi MG (2017) The effect of obesity and repeated exposure on pharmacokinetic response to grape polyphenols in humans. Mol. Nutr. Food Res. 61 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Ursell LK, Clemente JC, Rideout JR, Gevers D, Caporaso JG, and Knight R (2012) The interpersonal and intrapersonal diversity of human-associated microbiota in key body sites. J. Allergy Clin. Immunol. 129, 1204–1208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Kleerebezem M (2018) Microbial metabolic gatekeeping in the jejunum. Nat. Microbiol. 3, 650–651 [DOI] [PubMed] [Google Scholar]
  • 144.Contijoch EJ, Britton GJ, Yang C, Mogno I, Li Z, Ng R, Llewellyn SR, Hira S, Johnson C, Rabinowitz KM, Barkan R, Dotan I, Hirten RP, Fu S-C, Luo Y, Yang N, Luong T, Labrias PR, Lira S, Peter I, Grinspan A, Clemente JC, Kosoy R, Kim-Schulze S, Qin X, Castillo A, Hurley A, Atreja A, Rogers J, Fasihuddin F, Saliaj M, Nolan A, Reyes-Mercedes P, Rodriguez C, Aly S, Santa-Cruz K, Peters L, Suárez-Fariñas M, Huang R, Hao K, Zhu J, Zhang B, Losic B, Irizar H, Song W-M, Di Narzo A, Wang W, Cohen BL, DiMaio C, Greenwald D, Itzkowitz S, Lucas A, Marion J, Maser E, Ungaro R, Naymagon S, Novak J, Shah B, Ullman T, Rubin P, George J, Legnani P, Telesco SE, Friedman JR, Brodmerkel C, Plevy S, Cho JH, Colombel J-F, Schadt EE, Argmann C, Dubinsky M, Kasarskis A, Sands B, and Faith JJ (2019) Gut microbiota density influences host physiology and is shaped by host and microbial factors. Elife 8:e40553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Frolinger T, Sims S, Smith C, Wang J, Cheng H, Faith J, Ho L, Hao K, and Pasinetti GM (2019) The gut microbiota composition affects dietary polyphenols-mediated cognitive resilience in mice by modulating the bioavailability of phenolic acids. Sci. Rep. 9, 3546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146.Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R, and Goodman AL (2019) Mapping human microbiome drug metabolism by gut bacteria and their genes. Nature 570, 462–467 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Ho L, Zhao D, Ono K, Ruan K, Mogno I, Tsuji M, Carry E, Brathwaite J, Sims S, Frolinger T, Westfall S, Mazzola P, Wu Q, Hao K, Lloyd TE, Simon JE, Faith J, and Pasinetti GM (2019) Heterogeneity in gut microbiota drive polyphenol metabolism that influences α-synuclein misfolding and toxicity. J. Nutr. Biochem. 64, 170–181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 148.Setchell KDR, Brown NM, and Lydeking-Olsen E (2002) The clinical importance of the metabolite equol-a clue to the effectiveness of soy and its isoflavones. J. Nutr. 132, 3577–3584 [DOI] [PubMed] [Google Scholar]
  • 149.Setchell KDR, Clerici C, Lephart ED, Cole SJ, Heenan C, Castellani D, Wolfe BE, Nechemias-Zimmer L, Brown NM, Lund TD, Handa RJ, and Heubi JE (2005) S-equol, a potent ligand for estrogen receptor beta, is the exclusive enantiomeric form of the soy isoflavone metabolite produced by human intestinal bacterial flora. Am. J. Clin. Nutr. 81, 1072–1079 [DOI] [PubMed] [Google Scholar]
  • 150.Setchell KDR and Cole SJ (2006) Method of defining equol-producer status and its frequency among vegetarians. J. Nutr. 136, 2188–2193 [DOI] [PubMed] [Google Scholar]
  • 151.Setchell KDR and Clerici C (2010) Equol: history, chemistry, and formation. J. Nutr. 140, 1355S–62S [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 152.Brown NM, Belles CA, Lindley SL, Zimmer-Nechemias L, Witte DP, Kim M-O, and Setchell KDR (2010) Mammary gland differentiation by early life exposure to enantiomers of the soy isoflavone metabolite equol. Food Chem. Toxicol. 48, 3042–3050 [DOI] [PubMed] [Google Scholar]
  • 153.Belzung C and Lemoine M (2011) Criteria of validity for animal models of psychiatric disorders: focus on anxiety disorders and depression. Biol. Mood Anxiety Disord. 1, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 154.Garner JP (2014) The significance of meaning: why do over 90% of behavioral neuroscience results fail to translate to humans, and what can we do to fix it? ILAR J. 55, 438–456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 155.McGonigle P and Ruggeri B (2014) Animal models of human disease: challenges in enabling translation. Biochem. Pharmacol. 87, 162–171 [DOI] [PubMed] [Google Scholar]
  • 156.Garner JP, Gaskill BN, Weber EM, Ahloy-Dallaire J, and Pritchett-Corning KR (2017) Introducing Therioepistemology: the study of how knowledge is gained from animal research. Lab Anim. 46, 103–113 [DOI] [PubMed] [Google Scholar]
  • 157.Redan BW, Buhman KK, Novotny JA, and Ferruzzi MG (2016) Altered Transport and Metabolism of Phenolic Compounds in Obesity and Diabetes: Implications for Functional Food Development and Assessment. Adv. Nutr. 7, 1090–1104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 158.Chen T-Y, Ferruzzi MG, Wu Q-L, Simon JE, Talcott ST, Wang J, Ho L, Todd G, Cooper B, Pasinetti GM, and Janle EM (2017) Influence of diabetes on plasma pharmacokinetics and brain bioavailability of grape polyphenols and their phase II metabolites in the Zucker diabetic fatty rat. Mol. Nutr. Food Res. 61 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 159.Beck JA, Lloyd S, Hafezparast M, Lennon-Pierce M, Eppig JT, Festing MF, and Fisher EM (2000) Genealogies of mouse inbred strains. Nat. Genet. 24, 23–25 [DOI] [PubMed] [Google Scholar]
  • 160.Mestas J and Hughes CCW (2004) Of mice and not men: differences between mouse and human immunology. J. Immunol. 172, 2731–2738 [DOI] [PubMed] [Google Scholar]
  • 161.Zschaler J, Schlorke D, and Arnhold J (2014) Differences in innate immune response between man and mouse. Crit. Rev. Immunol. 34, 433–454 [PubMed] [Google Scholar]
  • 162.Godec J, Tan Y, Liberzon A, Tamayo P, Bhattacharya S, Butte AJ, Mesirov JP, and Haining WN (2016) Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation. Immunity 44, 194–206 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 163.Eruslanov EB, Singhal S, and Albelda SM (2017) Mouse versus Human Neutrophils in Cancer: A Major Knowledge Gap. Trends Cancer 3, 149–160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 164.Herati RS and Wherry EJ (2018) What Is the Predictive Value of Animal Models for Vaccine Efficacy in Humans? Consideration of Strategies to Improve the Value of Animal Models. Cold Spring Harb. Perspect. Biol. 11, doi: 10.1101/cshperspect.a031583 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 165.Talan M (1984) Body temperature of C57BL/6J mice with age. Exp. Gerontol. 19, 25–29 [DOI] [PubMed] [Google Scholar]
  • 166.Clark M and Steger-Hartmann T (2018) A big data approach to the concordance of the toxicity of pharmaceuticals in animals and humans. Regul. Toxicol. Pharmacol. 96, 94–105 [DOI] [PubMed] [Google Scholar]
  • 167.Becker KJ (2016) Strain-Related Differences in the Immune Response: Relevance to Human Stroke. Transl. Stroke Res. 7, 303–312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 168.Velazquez EM, Nguyen H, Heasley KT, Saechao CH, Gil LM, Rogers AWL, Miller BM, Rolston MR, Lopez CA, Litvak Y, Liou MJ, Faber F, Bronner DN, Tiffany CR, Byndloss MX, Byndloss AJ, and Bäumler AJ (2019) Endogenous Enterobacteriaceae underlie variation in susceptibility to Salmonella infection. Nat Microbiol 4, 1057–1064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 169.Svenson KL, Von Smith R, Magnani PA, Suetin HR, Paigen B, Naggert JK, Li R, Churchill GA, and Peters LL (2007) Multiple trait measurements in 43 inbred mouse strains capture the phenotypic diversity characteristic of human populations. J. Appl. Physiol. 102, 2369–2378 [DOI] [PubMed] [Google Scholar]
  • 170.Morgan SJ, Elangbam CS, Berens S, Janovitz E, Vitsky A, Zabka T, and Conour L (2013) Use of animal models of human disease for nonclinical safety assessment of novel pharmaceuticals. Toxicol. Pathol. 41, 508–518 [DOI] [PubMed] [Google Scholar]
  • 171.Justice MJ and Dhillon P (2016) Using the mouse to model human disease: increasing validity and reproducibility. Dis. Model. Mech. 9, 101–103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 172.Salinero AE, Anderson BM, and Zuloaga KL (2018) Sex differences in the metabolic effects of diet-induced obesity vary by age of onset. Int. J. Obes. 42, 1088–1091 [DOI] [PubMed] [Google Scholar]
  • 173.Crusio WE, Goldowitz D, Holmes A, and Wolfer D (2009) Standards for the publication of mouse mutant studies. Genes Brain Behav. 8, 1–4 [DOI] [PubMed] [Google Scholar]
  • 174.Schmidt CW (2015) Diversity outbred: a new generation of mouse model. Environ. Health Perspect. 123, A64–A67 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 175.Costigan M, Scholz J, and Woolf CJ (2009) Neuropathic pain: a maladaptive response of the nervous system to damage. Annu. Rev. Neurosci. 32, 1–32 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 176.Tuttle AH, Philip VM, Chesler EJ, and Mogil JS (2018) Comparing phenotypic variation between inbred and outbred mice. Nat. Methods 15, 994–996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 177.Shultz LD, Ishikawa F, and Greiner DL (2007) Humanized mice in translational biomedical research. Nat. Rev. Immunol. 7, 118–130 [DOI] [PubMed] [Google Scholar]
  • 178.Turnbaugh PJ, Ridaura VK, Faith JJ, Rey FE, Knight R, and Gordon JI (2009) The effect of diet on the human gut microbiome: a metagenomic analysis in humanized gnotobiotic mice. Sci. Transl. Med 1, 6ra14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 179.Kashyap PC, Marcobal A, Ursell LK, Smits SA, Sonnenburg ED, Costello EK, Higginbottom SK, Domino SE, Holmes SP, Relman DA, Knight R, Gordon JI, and Sonnenburg JL (2013) Genetically dictated change in host mucus carbohydrate landscape exerts a diet-dependent effect on the gut microbiota. Proc. Natl. Acad. Sci. U. S. A. 110, 17059–17064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 180.Hintze KJ, Cox JE, Rompato G, Benninghoff AD, Ward RE, Broadbent J, and Lefevre M (2014) Broad scope method for creating humanized animal models for animal health and disease research through antibiotic treatment and human fecal transfer. Gut Microbes 5, 183–191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 181.Masopust D, Sivula CP, and Jameson SC (2017) Of Mice, Dirty Mice, and Men: Using Mice To Understand Human Immunology. J. Immunol. 199, 383–388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 182.Rosshart SP, Vassallo BG, Angeletti D, Hutchinson DS, Morgan AP, Takeda K, Hickman HD, McCulloch JA, Badger JH, Ajami NJ, Trinchieri G, Pardo-Manuel de Villena F, Yewdell JW, and Rehermann B (2017) Wild Mouse Gut Microbiota Promotes Host Fitness and Improves Disease Resistance. Cell 171, 1015–1028.e13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 183.Rosshart SP, Herz J, Vassallo BG, Hunter A, Wall MK, Badger JH, McCulloch JA, Anastasakis DG, Sarshad AA, Leonardi I, Collins N, Blatter JA, Han S-J, Tamoutounour S, Potapova S, Foster St Claire MB, Yuan W, Sen SK, Dreier MS, Hild B, Hafner M, Wang D, Iliev ID, Belkaid Y, Trinchieri G, and Rehermann B (2019) Laboratory mice born to wild mice have natural microbiota and model human immune responses. Science 365 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 184.Nair AB and Jacob S (2016) A simple practice guide for dose conversion between animals and human. J. Basic Clin. Physiol. Pharmacol. 7, 27–31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 185.Piper MDW, Soultoukis GA, Blanc E, Mesaros A, Herbert SL, Juricic P, He X, Atanassov I, Salmonowicz H, Yang M, Simpson SJ, Ribeiro C, and Partridge L (2017) Matching Dietary Amino Acid Balance to the In Silico-Translated Exome Optimizes Growth and Reproduction without Cost to Lifespan. Cell Metab. 25, 1206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 186.Sharma V and McNeill JH (2009) To scale or not to scale: the principles of dose extrapolation. British Journal of Pharmacology 157, 907–921 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 187.Tu H, Wang Y, Li H, Brinster LR, and Levine M (2017) Chemical Transport Knockout for Oxidized Vitamin C, Dehydroascorbic Acid, Reveals Its Functions in vivo. EBioMedicine 23, 125–135 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 188.Gabbay KH, Bohren KM, Morello R, Bertin T, Liu J, and Vogel P (2010) Ascorbate synthesis pathway: dual role of ascorbate in bone homeostasis. J. Biol. Chem. 285, 19510–19520 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 189.Selvaraj V, Zakroczymski MA, Naaz A, Mukai M, Ju YH, Doerge DR, Katzenellenbogen JA, Helferich WG, and Cooke PS (2004) Estrogenicity of the isoflavone metabolite equol on reproductive and non-reproductive organs in mice. Biol. Reprod. 71, 966–972 [DOI] [PubMed] [Google Scholar]
  • 190.Liang X, Bushman FD, and FitzGerald GA (2015) Rhythmicity of the intestinal microbiota is regulated by gender and the host circadian clock. Proc. Natl. Acad. Sci. U. S. A. 112, 10479–10484 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 191.Lévi F, Zidani R, and Misset JL (1997) Randomised multicentre trial of chronotherapy with oxaliplatin, fluorouracil, and folinic acid in metastatic colorectal cancer. International Organization for Cancer Chronotherapy. Lancet 350, 681–686 [DOI] [PubMed] [Google Scholar]
  • 192.Ruben MD, Smith DF, FitzGerald GA, and Hogenesch JB (2019) Dosing time matters. Science 365, 547–549 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 193.Liang X, Bushman FD, and FitzGerald GA (2015) Rhythmicity of the intestinal microbiota is regulated by gender and the host circadian clock. Proc. Natl. Acad. Sci. U. S. A. 112, 10479–10484 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 194.Davis DJ, Hecht PM, Jasarevic E, Beversdorf DQ, Will MJ, Fritsche K, and Gillespie CH (2017) Sex-specific effects of docosahexaenoic acid (DHA) on the microbiome and behavior of socially-isolated mice. Brain Behav. Immun. 59, 38–48 [DOI] [PubMed] [Google Scholar]
  • 195.Kim D, Hofstaedter CE, Zhao C, Mattei L, Tanes C, Clarke E, Lauder A, Sherrill-Mix S, Chehoud C, Kelsen J, Conrad M, Collman RG, Baldassano R, Bushman FD, and Bittinger K (2017) Optimizing methods and dodging pitfalls in microbiome research. Microbiome 5, 52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 196.Yoshihara M and Yoshihara M (2018) “Necessary and sufficient” in biology is not necessarily necessary – confusions and erroneous conclusions resulting from misapplied logic in the field of biology, especially neuroscience. Journal of Neurogenetics 32, 53–64 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 197.Chilton FH, Murphy RC, Wilson BA, Sergeant S, Ainsworth H, Seeds MC, and Mathias RA (2014) Diet-gene interactions and PUFA metabolism: a potential contributor to health disparities and human diseases. Nutrients 6, 1993–2022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 198.Rizos EC, Ntzani EE, Bika E, Kostapanos MS, and Elisaf MS (2012) Association between omega-3 fatty acid supplementation and risk of major cardiovascular disease events: a systematic review and meta-analysis. JAMA 308, 1024–1033 [DOI] [PubMed] [Google Scholar]
  • 199.Brown LL, Cohen B, Tabor D, Zappalà G, Maruvada P, and Coates PM (2018) The vitamin D paradox in Black Americans: a systems-based approach to investigating clinical practice, research, and public health - expert panel meeting report. BMC Proceedings 12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 200.Sorger PK, Allerheiligen SRB, Abernethy DR, Altman RB, Brouwer KLR, Califano A, D’Argenio DZ, Iyengar R, Jusko WJ, Lalonde R, and Others. (2011) Quantitative and systems pharmacology in the post-genomic era: new approaches to discovering drugs and understanding therapeutic mechanisms. [Google Scholar]
  • 201.Hansen J and Iyengar R (2013) Computation as the mechanistic bridge between precision medicine and systems therapeutics. Clin. Pharmacol. Ther. 93, 117–128 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 202.Stern AM, Schurdak ME, Bahar I, Berg JM, and Taylor DL (2016) A Perspective on Implementing a Quantitative Systems Pharmacology Platform for Drug Discovery and the Advancement of Personalized Medicine. J. Biomol. Screen. 21, 521–534 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 203.Obach RS, Walsky RL, Venkatakrishnan K, Gaman EA, Houston JB, and Tremaine LM (2006) The utility of in vitro cytochrome P450 inhibition data in the prediction of drug-drug interactions. J. Pharmacol. Exp. Ther. 316, 336–348 [DOI] [PubMed] [Google Scholar]
  • 204.Di L, Feng B, Goosen TC, Lai Y, Steyn SJ, Varma MV, and Obach RS (2013) A perspective on the prediction of drug pharmacokinetics and disposition in drug research and development. Drug Metab. Dispos. 41, 1975–1993 [DOI] [PubMed] [Google Scholar]
  • 205.Jones HM, Chen Y, Gibson C, Heimbach T, Parrott N, Peters SA, Snoeys J, Upreti VV, Zheng M, and Hall SD (2015) Physiologically based pharmacokinetic modeling in drug discovery and development: a pharmaceutical industry perspective. Clin. Pharmacol. Ther. 97, 247–262 [DOI] [PubMed] [Google Scholar]
  • 206.Deaton A and Cartwright N (2018) Understanding and misunderstanding randomized controlled trials. Soc. Sci. Med. 210, 2–21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 207.Gallin JI, Ognibene FP, and Johnson LL (2017) Principles and Practice of Clinical Research. Academic Press [Google Scholar]
  • 208.National Center for Complementary and Integrative Health. (2012) Data and Safety Monitoring of NCCIH-Funded Clinical Research (https://nccih.nih.gov/grants/policies/data-safety-monitoring). NCCIH/NIH [Google Scholar]
  • 209.Nehlig A (2018) Interindividual Differences in Caffeine Metabolism and Factors Driving Caffeine Consumption. Pharmacol. Rev. 70, 384–411 [DOI] [PubMed] [Google Scholar]
  • 210.Staff WIRED. (2009) Placebos Are Getting More Effective. Drugmakers Are Desperate to Know Why. WIRED [Google Scholar]
  • 211.Preston RA, Materson BJ, Reda DJ, and Williams DW (2000) Placebo-Associated Blood Pressure Response and Adverse Effects in the Treatment of Hypertension. Archives of Internal Medicine 160, 1449. [DOI] [PubMed] [Google Scholar]
  • 212.Barrett B, Brown R, Rakel D, Rabago D, Marchand L, Scheder J, Mundt M, Thomas G, and Barlow S (2011) Placebo effects and the common cold: a randomized controlled trial. Ann. Fam. Med. 9, 312–322 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 213.Barrett B (2013) Sufficiently important difference: concepts, caveats, and challenges. Med. Decis. Making 33, 869–874 [DOI] [PubMed] [Google Scholar]
  • 214.Albers C and Lakens D (2018) When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias. Journal of Experimental Social Psychology 74, 187–195 [Google Scholar]
  • 215.Scargle JD (1999) Publication Bias (The “File-Drawer Problem”) in Scientific Inference. [Google Scholar]
  • 216.Franco A, Malhotra N, and Simonovits G (2014) Social science. Publication bias in the social sciences: unlocking the file drawer. Science 345, 1502–1505 [DOI] [PubMed] [Google Scholar]
  • 217.Ioannidis JPA (2008) Why Most Discovered True Associations Are Inflated. Epidemiology 19, 640–648 [DOI] [PubMed] [Google Scholar]
  • 218.Simmons JP, Nelson LD, and Simonsohn U (2011) False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychol. Sci. 22, 1359–1366 [DOI] [PubMed] [Google Scholar]
  • 219.Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, and Rothman N (2004) Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J. Natl. Cancer Inst. 96, 434–442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 220.Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, and Schünemann HJ (2008) GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ 336, 924–926 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 221.Miller J and Schwarz W (2011) Aggregate and individual replication probability within an explicit model of the research process. Psychol. Methods 16, 337–360 [DOI] [PubMed] [Google Scholar]
  • 222.Simonsohn U, Nelson LD, and Simmons JP (2014) p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results. Perspect. Psychol. Sci. 9, 666–681 [DOI] [PubMed] [Google Scholar]
  • 223.Stanley TD and Doucouliagos H (2014) Meta-regression approximations to reduce publication selection bias. Res Synth Methods 5, 60–78 [DOI] [PubMed] [Google Scholar]
  • 224.Kaelin WG Jr. (2017) Common pitfalls in preclinical cancer target validation. Nat. Rev. Cancer 17, 425–440 [DOI] [PubMed] [Google Scholar]
  • 225.Munafò MR and Davey Smith G (2018) Robust research needs many lines of evidence. Nature 553, 399–401 [DOI] [PubMed] [Google Scholar]
  • 226.Shulman LP, Banuvar S, Fong HHS, and Farnsworth NR (2011) Discussion of a well-designed clinical trial which did not demonstrate effectiveness: UIC center for botanical dietary supplements research study of black cohosh and red clover. Fitoterapia 82, 88–91 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 227.Cui Y, Murphy B, Gentilcore A, Sharma Y, Minasian LM, Kramer BS, Coates PM, Gohagan JK, Klenk J, and Tidor B (2014) Multilevel modeling and value of information in clinical trial decision support. BMC Syst. Biol. 8, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 228.Jayasekera J, Li Y, Schechter CB, Jagsi R, Song J, White J, Luta G, Chapman J-AW, Feuer EJ, Zellars RC, Stout N, Julian TB, Whelan T, Huang X, Shelley Hwang E, Hopkins JO, Sparano JA, Anderson SJ, Fyles AW, Gray R, Sauerbrei W, Mandelblatt J, Berry DA, and CISNET-BOLD Collaborative Group. (2018) Simulation Modeling of Cancer Clinical Trials: Application to Omitting Radiotherapy in Low-risk Breast Cancer. J. Natl. Cancer Inst. 110, 1360–1369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 229.Center for Drug Evaluation and Research. (2019) Botanical Drug Development: Guidance for Industry U.S. Food and Drug Administration [Google Scholar]
  • 230.Gallo P, Chuang-Stein C, Dragalin V, Gaydos B, Krams M, and Pinheiro J (2006) Adaptive Designs in Clinical Drug Development—An Executive Summary of the PhRMA Working Group. Journal of Biopharmaceutical Statistics 16, 275–283 [DOI] [PubMed] [Google Scholar]
  • 231.Woodcock J and LaVange LM (2017) Master Protocols to Study Multiple Therapies, Multiple Diseases, or Both. N. Engl. J. Med. 377, 62–70 [DOI] [PubMed] [Google Scholar]
  • 232.Center for Drug Evaluation and Research. (2019) Expedite Development of Oncology Drugs and Biologics. U.S. Food and Drug Administration [Google Scholar]
  • 233.Meltzer DO, Hoomans T, Chung JW, and Basu A (2011) Minimal modeling approaches to value of information analysis for health research. Med. Decis. Making 31, E1–E22 [DOI] [PubMed] [Google Scholar]
  • 234.Hoomans T, Seidenfeld J, and Meltzer D (2012) CO3 Minimizing the Costs of Analyzing the Value of Health Research. Value in Health 15, A2 [Google Scholar]
  • 235.Mirza RD and Guyatt GH (2018) A Randomized Clinical Trial of n-of-1 Trials-Tribulations of a Trial. JAMA Intern. Med. 178, 1378–1379 [DOI] [PubMed] [Google Scholar]
  • 236.Box GEP and Draper NR (1987) Empirical model-building and response surfaces. John Wiley & Sons Inc. [Google Scholar]

RESOURCES