Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Oct 31.
Published in final edited form as: Adv Drug Deliv Rev. 2025 Oct 13;227:115716. doi: 10.1016/j.addr.2025.115716

Opportunities for machine learning and artificial intelligence in physiologically-based pharmacokinetic (PBPK) modeling

Anne M Talkington 1,2,*, Yanguang Cao 3, Anthony J Kearsley 1, Samuel K Lai 4,5,6,*
PMCID: PMC12573771  NIHMSID: NIHMS2119485  PMID: 41093240

Abstract

Physiologically-based pharmacokinetic (PBPK) modeling is a powerful tool for quantitating and understanding the fate of drug and drug carriers in complex living systems. It is particularly valuable in situations where data are difficult to obtain due to cost, time, or ethical constraints. Recent advances in PBPK modeling have greatly improved their accuracy in modeling in vivo and clinical data, especially in special populations (e.g., pediatric and geriatric subjects), which consequently enhanced their utility in drug development. Nevertheless, current PBPK models remain limited by our ability to ascertain complex biological mechanisms and/or physiological processes, often resulting in many critical but unknown parameters or parameters with large uncertainty. Machine learning (ML) and applications of broader artificial intelligence (AI) tools that facilitate parameter estimation, model learning, database mining, and uncertainty quantification not only offer the potential to address the shortcomings of PBPK modeling, but also introduce opportunities for enabling earlier use of PBPK modeling in the drug development process. Here, we summarize ML-influenced advances in PBPK modeling and discuss our expectations of the likely avenues for future ML/AI contributions to PBPK modeling.

Introduction

Physiologically-based pharmacokinetic (PBPK) modeling provides a mass-balanced means to track and predict the biodistribution of drugs and drug carrier systems (referred to collectively as “drugs”). By considering each organ or tissue type of interest as a compartment, a multi-compartment model can be constructed to recapitulate the fate of drugs in complex living systems, with the drug dosage form (including route of dosing) and the dose serving as the key inputs, and drug concentration profile over time at each organ/compartment serving as the key outputs1,2. By accounting for key processes in living systems, drug properties, and drug-environment interactions, PBPK modeling not only helps to ensure direct physiological relevance of its predicted drug concentrations but also yields valuable insights into drug interactions or exposures that cannot be easily established in vitro 1,3. Finally, with the incorporation of known pharmacological understanding of a drug’s potency, PBPK models can also predict a drug’s pharmacodynamic (PD) profile4,5.

By separately accounting for the various key underlying biological and physiological processes, different parameters and ordinary differential equations (ODEs) can be easily introduced, enabling PBPK models to more accurately scale and predict across species from small to large animals to eventually humans, as well as between different subpopulations (e.g., infants vs. adults)6. Not surprisingly, PBPK models have been used extensively to characterize and quantitatively predict the biodistribution and PK profile of various small molecule drugs7,8, therapeutic proteins and antibodies911, as well as nanoparticles1218 and other drug delivery systems1,1921. Each of these systems requires unique considerations accounting for whether the drug’s transport processes are primarily perfusion, diffusion or permeability-limited, as well as how the resulting drug distribution is affected by inherent properties of the drug, including polarity, hydrophobicity/solubility, charge, and chemical/biological stability. In addition to the development of new drugs, PBPK models have also been used to aid in the assessment of potential bioequivalence for generic drugs and biosimilars22. Recently, the utility of PBPK models has been extended to vaccine development, including evaluation of injected adjuvants and oral delivery of both antigen and adjuvant2325. Finally, PBPK models are increasingly utilized for evaluating potential toxicity, drug interactions, and how the body may process the drugs in special populations where data are often limited or unavailable (e.g., pediatrics or patients with chronic illness)2631.

There is an increasing push to minimize the use of preclinical animal models for assessing the safety and efficacy of drug candidates3234. Similarly, in silico drug discovery is also enjoying increasing popularity, with virtual chemical library screening offering the potential of reduced time and costs while improving the accuracy of candidate selection35,36. By definition, drug molecules must reach and persist in sufficient concentrations at the target disease cells or tissues over time to achieve therapeutic benefit, and they preferably do so at concentrations in target cells/tissues that exceed those in other cells or tissues to minimize systemic side effects. Thus, in addition to a differentiated pharmacological mechanism of action, an accurate accounting of the distribution and fate of drugs is undoubtedly a critical element in advancing computational medicine. We consider PBPK modeling to be the leading computational approach that bridges early-stage drug discovery through preclinical animal models to human studies.

While incorporating greater physiological and biological insights should improve the accuracy of PBPK modeling, doing so with an already large parameter space undoubtedly creates further challenges with parameter uncertainty. Among the strengths of artificial intelligence (AI) and machine learning (ML) techniques are their ability to inform ways to reduce the parameter space, which in turn reduces the complexity and increases the tractability of the problem, as well as their ability to increase our confidence in the estimated values of the most sensitive parameters. Below we discuss in detail the current limitations of the PBPK modeling space, techniques for applying AI/ML to PBPK, successful applications of these techniques, and the future of AI/ML in PBPK models from our perspective.

Challenges and Limitations of PBPK models

Despite the promise and success of PBPK modeling, the PBPK modeling field faces many challenges, which we elaborate below:

Dependence on known processes and quantities

A current limitation of PBPK models is their reliance on known parameters describing various physiological processes, as well as a drug’s physiochemical quantities that in turn influence its absorption, distribution, metabolism, and excretion (ADME). These parameters naturally vary not only for different drugs but also for different drug delivery systems as well as individuals, and thus are often not available a priori. Accurate measurements of many metabolic or transport processes are difficult to obtain, as they must either involve invasive experimentation or be subjected to the risks of poor in vitro to in vivo correlation 37. Often, measurements of metabolic or transport processes do not directly translate across different living species. Indeed, despite the extensive and broadly accepted use of allometry to translate from a mouse or large animal to a human system, and recent work suggesting improvements to allometric models by deviating from the traditional 3/4 power law for metabolic scaling, there remain frequent disparities between predictions derived from preclinical animal models and observed pharmacokinetics (PK) and biodistribution in humans3841. Experimentation also takes time and money, and is often ethically restrictive, which greatly reduces the rate new drugs can be advanced into a clinical setting. Recently, QSAR (quantitative structure-activity relationship) models have been used to address the challenge of obtaining clinically relevant parameters as inferred from their physiochemical properties. However, the QSAR process itself is inherently dependent on the availability of, and ability to leverage, large data sets to draw reliable, drug-specific conclusions from the underlying structure-function model. Many of these QSAR approaches are mainly for general exploration purposes4245 and thus frequently fail to achieve high accuracy46, although recent model iterations are yielding improved applicability47. There thus remains substantial room for improving its efficiency.

Rapidly compounding model complexity to capture relevant biological processes

To illustrate one example of the biological complexities faced with developing a PBPK model, we consider a PBPK model for a therapeutic immunoglobulin (IgG) as described by Garg and Balthasar11. Within each tissue compartment, the antibody first leaves the bloodstream (vascular space), then enters the endosomal vesicles in the endothelial cells via endocytosis or pinocytosis. From there, IgG may form high affinity bonds with the neonatal Fc-receptor (FcRn) as a result of the low endosomal pH, which allow FcRn-bound IgG to either be recycled back to the vascular space or transported into the interstitial space. IgG can also be transported directly between the vascular and interstitial space via two-pore paracellular transport4850. The IgG can leave the organ through lymphatic flow, leave through vascular flow, or be cleared from the system in a saturable target-specific manner (often referred as target-mediated drug disposition). Within each compartment, such a model requires knowledge of blood flow rate, lymph flow rate, residual blood volume, endosomal fraction of the organ, interstitial fraction of the organ, total tissue volume, dissociation constant for IgG:FcRn binding stoichiometry as a function of pH, expression level of FcRn, endocytosis rate, recycling rate for bound IgG, and transport rate for unbound IgG. Although many of these parameters have either been measured or estimated, the values used across different models can vary by orders of magnitude. For example, the fraction of lymph flow to blood flow has been estimated as both 0.02%49 and 2%51, due in part to varying tissue composition. Other considerations include tracking cell-associated fraction beyond the interstitial space, as proposed by Covell et al.51, Shah and Betts52, and Abuqayyas and Balthasar53, as well as focusing on preferential distribution due to target antigens53, approaches which further add to the number of unknowns per compartment. Extrapolating to a full model comprised of additional selected compartments – including plasma, heart, liver, spleen, kidney, muscle, lung, small intestine, large intestine, pancreas, gallbladder, bone, brain, adipose, thymus, thyroid, skin, and tumor – further increases the complexity of the parameter estimation by over an order of magnitude (Figure 1).

Figure 1.

Figure 1.

Schematic of PBPK system for therapeutic antibodies in circulation and within each organ compartment.

Further complexities with modeling different formulations and simultaneous use of multiple medications

To adapt or develop a PBPK model to predict the fate of a nanoparticle-based drug delivery system instead of a therapeutic antibody, we would need to first account for major discrepancies in known processes (e.g. extravasation, endocytosis, etc.). We would also then need to account for entirely new processes, such as (i) uptake by the mononuclear phagocytic system, which is frequently responsible for a high degree of inter-patient variability observed in clinical studies12, and (ii) specific and non-specific uptake by various cells. Accounting for passive and active transport processes of nanoparticles remains an area for further investigation for PBPK models12,5456. If the nanoparticle is a drug carrier, the release and separate tracking of the released small molecule drugs must also be layered on top of the model, further increasing the complexity of the system and number of required parameters.

It is possible to take another step and harness PBPK models to predict potential deleterious drug-drug interactions. Although most physiological parameters are translatable, a full set of drug-specific parameters for a second drug must be determined or estimated. One must also account for effects of either drug on specific biological processes, such as inhibition of metabolizing enzyme cytochrome P45057. Given that many patients, particularly the elderly, are chronically taking several medications simultaneously, the complexities of harnessing PBPK models to predict polypharmacy challenges can readily increase exponentially.

Limited data availability

We then consider limitations due to data availability, in the form of local drug concentrations among different cells and tissues. The aforementioned need to account for a large number of physiological processes by definition requires quantitative description of the various parameters that underpin the respective processes. To determine the predictive power of a PBPK model, it is also essential to validate whether the predictions are accurate. This necessitates measuring actual drug concentrations in various organs over time. This is generally accomplished by sacrificing animals and harvesting key organs at select time points, followed by processing of tissues and analyzing the resulting content using high-performance liquid chromatography, ELISA, or mass spectroscopy. Alternatively, to minimize the number of animals that must be sacrificed or to improve the accuracy of longitudinal assessment, relative concentrations at whole organ levels could be determined in large organs in live animals using in vivo detection techniques such as positron emission tomography/computed tomography (PET/CT). Regardless of the approaches, such experimentations range from very expensive to cost-prohibitive, generally yield only small batches of data, and typically do not provide insights at the sub-organ and cellular levels. Given differences in not only the assays but also inter-laboratory variability, it is often difficult to piece together data available from multiple published studies. Data available in the literature is also often scattered across sources, not readily available or accessible for modeling.

Another key challenge faced in PBPK model development is validating model predictions in patients early on, particularly in subpopulations where these models likely have the greatest potential for impact (e.g., pediatrics or rare disease patients). Notably, these subpopulations often lack suitable animal model substitutes, and thus are poorly supported by animal studies. The use of genetically identical animals also makes it difficult to capture the degree of inter-patient variability observed in clinical settings. Additionally, while existing PBPK models hold great value for modeling drugs administered by infusion or injection, there are far fewer models for alternative delivery systems and routes of dosing, such as topical application or inhalation. Understanding these systems and associated routes of delivery is an area with much room for PBPK models to expand in. PBPK modeling’s inherent capacity to incorporate the detailed physiology of the system and information on complex drugs makes them particularly apt for this purpose, whereas the inherent complexity of the system is well suited to harness AI to construct and validate such models.

Limited pre-existing information on processes and parameters in complex systems

Given the aforementioned difficulty of obtaining in vivo PK or ADME data, ascertaining the best model or scaling properties to capture the fate of new drugs in specific target organs must typically be accomplished without complete or accurate knowledge of all parameters. To overcome this uncertainty, one must consider a set of potential candidate models describing the drug distribution behavior, and corresponding parameters in each of the models, many of which include a wide range of potential values. Such a process necessitates a method to systematically and iteratively evaluate candidate models against known biodistribution data and narrow the ranges for each parameter. Over time, this allows us to infer the most important physiological process (i.e., those with greatest impact on PK predictions) or drug-specific parameters based on the learned model structure and parameter values, and simultaneously increases our confidence in the model predictions for drug concentrations over time.

AI/ML techniques applicable to PBPK modeling

While not novel techniques in and of themselves, many AI/ML algorithms have novel applications in PBPK modeling. We summarize them below:

Latin Hypercube Sampling (LHS) and other parameter estimation techniques

LHS is an optimization technique traditionally used in problems across engineering and biological systems 5863. LHS works by conducting “layers” of random sampling, i.e., sampling a range and then a point within the range. For example, let’s assume one wants to test combinations of values or “sweep” for one parameter that is expected to range from 0.1–0.5 (dimensionless units) and a second variable between 1.2–1.7. The LHS technique would divide this into partitions of 0.1–0.2, 0.2–0.3, etc. and 1.2–1.3, 1.3–1.4, and so on. One then shuffles the partitions, selects a value within each partition to test in combination, and repeats this process several times until arriving at a combination of parameters that yields predictions with the lowest error compared to experimental data. If potential parameter values span large ranges, one could sweep logarithmically to cover partitions across orders of magnitude. The LHS technique ensures more equitable sampling across the entire parameter search space, and simultaneously minimizes the chance of randomly picking points too close together.

The random sampling approach with LHS lends itself well to the parameterization of PBPK models to ensure a thorough search of parameter space. For example, a PBPK model parameterized with LHS has accurately recapitulated the biodistribution of PEGylated liposomes and proteins in mice and humans58,64. Strategically integrating a sensitivity analysis, such as Partial Rank Correlation Coefficients (PRCC), extended Fourier amplitude sensitivity test (eFAST), or one-at-a-time (OAT) sensitivity, with a sampling algorithm enables testing which components of the system most critically impact the behavior or property of interest, particularly in large multiscale systems 65. Iterating LHS or combining it with another optimization approach, such as Sobol Sequencing, Hammersley Sequence Sampling, or gradient descent, leads to increased confidence that the model converges to a solution with minimal error, particularly when a large number of unknown parameters are involved66,67. When combined with supervised learning, sampling methods such as LHS (i.e., through conditioned LHS (cLHS) or Progressive LHS (PLHS)) can yield large advantages in computational efficiency without sacrificing accuracy of parameterization68.

Monte Carlo (MC) parameter estimation and model learning

In their simplest sense, MC methods refer to random sampling of variables from a defined distribution or set of distributions. Applications include defining a distribution for each parameter in order to represent a range of potential drug properties, representing population variance, establishing a means of uncertainty quantification for risk analysis, and defining in silico experiments that assist in the design of preclinical studies and clinical trials6971. Such methods have been employed in population-level PBPK studies measuring exposure and bioaccumulation7275. Markov Chain Monte Carlo, or MCMC, is a variant of MC sampling in which the distribution of values sampled is dependent on previous samples76. MCMC is used in Bayesian or likelihood inference77, and represents a strategic way to iteratively refine estimated parameter values for PBPK modeling and simulation74. Notable subclasses of MC include importance sampling and particle filtering. Importance sampling leverages weighting or sampling from transformations of expected parameter distributions to ultimately achieve convergence at a parameter’s estimated value or distribution. Also known as sequential importance sampling, particle filtering is a method of assigning weights based on the inferred likelihood of a parameter value, given the intersection of multiple potential distributions78,79. While traditionally implemented in risk analysis or statistics, these techniques have been applied for parameter identification at the population level when there is low confidence at the individual level80,81, and are utilized in parameter calibration algorithms for systems biology82.

Sampling techniques, including MC and LHS, become “learning” when parameter values, distributions, or model structures are sampled, evaluated, and modified accordingly. In addition to its applications in parameterization, an MC approach to model learning can fundamentally influence our understanding of a biological system in which we must make assumptions about the most appropriate physiological processes and the corresponding functional form incorporated into PBPK (e.g., the relative importance of an organ subcompartment, or to discern whether convective or diffusive transport is dominant). In such cases, biodistribution data and an initial parameter set based on in vitro studies or a literature search for similar characterized drugs are used to initialize a suite of candidate models. The goal is to search not only for a set of parameters, but to arrive at a model or set of models that, when fitted with optimized parameters or well-characterized parameters, generates predictions most consistent with the data83,84. Commonly used in applications ranging from information theory to finance, these methods are robust in that they can potentially compensate for sparse data sets84,85.

In not only learning parameter values but also improving the model structure itself while attempting to minimize errors with model outputs, we can frequently gain insights into the mechanistic underpinnings of how specific drugs are distributed and processed in the body. For example, evaluation of competing PBPK models may emphasize the role of flow limitation within a compartment in providing more accurate predictions than the previously assumed well-mixed system86. Such insights inform more targeted experiments for future data collection, for instance, which compartments or organ systems should be prioritized for sampling in future studies. MC models are also valuable for assessing the degree and sources of variability between individuals71,87. As previously discussed in the context of LHS, implementing a global sensitivity analysis platform is an efficient means of refining the model where little is known about a system and data are difficult to obtain88,89. Bayesian techniques have been successfully incorporated and combined with MCMC to uncover sources of uncertainty at the population level71.

Ensemble modeling

Another valuable strategy, when data are especially sparse or mechanistic understanding is lacking for the system of interest, is consensus or ensemble modeling. In ensemble modeling, a large number of different models (generally more phenomenological than mechanistic) are tested and evaluated, with the goal of obtaining a set of candidate models that can produce an accurate readout of biodistribution or other metric of interest. While each top model candidate is individually incorrect, each model does capture some aspect of the system, and the models collectively produce the desired behavior. Since ensemble modeling is based on the assumption that we can leverage a set of models rather than one ground truth to describe the desired phenomena, the goal is typically more outcome-oriented than mechanistic. However, the candidate models could still inform our mechanistic understanding of the system.

The ensemble strategy is implemented when uncertainty regarding the model structure is not resolved, but we can infer a purely data-driven prediction and propose a large number of model candidates. Since we have limited understanding of the system in this scenario, using these models together provides the greatest predictive power. These concepts have previously been applied in fields as diverse as genomics, weather forecasting, predicting traffic flow patterns, and credit scoring9094. Expanding into the PBPK space holds valuable implications for accurately predicting drug properties or interactions with high confidence. For example, an ensemble strategy has been demonstrated to successfully predict not only parameter values but also the distribution and dissolution profiles of drugs in humans95,96. Implementing an ensemble ML approach can help us find the best model candidates while being sure that we are appropriately capturing the desired phenomenon with a greater degree of certainty.

Alternative iterative methods for model and parameter identification

Alternative algorithms for inferring model structures and parameter values include decision trees or random forests 9799, genetic or evolutionary algorithms100102, and gradient boosting103,104. Essentially, these are iterative methods that allow a user to sample a wide range of conditions and decide which set of parameters or model structure makes the most sense for the data. The accuracy of the model is improved with each iteration, a model is decided, and ultimately the model candidates are tested or validated on an independent data set. Applications of such algorithms to date have shown potential for refining the process of ADME parameter estimation and integration into a larger PBPK model105. However, all of these methods contain an inherent risk of overfitting. Even random forests, often considered to be among the most robust approaches, are still susceptible106,107. This issue of overfitting can be addressed manually, through restricting a model to a strictly mechanistic system108, or through the introduction of a penalty associated with increasing the size of the model (e.g., Least Absolute Shrinkage and Selection Operator (LASSO), marginal, or ridge regression with or without boosting)109,110.

Sometimes the best model candidate is a minimal PBPK (mPBPK) model. mPBPK modeling is traditionally used in the scenario of many unknowns as a means of obtaining a simplified model that captures the primary variable of interest, often plasma concentration9,111. Since the scope of the mPBPK model is more limited, the model structure and parameter values become far easier to learn. AI is invaluable for informing or deciding which elements are critical to a reduced system. When a model framework and parameter values are learned, decisions on the top model candidates are made based on reducing the error between a desired benchmark and the model prediction. Iterations of the model are then constructed based on error minimization, where parameters with the greatest sensitivity (and hence the greatest capacity to affect error) are the most critical components of the updated model.

In the best-case scenario, the proposed inferred model system can point to a potentially under-investigated mechanism of physiology, thus improving our mechanistic understanding. Regardless of whether the mechanism in the inferred model holds biologically, the benefits of a predictive model are invaluable for generating or validating difficult-to-obtain data with greater confidence.

Neural networks and large language models

Neural networks consist of layers of independent, interconnected nodes representing input, output, and a series of steps through which output is derived from the input112114. The sequence of events through which the model learns which outputs are associated with which inputs mimics associations formed by neurons firing in the brain114. During model training, correct associations are rewarded, while incorrect associations are penalized115. When multiple steps exist, as opposed to a single layer, between the input and output layers, the network is referred to as a deep neural network. The complexity of training multiple layers poses a model optimization tradeoff wherein a sufficient number of layers are needed to fully capture the problem at hand, but models with fewer layers are more reliably trained115. Traditionally, neural network models have been effective for classification and pattern recognition tasks. More recently, they have also been used to learn model structures in PK and other domains where there is insufficient data to fully inform a mechanistic model, for example, in extrapolating from sparse concentration-time profiles116.

Large language models (LLMs), a subclass of neural network-based work, have recently gained prominence with the advent of commercial tools such as ChatGPT, Gemini, Copilot, and others117119. LLMs are specifically optimized to learn patterns and make associations with language, in models where inputs and outputs consist of words and phrases, much like a human brain processes speech. In their current state, caution must be taken to avoid “hallucinations,” in which undiscerning LLMs report false statements as fact. However, they are valuable for summarizing information, parsing large data sets, sifting through disjointed reports of data, and even suggesting potential mechanisms for further investigation. These features contribute to the appeal of adopting LLMs for medical applications120.

Hybrid modeling

The concept of hybrid modeling generally refers to integration of multiple modeling techniques, e.g., combining mechanistic models (where a system is well characterized) with ML models (where the system is poorly characterized), or multiple layers of ML models to most effectively capture complex processes. Hybrid models have been successfully used in many pharmacological applications. For example, Boolean network models underlying a system of ODEs have informed quantitative systems pharmacology (QSP) models as a means of screening and identifying targets121, as well as more generally accounting for underlying signaling processes throughout systems medicine122,123. A deep learning model optimized through Bayesian methods has yielded drug concentration prediction results comparable to traditional NONMEM predictions124.

Additional examples of successfully integrating ML/AI to advance PBPK modeling, often in combination with other modeling techniques, are discussed below.

Specific examples of applications of ML/AI in PBPK modeling and future potential

The idea of harnessing the fundamentals of ML to parameterize PBPK models, understand sources of variability, and address current challenges in PBPK modeling is gaining traction and has already been applied successfully in several systems. Recalling our IgG example, Garg and Balthasar used maximum likelihood estimation to fit the unknown parameters for IgG endosomal compartment uptake rate, fraction recycled, and the rate of recycling in their antibody model11. They employed a normalized sensitivity analysis to reveal that the model predictions are most sensitive to IgG fraction recycled via FcRn. Based on the sensitivity of their model, they concluded that FcRn plays a larger role than previously thought in IgG distribution, in a tissue-specific manner. Krauss et al. employed MCMC to uncover population subgroups and sources of heterogeneity in response to pravastatin (Figure 2)71. By separating sources of variability into patient-specific parameters and drug-specific parameters, then examining the subpopulations of patients that resulted from their inferred parameter distributions, they concluded that variation in OATP1B1 transporter activity is responsible for inter-patient variability in pravastatin distribution. Both of these examples illustrate the role of learning aspects of a model system in facilitating a deeper understanding of the mechanisms at the molecular level and across patient populations.

Figure 2.

Figure 2.

Concentration of pravastatin obtained from patient data (open shapes) and predicted from Bayes-MCMC PBPK model (solid lines) in (A) all patients and (B) representative patients highlighting inter-patient variability. Figure obtained from ref [71] (CC BY 2.0 DEED).

Our previous work implemented an LHS-PBPK framework to parameterize an 8-compartment PBPK model for PEGylated liposomes and proteins, in mice and humans58,64. Our model predictions for concentration-time profile were in good agreement with the data for animals (Figure 3) as well as populations with and without anti-PEG antibodies (APA). Utilizing non-invasive PET/CT imaging over time, we were able to infer the parameters corresponding to organ-specific uptake and retention, with the model revealing that liver and splenic retention were significantly increased in animals with high levels of APA exhibiting accelerated blood clearance. Biologically, the most plausible explanation is sequestration of APA/drug immunocomplexes by liver sinusoidal endothelial cells (LSEC). In this instance, the learning aspects of the PBPK model for biodistribution of PEGylated drugs were instrumental in elucidating the role of APA in altered systemic clearance.

Figure 3.

Figure 3.

Comparison between PBPK model predictions (turquoise line) vs. biodistribution data obtained by in vivo PET/CT imaging (black dots) in (A) a representative naïve mouse and (B) a representative APA+ mouse. Concentrations are normalized as % injected dose/g, where 1 = 100% ID/g. Figure reprinted from ref [64], with permission from Elsevier.

Recently, more attention has been given to ML as a means of integrating multi-omics data modalities in parameter estimation125. A proposed “ML-PBPK” system by Li et al. boasted higher accuracy using learned physicochemical properties vs. in vitro data, supporting the combined approach126. Their model was constructed based on a “bottom up” approach using the predicted human PK properties (uptake, permeability, and clearance) of 40 small molecule drugs. It yielded an Area Under the Curve (AUC), or total drug exposure, prediction accuracy of 62.5% vs. the in vitro AUC accuracy of 47.5%. This work underscores both the potential for making predictions of anticipated PK in humans strictly based on in silico results, and the necessity of further understanding sources of variability in the complex human system. It also highlights the utility of previously obtained PK data and the importance of database management, such that relevant model training data is available for ML-PBPK.

Paixão et al. used an artificial neural networks ensemble approach to predict tissue-to-blood partition coefficients (Kt:b) for a variety of compounds and tissue types, which were then fed into a PBPK framework 95. They optimized their neural network structure for a selected set of molecular descriptors of these compounds, by accounting for redundancy in the descriptors and testing models with different numbers of layers and nodes. Their model yielded a partition coefficient prediction in strong agreement with observed in vitro and in vivo values for Kt:b in 13 tissue types in rats, as well as an approximation for the volume of distribution at steady state (Vss) for over 500 drugs in humans. The molecular descriptors, including pKa, molecular weight, lipophilicity, and others, together determined tissue distribution. The Vss prediction accuracy negatively correlated with the variability of the partition coefficients for each tissue, once again highlighting the importance of the complexity of modeling a human system. A high variability coefficient was interpreted as a drug being outside of the applicability domain, once again underscoring the importance of relevant training data. In this model, results were the least accurate for large molecule drugs, as the model was not sufficiently trained to handle drugs with these properties.

Mavroudis et al. employed ML to predict plasma exposure based on PK properties derived from the molecular structure of a new drug candidate (Figure 4)127. Their work again suggests a need for greater understanding of model variability to maximize the accuracy of this approach. Namely, the model was more variable and less accurate when a variable primary clearance mechanism existed, which highlights the necessity of a comprehensive training set in determining accurate predictions. PK parameters have also successfully been estimated based on chemical structure by the ML algorithms of Miljković et al128. and Murad et al129. AI methods have also been increasingly proposed as a means of increasing efficiency for gathering robust, reliable data and learning ADME and toxicity based on QSAR, including ADMET-AI130 and ADMETboost131 that leverage neural net and gradient boosting techniques. Such techniques seek to directly address the issue of limited and difficult-to-obtain data.

Figure 4.

Figure 4.

Representation of observed plasma exposure vs. PBPK model predictions using CL and Vss parameters determined by ML. (A) Observed vs. predicted plasma exposure in a rat model. (B) Observed vs. predicted AUC at the terminal time point. (C) Observed vs. predicted maximum concentration (Cmax). Figure obtained from ref [127] (CC BY 4.0 DEED).

An integrated ML and PBPK approach by Habiballah and Reisfeld highlighted a workflow in which crosstalk between learning and mechanistic PBPK can strengthen model predictions132. The model output was reported by the normalized summary pharmacokinetic metrics (SPKMs) Cmax/dose (peak concentration), tmax (time of peak concentration), and AUC/dose. In this workflow, rather than an underlying ML prediction feeding into a PBPK model, the PBPK model itself was implemented to generate predictions for sets of drugs with specific properties. The PK properties were then calculated for sets of drugs with similar properties based on a learned model, as a means of reducing the complexity and computational burden of a full mechanistic PBPK model pipeline. Although predictions from the combined ML approach were in good agreement with the purely PBPK approach, at times both approaches yield predictions that were relatively different from experimental data (Figure 5). While overall promising, this underscores the need for fundamentally understanding or being able to learn the PBPK model structure for our drug delivery process.

Figure 5.

Figure 5.

Representation of traditional PBPK (OpenCAT) predictions (blue circle), ML predictions (yellow triangle), and experimental data (bars), for Cmax/dose, tmax, and AUC/dose. Figure obtained from ref [132] (CC BY 4.0 DEED).

Natural language processing (NLP), a class of methods that include LLMs, has recently been adapted to support PBPK analyses of new molecular entities (NMEs). Specifically, Hsu et al. implemented a custom deep learning model that allowed for a system of manual checks and inputs, providing additional training input as needed. Among their case studies, their interactive and iterative model approach allowed them to identify dosing strategies and PK parameter covariates based on information gleaned via NLP and compare to manual curation (Figure 6)133. Such findings suggest that successfully leveraging NLP can allow one to broaden an initial literature search and expedite the process of defining, building, and implementing comprehensive PBPK models. Utilizing this more expansive approach to PBPK model development has the potential to increase the applicability of PBPK model predictions by improving efficiency and access to more comprehensive sources of data. In fact, their analyses of data extracted from the FDA website on the use of PBPK modeling in the evaluation of new molecular entity and biologic drugs provide further evidence of the increasing role of PBPK in the regulatory sphere and the utility of detecting adverse interactions based on broad AI-assisted surveys.

Figure 6.

Figure 6.

Representation of covariates identified by NLP in PK models of large (blue) and small (yellow) molecule oncology drugs. Figure obtained from ref [133].

ALB, albumin; ALT, alanine transaminase; AP, alkaline phosphatase; AST, aspartate aminotransferase; BILI, bilirubin; BSA, body surface area; BWT, body weight; CRCL, creatinine clearance; ECOG, Eastern Cooperative Oncology Group performance status; FDA, US Food and Drug Administration; IGG, immunoglobulin G; PK, pharmacokinetic; PopPK, population pharmacokinetic; PPI, proton pump inhibitor.

There is also recent precedent for AI models to streamline delivery predictions for nanomedicines, in which drug delivery behavior is learned from the properties of the nanoparticles134. This framework seeks to infer the delivery of a new nanomedicine to the tumor site by utilizing data from previous in vivo studies of nanomedicines, available from a Nano-Tumor Database18, rather than new in vivo studies. The time series for predicted tumor concentration from the AI-informed PBPK model, with parameters learned from the nanoparticle database, recapitulated previously published experimental data with high accuracy, thus potentially providing an alternative to animal studies as a screening method. Combining deep learning and ensemble models may further improve nanoparticle PK predictions by incorporating information across nanomedicine datasets where the data are limited, accounting more fully for features unique to the ADME of nanoparticles versus larger drugs135. It is important to note, however, that unlike small molecules and even biologics, nanoparticle properties and stability may differ substantially due to subtle differences in technique or quality control in manufacturing that are rarely described, not to mention actual physicochemical differences among different nanoparticle systems. This in turn can greatly influences observed in vivo behavior, and thus potentially limit the predictive power of any strictly AI approach.

Recent improvements to PK predictions span both modeling and treatment modalities, as evidenced by the variety of techniques and applications in the following examples. Neural networks, response surface methodology, and PBPK have been combined in the development and analysis of rivaroxaban controlled-release tablets with low solubility136. Specifically, a neural ordinary differential equations (neural-ODE) approach was able to predict PK measurements for new dosing regimens based on serum concentration levels from known regimens in humans receiving conjugated monoclonal antibody (the well-characterized trastuzumab emtansine) therapy, with greater accuracy than alternative models137. An ML model for predicting PK profiles of intravenously administered compounds in rats yielded results comparable to traditional methods when analyzing the PK profiles of thousands of small molecules138. An integrated AI-PBPK model has successfully predicted tissue distribution of over 600 oral and 71 intravenous drugs from the PK-DB database based on their physicochemical properties (enumerated as solubility, pKa values, crystal density, intrinsic dissolution rate, apparent permeability, protein unbound fraction, plasma clearance, and tissue partition coefficients for 15 organs), and proposed a method of leveraging allometric scaling to integrate data optimized across species139. By integrating available data sources to make predictions, such approaches reduce the gap between in vitro studies, early in vivo work, and the clinical result which often does not agree with expected behaviors based on preclinical results140.

Summary: Future directions of AI in PBPK modeling

Given the high costs of preclinical and early clinical development, advancing the correct drug candidate at the right doses is paramount, which in turn underscores the importance of accurately predicting a drug’s concentration profile over time in the body. Currently, PK and biodistributions are most frequently determined very late in the drug discovery process, only after a candidate has been selected. Indeed, PK are often measured and PK modeling performed as part of toxicokinetics assessments in the GLP toxicology studies necessary to enter human studies. In principle, a more accurate understanding of PK/biodistribution properties via modeling should enable researchers to eliminate, earlier in the drug discovery and development process, drug candidates with undesirable PK/biodistribution profiles that make them unlikely to be efficacious or safe. This in turn not only reduces unnecessary sacrifice of lives from animal studies, but by extension can enable more cost-effective and faster drug discovery and development. As such, we believe it is only a matter of time before PBPK modeling is firmly entrenched as part of in silico drug discovery.

Early adaptation of AI and ML techniques to PBPK modeling is already increasing the predictive power and confidence in PBPK models, while enhancing our understanding of the sources of uncertainty and reducing them where possible. To date, most applications of ML that are currently implemented in PBPK are used to estimate parameters or parameter ranges. While there is still room to develop in this area, implementing ML/AI in the context of model selection and model learning remains an area rich in opportunities for growth in the PBPK field. In parallel, advances in database management and web scraping technologies can improve our ability to more efficiently and/or more accurately obtain relevant ADME parameters. These trends have the potential to increase the role of PBPK in every step of the drug development process, from early drug discovery to formulation optimization to investigational new drug (IND) enabling studies to clinical development. Doing so could replace iterations of costly experimentation with iterations of a learned model, thereby reducing not only time and costs but also potentially improving both efficiency and accuracy throughout the drug discovery and development process.

We believe there is room to incorporate PBPK modeling at every stage of the drug discovery and development pipeline. ML/AI techniques should enable us to increasingly adopt sophisticated PBPK models even with only partial understanding of the PK parameters. For example, when targets are identified as part of the in silico drug discovery process, the algorithmic framework could be expanded at this stage to screen or test other critical aspects of drug design, such as anticipated delivery modalities or off-target interactions. Implementing a PBPK system in this context could inform the earliest phases of drug design, drug formulation, or engineering of drug carrier systems. As the drug design process continues, incorporating AI in a multiscale PBPK framework has the potential to become an invaluable tool for not only predicting where a drug is in the body, but also informing how we can best design a drug with the desired biodistribution pattern, as well as the most sensitive time points to sample in animal and human studies. Integrating such a model into a PD system would, in turn, both predict the outcome of a drug and inform the optimal design to maximize our ability to achieve that outcome. Over time, if PBPK predictions prove to be sufficiently accurate, it may one day take on an increasing role in informing the regulatory approval process141.

Acknowledgments.

The authors would like to thank the Applied and Computational Mathematics Division at NIST for their support of this work.

Financial Support.

This work was funded in part by the NIST National Research Council Associateship program (AMT), National Institutes of Health (R35 GM152449; YC), and startup funds from the UNC Eshelman School of Pharmacy (SKL) and UB School of Pharmacy and Pharmaceutical Sciences (AMT).

Footnotes

Declaration of Competing Interest. The authors have no conflict of interest to disclose.

References

  • 1.Zhuang X & Lu C PBPK modeling and simulation in drug research and development. Acta Pharmaceutica Sinica B vol. 6 430–440 Preprint at 10.1016/j.apsb.2016.04.004 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Andersen ME, Yang RSH, Clewell HJ & Reddy MB Introduction: A Historical Perspective of the Development and Applications of PBPK Models. in Physiologically Based Pharmacokinetic Modeling 1–18 (Wiley, 2005). doi: 10.1002/0471478768.ch1. [DOI] [Google Scholar]
  • 3.Sager JE, Yu J, Ragueneau-Majlessi I & Isoherranen N Physiologically Based Pharmacokinetic (PBPK) Modeling and Simulation Approaches: A Systematic Review of Published Models, Applications, and Model Verification. Drug Metabolism and Disposition 43, 1823 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kuepfer L et al. Applied Concepts in PBPK Modeling: How to Build a PBPK/PD Model. CPT: Pharmacometrics and Systems Pharmacology vol. 5 516–531 Preprint at 10.1002/psp4.12134 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Chetty M et al. Applications of linking PBPK and PD models to predict the impact of genotypic variability, formulation differences, differences in target binding capacity and target site drug concentrations on drug responses and variability. Front Pharmacol 5, 1–29 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Espié P, Tytgat D, Sargentini-Maier M-L, Poggesi I & Watelet J-B Physiologically based pharmacokinetics (PBPK). Drug Metab Rev 41, 391–407 (2009). [DOI] [PubMed] [Google Scholar]
  • 7.Ince I et al. Predictive Performance of Physiology-Based Pharmacokinetic Dose Estimates for Pediatric Trials: Evaluation With 10 Bayer Small-Molecule Compounds in Children. J Clin Pharmacol 61, S70–S82 (2021). [Google Scholar]
  • 8.Wang X et al. Application of physiologically based pharmacokinetics modeling in the research of small-molecule targeted anti-cancer drugs. Cancer Chemotherapy and Pharmacology vol. 92 253–270 Preprint at 10.1007/s00280-023-04566-z (2023). [DOI] [PubMed] [Google Scholar]
  • 9.Cao Y & Jusko WJ Applications of minimal physiologically-based pharmacokinetic models. J Pharmacokinet Pharmacodyn 39, 711–723 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Eigenmann MJ et al. Interstitial IgG antibody pharmacokinetics assessed by combined in vivo- and physiologically-based pharmacokinetic modelling approaches. J Physiol 595, 7311–7330 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Garg A & Balthasar JP Physiologically-based pharmacokinetic (PBPK) model to predict IgG tissue kinetics in wild-type and FcRn-knockout mice. J Pharmacokinet Pharmacodyn 34, 687–709 (2007). [DOI] [PubMed] [Google Scholar]
  • 12.Yuan D, He H, Wu Y, Fan J & Cao Y Physiologically Based Pharmacokinetic Modeling of Nanoparticles. J Pharm Sci 108, 58–72 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Li M, Al-Jamal KT, Kostarelos K & Reineke J Physiologically Based Pharmacokinetic Modeling of Nanoparticles. ACS Nano 4, 6303–6317 (2010). [DOI] [PubMed] [Google Scholar]
  • 14.Li M, Zou P, Tyner K & Lee S Physiologically Based Pharmacokinetic (PBPK) Modeling of Pharmaceutical Nanoparticles. Aaps j 19, 26–42 (2017). [DOI] [PubMed] [Google Scholar]
  • 15.Bachler G, von Goetz N & Hungerbühler K A physiologically based pharmacokinetic model for ionic silver and silver nanoparticles. Int J Nanomedicine 8, 3365–3382 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bachler G, von Goetz N & Hungerbuhler K Using physiologically based pharmacokinetic (PBPK) modeling for dietary risk assessment of titanium dioxide (TiO2) nanoparticles. Nanotoxicology 9, 373–380 (2015). [DOI] [PubMed] [Google Scholar]
  • 17.Siccardi M, Rannard S & Owen A The emerging role of physiologically based pharmacokinetic modelling in solid drug nanoparticle translation. Advanced Drug Delivery Reviews vol. 131 116–121 Preprint at 10.1016/j.addr.2018.06.016 (2018). [DOI] [PubMed] [Google Scholar]
  • 18.Cheng YH, He C, Riviere JE, Monteiro-Riviere NA & Lin Z Meta-Analysis of Nanoparticle Delivery to Tumors Using a Physiologically Based Pharmacokinetic Modeling and Simulation Approach. ACS Nano 14, 3075–3095 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Jones HM et al. Application of PBPK modelling in drug discovery and development at Pfizer. Xenobiotica 42, 94–106 (2012). [DOI] [PubMed] [Google Scholar]
  • 20.Jones HM, Gardner IB & Watson KJ Modelling and PBPK simulation in drug discovery. AAPS Journal 11, 155–166 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ferl GZ, Theil F-P & Wong H Physiologically based pharmacokinetic models of small molecules and therapeutic antibodies: a mini-review on fundamental concepts and applications. Biopharm Drug Dispos 37, 75–92 (2016). [DOI] [PubMed] [Google Scholar]
  • 22.K Y, Kollipara S, Ahmed T & Chachad S Applications of PBPK/PBBM modeling in generic product development: An industry perspective. Journal of Drug Delivery Science and Technology vol. 69 Preprint at 10.1016/j.jddst.2022.103152 (2022). [DOI] [Google Scholar]
  • 23.Tegenge MA & Mitkus RJ A first-generation physiologically based pharmacokinetic (PBPK) model of alpha-tocopherol in human influenza vaccine adjuvant. Regulatory Toxicology and Pharmacology 71, 353–364 (2015). [DOI] [PubMed] [Google Scholar]
  • 24.Saldanha L & Vale N The First Physiologically Based Pharmacokinetic (PBPK) Model for an Oral Vaccine Using Alpha-Tocopherol as an Adjuvant. Pharmaceutics 15, (2023). [Google Scholar]
  • 25.Saldanha L, Langel Ü & Vale N In Silico Studies to Support Vaccine Development. Pharmaceutics vol. 15 Preprint at 10.3390/pharmaceutics15020654 (2023). [DOI] [Google Scholar]
  • 26.Lutz JD et al. Physiologically-Based Pharmacokinetic Modeling of Remdesivir and its Metabolites to Support Dose Selection for the Treatment of Pediatric Patients with COVID-19. Clin Pharmacol Ther 10.1002/cpt.2176 (2021) doi:10.1002/cpt.2176. [DOI] [Google Scholar]
  • 27.He H et al. A Multiscale Physiologically-Based Pharmacokinetic Model for Doxorubicin to Explore its Mechanisms of Cytotoxicity and Cardiotoxicity in Human Physiological Contexts. Pharm Res 35, 174 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Thai H-T, Mazuir F, Cartot-Cotton S & Veyrat-Follet C Optimizing pharmacokinetic bridging studies in paediatric oncology using physiologically-based pharmacokinetic modelling: application to docetaxel. Br J Clin Pharmacol 80, 534–547 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Zhou W et al. Predictive Performance of Physiologically Based Pharmacokinetic and Population Pharmacokinetic Modeling of Renally Cleared Drugs in Children. CPT Pharmacometrics Syst Pharmacol 5, 475–483 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lang J, Vincent L, Chenel M, Ogungbenro K & Galetin A Impact of Hepatic CYP3A4 Ontogeny Functions on Drug-Drug Interaction Risk in Pediatric Physiologically-Based Pharmacokinetic/Pharmacodynamic Modeling: Critical Literature Review and Ivabradine Case Study. Clin Pharmacol Ther 10.1002/cpt.2134 (2020) doi:10.1002/cpt.2134. [DOI] [Google Scholar]
  • 31.Walsh C et al. Development of a physiologically based pharmacokinetic model of actinomycin D in children with cancer. Br J Clin Pharmacol 81, 989–998 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Nuwer R US agency seeks to phase out animal testing. Nature 10.1038/d41586-022-03569-9 (2022) doi:10.1038/d41586-022-03569-9. [DOI] [Google Scholar]
  • 33.Wewetzer H, Wagenknecht T, Bert B & Schönfelder G The fate of surplus laboratory animals. EMBO Rep 24, (2023). [Google Scholar]
  • 34.Wadman M FDA no longer has to require animal testing for new drugs. Science (1979) 379, 127–128 (2023). [Google Scholar]
  • 35.Rifaioglu AS et al. Recent applications of deep learning and machine intelligence on in silico drug discovery: Methods, tools and databases. Briefings in Bioinformatics vol. 20 1878–1912 Preprint at 10.1093/bib/bby061 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Shi J & Zha W Predicting Human Pharmacokinetics: Physiologically Based Pharmacokinetic Modeling and In Silico ADME Prediction in Early Drug Discovery. Eur J Drug Metab Pharmacokinet 44, 135–137 (2019). [DOI] [PubMed] [Google Scholar]
  • 37.Liu W et al. Developments in methods for measuring the intestinal absorption of nanoparticle-bound drugs. International Journal of Molecular Sciences vol. 17 Preprint at 10.3390/ijms17071171 (2016). [DOI] [Google Scholar]
  • 38.Glazier DS Variable metabolic scaling breaks the law: from ‘Newtonian’ to ‘Darwinian’ approaches. Proceedings of the Royal Society B: Biological Sciences vol. 289 Preprint at 10.1098/rspb.2022.1605 (2022). [DOI] [Google Scholar]
  • 39.Mahmood I Misconceptions and issues regarding allometric scaling during the drug development process. Expert Opin Drug Metab Toxicol 14, 843–854 (2018). [DOI] [PubMed] [Google Scholar]
  • 40.Huh Y, Smith DE & Rose Feng M Interspecies scaling and prediction of human clearance: comparison of small- and macro-molecule drugs. Xenobiotica 41, 972–987 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ritschel WA, Vachharajani NN, Johnson RD & Hussain AS The allometric approach for interspecies scaling of pharmacokinetic parameters. Comparative Biochemistry and Physiology Part C: Comparative Pharmacology 103, 249–253 (1992). [Google Scholar]
  • 42.Poulin P & Theil F A Priori Prediction of Tissue:Plasma Partition Coefficients of Drugs to Facilitate the Use of Physiologically-Based Pharmacokinetic Models in Drug Discovery. J Pharm Sci 89, 16–35 (2000). [DOI] [PubMed] [Google Scholar]
  • 43.Poulin P, Schoenlein K & Theil F Prediction of adipose tissue: Plasma partition coefficients for structurally unrelated drugs. J Pharm Sci 90, 436–447 (2001). [DOI] [PubMed] [Google Scholar]
  • 44.Rodgers T, Leahy D & Rowland M Physiologically Based Pharmacokinetic Modeling 1: Predicting the Tissue Distribution of Moderate-to-Strong Bases. J Pharm Sci 94, 1259–1276 (2005). [DOI] [PubMed] [Google Scholar]
  • 45.Rodgers T & Rowland M Physiologically based pharmacokinetic modelling 2: Predicting the tissue distribution of acids, very weak bases, neutrals and zwitterions. J Pharm Sci 95, 1238–1257 (2006). [DOI] [PubMed] [Google Scholar]
  • 46.Utsey K et al. Quantification of the Impact of Partition Coefficient Prediction Methods on Physiologically Based Pharmacokinetic Model Output Using a Standardized Tissue Composition. Drug Metabolism and Disposition 48, 903–916 (2020). [DOI] [PubMed] [Google Scholar]
  • 47.Poulin P, Nicolas J-M & Bouzom F A New Version of the Tissue Composition-Based Model for Improving the Mechanism-Based Prediction of Volume of Distribution at Steady-State for Neutral Drugs. J Pharm Sci 113, 118–130 (2024). [DOI] [PubMed] [Google Scholar]
  • 48.Jagdale P, Sepp A & Shah DK Physiologically-based pharmacokinetic model for pulmonary disposition of protein therapeutics in humans. J Pharmacokinet Pharmacodyn 49, 607–624 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Baxter LT, Zhu H, Mackensen DG & Jain RK Physiologically based pharmacokinetic model for specific and nonspecific monoclonal antibodies and fragments in normal tissues and human tumor xenografts in nude mice. Cancer Res 54, 1517–28 (1994). [PubMed] [Google Scholar]
  • 50.Rippe B & Haraldsson B Transport of macromolecules across microvascular walls: the two-pore theory. Physiol Rev 74, 163–219 (1994). [DOI] [PubMed] [Google Scholar]
  • 51.Covell DG et al. Pharmacokinetics of Monoclonal Immunoglobulin G1, F(ab′)2, and Fab′ in Mice. Cancer Res 46, 3969–3978 (1986). [PubMed] [Google Scholar]
  • 52.Shah DK & Betts AM Towards a platform PBPK model to characterize the plasma and tissue disposition of monoclonal antibodies in preclinical species and human. J Pharmacokinet Pharmacodyn 39, 67–86 (2012). [DOI] [PubMed] [Google Scholar]
  • 53.Abuqayyas L & Balthasar JP Application of PBPK modeling to predict monoclonal antibody disposition in plasma and tissues in mouse models of human colorectal cancer. J Pharmacokinet Pharmacodyn 39, 683–710 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ozbek O, Genc DE & O. Ulgen K Advances in Physiologically Based Pharmacokinetic (PBPK) Modeling of Nanomaterials. ACS Pharmacol Transl Sci 7, 2251–2279 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Henrique Silva A et al. A physiologically based pharmacokinetic model to predict the superparamagnetic iron oxide nanoparticles (SPIONs) accumulation in vivo. Eur J Nanomed 9, (2017). [Google Scholar]
  • 56.Moreno-Chaparro D, Moreno N, Usabiaga FB & Ellero M Computational modeling of passive transport of functionalized nanoparticles. J Chem Phys 158, (2023). [Google Scholar]
  • 57.Hashemian SMR et al. Paxlovid (Nirmatrelvir/Ritonavir): A new approach to Covid-19 therapy? Biomed Pharmacother 162, 114367 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Talkington AM et al. Experimental Data and PBPK Modeling Quantify Antibody Interference in PEGylated Drug Carrier Delivery. Bull Math Biol 83, 123 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Helton JC & Davis FJ Latin hypercube sampling and the propagation of uncertainty in analyses of complex systems. Reliab Eng Syst Saf 81, 23–69 (2003). [Google Scholar]
  • 60.Marino S, Hogue IB, Ray CJ & Kirschner DE A methodology for performing global uncertainty and sensitivity analysis in systems biology. J Theor Biol 254, 178–196 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.McKay MD, Beckman RJ & Conover WJ A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer Code. Technometrics 21, 239 (1979). [Google Scholar]
  • 62.Hora SC & Iman RL Expert Opinion in Risk Analysis: The NUREG-1150 Methodology. Nuclear Science and Engineering 102, 323–331 (1989). [Google Scholar]
  • 63.Iman RL, Helton JC & Campbell JE An Approach to Sensitivity Analysis of Computer Models: Part I—Introduction, Input Variable Selection and Preliminary Variable Assessment. Journal of Quality Technology 13, 174–183 (1981). [Google Scholar]
  • 64.Talkington AM et al. A PBPK model recapitulates early kinetics of anti-PEG antibody-mediated clearance of PEG-liposomes. Journal of Controlled Release 343, 518–527 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Renardy M, Hult C, Evans S, Linderman JJ & Kirschner DE Global sensitivity analysis of biological multiscale models. Current Opinion in Biomedical Engineering vol. 11 109–116 Preprint at 10.1016/j.cobme.2019.09.012 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Dige N & Diwekar U Efficient sampling algorithm for large-scale optimization under uncertainty problems. Comput Chem Eng 115, 431–454 (2018). [Google Scholar]
  • 67.Renardy M, Joslyn LR, Millar JA & Kirschner DE To Sobol or not to Sobol? The effects of sampling schemes in systems biology applications. Math Biosci 337, (2021). [Google Scholar]
  • 68.Iordanis I, Koukouvinos C & Silou I On the efficacy of conditioned and progressive Latin hypercube sampling in supervised machine learning. Applied Numerical Mathematics 10.1016/j.apnum.2023.12.016 (2024) doi:10.1016/j.apnum.2023.12.016. [DOI] [Google Scholar]
  • 69.Bonate PL A Brief Introduction to Monte Carlo Simulation. Clin Pharmacokinet 40, 15–22 (2001). [DOI] [PubMed] [Google Scholar]
  • 70.Bois FY Analysis of PBPK Models for Risk Characterization. Ann N Y Acad Sci 895, 317–337 (1999). [DOI] [PubMed] [Google Scholar]
  • 71.Krauss M et al. Using Bayesian-PBPK modeling for assessment of inter-individual variability and subgroup stratification. In Silico Pharmacol 1, 6 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Weijs L, Yang RSH, Das K, Covaci A & Blust R Application of Bayesian Population Physiologically Based Pharmacokinetic (PBPK) Modeling and Markov Chain Monte Carlo Simulations to Pesticide Kinetics Studies in Protected Marine Mammals: DDT, DDE, and DDD in Harbor Porpoises. Environ Sci Technol 47, 4365–4374 (2013). [DOI] [PubMed] [Google Scholar]
  • 73.Weijs L et al. Lifetime PCB 153 bioaccumulation and pharmacokinetics in pilot whales: Bayesian population PBPK modeling and Markov chain Monte Carlo simulations. Chemosphere 94, 91–96 (2014). [DOI] [PubMed] [Google Scholar]
  • 74.Allen BC, Eric Hack C & Clewell HJ Use of Markov Chain Monte Carlo Analysis with a Physiologically-Based Pharmacokinetic Model of Methylmercury to Estimate Exposures in U.S. Women of Childbearing Age. Risk Analysis 27, 947–959 (2007). [DOI] [PubMed] [Google Scholar]
  • 75.McNally K et al. Reconstruction of Exposure to m-Xylene from Human Biomonitoring Data Using PBPK Modelling, Bayesian Inference, and Markov Chain Monte Carlo Simulation. J Toxicol 2012, 1–18 (2012). [Google Scholar]
  • 76.Brooks S Markov chain Monte Carlo method and its application. Journal of the Royal Statistical Society: Series D (The Statistician) 47, 69–100 (1998). [Google Scholar]
  • 77.Brooks S, Gelman A, Jones G & Meng X-L Handbook of Markov Chain Monte Carlo. (Chapman and Hall/CRC, New York, 2011). doi: 10.1201/b10905. [DOI] [Google Scholar]
  • 78.Djuric PM et al. Particle filtering. IEEE Signal Process Mag 20, 19–38 (2003). [Google Scholar]
  • 79.Elfring J, Torta E & van de Molengraft R Particle filters: A hands-on tutorial. Sensors (Switzerland) 21, 1–28 (2021). [Google Scholar]
  • 80.Wedagedera JR et al. Population PBPK modeling using parametric and nonparametric methods of the Simcyp Simulator, and Bayesian samplers. CPT Pharmacometrics Syst Pharmacol 11, 755–765 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Cox LA Causality for Risk Analysts: Improving Our Understanding of How the World Works. in Improving Risk Analysis (ed. Anthony Cox Louis Jr.) 3–58 (Springer New York, New York, NY, 2012). doi: 10.1007/978-1-4614-6058-9_1. [DOI] [Google Scholar]
  • 82.Nanda P & Kirschner DE Calibration methods to fit parameters within complex biological models. Front Appl Math Stat 9, (2023). [Google Scholar]
  • 83.Andrieu C, Djurić PM & Doucet A Model selection by MCMC computation. Signal Processing 81, 19–37 (2001). [Google Scholar]
  • 84.Newland MC An Information Theoretic Approach to Model Selection: A Tutorial with Monte Carlo Confirmation. Perspect Behav Sci 42, 583–616 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Xiao H & Sun Y On Tuning Parameter Selection in Model Selection and Model Averaging: A Monte Carlo Study. Journal of Risk and Financial Management 12, 109 (2019). [Google Scholar]
  • 86.Thompson MD & Beard DA Physiologically based pharmacokinetic tissue compartment model selection in drug development and risk assessment. J Pharm Sci 101, 424–435 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Strikwold M, Spenkelink B, Woutersen RA, Rietjens IMCM & Punt A Development of a combined in Vitro Physiologically Based Kinetic (PBK) and Monte Carlo modelling approach to predict interindividual human variation in phenol-induced developmental toxicity. Toxicological Sciences 157, 365–376 (2017). [DOI] [PubMed] [Google Scholar]
  • 88.Hsieh NH, Reisfeld B, Bois FY & Chiu WA Applying a Global Sensitivity Analysis Workflow to Improve the Computational Efficiencies in Physiologically-Based Pharmacokinetic Modeling. Front Pharmacol 9, 588 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.McNally K, Cotton R & Loizou GD A Workflow for Global Sensitivity Analysis of PBPK Models. Front Pharmacol 2:31, (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Anzar I, Sverchkova A, Stratford R & Clancy T NeoMutate: An ensemble machine learning framework for the prediction of somatic mutations in cancer. BMC Med Genomics 12, (2019). [Google Scholar]
  • 91.Fritsch JM, Hilliker J, Ross J & Vislocky RL Model Consensus. Weather Forecast 15, 571–582 (2000). [Google Scholar]
  • 92.Topchy A, Jain AK & Punch W Clustering ensembles: models of consensus and weak partitions. IEEE Trans Pattern Anal Mach Intell 27, 1866–1881 (2005). [DOI] [PubMed] [Google Scholar]
  • 93.Zhan H et al. Consensus Ensemble System for Traffic Flow Prediction. IEEE Transactions on Intelligent Transportation Systems 19, 3903–3914 (2018). [Google Scholar]
  • 94.Ala’raj M & Abbod MF A new hybrid ensemble credit scoring model based on classifiers consensus system approach. Expert Syst Appl 64, 36–55 (2016). [Google Scholar]
  • 95.Paixão P, Aniceto N, Gouveia LF & Morais JAG Prediction of drug distribution in rat and humans using an artificial neural networks ensemble and a PBPK model. Pharm Res 31, 3313–3322 (2014). [DOI] [PubMed] [Google Scholar]
  • 96.Gao H, Wang W, Dong J, Ye Z & Ouyang D An integrated computational methodology with data-driven machine learning, molecular modeling and PBPK modeling to accelerate solid dispersion formulation design. European Journal of Pharmaceutics and Biopharmaceutics 158, 336–346 (2021). [DOI] [PubMed] [Google Scholar]
  • 97.Abdulhafedh A Comparison between Common Statistical Modeling Techniques Used in Research, Including: Discriminant Analysis vs Logistic Regression, Ridge Regression vs LASSO, and Decision Tree vs Random Forest. OAlib 09, 1–19 (2022). [Google Scholar]
  • 98.Yun YE, Cotton CA & Edginton AN Development of a decision tree to classify the most accurate tissue-specific tissue to plasma partition coefficient algorithm for a given compound. J Pharmacokinet Pharmacodyn 41, 1–14 (2014). [DOI] [PubMed] [Google Scholar]
  • 99.Chen EP, Bondi RW & Michalski PJ Model-based Target Pharmacology Assessment (mTPA): An Approach Using PBPK/PD Modeling and Machine Learning to Design Medicinal Chemistry and DMPK Strategies in Early Drug Discovery. J Med Chem 64, 3185–3196 (2021). [DOI] [PubMed] [Google Scholar]
  • 100.Goldberg DE Genetic and evolutionary algorithms come of age. Commun. ACM 37, 113–119 (1994). [Google Scholar]
  • 101.Vikhar PA Evolutionary algorithms: A critical review and its future prospects. in 2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) 261–265 (2016). doi: 10.1109/ICGTSPICC.2016.7955308. [DOI] [Google Scholar]
  • 102.Bartz-Beielstein T, Branke J, Mehnen J & Mersmann O Evolutionary Algorithms. WIREs Data Mining and Knowledge Discovery 4, 178–195 (2014). [Google Scholar]
  • 103.Natekin A & Knoll A Gradient boosting machines, a tutorial. Front Neurorobot 7, (2013). [Google Scholar]
  • 104.Adachi K, Shimizu M & Yamazaki H Updated in Silico Prediction Methods for Fractions Absorbed and Key Input Parameters of 355 Disparate Chemicals for Physiologically Based Pharmacokinetic Models for Time-Dependent Plasma Concentrations after Virtual Oral Doses in Humans. Biol Pharm Bull 45, b22–00502 (2022). [Google Scholar]
  • 105.Chou WC & Lin Z Machine learning and artificial intelligence in physiologically based pharmacokinetic modeling. Toxicological sciences : an official journal of the Society of Toxicology vol. 191 1–14 Preprint at 10.1093/toxsci/kfac101 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Hastie T, Tibshirani R & Friedman J Random Forests. in 587–604 (2009). doi: 10.1007/978-0-387-84858-7_15. [DOI] [Google Scholar]
  • 107.Segal MR Machine Learning Benchmarks and Random Forest Regression. (2004).
  • 108.Chao FC, Manaia EB, Ponchel G & Hsieh CM A physiologically-based pharmacokinetic model for predicting doxorubicin disposition in multiple tissue levels and quantitative toxicity assessment. Biomedicine and Pharmacotherapy 168, (2023). [Google Scholar]
  • 109.Genovese CR, Jin J, Wasserman L & Yao Z A Comparison of the Lasso and Marginal Regression. Journal of Machine Learning Research vol. 13 (2012). [Google Scholar]
  • 110.Tutz G & Binder H Boosting ridge regression. Comput Stat Data Anal 51, 6044–6059 (2007). [Google Scholar]
  • 111.Jusko WJ Moving from basic toward systems pharmacodynamic models. J Pharm Sci 102, 2930–2940 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Abdi H, Valentin D & Edelman B Neural Networks. (SAGE Publications, Inc.; 1999). doi: 10.4135/9781412985277. [DOI] [Google Scholar]
  • 113.Gurney K An Introduction to Neural Networks. (CRC Press, 1997). doi: 10.1201/9781315273570. [DOI] [Google Scholar]
  • 114.Abiodun OI et al. State-of-the-art in artificial neural network applications: A survey. Heliyon 4, (2018). [Google Scholar]
  • 115.Larochelle H, Bengio Y, Louradour J & Ca LU Exploring Strategies for Training Deep Neural Networks Pascal Lamblin. Journal of Machine Learning Research vol. 1 (2009). [Google Scholar]
  • 116.Bräm DS, Parrott N, Hutchinson L & Steiert B Introduction of an artificial neural network–based method for concentration-time predictions. CPT Pharmacometrics Syst Pharmacol 11, 745–754 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Naveed H et al. A Comprehensive Overview of Large Language Models. ACM Trans Intell Syst Technol 10.1145/3744746 (2025) doi:10.1145/3744746. [DOI] [Google Scholar]
  • 118.Arısoy E, Sainath TN, Kingsbury B & Ramabhadran B Deep Neural Network Language Models. in NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 20–28 (Association for Computational Linguistics, Montreal, 2012). [Google Scholar]
  • 119.Mikolov T, Deoras A, Povey D, Burget L & Černocký J Strategies for training large scale neural network language models. in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding 196–201 (2011). doi: 10.1109/ASRU.2011.6163930. [DOI] [Google Scholar]
  • 120.Thirunavukarasu AJ et al. Large language models in medicine. Nat Med 29, 1930–1940 (2023). [DOI] [PubMed] [Google Scholar]
  • 121.Bloomingdale P, Nguyen VA, Niu J & Mager DE Boolean network modeling in systems pharmacology. J Pharmacokinet Pharmacodyn 45, 159–180 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Hemedan AA, Niarakis A, Schneider R & Ostaszewski M Boolean modelling as a logic-based dynamic approach in systems medicine. Comput Struct Biotechnol J 20, 3161–3172 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Patidar K & Ford Versypt AN Logic-based modeling of inflammatory macrophage cross talk with glomerular endothelial cells in diabetic kidney disease. American Journal of Physiology-Renal Physiology 329, F202–F224 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Khusial R, Bies RR & Akil A Deep Learning Methods Applied to Drug Concentration Prediction of Olanzapine. Pharmaceutics 15, 1139 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Zhang T et al. Two heads are better than one: current landscape of integrating QSP and machine learning: An ISoP QSP SIG white paper by the working group on the integration of quantitative systems pharmacology and machine learning. J Pharmacokinet Pharmacodyn 49, 5–18 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Li Y et al. A Combination of Machine Learning and PBPK Modeling Approach for Pharmacokinetics Prediction of Small Molecules in Humans. Pharm Res 41, 1369–1379 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Mavroudis PD, Teutonico D, Abos A & Pillai N Application of machine learning in combination with mechanistic modeling to predict plasma exposure of small molecules. Frontiers in Systems Biology 3, (2023). [Google Scholar]
  • 128.Miljković F et al. Machine Learning Models for Human In Vivo Pharmacokinetic Parameters with In-House Validation. Mol Pharm 18, 4520–4530 (2021). [DOI] [PubMed] [Google Scholar]
  • 129.Murad N et al. Predicting Volume of Distribution in Humans: Performance of In Silico Methods for a Large Set of Structurally Diverse Clinical Compounds. Drug Metabolism and Disposition 49, 169–178 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Swanson K et al. ADMET-AI: A machine learning ADMET platform for evaluation of large-scale chemical libraries. 10.5281/zenodo.10372930 doi:10.5281/zenodo.10372930. [DOI] [Google Scholar]
  • 131.Tian H, Ketkar R & Tao P ADMETboost: a web server for accurate ADMET prediction. J Mol Model 28, (2022). [Google Scholar]
  • 132.Habiballah S & Reisfeld B Adapting physiologically-based pharmacokinetic models for machine learning applications. Sci Rep 13, (2023). [Google Scholar]
  • 133.Hsu JC et al. Applications of Advanced Natural Language Processing for Clinical Pharmacology. Clin Pharmacol Ther 10.1002/cpt.3161 (2024) doi:10.1002/cpt.3161. [DOI] [Google Scholar]
  • 134.Chou W-C et al. An artificial intelligence-assisted physiologically-based pharmacokinetic model to predict nanoparticle delivery to tumors in mice. Journal of Controlled Release 361, 53–63 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Khakpour A et al. AI-Powered Prediction of Nanoparticle Pharmacokinetics: A Multi-View Learning Approach. http://arxiv.org/abs/2503.13798 (2025).
  • 136.Saleem MT, Shoaib MH, Yousuf RI & Siddiqui F RSM and AI based machine learning for quality by design development of rivaroxaban push-pull osmotic tablets and its PBPK modeling. Sci Rep 15, (2025). [Google Scholar]
  • 137.Lu J, Deng K, Zhang X, Liu G & Guan Y Neural-ODE for pharmacokinetics modeling and its advantage to alternative machine learning models in predicting new dosing regimens. iScience 24, (2021). [Google Scholar]
  • 138.Walter M et al. Predicting Pharmacokinetics in Rats Using Machine Learning: A Comparative Study Between Empirical, Compartmental, and PBPK-Based Approaches. Clin Transl Sci 18, e70150 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Wang W et al. An Integrated AI-PBPK Platform for Predicting Drug In Vivo Fate and Tissue Distribution in Human and Inter-Species Extrapolation. Clin Pharmacol Ther 10.1002/cpt.3732 (2025) doi:10.1002/cpt.3732. [DOI] [Google Scholar]
  • 140.Rostami-Hodjegan A Reverse Translation in PBPK and QSP: Going Backwards in Order to Go Forward With Confidence. Clin Pharmacol Ther 103, 224–232 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141.Zhao P et al. Applications of Physiologically Based Pharmacokinetic (PBPK) Modeling and Simulation During Regulatory Review. Clin Pharmacol Ther 89, 259–267 (2011). [DOI] [PubMed] [Google Scholar]

RESOURCES