A Machine-Learning Based Drug Repurposing Approach Using Baseline Regularization

Zhaobin Kuang; Yujia Bao; James Thomson; Michael Caldwell; Peggy Peissig; Ron Stewart; Rebecca Willett; David Page

doi:10.1007/978-1-4939-8955-3_15

. Author manuscript; available in PMC: 2019 Jan 1.

Published in final edited form as: Methods Mol Biol. 2019;1903:255–267. doi: 10.1007/978-1-4939-8955-3_15

A Machine-Learning Based Drug Repurposing Approach Using Baseline Regularization

Zhaobin Kuang ^1,^*, Yujia Bao ², James Thomson ³, Michael Caldwell ⁴, Peggy Peissig ⁵, Ron Stewart ⁶, Rebecca Willett ⁷, David Page ⁸

PMCID: PMC6296259 NIHMSID: NIHMS974404 PMID: 30547447

Abstract

We present the baseline regularization model for computational drug repurposing using electronic health records (EHRs). In EHRs, drug prescriptions of various drugs are recorded throughout time for various patients. In the same time, numeric physical measurements (e.g. fasting blood sugar level) are also recorded. Baseline regularization uses statistical relationships between the occurrences of prescriptions of some particular drugs and the increase or the decrease in the values of some particular numeric physical measurements to identify potential repurposing opportunities.

Keywords: Electronic health records, computational drug repurposing, longitudinal data, self-controlled case series, silico repurposing

1 Introduction

With the increasing availability of electronic health record (EHR) data, there is an emerging interest in using EHRs from various patients for computational drug repurposing (CDR). Specifically, in EHRs, drug prescriptions of various drugs are recorded throughout time for various patients. In the same time, numeric physical measurements, such as fasting blood sugar (FBG) level, blood pressure, and low density lipoprotein are also recorded. By designing machine learning algorithms that can establish relationships between the occurrences of prescriptions of some particular drugs and the increase or the decrease in the values of some particular numeric physical measurements, we might be able to identify drugs that can be potentially repurposed to control certain numeric physical measurements. This chapter describes such a machine learning algorithm called baseline regularization [12] for CDR.

2 Materials

Figure 1 visualizes a set of electronic health records from two patients. Drug prescriptions of different types enter the EHRs of the two patients at different times. Fasting blood sugar (FBG) level measurements are also recorded at various times. In this chapter, we will consider how to identify drugs that can be potentially repurposed to control FBG level as an example to illustrate the use of baseline regularization. The idea is to formulate this problem as a machine learning problem by considering an FBG record as a response variable and using the drug prescriptions that occur before the FBG record as features to predict the value of the FBG record. If through the predictive model we notice that the prescription of a particular drug is associated with the decrease of FBG, then we can consider this drug as a potential candidate to be repurposed for glucose control. It should be noticed that while we are using FBG level control as an example for the ease of presentation, the proposed algorithm can also be used to identify drugs that can be potentially repurposed to control other numeric physical measurements.

Visualization of electronic health records (EHRs) from two patients. Fasting blood sugar (FBG) level measurements as well as drug prescriptions of various drugs are observed for the two patients over time.

2.1 Notation

Without loss of generality, we assume that only drug prescription records and FBG records are available for each patient. And we consider only patients with at least one FBG record throughout their observations. Let there be N patients and p drugs under consideration in total. Suppose that for the i^th patient, there are n_i drug prescription records and m_i FBG records in total, where i ∈ {1, 2, …, N}. We can use a 2-tuple (x_ij, t_ij) to represent the j^th drug prescription record of the i^th patient, where j ∈ {1, 2, …, n_i}, x_ij ∈ {1, 2, …, p} represents which drug among the p drugs is prescribed, and t_ij represents the timestamp of the drug prescription. Similarly, we can also use a 2-tuple (y_ik, τ_ik) to represent the k^th FBG measurement record from the i^th patient, where k ∈ {1, 2, …, m_i}, y_ik denotes the value of the FBG measurement, and τ_ik represents the measurement timestamp. Note that given i, $t_{i 1} \leq t_{i 2} \leq \dots \leq t_{i n_{i}}$ and $τ_{i 1} \leq τ_{i 2} \leq \dots \leq τ_{i n_{i}}$ . In this way, we can represent the EHR of each patient as a set of the aforementioned 2-tuples.

3 Methods

We first present how the potential influence of various drugs over time on the value of FBG measurements can be ascertained via the use of dyadic influence functions, directly from raw EHR data. We then present our baseline regularization model that combines the effects of time-varying patient-specific baselines and the effects from various drugs throughout time to predict FBG levels for CDR.

3.1 Dyadic Influence

We assume that drug prescriptions in the EHR of a patient have certain influences on the values of the FBG measurements that occur after the prescriptions. Since drug prescriptions occur throughout time for various patients, given an FBG measurement record, an intuition is that a drug prescription record that occurs long before has less effect, if any, on the value of the FBG measurement in question, compared with a more recent drug prescription occurrence. Based on this intuition, for t_ij ≤ τ_ik, we represent the effect of a drug prescription (x_ij, t_ij) on an FBG measurement (y_ik, τ_ik) through a weighted sum of a pre-defined set of dyadic influence functions ${ϕ_{l} (\cdot)}_{l = 0}^{L - 1}$ [3]. Specifically, let S > 0 and L ∈ ℕ⁺ be given. For l ∈ {0, 1, 2, …, L − 1}, we define

α_{l} ≜ {\begin{matrix} 2^{L - 1} / S & l = 0 \\ 2^{L - l} / S & l = 1, 2, \dots, L - 1 \end{matrix};

and the half-closed-half-open intervals,

I_{l} ≜ {\begin{matrix} [0, 1 / α_{l}) & l = 0 \\ [1 / α_{l}, 2 / α_{l}) & l = 1, 2, \dots, L - 1 \end{matrix} .

Then we define

ϕ_{l} (δ) ≜ α_{l} I (δ \in I_{l}),

where δ = τ_ik −t_ij is the time difference between the drug prescription and the FBG measurement, and $I (\cdot)$ is the indicator function. Note that these ϕ_l(·)’s all integrate to one and are orthogonal to one another.

Figure 2.1 visualizes the set of dyadic influence functions when S = 512 and L = 6. As can be seen, when the time difference between two events δ increases, the influence decays in exponential order. For δ ≥ S, the previous drug prescription is assumed not to have any influence on the value of the FBG measurement in question. Dyadic influence functions provide a flexible approach to ascertain influences of various drug prescriptions in the past on the value of FBG measurement records. This is in contrast to the drug era construction that is prevalent in the pharmacovigilance literature [15, 21, 20, 14], where ad-hoc heuristics are used to generate a consecutive time period during which the value of an FBG measurement is assumed to be under unattenuated influence.

Dyadic influence functions for S = 512 and L = 6.

3.2 Baseline Regularization

Baseline regularization assumes that an observed FBG value is due to the influences of various drug prescriptions that occur in the past as well as a hidden, intrinsic baseline FBG value that represents the FBG level that would have been observed if the patient were not under any other influences. Specifically, baseline regularization considers solving the optimization problem in (1):

\hat{b}, \hat{β} ≜ \arg min_{b, β} \frac{1}{2 M} \sum_{i = 1}^{N} \sum_{k = 1}^{m_{i}} {(y_{i k} - b_{i k} - \sum_{j = 1}^{n_{i}} \sum_{q = 1}^{p} \sum_{l = 0}^{L - 1} β_{q l} ϕ_{l} (τ_{i k} - t_{i j}) \cdot I (x_{i j} = q))}^{2} + λ_{1} \sum_{i = 1}^{N} \sum_{k = 1}^{m_{i} - 1} | b_{i k} - b_{i (k + 1)} | + λ_{2} {‖ β ‖}_{1},

(1)

where $M = \sum_{i = 1}^{N} m_{i}$ is the total number of FBG measurements under consideration, λ₁ > 0 and λ₂ > 0 are regularization parameters, and

b ≜ {[\begin{array}{l} b_{11} & b_{12} & \dots & b_{1 m_{1}} & \dots & b_{N 1} & b_{N 2} & \dots & b_{N m_{N}} \end{array}]}^{⊤} and β ≜ {[\begin{array}{l} β_{10} & b_{11} & \dots & β_{1 (L - 1)} & \dots & β_{p 0} & β_{p 1} & \dots & β_{p (L - 1)} \end{array}]}^{⊤}

are the parameters that we need to estimate. The baseline regularization problem is a regularized least square problem with a fused lasso penalty (controlled by λ₁) and a lasso penalty (controlled by λ₂).

The parameter b is a baseline parameter vector whose components represent the potentially different baseline FBG levels throughout time for different patients. Such time-varying and patient specific baselines are of great importance to provide flexibility to describe the intricate data generation process in reality. For example, diabetic patients tend to have higher FBG levels compared to a healthy person. Therefore, the fact that the baselines used are patient-specific helps to model such heterogeneity among different individuals in the data. Even for a particular patient, the FBG levels can also change dramatically over the years as the patient ages. Therefore, the time-varying nature of the baseline parameters also helps to capture the heterogeneity of the FBG levels over time. The baseline parameter b is regularized by a fused lasso penalty, without which b is flexible enough to explain any given FBG level observations. The intuition of using a fused lasso penalty is to minimize the difference between two adjacent baseline parameters. Since baseline parameters represent the FBG values that would have been observed if the patient were not under other influences, it is reasonable to assume that these baseline values are usually relatively stable over a certain period of time, and hence we encourage such stability via the use of fused lasso penalties.

The parameter β represents the effects of every drug on the value of the FBG level depending on the time difference between the drug prescription and the FBG measurement. A lasso penalty is used to encourage sparsity over the effect parameter β as we assume that only a small portion of drugs can have some effect on the value of an FBG measurement during a certain period of time.

The least square objective is hence to minimize the differences between the observed FBG values and the values given by the model that take into consideration both the time-varying patient-specific baseline parameters that change stably and the sparse effect parameters that describe effects of various drugs during various periods of time.

For the q^th drug, let ${{\hat{β}}_{q 0}, {\hat{β}}_{q 1}, {\hat{β}}_{q 2}, \dots, {\hat{β}}_{q (L - 1)}}$ be the set of effects learned from the baseline regularization model. We measure the overall effect of o_q on the FBG level as the average of the elements in the set: $o_{q} ≜ \frac{1}{L} \sum_{l = 0}^{L - 1} {\hat{β}}_{q l}$ .

Algorithm 1.

Baseline Regularization


Require: y, Z, D, λ₁, and λ₂.
Ensure: $\hat{b}$ and $\hat{β}$ .
1:	Initialize β⁽⁰⁾.
2:	u ←0.
3:	while true do
4:	${\overset{⌣}{y}}^{(u + 1)} \leftarrow y - Z β^{(u)}$ .
5:	$b^{(u + 1)} \leftarrow \arg {min}_{b} \frac{1}{2 M} {‖ {\overset{⌣}{y}}^{(u + 1)} - b ‖}_{2}^{2} + λ_{1} {‖ D b ‖}_{1}$ .	▹ b-step
6:	${\overset{⌣}{y}}^{(u + 1)} \leftarrow y - b^{(u + 1)}$ .	▹ β-step
7:	$β^{(u + 1)} \leftarrow \arg {min}_{β} \frac{1}{2 M} {‖ {\tilde{y}}^{(u + 1)} - Z β ‖}_{2}^{2} + λ_{2} {‖ β ‖}_{1}$ .
8:	if Stopping criteria met then
9:	$\hat{b} \leftarrow b^{(u + 1)}$ and $\hat{β} \leftarrow β^{(u + 1)}$ .
10:	return $\hat{b}$ and $\hat{β}$ .
11:	else
12:	u ← u + 1.
13:	end if
14:	end while

Open in a new tab

3.3 Optimization for Baseline Regularization

The baseline regularization problem in (1) is a convex optimization problem. Furthermore, b and β are separable in the optimization problem. Therefore, we can perform a blockwise minimization procedure that alternates between the minimization of b and β to achieve optimality [25]. When b is fixed, the optimization problem with respect to β is a lasso linear regression problem [22]. When β is fixed, the optimization problem with respect to b is a blockwise fused lasso signal approximator problem [24]. Both problems can be solved efficiently. The blockwise minimization algorithm is summarized in Algorithm 1. To see the two subproblems, let

z_{iql} ≜ \sum_{j = 1}^{n_{i}} ϕ_{l} (τ_{i k} - t_{i j}) \cdot I (x_{i j} = q) .

Then (1) can be rewritten as:

\hat{b}, \hat{β} ≜ \arg min_{b, β} \frac{1}{2 M} {‖ y - b - Z β ‖}_{2}^{2} + λ_{1} {‖ D b ‖}_{1} + λ_{2} {‖ β ‖}_{1},

(2)

where

y ≜ {[\begin{array}{l} y_{11} & y_{12} & \dots & y_{1 m_{1}} & \dots & y_{N 1} & y_{N 2} & \dots & y_{N m_{N}} \end{array}]}^{⊤},

Z is an M × (p × L) data matrix whose i^th row is:

y ≜ {[\begin{array}{l} z_{i 10} & z_{i 11} & \dots & z_{i 1 (L - 1)} & \dots & z_{i p 0} & z_{i p 1} & \dots & z_{i p (L - 1)} \end{array}]}^{⊤},

and D is the blockwise first difference matrix:

D ≜ [\begin{matrix} D_{m_{1}} \\ D_{m_{2}} \\ ⋱ \\ D_{m_{N}} \end{matrix}],

with an (m − 1) × m first difference matrix defined as D₁ = 0 and:

D_{m} ≜ [\begin{array}{l} - 1 & 1 \\ - 1 & 1 \\ ⋱ \\ - 1 & 1 \end{array}] .

Therefore, from (2), when β is fixed, let $\overset{⌣}{y} ≜ y - Z β$ ; then the blockwise fused lasso signal approximator problem with respect to b is:

\arg min_{b} \frac{1}{2 M} {‖ \overset{⌣}{y} - b ‖}_{2}^{2} + λ_{1} {‖ D b ‖}_{1} .

On the other hand, from (2), when b is fixed, let $\tilde{y} ≜ y - b$ , then the lasso linear regression problem with respect to β is:

\arg min_{β} \frac{1}{2 M} {‖ \tilde{y} - Z β ‖}_{2}^{2} + λ_{2} {‖ β ‖}_{1} .

(3)

In Algorithm 1 the two most computationally-intensive steps are Step 5 and Step 7. The former involves solving a fused lasso signal approximator problem, whose solution can be computed exactly by the dynamic programming algorithm proposed in [11]. The latter involves solving a lasso linear regression problem, which is achieved by the cyclic coordinate descent algorithm with variable screening proposed in [9] and [23].

4 Results

To demonstrate the utility of baseline regularization, we run our algorithm on the Marshfield Clinic EHR to identify drugs that can be potentially used to control FBG level. We consider patients with at least one FBG measurement throughout their observations. This leads to a total number of 333,907 FBG measurements from 75,146 patients.

To ascertain influences from drug prescriptions, we choose S to be half a year and L = 5 for the dyadic influence function. We only consider drugs that have at least one drug prescription that is at most S amount of time prior to the occurrence of at least one FBG measurement, yielding a total number of 5147 different drugs for consideration. λ₁ and λ₂ are chosen such that roughly 200 drugs will be selected eventually by the model. This is because we do not know in advance whether the drugs returned by the algorithm could potentially control FBG level or not, and we need to examine the findings of the algorithm manually. Therefore, the regularization parameters need to be carefully chosen so that the number of drugs selected by the model can be feasibly examined. Table 1 reports the top thirty drugs ranked by their overall effects among the 180 drugs generated by the baseline regularization using λ₁ = 86 and λ₂ = 2.841977 × 10⁻⁴. For more information about choosing the regularization parameters, please see Section 5.

Table 1.

Top Thirty Drugs Selected by Baseline Regularization Associated with FBG Decrease

INDX	CODE	DRUG NAME	SCORE
1	4132	GLUCOPHAGE	−82.388
2	7470	PIOGLITAZONE HCL	−36.869
3	8437	ROSIGLITAZONE MALEATE	−29.046
4	5786	METFORMIN	−18.867
5	4184	GLYBURIDE	−16.664
6	6382	NEEDLES INSULIN DISPOSABLE	−15.233
7	5787	METFORMIN HCL	−9.910
8	4806	INSULIN GLARGINE HUM.REC.ANLOG	−8.523
9	4497	HUM INSULIN NPH/REG INSULIN HM	−7.336
10	160	ACTOS	−6.006
11	7768	PREMARIN	−4.879
12	4106	GLIMEPIRIDE	−4.028
13	6656	NPH HUMAN INSULIN ISOPHANE	−3.613
14	4971	ISOSORBIDE MONONITRATE	−3.229
15	4561	HYDROCORTISONE	−3.084
16	4107	GLIPIZIDE	−3.007
17	9379	THIAMINE HCL	−2.968
18	1573	CAPTOPRIL	−2.871
19	5368	LIPITOR	−2.819
20	9152	SYRING W-NDL DISP INSUL 0.5ML	−2.380
21	1988	CIPROFLOXACIN HCL	−2.367
22	3937	FOSINOPRIL SODIUM	−2.252
23	5390	LISINOPRIL	−2.004
24	9994	VERAPAMIL HCL	−1.965
25	1216	BLOOD SUGAR DIAGNOSTIC	−1.900
26	7760	PREGABALIN	−1.708
27	6803	ONDANSETRON HCL	−1.678
28	4970	ISOSORBIDE DINITRATE	−1.575
29	6540	NITROGLYCERIN	−1.496
30	5571	MAGNESIUM	−1.266

Open in a new tab

As shown in Table 1, the drugs in green are drugs that are prescribed to control blood sugar level. The drugs in white are not normally used to control blood sugar level. However, there might be some potentially interesting findings based on a literature review. For example, thiamine HCL is reported to reduce the adverse effect of hyperglycemia by inhibiting certain biological pathways [27], and deficiency of thiamine is observed in diabetic patients [17]. Ciprofloxacin HCL could lead to hypoglycemia, according to the medication guide from the Food and Drug Administration (FDA) [5]. Lisinopril is also associated with hypoglycemia, according to the drug label from the FDA [7]. Verapamil HCL is reported to decrease blood sugar level as well as to have some hope in preventing pancreatic β cell loss. Such a loss is considered a pathological characteristic for diabetes [18]. Cases of hypoglycemia associated with the use of pregabalin have been reported [1, 19]. Premarin, fosinopril sodium, and hydrocortisone are potential false positives for our method, since they have been linked to hyperglycemia [4]. Drugs with mixed evidence are also found. For example, according to [4], both Lipitor and captopril are linked to hyperglycemia. Studies that suggest otherwise are also seen in the literature [6, 10, 16].

The baseline regularization algorithm is implemented with R. The blockwise fused lasso signal approximator problem is solved using a subroutine in the R package glmgen [2]. The lasso linear regression problem is solved using the R package glmnet [8].

5 Notes

5.1 Splitting Patient Records

In (1), we try to control the differences between two adjacent baseline parameters via the use of the fused lasso penalty. Consider the pair b_ik and b_i₍_k₊₁₎ that indicates the baseline FBG levels corresponding to two adjacent physical measurements. Although the two measurements are adjacent to each other in time, the actual time difference between the two measurements could be large, i.e. τ_ik ≪ τ_i₍_k₊₁₎. In this case, it might not be reasonable any more to regularize the difference between the two baselines as the FBG level could go through substantial changes during such a long period of time. Therefore, we consider splitting the records from the same patient into various subsets within which the records are close to each other in time, and just regularize the differences between adjacent baselines within the same subset. It remains to determine how far apart two adjacent records should be for us to consider them belonging to distinct subsets. We take a data-driven approach to determine this threshold. In detail, we compute the time differences of all adjacent pairs of FBG measurements for all patients. We then use Tukey’s method of outlier identification [26] to determine the smallest outlier. The distribution of the differences is heavy-tailed, and most of the differences are small. Therefore, the smallest outlier is a relatively large time difference value, and we set this value as our threshold. After splitting the FBG records of a patient into various subsets, each subset of FBG records can be considered as data from an independent patient. Therefore, the previously established formulation of the baseline regularization model can be naturally extended to handle this situation by simply modifying D in (2) accordingly. The threshold value identified in our dataset is 4.1 years.

5.2 Model Selection

Since in CDR, we do not know a priori what drugs returned by the algorithm can actually decrease or increase FBG levels, we manually review the drug list to identify potential repurposing opportunities. Therefore, model selection for baseline regularization not only needs to identify a model that explains the data well but also needs to generate a drug list of moderate size so that subsequent reviewing efforts are feasible.

To determine an appropriate λ₁, we start from identifying the minimum $λ_{1}^{*}$ such that all the baseline parameters are fused to its average in the following fused lasso signal approximator problem, where we only use the baseline parameter b to model the FBG measurements y:

\arg min_{b} \frac{1}{2 M} {‖ y - b ‖}_{2}^{2} + λ_{1} {‖ D b ‖}_{1} .

Define T_m as an m × m upper triangular matrix whose upper part and the diagonal are all ones, and whose entries are otherwise zeros. Then according to [28],

λ_{1}^{*} = min_{i \in {1, 2. \dots, N}} {‖ T_{m_{i}} (y_{i} - {\bar{y}}_{i} 1_{m_{i}}) ‖}_{\infty},

(4)

where 1_m is an m × 1 vector of all ones, and ${\bar{y}}_{i}$ is the mean of all the FBG measurements from the i^th patient. Upon the determination of $λ_{1}^{*}$ in (4), we can choose $λ_{1} = γ λ_{1}^{*}$ , where γ ∈ (0, 1) can vary to generate different models. The results reported in Table 1 are given by $λ_{1} = 0.05 λ_{1}^{*}$ .

To determine an appropriate λ₂, we first solve for the pathwise solution to a continuous self-controlled case series (CSCCS) problem [13], which is a lasso linear regression problem assuming a fixed baseline parameter for each patient:

\arg min_{β} \frac{1}{2 M} {‖ y - U \bar{y} - (X - U \bar{Z}) β ‖}_{2}^{2} + λ_{2} {‖ β ‖}_{1},

where

U ≜ [\begin{matrix} 1_{m_{1}} \\ 1_{m_{2}} \\ ⋱ \\ 1_{m_{N}} \end{matrix}], \bar{y} ≜ {(U^{⊤} U)}^{- 1} U^{⊤} y, \bar{Z} ≜ {(U^{⊤} U)}^{- 1} U^{⊤} Z .

In our experiments, we are aiming at selecting about 200 drugs in the end. Therefore, from the solution path, we choose an λ₂ whose solution selects about 250 drugs and we use this λ₂ for the baseline regularization problem. The solution to the CSCCS problem can also be used to initialize β⁽⁰⁾ in baseline regularization in Algorithm 1. Given the same λ₂, we notice that the baseline regularization problem usually will select fewer drugs compared to the corresponding CSCCS problem. Intuitively, this is because the introduction of time-varying and patient-specific baseline parameters in the baseline regularization problem help to explain the changes in the FBG measurements better. Therefore, fewer drugs are needed in order to explain the changes of FBG levels in the dataset, yielding a sparser drug effect parameterization.

When multiple configurations of λ₁’s and λ₂’s are provided, we can use Akaike information criterion (AIC) or Bayesian information criterion (BIC) for model selection. The degree of freedom of the baseline regularization model needed in the calculation is the summation of the degree of freedom of the baseline parameter b and the degree of freedom of the drug effect parameter β. The former is the total number of piecewise constant segments of b and the latter is the number of nonzero entries of β.

Since the dimension of the parameterization in baseline regularization is larger than the sample size of the data, caution needs to be paid when we choose regularization parameters. Essentially, we would like to choose large λ₁ and λ₂ to impose strong regularization to avoid overfitting. The degree of freedom of the learned model also needs to be monitored and controlled so that it is smaller than the sample size of the data.

5.3 Stopping Criteria

Since the baseline regularization problem is a convex optimization problem, we can verify the convergence of the optimization procedure in Algorithm 1 by checking the violation of the Karush–Kuhn–Tucker (KKT) conditions of the current iterate. Since when β^(u) is given, the update to b^(u+1) can be carried out exactly by Step 4 and Step 5 of Algorithm 1, we are interested in knowing the violation due to b^(u+1) and β^(u) via the KKT conditions of (3):

s^{(u)} = \frac{1}{n λ_{2}} Z^{⊤} (y - b^{(u + 1)} - Z β^{(u)}),

where s^(u) is the subgradient of ║β║₁. If b^(u+1) and β^(u) are optimal, then

{\hat{s}}_{d} {\begin{cases} = 1, & β_{d}^{(u)} > 0 \\ = - 1, & β_{d}^{(u)} < 0 \\ \in [- 1, 1], & β_{d}^{(u)} = 0 \end{cases},

(5)

where ${\hat{s}}_{d}$ and $β_{d}^{(u)}$ are the d^th component of $\hat{s}$ and β^(u), respectively. By measuring how much s^(u) violates the specification of $\hat{s}$ in (5) via ║v^(u)║₂, where the d^th component of v^(u) is

v_{d}^{(u)} {\begin{cases} s_{d}^{(u)} - 1, & β_{d}^{(u)} > 0 \\ s_{d}^{(u)} + 1, & β_{d}^{(u)} < 0 \\ max {0, | s_{d}^{u} | - 1}, & β_{d}^{(u)} = 0 \end{cases},

we know about how far away the current solution is to optimality. Such a measurement can be used as a stopping criterion. In our experiment, we set ║v^(u)║₂ ≤ 0.01 as our stopping criterion.

6 Conclusion

We have presented an algorithm to predict the effects of drugs on numeric physical measurements in the EHR such as fasting blood glucose. Drugs with a strong effect to decrease the measurement are potential repurposing targets. Our method inherits from self-controlled case series [13] the ability to take into account inter-patient variation. By addition of a time-varying baseline it can also address intra-patient variation over time. And by use of dyadic influence functions it can avoid the need to decide drug eras and can model different effect times for different drugs.

Acknowledgments

The authors would like to gratefully acknowledge the NIH BD2K Initiative grant U54 AI117924, the NIGMS grant 2RO1 GM097618, NIH CTSA at UW-Madison 1UL1TR002373, NSF grant CCF-1418976, and ARO grant W911NF-17-1-0357.

Contributor Information

Zhaobin Kuang, The University of Wisconsin, Madison.

Yujia Bao, The Massachusetts Institute of Technology.

James Thomson, The Morgridge Institute for Research.

Michael Caldwell, The Marshfield Clinic.

Peggy Peissig, The Marshfield Clinic.

Ron Stewart, The Morgridge Institute for Research.

Rebecca Willett, The University of Wisconsin, Madison.

David Page, The University of Wisconsin, Madison.

References

1.ABE M, NAKAMURA S, HIGA T, OKUBO J, KAKINOHANA M. Frequent hypoglycemia after prescription of pregabalin in a patient with painful diabetic neuropathy. Journal of Japan Society of Pain Clinicians, advpub. 2015 doi: 10.11321/jjspc.14-0035. [DOI] [Google Scholar]
2.Arnold T, Sadhanala V, Tibshirani RJ. Glmgen: Fast generalized lasso solver. 2014 [Google Scholar]
3.Bao Y, Kuang Z, Peissig P, Page D, Willett R. Hawkes process modeling of adverse drug reactions with longitudinal observational data. Machine Learning for Healthcare Conference. 2017:177–190. [Google Scholar]
4.DiabetesInControl. Drugs that can affect blood glucose levels. 2015 http://www.diabetesincontrol.com/wp-content/uploads/2010/07/www.diabetesincontrol.com_images_tools_druglistaffectingbloodglucose.pdf. Visited on 03/12/2018.
5.FDA. Cipro medication guide. https://www.fda.gov/downloads/Drugs/DrugSafety/UCM088572.pdf,. (Visited on 03/12/2018)
6.FDA. Lipitor (atorvastatin calcium) tablets. https://www.accessdata.fda.gov/drugsatfda_docs/label/2009/020702s057lbl.pdf,. (Visited on 03/12/2018)
7.FDA. Zestril (lisinopril) label. https://www.accessdata.fda.gov/drugsatfda_docs/label/2009/019777s054lbl.pdf,. (Visited on 03/12/2018)
8.Friedman J, Hastie T, Tibshirani R. Glmnet: Lasso and elastic-net regularized generalized linear models. R package version. 2009;1(4) [Google Scholar]
9.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of statistical software. 2010;33(1):1. [PMC free article] [PubMed] [Google Scholar]
10.Girardin E, Raccah D. Interaction between converting enzyme inhibitors and hypoglycemic sulfonamides or insulin. Presse medicale (Paris, France: 1983) 1998;27(37):1914–1923. [PubMed] [Google Scholar]
11.Johnson NA. A dynamic programming algorithm for the fused lasso and l 0-segmentation. Journal of Computational and Graphical Statistics. 2013;22(2):246–260. [Google Scholar]
12.Kuang Z, Thomson J, Caldwell M, Peissig P, Stewart R, Page D. Baseline regularization for computational drug repositioning with longitudinal observational data. IJCAI: proceedings of the conference. 2016;2016:2521. NIH Public Access. [PMC free article] [PubMed] [Google Scholar]
13.Kuang Z, Thomson J, Caldwell M, Peissig P, Stewart R, Page D. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM; 2016. Computational drug repositioning using continuous self-controlled case series; pp. 491–500. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Kuang Z, Peissig P, Santos Costa V, Maclin R, Page D. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM; 2017. Pharmacovigilance via baseline regularization with large-scale longitudinal observational data; pp. 1537–1546. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Nadkarni PM. Drug safety surveillance using de-identified emr and claims data: issues and challenges. Journal of the American Medical Informatics Association: JAMIA. 2010;17(6):671. doi: 10.1136/jamia.2010.008607. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Neerati P, Gade J. Influence of atorvastatin on the pharmacokinetics and pharmacodynamics of glyburide in normal and diabetic rats. European Journal of Pharmaceutical Sciences. 2011;42(3):285–289. doi: 10.1016/j.ejps.2010.12.006. [DOI] [PubMed] [Google Scholar]
17.Page G, Laight D, Cummings M. Thiamine deficiency in diabetes mellitus and the impact of thiamine replacement on glucose metabolism and vascular disease. International journal of clinical practice. 2011;65(6):684–690. doi: 10.1111/j.1742-1241.2011.02680.x. [DOI] [PubMed] [Google Scholar]
18.Poudel RR, Kafle NK. Verapamil in diabetes. Indian journal of endocrinology and metabolism. 2017;21(5):788. doi: 10.4103/ijem.IJEM_190_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Raman P. Hypoglycemia induced by pregabalin. Journal of The Association of Physicians of India. 2016;64 [PubMed] [Google Scholar]
20.Ryan P. Establishing a drug era persistence window for active surveillance. Foundation for the national institutes of health, 2010. 2015 [Google Scholar]
21.Simpson SE, Madigan D, Zorych I, Schuemie MJ, Ryan PB, Suchard MA. Multiple self-controlled case series for large-scale longitudinal observational databases. Biometrics. 2013;69(4–8):93–902. doi: 10.1111/biom.12078. [DOI] [PubMed] [Google Scholar]
22.Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological) 1996:267–288. [Google Scholar]
23.Tibshirani R, Bien J, Friedman J, Hastie T, Simon N, Taylor J, Tibshirani RJ. Strong rules for discarding predictors in lasso-type problems. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2012;74(2):245–266. doi: 10.1111/j.1467-9868.2011.01004.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tibshirani RJ, Taylor J. The solution path of the generalized lasso. The Annals of Statistics. 2011:1335–1371. [Google Scholar]
25.Tseng P. Convergence of a block coordinate descent method for nondifferentiable minimization. Journal of optimization theory and applications. 2001;109(3):475–494. [Google Scholar]
26.Tukey JW. Exploratory data analysis. Vol. 2. Reading, Mass: 1977. [Google Scholar]
27.vinh quoc Luong K, Nguyen LTH. The impact of thiamine treatment in the diabetes mellitus. Journal of clinical medicine research. 2012;4(3):153. doi: 10.4021/jocmr890w. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Wang J, Fan W, Ye J. Fused lasso screening rules via the monotonicity of subdifferentials. IEEE transactions on pattern analysis and machine intelligence. 2015;37(9):1806–1820. doi: 10.1109/TPAMI.2014.2388203. [DOI] [PubMed] [Google Scholar]

[R1] 1.ABE M, NAKAMURA S, HIGA T, OKUBO J, KAKINOHANA M. Frequent hypoglycemia after prescription of pregabalin in a patient with painful diabetic neuropathy. Journal of Japan Society of Pain Clinicians, advpub. 2015 doi: 10.11321/jjspc.14-0035. [DOI] [Google Scholar]

[R2] 2.Arnold T, Sadhanala V, Tibshirani RJ. Glmgen: Fast generalized lasso solver. 2014 [Google Scholar]

[R3] 3.Bao Y, Kuang Z, Peissig P, Page D, Willett R. Hawkes process modeling of adverse drug reactions with longitudinal observational data. Machine Learning for Healthcare Conference. 2017:177–190. [Google Scholar]

[R4] 4.DiabetesInControl. Drugs that can affect blood glucose levels. 2015 http://www.diabetesincontrol.com/wp-content/uploads/2010/07/www.diabetesincontrol.com_images_tools_druglistaffectingbloodglucose.pdf. Visited on 03/12/2018.

[R5] 5.FDA. Cipro medication guide. https://www.fda.gov/downloads/Drugs/DrugSafety/UCM088572.pdf,. (Visited on 03/12/2018)

[R6] 6.FDA. Lipitor (atorvastatin calcium) tablets. https://www.accessdata.fda.gov/drugsatfda_docs/label/2009/020702s057lbl.pdf,. (Visited on 03/12/2018)

[R7] 7.FDA. Zestril (lisinopril) label. https://www.accessdata.fda.gov/drugsatfda_docs/label/2009/019777s054lbl.pdf,. (Visited on 03/12/2018)

[R8] 8.Friedman J, Hastie T, Tibshirani R. Glmnet: Lasso and elastic-net regularized generalized linear models. R package version. 2009;1(4) [Google Scholar]

[R9] 9.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of statistical software. 2010;33(1):1. [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Girardin E, Raccah D. Interaction between converting enzyme inhibitors and hypoglycemic sulfonamides or insulin. Presse medicale (Paris, France: 1983) 1998;27(37):1914–1923. [PubMed] [Google Scholar]

[R11] 11.Johnson NA. A dynamic programming algorithm for the fused lasso and l 0-segmentation. Journal of Computational and Graphical Statistics. 2013;22(2):246–260. [Google Scholar]

[R12] 12.Kuang Z, Thomson J, Caldwell M, Peissig P, Stewart R, Page D. Baseline regularization for computational drug repositioning with longitudinal observational data. IJCAI: proceedings of the conference. 2016;2016:2521. NIH Public Access. [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Kuang Z, Thomson J, Caldwell M, Peissig P, Stewart R, Page D. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM; 2016. Computational drug repositioning using continuous self-controlled case series; pp. 491–500. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Kuang Z, Peissig P, Santos Costa V, Maclin R, Page D. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM; 2017. Pharmacovigilance via baseline regularization with large-scale longitudinal observational data; pp. 1537–1546. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Nadkarni PM. Drug safety surveillance using de-identified emr and claims data: issues and challenges. Journal of the American Medical Informatics Association: JAMIA. 2010;17(6):671. doi: 10.1136/jamia.2010.008607. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Neerati P, Gade J. Influence of atorvastatin on the pharmacokinetics and pharmacodynamics of glyburide in normal and diabetic rats. European Journal of Pharmaceutical Sciences. 2011;42(3):285–289. doi: 10.1016/j.ejps.2010.12.006. [DOI] [PubMed] [Google Scholar]

[R17] 17.Page G, Laight D, Cummings M. Thiamine deficiency in diabetes mellitus and the impact of thiamine replacement on glucose metabolism and vascular disease. International journal of clinical practice. 2011;65(6):684–690. doi: 10.1111/j.1742-1241.2011.02680.x. [DOI] [PubMed] [Google Scholar]

[R18] 18.Poudel RR, Kafle NK. Verapamil in diabetes. Indian journal of endocrinology and metabolism. 2017;21(5):788. doi: 10.4103/ijem.IJEM_190_17. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Raman P. Hypoglycemia induced by pregabalin. Journal of The Association of Physicians of India. 2016;64 [PubMed] [Google Scholar]

[R20] 20.Ryan P. Establishing a drug era persistence window for active surveillance. Foundation for the national institutes of health, 2010. 2015 [Google Scholar]

[R21] 21.Simpson SE, Madigan D, Zorych I, Schuemie MJ, Ryan PB, Suchard MA. Multiple self-controlled case series for large-scale longitudinal observational databases. Biometrics. 2013;69(4–8):93–902. doi: 10.1111/biom.12078. [DOI] [PubMed] [Google Scholar]

[R22] 22.Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological) 1996:267–288. [Google Scholar]

[R23] 23.Tibshirani R, Bien J, Friedman J, Hastie T, Simon N, Taylor J, Tibshirani RJ. Strong rules for discarding predictors in lasso-type problems. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2012;74(2):245–266. doi: 10.1111/j.1467-9868.2011.01004.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Tibshirani RJ, Taylor J. The solution path of the generalized lasso. The Annals of Statistics. 2011:1335–1371. [Google Scholar]

[R25] 25.Tseng P. Convergence of a block coordinate descent method for nondifferentiable minimization. Journal of optimization theory and applications. 2001;109(3):475–494. [Google Scholar]

[R26] 26.Tukey JW. Exploratory data analysis. Vol. 2. Reading, Mass: 1977. [Google Scholar]

[R27] 27.vinh quoc Luong K, Nguyen LTH. The impact of thiamine treatment in the diabetes mellitus. Journal of clinical medicine research. 2012;4(3):153. doi: 10.4021/jocmr890w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Wang J, Fan W, Ye J. Fused lasso screening rules via the monotonicity of subdifferentials. IEEE transactions on pattern analysis and machine intelligence. 2015;37(9):1806–1820. doi: 10.1109/TPAMI.2014.2388203. [DOI] [PubMed] [Google Scholar]

PERMALINK

A Machine-Learning Based Drug Repurposing Approach Using Baseline Regularization

Zhaobin Kuang

Yujia Bao

James Thomson

Michael Caldwell

Peggy Peissig

Ron Stewart

Rebecca Willett

David Page

Abstract

1 Introduction

2 Materials

Figure 1.

2.1 Notation

3 Methods

3.1 Dyadic Influence

Figure 2.

3.2 Baseline Regularization

Algorithm 1.

3.3 Optimization for Baseline Regularization

4 Results

Table 1.

5 Notes

5.1 Splitting Patient Records

5.2 Model Selection

5.3 Stopping Criteria

6 Conclusion

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A Machine-Learning Based Drug Repurposing Approach Using Baseline Regularization

Zhaobin Kuang

Yujia Bao

James Thomson

Michael Caldwell

Peggy Peissig

Ron Stewart

Rebecca Willett

David Page

Abstract

1 Introduction

2 Materials

Figure 1.

2.1 Notation

3 Methods

3.1 Dyadic Influence

Figure 2.

3.2 Baseline Regularization

Algorithm 1.

3.3 Optimization for Baseline Regularization

4 Results

Table 1.

5 Notes

5.1 Splitting Patient Records

5.2 Model Selection

5.3 Stopping Criteria

6 Conclusion

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases