Abstract
Objective:
Gastric functional and motility disorders are highly prevalent, with gastroparesis (GP) and functional dyspepsia (FD), affecting 1.5-3% and 10% of the population, respectively. Multiple disease etiologies with overlapping symptoms, such as antral hypomotility, pylorospasm, autonomic dysfunction, and gastric myoelectric dysfunction underlie GP and FD. There is an unmet need to differentiate these etiologies non-invasively to tailor treatment strategies and predict treatment response.
Methods:
We performed cutaneous high-resolution electrogastrogram (HR-EGG) recordings on 32 human subjects (controls, GP, and FD) and computed gastric slow wave propagation patterns. We implemented robust regression and clustering methods to identify one group of patients with symptoms well explained by spatial slow wave features and another with symptom severity significantly exceeding predictions from spatial slow wave features. Five patients were re-assessed with validated symptom questionnaires after pyloric and prokinetic interventions.
Results:
A group of seven patients was identified whose spatial slow wave features lie within the same range as control subjects but whose symptom severity significantly exceeded what is predicted from spatial slow wave features. We hypothesize that gastric myoelectric dysfunction is not a prominent disease etiology in this group. A highly accurate regression holds in the other group of patients (r=0.8). Of the patients with repeat questionnaires, patients with symptom severity exceeding the regression line reported symptom improvement, whereas patients with symptoms in close proximity to the regression line experienced no improvement.
Conclusion:
These findings suggest that patients with symptom severity significantly exceeding the robust regression line have symptoms that cannot be explained by gastric myoelectric dysfunction alone, and vice versa.
Significance:
This methodology may provide clinicians with an opportunity to screen patients to determine when existing interventions will be effective, and on the flipside, when slow wave restoration interventions, such as gastric neuromodulation, may be most effective in improving symptoms and quality of life.
I. Introduction
A. Background and Unmet Need
Gastrointestinal (GI) problems are the second most common reason for someone to miss school or work, after the common cold [1]. Two of the most prevalent GI functional and motility disorders are functional dyspepsia (FD) and gastroparesis (GP), which affect 10% and 1.5-3% of the U.S. population, respectively [2], [3]. Multiple disease etiologies underlie the umbrella diagnoses of GP and FD, including but not limited to i) antral hypomotility, ii) pylorospasms, iii) gastric myoelectric dysfunction [4], and iv) autonomic dysfunction. Tailored treatments for such causes include i) prokinetic medications, ii) botulinum toxin injection of the pylorus [5], gastric neuromodulation [6], [7], [8], [9], [10], [11], [12], [13] and iv) vagus nerve stimulation [14], respectively. The common occurrence of treatment regimens tailored to the wrong etiology or to symptom mitigation results in repetitive visits to the clinic and adverse effects on quality of life with vast economic, social, and healthcare consequences. There is a desire and unmet need to sub-type GP and FD in terms of underlying etiologies to improve prediction of treatment response and enable targeted interventions [15], [16]. GP and FD are analyzed together as one group in this work, since subtyping is geared toward distinguishing between underlying etiologies (i.e. pylorospasms vs gastric myoelectric dysfunction), as opposed to subtyping between GP and FD (which is increasingly being questioned by key opinion leaders due to the indistinguishable nature of the two diagnoses [17]).
B. Previous Work
Current diagnostic tools are typically not etiology-specific. For instance, the PAGI-SYM questionnaire on foregut symptoms, which includes the gastroparesis cardinal symptom index, is used to manage treatments but shows a high degree of symptom overlap with underlying disease etiologies [4]. Alternatively, the gastric emptying test, a clinical gold standard that entails post-fasting ingestion of a meal containing a radioactive tracer and measuring the amount of tracer that has left the stomach at fixed intervals, shows inconsistent correlation with symptom severity [18], [19]. Further, some medications improve symptoms but not gastric emptying, and vice versa [20], [21], [22]. As such, there is an unmet need to develop widely-deployable screening tools that correlate with symptom severity and sub-type GP/FD in terms of disease etiology.
Gastric neuromuscular activity involves the interplay between electrical activity on the stomach’s surface and smooth muscle contractions that propel food from the esophagus to the intestines. Gastric pacemaker cells, or interstitial cells of Cajal (ICCs), receive modulatory inputs from enteric neurons and undergo spontaneous cyclical depolarization, similar to cardiac pacemaker cells in the heart. These oscillatory membrane potentials excite neighboring smooth muscle cells, causing contractions at regular intervals, and excite adjacent downstream ICCs in their network [23], [24]. This results in a traveling electrical wave, called the gastric slow wave [25], [26].
In healthy individuals, slow waves oscillate at 3 cycles per minute (0.05Hz), originate in the pacemaker region of the stomach, and propagate along the outer surface of the stomach in an anterograde direction toward the duodenum [26], [27]. Abnormalities in slow wave initiation and propagation patterns arise from loss or deterioration of ICCs [28], [29]. These pathological slow waves can spontaneously arise from locations other than the pacemaker region (i.e. the antrum) and/or propagate in aberrant directions (i.e. in a retrograde direction towards the esophagus or a bifurcated combination of anterograde/retrograde propagation). These abnormal propagation patterns disrupt the spatial coordination of smooth muscle contractions and have been observed in GP/FD patients [30].
High-resolution spatiotemporal mapping of the gastric slow wave has been performed with electrodes placed directly on the surface of the stomach during open-abdominal surgery. These findings indicated normal slow wave behavior in control subjects (n=12) [31] and abnormalities in slow wave initiation and propagation in > 80% of GP subjects (n=12) [30]. Whereas these studies demonstrated spatial slow wave abnormalities as possible explanations for foregut disorders, the involvement of surgical procedures limits the technique’s scalability.
A non-invasive alternative is the electrogastrogram (EGG), which involves electrodes placed on the surface of the abdomen (as opposed to the stomach) and measures electrical activity that reaches the abdominal surface via volume conduction. Conventional EGG is comprised of 3-4 electrodes and is analyzed via spectral methods (i.e. power-frequency) [26], [32]. However, it has been shown that i) spectral features do not reliably correlate with symptoms or disease [33], [34] and ii) spatial slow wave abnormalities (which do have high correlation with disease and symptoms) are undetectable by spectral analyses [30], [35], [29], [36].
The high-resolution EGG (HR-EGG) has recently been established to overcome the pitfalls of traditional EGG. This approach is comprised of a dense cutaneous electrode array and subsequent signal processing methods for artifact rejection [37] and estimation of spatial slow wave directions [38]. It has been shown, both in silico and in human subjects, that estimated wave directions orient in an anterograde direction towards the duodenum in healthy controls [38], [39].
HR-EGG was also recently used in human subjects with GP (n=18) and FD (n=7) to establish a correlation between incidence of non-anterograde slow wave propagation and symptom severity [35]. Specifically, a linear correlation (r=0.57) was established between the percentage of non-anterograde slow wave directions and the mean Gastroparesis Cardinal Symptom Index (GCSI) symptom score, a standardized foregut symptom severity score that takes into account nausea, vomiting, early satiety, bloating, and abdominal distention [40]. A more recent finding showcased that gastric slow wave spatial patterns derived from HR-EGG distinguish children with symptomatic chronic nausea from healthy subjects [41].
Although these findings suggest that gastric slow wave features determined from cutaneous multi-electrode recordings may provide an explanation for GP and FD symptoms, some key limitations arise. Since multiple disease etiologies underlie GP and FD, a ‘one size fits all’ regression paradigm is likely inadequate to explain symptom severity solely from spatial slow wave features. For instance, in patients with high symptom severity arising from etiologies other than gastric myoelectric dysfunction, spatial slow wave features would likely resemble those of healthy controls although their symptom severity score would exceed that of control subjects [35]. We hypothesize that if a patient’s symptom severity score exceeds what would be ‘expected’ from slow wave patterns, then the underlying cause of symptoms is other than gastric myoelectric dysfunction. In line with this reasoning, if we can isolate and exclude groups of patients for which symptoms exceed regression predictions, one can reasonably hypothesize that a higher-accuracy correlation can be drawn between abnormal wave propagation and symptoms in the remaining subjects.
C. Our Proposed Approach
The aforementioned drawbacks call for the implementation of a “robust regression” method which involves a mathematically rigorous approach to determine an inclusion criterion of data to be used in regression, and an exclusion criterion of data to be considered outliers. Robust regression has been widely used in the current literature to fit highly-accurate regressions in the presence of outliers by assuming the probability distribution of residuals has multiple modes [42], [43], [44], [45]. Optimization of model-fitting parameters typically involves maximizing a multi-modal likelihood function over the residual errors, with at least one mode near zero [42], [43], [44]. This method optimizes regression fit over the data assumed to represent ‘signal’ rather than ‘noise.’
We use robust regression in our work presented here in order to identify and exclude an “outlier” class of patients for which we hypothesize that their underlying GP or FD etiology is not gastric myoelectric dysfunction. This separation of patients whose symptoms are adequately explained by slow wave features, and those whose symptoms are not, may have the potential to predict treatment response to, for instance, gastric neuromodulation (a more invasive intervention) or medications (a more conservative approach). Prediction of treatment response has the potential to rapidly guide effective treatments, as well as reduce the risk of performing unnecessary procedures.
Another potential improvement to the preliminary correlation study in [35] involves estimating an entire probability distribution of wave directions for each recording. This provides more information, pertaining to modality and overall distributional structure, that may lead to improved characterizations of recordings. Furthermore, utilizing the full probability distributions enables the use of an optimal transport framework to build metrics between distributions. ‘Optimal transport’ refers to the process of minimizing the cost of transporting all of the mass in one probability distribution to another, and has powered state-of-the-art studies in broad applications of mathematics, data science, and economics [46], [47], [48], [49]. In this study, we make use of the Wasserstein metric between probability distributions to develop unique spatial slow wave features to be used downstream for robust regression.
We calculate Wasserstein barycenters [50] between probability distributions of control subjects, which serves as a ‘model’ of normalcy. We then compute the optimal transport cost between each subject’s probability distribution and the controls’ barycenter to quantify the deviation of subject’s probability distribution from ‘normalcy’. In addition, we compute distances between each subject’s probability distribution and a uniform distribution over directions as a secondary feature to quantify the deviation from chaotic or uncoordinated propagation. As previously mentioned, abnormal slow wave propagation can take on several forms. One such form is an abnormal initiation near the antrum with a large degree of coordinated retrograde propagation, in addition to some anterograde propagation [30]. Probability distributions of wave directions in these types of cases would differ in shape and modality from recordings of healthy controls, which are typically dominated by anterograde propagation. In addition, this physiology is distinct from chaotic or uncoordinated propagation, which would result in wave directions having probabilistic signatures more akin to a uniform distribution. As such, these features derived from a sophisticated optimal transport framework follow insights from slow wave physiology.
Lastly, previous HR-EGG methods [38], [35] required computed tomography (CT) scans for both accurate placement of electrodes over the stomach, as well as measurements related to anatomical landmarks in order to define ‘normal’ propagation. The need for a CT presents a current bottleneck in these existing HR-EGG methods because CT is costly for patients and requires otherwise unnecessary exposure to radiation. We have addressed this in our present work by taking into account the features and modalities of full probability distributions, which results in a method that is robust to an estimation of these anatomical measurements.
D. Key Findings
In summary, our contributions in this manuscript involve building a full probability distribution from measured HR-EGG wave directions, developing an optimal transport framework for features pertaining to distances between probability distributions, and performing robust regression to a) isolate a group of patients in which we hypothesize that gastric myoelectric dysfunction is not a prominent etiology in their disease and b) form a more accurate correlation between symptoms and features of wave directions in the remaining group of subjects. Our methods are robust to a barycenter-type estimate of the angle that defines ‘normal’ propagation and, if used in conjunction with recent methods in stomach identification with portable ultrasound [51], would eliminate the need for a CT scan. Overall, we propose a method to screen patients for which aggressive intervention, such as gastric neuromodulation, would be most effective in promoting good clinical outcomes.
II. Methods
A. Data Acquisition and Signal Processing
We enrolled healthy control subjects with no prior GI disease (n=7), GP patients (n=18), and FD patients (n=7), for a total of 32 subjects. We performed a PAGI-SYM questionnaire on all subjects, with repeat PAGI-SYM questionnaires on 5 patients (4 GP, 1 FD) after non-electrical treatments (pyloric or prokinetic interventions). Since both treatments do not directly target gastric myoelectric dysfunction, the distinction between pyloric and prokinetic interventions is not explored here. The PAGI-SYM questionnaire is a standardized, validated patient survey to rate symptoms such as nausea, bloating, early satiety, etc [52]. We performed 25-channel HR-EGG recordings on all subjects for 90 minutes (approximately 30 minutes preprandial and approx. 60 minutes postprandial). Each subject was asked to fast for 12 hours prior to ingestion of the meal at the 30 minute time point.
The ingestion of food activates excitatory neural responses that interface with the stomach to increase the magnitude of gastric smooth muscle contractions [53]. Given this fact, and since gastric contraction strength correlates with EGG power [54], [55], we solely use the postprandial portion of the recording the remainder of this section to operate in a regime of high signal-to-noise ratio (SNR) for our HR-EGG recordings.
HR-EGG recording and prepossessing methods were identical to those published here [35], which included filtering in the range of 0.05 Hz, artifact rejection, etc. For generalizability, we refer to the number of subjects as and the number of electrodes as throughout this manuscript. Ethical approval for this work was obtained from the institutional review board at the University of California, San Diego, USA, Protocol Number: 141069, Date of Approval: June 19, 2014.
B. Instantaneous Wave Direction at Every Sensor
We performed the Hilbert transform on mean-centered HR-EGG waveforms to extract the instantaneous wave phase, , at the channel and the subject .
| (1) |
In accordance with current HR-EGG analysis studies [35], [38], [39], we calculated the instantaneous wave direction, , at each channel by taking the inverse tangent of phase gradients.
| (2) |
In order to exclude segments of recordings dominated by noise, we defined a notion of a ‘sustained wave’ and compiled subsets of the recording in which the ‘sustained’ criterion was met, explained as follows. We defined the instantaneous phase gradient directionality (PGD) for the subject as:
| (3) |
where . The PGD has been used in neuroscience [56] as well as previous HR-EGG analyses [39], [38], [35] as a measure of cohesivity among wave directions. From Jensen’s inequality, it is upper bounded by 1 which occurs when a plane wave, which has the highest degree of direction agreement, is present. In order to reduce the false discovery of noise to below 1% with 25 channel arrays, we categorize a wave as satisfying the ‘sustained’ criterion if the PGD exceeds 0.5 for longer than 2 seconds [38]. As such, direction values calculated in (2) that indicate sporadic or otherwise uncoordinated wave propagation must have some inherent pattern and coordination in order to meet the sustained criterion, as is consistent with physiology (i.e. an ectopic event results in local coordination distinct from patterns elsewhere on the stomach [30]).
C. Direction Values Relative to the Duodenum
The aforementioned wave direction values are measured with respect to the cutaneous recording array. However, the stomach has a characteristic ‘J’ shape, with subject-to-subject anatomical variability, and further is not always perfectly in plane with the sensor array. Previous studies have addressed this by translating direction values by the subject-specific angle to the duodenum, derived from CT measurements [35], [38]. Thus, a translated direction value of zero radians would indicate wave propagation toward the duodenum (or anterograde propagation).
Instead of translating each subject’s direction values by their unique duodenum angle, we translated direction data by an average of all subjects’ duodenum angle. We then slightly adjusted the translation, such that the location of maximum incidence of the histogram of translated directions within a range of occurred at zero radians. There is precedent for this approach within the literature that gives a degree window of tolerance around the subject-specific angle to the duodenum in order to define ‘normal’ wave propagation [35]. Our adjustment to the translation operated on a more conservative window while still accounting for anatomical variability. Translating directions in this way eliminates the need for a CT going forward, since we can utilize measurements from CTs that have already been performed in calculation of averages, as well as incorporate data from large imaging databases to further improve the average if desired. Because the downstream analysis methods presented in this work estimate the entire probability distribution and exploit characteristics of the shape and modalities of the distribution, we obviate the need for high precision with the angle by which we translate direction values.
We collapsed sustained wave direction values across both spatial and temporal axes, giving rise to set , for the subject, as seen in (4). The set is shown for two human research subjects as a histogram in the circular domain in Fig. 2A.
Fig. 2.
A) Histograms of wave directions, . B) Estimated probability density function, from methods described in Section II-D, shown in polar axes. C) Estimated probability density function, from methods described in Section II-D, shown in Euclidean axes. Panels A-C are shown for a healthy control subject with normal slow wave physiology (top) and a symptomatic patient with indications of abnormal gastric slow wave function (bottom).
| (4) |
D. Probability Density Estimation with von Mises Mixtures
Since direction data in set lie on a circle, a natural family of probability density functions on which we can perform density estimation is the family of von Mises distributions, which are commonly referred to as the “Gaussians” of circular statistics [57], [58]. There is precedent established in the literature across multiple scientific fields for the von Mises mixture model approach used in this work [59], [60]. The von Mises probability density function is given by:
| (5) |
where is the mean of the distribution, is the inverse variance, and is the order Bessel function. See Fig. 1A for examples.
Fig. 1.
A) Probability density function of a unimodal von Mises distribution (5) . b) von Mises mixture of two distributions (6). Both unimodal and multimodal distributions shown in polar (left) and Euclidean (right) axes.
We performed maximum likelihood density estimation of von Mises mixtures (vMM), given by (6), in order to capture multiple modalities in , which could indicate multiple modes of wave propagation (i.e. bifurcated anterograde and retrograde propagation). An example of a von Mises mixture is shown in Figure 1B.
| (6) |
where , and
| (7) |
The log likelihood, given the set of vMM parameters, , and set is:
| (8) |
where is the length of , or . Optimal parameters, , are those which maximize (8).
| (9) |
E. Model Selection Procedure
Solving (9) with no restriction on the number of mixture distributions, , may cause the model to over-fit to variants arising from noise or human subject-to-subject variability, which results in a loss of generalizability. Adding a penalty to the number of distributions prevents over-fitting while still capturing multiple modalities in . Since all weights are non-negative and sum to 1, by virtue of (7), traditional penalized model fitting procedures are not applicable to this context. We implemented a pruning method to iteratively reduce the number of mixture distributions, , and subsequently perform dimensionality-based penalization.
We began the iterative pruning method by executing maximum likelihood optimization in (9), given the set and a large dictionary of pairs of von Mises density parameters, , to over-represent the problem space [61]. The parameter dictionary consisted of samples of within a range of , sampled in increments of . Similarly, was sampled within a range of [0, 15], in increments of 15/9. The complete dictionary consisted of a mesh grid of all possible combinations of and , with size . Since parameters and were specified by this dictionary, optimization in (9) solves for the mixture probabilities, , which maximize (8).
We then eliminated the set which was lowest-weighted, or
| (10) |
since contributed the least to the overall log likelihood. We repeated this pruning process until the dimensionality of the parameter dictionary, , was equal to 1. We computed the negative normalized log likelihood (NNLL) at each pruning iteration, given by:
| (11) |
where is the set of parameters that maximizes the log likelihood in (8) when the parameter dictionary is pruned to specific pairs. We implemented Bayesian Inference Criterion (BIC) [62] penalization where the penalized normalized negative log likelihood (PNNLL) is given below.
| (12) |
We obtained the optimal number of von Mises densities . which minimizes (12), and then obtained the corresponding set of optimal density parameters, to construct the density estimate.
F. Maximum Likelihood Estimation with the EM Algorithm
During each step of the pruning method, we used the Expectation Maximization (EM) for mixtures [63] to carry out the maximum likelihood estimation in (9).
The EM algorithm makes use of a latent variable, , for which encodes which distribution is drawn from in the von Mises mixture model. The complete log likelihood (conditioned on ) is given by:
| (13) |
where equals 1 if event occurs and 0 otherwise. For the sake of brevity we will use the notation , , to index specific parameters in the iteration of the pruning method in which there are mixture distributions. It is implied that refers to the specific , for instance. The posterior probabilities of latent variable, , conditioned on set are given by:
| (14) |
| (15) |
at the iteration of the EM algorithm. We define the expectation of the complete log likelihood given by (13) as:
| (16) |
where is the vector of parameters used to compute the posterior probability, , and vector houses each , , in (16). Maximizing this expectation yields parameter updates:
| (17) |
for where is a normalization constant to guarantee the sum of is equal to 1.
We performed 20 iterations of parameter updates.
G. Wasserstein Distances: Optimal Transport Theory
We used an approach based in Optimal Transport Theory, namely the Wasserstein distance, to define a metric between probability distributions. Such an approach is standard across multiple disciplines [46], [48], [49]. In this setting, to account for the circular nature of directional information, we utilize a Wasserstein distance with the base distance being a geodesic defined on the unit circle [64], [65]. Assuming that probability distributions and have densities and respectively, the Wasserstein squared distance between probability distributions and is defined as:
| (18) |
where is the set of all possible joint probability densities with marginals and , is the support of and is the support of . The function is a cost function pertaining to the squared distance between and .
In the work presented here, data are direction values, which lie on a circular manifold. As such, we computed a quadratic cost
| (19) |
operating on
| (20) |
which is the geodesic distance on the unit circle. We formulated the optimization in (18) as a linear program, discretized over the compact set with boundary points and subject to:
where the first two constraints hold because is a joint probability density, and the latter two hold because and thus it must preserve marginal densities and .
H. Wasserstein Barycenter of Probability Distributions from Control Subjects
We constructed a Wasserstein barycenter (denoted ) between healthy control subjects. This barycenter represents an ‘average’ distribution for the control group. Other aggregate methods, such as averaging of probability distributions, fail to capture modality and shapes of individual distributions (see Figure 3). This motivated our use of a Wasserstein barycenter approach, as distributional shape and modality are paramount to the downstream feature space in this work.
| (21) |
where is the number of control subjects (n=7) and refers to individual probability distributions of control subjects, solved for in Section II-D. For each subject (including controls) we computed Wasserstein squared distances, via (18), between i) each probability distribution, and the control barycenter, and ii) each probability distribution and a uniform distribution, denoted as , over the support . This first feature is a metric of how close each distribution is to an approximation of normalcy, whereas the second feature measures closeness to complete randomness, as a uniform distribution models the general distributional shape of aberrant wave propagation (see Fig. 4 for such an outlier). These features populate the regressor matrix, defined in the following section, and will be referred to throughout the remainder of this work as ‘slow wave features.’
Fig. 3.
Example highlighting the importance of using a Wasserstein barycenter to represent a model of healthy controls in Section II-H. A) Two unimodal von Mises probability distributions. B) The L-2 average of the two distributions is bi-modal, with one mode centered at each individual distribution’s mean. C) The Wasseerstein barycenter between the two distributions is unimodal and therefore better represents the structure of both of the original distributions. Panels A-C shown in uniform-radius polar (top) and Euclidean (bottom) axes.
Fig. 4.
Probability density function (found in Section II-D) of outlier within the control group, with uniform distribution along the support for reference. The outlier distribution lacks the high-energy mode near zero that is expected of healthy controls exhibiting primarily anterograde wave propagation (i.e. as seen in Fig. 2C), and instead exhibits overall structure similar to a uniform distribution. Shown in A) polar axis and B) Euclidean axis.
I. Robust Regression and EM Clustering
We identified an outlier in the control group. This outlier probability distribution deviated in modality from both a) the expected probability distribution of healthy controls with respect to physiology and b) the other control subjects (see Fig. 4). In fact, this probability distribution showed aberrant, chaotic slow wave propagation similar to distributions estimated from from GP and FD HR-EGG recordings. In this section, we have removed the outlier from calculation in the robust regression algorithm. It is plotted in the results figures but not used for computation of regression coefficients and residuals. As such, the indexing notation of human research subjects going forward is , where .
To carry out robust regression in this work, we defined a regressor matrix , of size , with the row given by:
| (22) |
We defined the response vector, as the Mean GCSI (Gastroparesis Cardinal Symptom Index) score for all subjects, which is a subset of the PAGI-SYM questionnaire [52], [40]. We did not study the effect of gastric myoelectric features on improvement of specific symptoms, as these changes may not have sufficient statistical power with the limited patient cohort size. This is a subject of future work and is fitting for a large cohort study. We modeled to linearly relate to such that:
| (23) |
where is the vector of residual errors. Since spatial gastric slow wave abnormalities are only one type of etiology of gastrointestinal foregut disorders, we hypothesize that this linear model will fit a subset of the data for which spatial slow wave abnormalities are a dominant cause of symptoms. For the rest of the subjects, however, we expect their symptom and HR-EGG features to be anomalous to the linear trend. For instance, this anomalous class would encompass patients who have normal slow waves, with probability distributions indistinguishable from healthy controls, but high symptom severity scores. Following this logic, we assume residual errors from the linear regression, , will follow a bi-modal distribution: with one mode near zero and another mode near some . This bi-modal distribution can be modeled as a Gaussian mixture model (GMM)
| (24) |
where and are non-negative weights that add to 1 and is the density of a Gaussian random variable with expectation and variance evaluated at :
| (25) |
Since we hypothesize that the residuals follow a bi-modal GMM, follows a bi-modal GMM, with one mode at and the other mode at . Because we hypothesize that the outlier class will be comprised of patients for which symptoms exceed what would be ‘expected’ from features of the slow wave, we aimed to identify outliers that lie above the regression line. As such, we defined . To find the optimal offset, , we created a dictionary of offset values, with entries (for ), on the interval (0, 1] in increments of . The EM algorithm was used to solve for the remainder of the density estimation parameters in (26). EM-model fitting was performed times, for each in the dictionary. The optimal offset, , was chosen to be the value of which maximized the log likelihood given in (26). We implemented this method with
We used methods similar to those presented in [44]. However, deviating from the approach in [44], where regression coefficients, , for each mode were distinct from one another, we solved for the regression coefficients only for the non-outlier class. This decision rests upon our hypothesis that the non-outlier class corresponds to patients where slow wave patterns are the dominant etiology for their symptoms, which implies that the outlier class pertains to patients where the dominant cause of symptoms is an etiology other than slow wave abnormalities (i.e. dysfunction of the pylorus, or autonomic dysfunction). Because there are multiple such etiologies, and given that our dataset contained subjects, we determined that this approach minimizes opportunities for overfitting or spurious calculations.
Similar to the von Mises mixture density estimation procedure discussed in Section II-D, optimization of the GMM problem can be solved with the EM algorithm. We initialized the weights as , since we can reasonably hypothesize that the majority of the data fits within Class 1. We initialized as the coefficients of the linear least squares regression of the entire dataset with symptoms. We initialized and as the variance of the residuals given by (23) with given by its initialized value.
We define a latent binary random vector for which if is drawn from a distribution and if is drawn from a distribution. As such, the joint distribution on for parameters is given by:
| (26) |
where equals 1 if event occurs and 0 otherwise.
For the E step, we define to be the posterior probability at the iteration that the sample is generated from the Gaussian component, given by [66]:
| (27) |
| (28) |
where
| (29) |
| (30) |
where . For the step, the standard EM parameter updates for the GMM problem at iteration , , are given as follows [66]. We first define as and then
| (31) |
| (32) |
| (33) |
We used (32) to find the regression coefficients by virtue of (29). We performed 100 iterations of the EM algorithm (for each ) and found the optimal offset, to be 0.8. We performed maximum likelihood classification [67] to assign each subject to either ‘Class 1’ or ‘Class 2,’ where Class 2 included data that lies far above the regression line. Subject was assigned to Class 2 if:
| (34) |
We performed clustering to isolate a subset of Class 2 with features that most closely resembled healthy controls. We refer to this subset of Class 2 as the ‘anomalous class.’ In total, this anomalous class involved subjects which a) lie far above the regression line (i.e. high symptom severity with respect to regression predictions from slow wave features) and b) had slow wave features that were indistinguishable from controls. We used two different clustering algorithms: i) DBSCAN [68], [69] with parameters: epsilon=0.2, minimum samples=2 and ii) K-Means [70] with K=3, to cluster the transformed features within Class 2. The DBSCAN clustering algorithm was not found to be sensitive to small perturbations in epsilon. As such, implementation of a more sophisticated algorithm, such as HDBSCAN [71], was not explored at this time but could be a source of future work to boost robustness. As shown in Fig 5, both algorithms isolated the same group of seven data points which met criteria i) and ii). In addition, these seven data are the only data whose linear combination of slow wave features fit within a range of the controls’ linear combination of slow wave features. Thus, we labeled these seven subjects as “anomalous” and hypothesize that their prominent disease etiology is not gastric myoelectric dysfunction. Lastly, we performed a final linear least squares regression on the complete dataset labeled as ‘non-anomalous’ (post anomaly-exclusion) to adjust the regression coefficients and compute r values.
Fig. 5.
DBSCAN (A) and K-Means (B) clustering of linear transformation of features. K-Means centers plotted in magenta (diamonds). Both clustering algorithms identify the same anomalous class (orange). The upper limit of transformed features within the control group (outlier excluded) is plotted as a dashed line to show that all anomalous class features fit within a boundary established by controls.
III. Results
Figure 6 shows the result of the robust regression and identification of the anomaly class. We show, in orange, the patients for which we hypothesize that spatial gastric slow wave abnormalities are not a prominent etiology of their disease. These patients exhibit slow wave patterns that closely resemble healthy controls, as is seen with feature overlap, but exhibit high symptom scores, which is not a characteristic of controls. We measured an r value of 0.8 from regression on only non-anomalous data.
Fig. 6.
Identification of anomalous class (orange) for which we hypothesize that the underlying GP/FD etiology was not spatial gastric slow wave abnormalities. Regression with non-anomalous data (blue). We show 95% confidence interval (shaded) and report r=0.8. Outlier in the control group is shown in red but was not used in regression.
Figure 7 shows the symptom prognosis of five patients after treatment. Two patients, shown in triangle markers, had no improvement, and three patients, shown in hexagon markers, did see improvements. Interventions included treatments of the pylorus and prokinetic medications. The three patients that experienced symptom improvement lie farther away from the regression line than the patients that did not improve. A natural hypothesis is that these treatments helped mitigate other etiologies responsible for the patients’ GP and FD. However, for patients that lie directly on the regression line, it is hypothesized that their symptoms are a direct cause of spatial gastric slow wave abnormalities and as such treating other etiologies may not result in improvement of symptoms.
Fig. 7.
Symptom progression after pyloric and prokinetic interventions (n=5). These interventions did not directly target spatial gastric slow wave abnormalities. Three patients reported improvement of symptoms (marked with a hexagon), and two patients saw no improvement (marked with a triangle). Patients who did not improve after treatment were in close proximity to the regrssion line, whereas patients who did improve were far above the regression line, with post-treatment scores lying closer to the regression line.
Figure 8 compares methods to previous work that pioneered linear regression of spatial gastric slow wave features with patient symptom scores [35]. These previous findings utilize raw direction values, where wave directions are calculated analogously to those described in (2), have an inclusion criteria given by (3) and are translated by the patient-specific angle to the duodenum, derived from the CT. ‘Normal’ wave directions are defined to be those which lie with in of the angle to the duodenum., and direction values are labeled as ‘abnormal’ otherwise. To create a comparison plot, we integrated the probability distributions found in Section II-D on the closed interval , and defined this as the percentage of ‘normal’ slow wave activity. Since the integral of a probability density function over its entire support is equal to one, this integration technique captures a percentage of ‘normal’ and ‘abnormal,’ and yields very similar results to [35, Fig. 4]. Fig. 8A shows the linear least squares (LLSQ) regression (blue line) with the shaded 95% confidence interval. Data closely followed trends from [35]. We report a r value of 0.56 and previous findings report r=0.57 [35, Fig. 4]. This firstly validates our approach of probability density estimation because it shows that probability distributions are able to capture signatures of the histogram well. Secondly, this provides a good basis to make comparisons between methods in [35] and the design of our new methods presented here.
Fig. 8.
Linear least squares (LLSQ) regression of percentage of slow waves propagating in an abnormal direction with mean GCSI symptom score (shown as blue line in panels A-D with 95% confidence interval shaded in lighter blue). Percentage of slow waves exhibiting abnormal propagation was found by integrating each probability distribution on the interval (same interval as in [35]). Panels A-D serve as a comparison to results published in [35]. Panels A-C are computed from a 90 minute recording, with approx. 30 minutes preprandial and approx. 60 minutes postprandial. (A) We calculated an r value of 0.56 if all 32 subjects were included (consistent with [35]) in the regression and an r value of 0.59 if control outlier was excluded. (B) Pink data show GP and FD subjects whose percentage of slow waves overlap with the range of controls (excluding the outlier). (C) Orange data show the seven subjects labeled as “anomalous” from the robust regression framework presented here. It is hypothesized that these subjects have GP/FD arising from an etiology distinct from gastric myoelectric dysfunction. (D) Percentage of slow waves propagating in an abnormal direction was calculated for only postprandial recording and subsequent regression was performed. Analogous to panel B, pink data show GP and FD subjects whose percentage of slow waves overlap with the range of controls (excluding the outlier).
Figure 8 B shows, in pink, the GP and FD subjects with percentages of abnormal slow waves that coincide within the range of healthy controls, excluding the control outlier. It is unlikely that a cluster of patients which have normally functioning slow waves but high symptom scores could be identified with these types of methods because the control group spans a large portion of the total data. For example, data marked as pink (within range of controls) are directly adjacent to data marked in purple (out of range of controls) and as such, there is no clear separation between a healthy and pathological percentage of abnormal slow waves.
Figure 8 C replicates the approach shown in Panel A, but we have marked the seven subjects that were flagged as ‘anomalous’ in the methods presented in this work. Some of these subjects lie either close to or directly on the regression line and thus do not appear anomalous at all in this previous approach. This shows that using the previous methodology, this group of patients for which it is hypothesized that spatial slow wave abnormalities are not a likely cause of disease would be missed.
In Fig. 8 D, we replicate panel B with only the postprandial period of the recordings. This was done to show that the paradigm shift within our methodology (i.e. looking at signatures of the full probability distribution and performing robust regression to account for different disease etiologies) was the key driver of our novel findings presented in this work. While we assert that performing HR-EGG studies during an immediate postprandial period is a best practice moving forward (with justification in II-A), isolating the postprandial portion of the recording alone was not the mechanism behind our results.
We show in Fig. 8, in red, the outlier within the control group that we identified after estimating probability distributions. Interestingly, this outlier does not appear to separate from the remainder of the control subjects in Fig. 8 panels A-D. We show the probability density function for this outlier subject in Fig. 4. This density lacks a cohesive single mode centered near zero and more closely resembles a probability distribution that would be expected from chaotic propagation. However, summing energy across the multiple modes within the range of , gives a false indication of normalcy since this total energy is closer to a range of the other controls. As such, investigating signatures and modalities of the entire probability distribution allowed us to recognize an outlier that would have otherwise gone undetected.
IV. Discussion
In this work, we were able to show a strong correlation (r=0.8) between features relating to spatial gastric slow wave abnormalities and standardized patient symptom scores, improving upon previous published findings’ r value by 0.23 [35]. We were also able to isolate a group of patients for which we hypothesize spatial gastric slow wave abnormalities are not likely a predominant etiology of their disease. We have shown results that allow us to hypothesize that treating a patient’s GP or FD with a treatment regimen that does not directly target gastric myoelectric dysfunction may only produce results in the case where other underlying etiologies are present, and patients who lie on the regression line (i.e. we hypothesize that their symptoms can be explained entirely based on spatial gastric slow wave abnormalities) will see limited success in such treatments. All patients in the anomalous group were diagnosed with GP. We hypothesize this was coincidental due to the larger population of GP patients in the cohort and further the 4:1 GP:FD ratio of patients that were evaluated after treatment. However, further investigation is warranted to explore this hypothesis, and its possible refutation could lead to findings of new distinction between GP and FD.
One limitation to this study is the size of the patient cohort evaluated after treatment. These preliminary results motivate the continuation of this work with a larger cohort of human subjects. In addition, this study was limited in that the evaluation of patients with treatment lacked a placebo group and double-blind control. However, it should be noted that the treatments given to the patients were not themselves experimental and not specific to diagnostic research [15], [16]. Another limitation of this work is the imbalance in the number of GP/FD patients, as well as control subjects. Though GP and FD subgroups were intentionally analyzed as one group, with justification provided earlier in this manuscript, this imbalance can nonetheless be mitigated in future work to strengthen the conclusions drawn.
Previous studies have used a patient CT to guide electrode array placement over the antrum of the stomach and determine the angle towards the duodenum [35], [38]. The advent of portable ultrasound devices and their usage to locate the antrum [72], [51] suggest placement of electrodes to cover the stomach need not require a CT. The methods presented here do not make use of the exact angle to the duodenum, from CT reconstructed geometry, but rather are robust to a barycenter of duodenum angles across several human subjects. This lays the groundwork for future studies to investigate the dependence of EGG-analysis methods on imaging. In addition, even if the electrode placement is less precise with the ultrasound localization method, we hypothesize, based on what has been shown here, that designing analysis methods to find patterns within the overall shape and modalities of probability distributions may provide robustness to slight deviations from optimal electrode placement, as these perturbations would not likely change distribution modalities. Since the HR-EGG device is non-invasive, eliminating the need for a CT will eliminate a significant barrier in bringing this technology to the clinic as a widely-deployable screening tool.
By virtue of using histograms and performing density estimation, we made the implicit assumption in this work that wave direction values can be treated as independent, identically distributed samples collapsed across temporal and spatial axes (see Section II-B). A subject of future work would be to solve for conditional probabilities in both the temporal and spatial axes using Markov modeling techniques.
It is well documented that the vulnerabilities of the EM algorithm arise from the non-convexity of the log likelihood surface and thus the presence of local minima/ maxima [73], [74]. Two of these vulnerabilities are: the need for appropriate parameter initialization as well as a proper stopping criterion to limit the number of extra iterations performed once convergence has been reached [73], [74], [75]. Failure to implement these regulations can result in parameters converging to one of their most extreme values. As such, it is typically standard to apply targeted parameter initializations [74]. In our work presented here, we initialized parameters with insights specific to the problem in the GMM implementation of the EM algorithm. However, when using the EM algorithm for the vMM problem, we initialized distribution weights randomly and we observed that convergence did not depend on proper initialization of weights, as was expected since this problem is convex. In addition, published studies have established precedent for enforcing a ‘stopping criterion’ for the EM algorithm [76]. As such, in both implementations of the EM algorithm (vMM and GMM problems), we chose a maximum number of parameter updates that allowed the algorithm to achieve convergence. More robust parameter initializations have been proposed [74], as well as competitive EM (CEM) [75] and other ‘split and merge’ algorithms [77], which offer an approach to address both the challenges of proper parameter initialization and developing a stopping criterion. A subject of future work could entail implementing one of these algorithms in lieu of the standard EM algorithm presented here.
Through the methods presented in this work, we identified an outlier within the control group. As discussed previously, identification of this outlier was only made possible once we estimated the full probability distribution of wave directions. The outlier, which deviates from from the ‘normal’ slow wave propagation pattern typically seen in controls, may be a result of recording noise or artifacts. However, this finding could instead suggest discovery of an asymptomatic class of patients that have spatial slow wave abnormalities. This suggestion is not meant to imply certainty of any kind, but rather to motivate a future inquiry in the GP and FD research community. Future work could entail enrolling a large cohort of healthy controls and performing HR-EGG with analysis methods analogous to those presented here to assess probability distributions of wave directions. Given that the cerebral cortex exerts influence over the stomach [78] and vice versa [79], it may also be sensible to perform simultaneous brain and gastric recordings to associate the two modalities, for instance from wave propagation [80], [81] or phase coupling [79], [82] perspectives.
V. Implications of This Work
Although recent findings suggest that gastric neuromodulation is able to entrain the slow wave, there is not a reliable method to determine which GP and FD patients may benefit from such treatment. Furthermore, we may be able to identify patients for which existing conservative treatments would yield symptom improvement, and for which patients may benefit from neuromodulation. Finally, we may be able to provide an explanation as to why certain treatments either only partially alleviate symptoms or fail to do so completely.
References
- [1].Brun R and Kuo B, “Functional dyspepsia,” Therapeutic advances in gastroenterology, vol. 3, no. 3, pp. 145–164, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].El-Serag H and Talley N, “The prevalence and clinical course of functional dyspepsia,” Alimentary pharmacology & therapeutics, vol. 19, no. 6, pp. 643–654, 2004. [DOI] [PubMed] [Google Scholar]
- [3].Jung H-K et al. , “The incidence, prevalence, and outcomes of patients with gastroparesis in Olmsted County, Minnesota, from 1996 to 2006,” Gastroenterology, vol. 136, no. 4, pp. 1225–1233, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Gikas A and Triantafillidis JK, “The role of primary care physicians in early diagnosis and treatment of chronic gastrointestinal diseases,” International Journal of General Medicine, vol. 7, p. 159, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Lacy BE, Crowell MD, Schettler-Duncan A, Mathis C, and Pasricha PJ, “The treatment of diabetic gastroparesis with botulinum toxin injection of the pylorus,” Diabetes Care, vol. 27, no. 10, 2004. [DOI] [PubMed] [Google Scholar]
- [6].Abell TL et al. , “Gastric electrical stimulation in intractable symptomatic gastroparesis,” Digestion, vol. 66, no. 4, pp. 204–212, 2002. [DOI] [PubMed] [Google Scholar]
- [7].Atassi H and Abell TL, “Gastric electrical stimulator for treatment of gastroparesis,” Gastrointestinal Endoscopy Clinics, 2019. [DOI] [PubMed] [Google Scholar]
- [8].Lin Z, Sarosiek I, Forster J, Ross RA, Chen JD, and McCallum RW, “Two-channel gastric pacing in patients with diabetic gastroparesis,” Neurogast. & Motility, vol. 23, no. 10, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].McCallum RW, Chen JDZ, Lin Z, Schirmer BD, Williams RD, and Ross RA, “Gastric pacing improves emptying and symptoms in patients with gastroparesis,” Gastroenterology, vol. 114, no. 3, 1998. [DOI] [PubMed] [Google Scholar]
- [10].Lin Z, McCallum R, Schirmer B, and Chen J, “Effects of pacing parameters on entrainment of gastric slow waves in patients with gastroparesis,” American Journal of Physiology-Gastrointestinal and Liver Physiology, vol. 274, no. 1, pp. G186–G191, 1998. [DOI] [PubMed] [Google Scholar]
- [11].Alighaleh S et al. , “A novel gastric pacing device to modulate slow waves and assessment by high-resolution mapping,” IEEE Trans Biomed Eng, 2019. [DOI] [PubMed] [Google Scholar]
- [12].Paskaranandavadivel N, Avci R, Nagahawatte N, Farajidavar A, and Cheng LK, “Electroceutical approaches for gastroparesis,” in Obesity and Diabetes. Springer, 2020, pp. 967–982. [Google Scholar]
- [13].Perley A et al. , “Miniaturized wireless gastric pacing via inductive power transfer with non-invasive monitoring using cutaneous electrogastrography,” Bioelectronic Medicine, vol. 7, no. 1, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Gottfried-Blackmore A et al. , “Open-label pilot study: non-invasive vagal nerve stimulation improves symptoms and gastric emptying in patients with idiopathic gastroparesis,” Neurogast. & Motility, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Langworthy J, Parkman HP, and Schey R, “Emerging strategies for the treatment of gastroparesis,” Expert Review of Gastroenterology and Hepatology, vol. 10, no. 7, pp. 817–825, 2016. [DOI] [PubMed] [Google Scholar]
- [16].Irvine EJ et al. , “Design of treatment trials for functional gastrointestinal disorders,” Gastroenterology, vol. 150, no. 6, 2016. [DOI] [PubMed] [Google Scholar]
- [17].Pasricha PJ, Grover M, Yates KP, Abell TL, Bernard CE, Koch KL, McCallum RW, Sarosiek I, Kuo B, Bulat R et al. , “Functional dyspepsia and gastroparesis in tertiary care are interchangeable syndromes with common clinical and pathologic features,” Gastroenterology, vol. 160, no. 6, pp. 2006–2017, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Pasricha PJ et al. , “Characteristics of patients with chronic unexplained nausea and vomiting and normal gastric emptying,” Clinical Gastr. & Hepatology, vol. 9, no. 7, pp. 567–576, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Janssen P et al. , “The relation between symptom improvement and gastric emptying in the treatment of diabetic and idiopathic gastroparesis,” The American Journal of Gastroenterology, vol. 108, no. 9, 2013. [DOI] [PubMed] [Google Scholar]
- [20].Corinaldesi R et al. , “Effect of chronic administration of cisapride on gastric emptying of a solid meal and on dyspeptic symptoms in patients with idiopathic gastroparesis.” Gut, vol. 28, no. 3, pp. 300–305, 1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].McCallum R, Cynshi O, and Team I, “Clinical trial: effect of mitemcinal (a motilin agonist) on gastric emptying in patients with gastroparesis–a randomized, multicentre, placebo-controlled study,” Alimentary Pharmacology and Therapeutics, vol. 26, no. 8, 2007. [DOI] [PubMed] [Google Scholar]
- [22].Barton ME et al. , “A randomized, double-blind, placebo-controlled phase II study (MOT114479) to evaluate the safety and efficacy and dose response of 28 days of orally administered camicinal, a motilin receptor agonist, in diabetics with gastroparesis,” Gastroenterology, 2014. [Google Scholar]
- [23].Farrugia G, “Interstitial cells of Cajal in health and disease,” Neurogast. & Motility, vol. 20, 2008. [DOI] [PubMed] [Google Scholar]
- [24].Ward SM and Sanders KM, “Interstitial cells of Cajal: primary targets of enteric motor innervation,” The Anatomical Record: An Official Publication of the American Association of Anatomists, 2001. [DOI] [PubMed] [Google Scholar]
- [25].Van Helden DF, Laver DR, Holdsworth J, and Imtiaz MS, “Generation and propagation of gastric slow waves,” Clinical and Experimental Pharmacology and Physiology, vol. 37, no. 4, 2010. [DOI] [PubMed] [Google Scholar]
- [26].Koch KL and Stern RM, Handbook of Electrogastrography. Oxford University Press, 2004. [Google Scholar]
- [27].Hirst GDS, Garcia-Londoño AP, and Edwards FR, “Propagation of slow waves in the guinea-pig gastric antrum,” The Journal of physiology, vol. 571, no. 1, pp. 165–177, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].Ördög T, “Interstitial cells of Cajal in diabetic gastroenteropathy,” Neurogast. & Motility, vol. 20, no. 1, pp. 8–18, 2008. [DOI] [PubMed] [Google Scholar]
- [29].Angeli TR et al. , “Loss of interstitial cells of Cajal and patterns of gastric dysrhythmia in patients with chronic unexplained nausea and vomiting,” Gastroenterology, vol. 149, no. 1, pp. 56–66, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].O’Grady G et al. , “Abnormal initiation and conduction of slow-wave activity in gastroparesis, defined by high-resolution electrical mapping,” Gastroenterology, vol. 143, no. 3, pp. 589–598, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].—, “Origin and propagation of human gastric slow-wave activity defined by high-resolution mapping,” American Journal of Physiology-Gastrointestinal and Liver Physiology, vol. 299, no. 3, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Parkman H et al. , “Electrogastrography: a document prepared by the gastric section of the american motility society clinical gi motility testing task force,” Neurogast.& Motility, vol. 15, no. 2, pp. 89–102, 2003. [DOI] [PubMed] [Google Scholar]
- [33].Holmvall P and Lindberg G, “Electrogastrography before and after a high-caloric, liquid test meal in healthy volunteers and patients with severe functional dyspepsia,” Scandinavian Journal of Gastroenterology, vol. 37, no. 10, pp. 1144–1148, 2002. [DOI] [PubMed] [Google Scholar]
- [34].Oba-Kuniyoshi A et al. , “Postprandial symptoms in dysmotility-like functional dyspepsia are not related to disturbances of gastric myoelectrical activity,” Brazilian J. Medical & Biological Research, 2004. [DOI] [PubMed] [Google Scholar]
- [35].Gharibans AA et al. , “Spatial patterns from high-resolution electrogastrography correlate with severity of symptoms in patients with functional dyspepsia and gastroparesis,” Clinical Gastr. & Hep, 2019. [DOI] [PubMed] [Google Scholar]
- [36].Berry R et al. , “Patterns of abnormal gastric pacemaking after sleeve gastrectomy defined by laparoscopic high-resolution electrical mapping,” Obesity surgery, vol. 27, no. 8, pp. 1929–1937, 2017. [DOI] [PubMed] [Google Scholar]
- [37].Gharibans AA et al. , “Artifact rejection methodology enables continuous, noninvasive measurement of gastric myoelectric activity in ambulatory subjects,” Scientific Reports, vol. 8, no. 1, p. 5019, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].—, “High-resolution electrogastrogram: a novel, noninvasive method for determining gastric slow-wave direction and speed,” IEEE Trans Biomed Eng, vol. 64, no. 4, pp. 807–815, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Agrusa AS et al. , “A deep convolutional neural network approach to classify normal and abnormal gastric slow wave initiation from the high resolution electrogastrogram,” IEEE Trans Biomed Eng, 2019. [DOI] [PubMed] [Google Scholar]
- [40].Revicki DA et al. , “Gastroparesis cardinal symptom index (gcsi): development and validation of a patient reported assessment of severity of gastroparesis symptoms,” Quality of Life Research, 2004. [DOI] [PubMed] [Google Scholar]
- [41].Somarajan S et al. , “The effect of chronic nausea on gastric slow wave spatiotemporal dynamics in children,” Neurogastr. & Motility, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Rousseeuw PJ and Leroy AM, Robust regression and outlier detection. John wiley & sons, 2005, vol. 589. [Google Scholar]
- [43].Andersen R, Modern methods for robust regression. Sage, 2008. [Google Scholar]
- [44].Chamroukhi F, “Unsupervised learning of regression mixture models with unknown number of components,” Journal of Statistical Computation and Simulation, vol. 86, no. 12, pp. 2308–2334, 2016. [Google Scholar]
- [45].Chen R and Paschalidis I, “Outlier detection using robust optimization with uncertainty sets constructed from risk measures,” ACM SIGMETRICS Performance Evaluation Review, vol. 45, no. 3, pp. 174–179, 2018. [Google Scholar]
- [46].Villani C, Optimal transport: old and new. Springer Science & Business Media, 2008, vol. 338. [Google Scholar]
- [47].Santambrogio F, “Optimal transport for applied mathematicians,” Birkäuser, NY, vol. 55, no. 58-63, p. 94, 2015. [Google Scholar]
- [48].Peyré G, Cuturi M et al. , “Computational optimal transport: With applications to data science,” Foundations and Trends® in Machine Learning, vol. 11, no. 5-6, pp. 355–607, 2019. [Google Scholar]
- [49].Galichon A, Optimal transport methods in economics. Princeton University Press, 2018. [Google Scholar]
- [50].Agueh M and Carlier G, “Barycenters in the wasserstein space,” SIAM Journal on Mathematical Analysis, vol. 43, no. 2, pp. 904–924, 2011. [Google Scholar]
- [51].Kurniawan JF et al. , “An adhesive-integrated stretchable silver-silver chloride electrode array for unobtrusive monitoring of gastric neuromuscular activity,” Advanced Materials Technologies, vol. 6, no. 5, 2021. [Google Scholar]
- [52].Rentz A et al. , “Development and psychometric evaluation of the patient assessment of upper gastrointestinal symptom severity index (PAGI-SYM) in patients with upper gastrointestinal disorders,” Quality of Life Research, 2004. [DOI] [PubMed] [Google Scholar]
- [53].Won K-J, Sanders KM, and Ward SM, “Interstitial cells of Cajal mediate mechanosensitive responses in the stomach,” Proceedings of the National Academy of Sciences, vol. 102, no. 41, 2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Shimada Y et al. , “Electrogastrographic power ratio in humans is not related to changes in antrum-skin distance but to antral motility,” Journal of Gastroenterology, vol. 33, no. 3, pp. 310–317, 1998. [DOI] [PubMed] [Google Scholar]
- [55].Yin J and Chen JD, “Electrogastrography: methodology, validation and applications,” Neurogast. & motility, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Rubino D, Robbins KA, and Hatsopoulos NG, “Propagating waves mediate information transfer in the motor cortex,” Nature neuroscience, vol. 9, no. 12, pp. 1549–1557, 2006. [DOI] [PubMed] [Google Scholar]
- [57].Mardia KV, “Statistics of directional data,” Journal of the Royal Statistical Society: Series B (Methodological), vol. 37, no. 3, 1975. [Google Scholar]
- [58].—, Statistics of directional data. Academic press, 2014. [Google Scholar]
- [59].McGraw T, Vemuri BC, Yezierski B, and Mareci T, “Von mises-fisher mixture model of the diffusion odf,” in 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2006. IEEE, 2006, pp. 65–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Mardia KV, Taylor CC, and Subramaniam GK, “Protein bioinformatics and mixtures of bivariate von mises distributions for angular data,” Biometrics, vol. 63, no. 2, pp. 505–512, 2007. [DOI] [PubMed] [Google Scholar]
- [61].Johnson TD, Coleman TP, and Rangel LM, “A flexible likelihood approach for predicting neural spiking activity from oscillatory phase,” Journal of neuroscience methods, vol. 311, pp. 307–317, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [62].Schwarz G et al. , “Estimating the dimension of a model,” The annals of statistics, vol. 6, no. 2, pp. 461–464, 1978. [Google Scholar]
- [63].Redner RA and Walker HF, “Mixture densities, maximum likelihood and the EM algorithm,” SIAM review, vol. 26, no. 2, pp. 195–239, 1984. [Google Scholar]
- [64].Rabin J, Delon J, and Gousseau Y, “Transportation distances on the circle,” Journal of Mathematical Imaging and Vision, vol. 41, no. 1, pp. 147–167, 2011. [Google Scholar]
- [65].Solomon J, Rustamov R, Guibas L, and Butscher A, “Wasserstein propagation for semi-supervised learning,” in International Conference on Machine Learning. PMLR, 2014, pp. 306–314. [Google Scholar]
- [66].Gupta MR and Chen Y, Theory and use of the EM algorithm. Now Publishers Inc, 2011. [Google Scholar]
- [67].Woolf B, “The log likelihood ratio test (the g-test),” Annals of human genetics, vol. 21, no. 4, pp. 397–409, 1957. [DOI] [PubMed] [Google Scholar]
- [68].Hahsler M, Piekenbrock M, and Doran D, “dbscan: Fast density-based clustering with R,” J. Statistical Software, vol. 91, no. 1, 2019. [Google Scholar]
- [69].—, “dbscan: Fast density-based clustering with r,” Journal of Statistical Software, vol. 91, no. 1, pp. 1–30, 2019. [Google Scholar]
- [70].Hartigan JA and Wong MA, “Algorithm as 136: A k-means clustering algorithm,” Journal of the royal statistical society. series c (applied statistics), vol. 28, no. 1, pp. 100–108, 1979. [Google Scholar]
- [71].McInnes L, Healy J, and Astels S, “hdbscan: Hierarchical density based clustering,” Journal of Open Source Software, vol. 2, no. 11, p. 205, 2017. [Google Scholar]
- [72].Kruisselbrink R, Gharapetian A, Chaparro LE, Ami N, Richler D, Chan VW, and Perlas A, “Diagnostic accuracy of point-of-care gastric ultrasound,” Anesthesia & Analgesia, vol. 128, no. 1, pp. 89–95, 2019. [DOI] [PubMed] [Google Scholar]
- [73].Baudry J-P and Celeux G, “EM for mixtures,” Statistics and computing, vol. 25, no. 4, pp. 713–726, 2015. [Google Scholar]
- [74].Panić B, Klemenc J, and Nagode M, “Improved initialization of the EM algorithm for mixture model parameter estimation,” Mathematics, vol. 8, no. 3, p. 373, 2020. [Google Scholar]
- [75].Zhang B, Zhang C, and Yi X, “Competitive EM algorithm for finite mixture models,” Pattern recognition, vol. 37, no. 1, pp. 131–144, 2004. [Google Scholar]
- [76].Coakley KJ, “A cross-validation procedure for stopping the EM algorithm and deconvolution of neutron depth profiling spectra,” IEEE Trans Nuclear Science, vol. 38, no. 1, pp. 9–15, 1991. [Google Scholar]
- [77].Ueda N, Nakano R, Ghahramani Z, and Hinton GE, “SMEM algorithm for mixture models,” Neural computation, vol. 12, no. 9, 2000. [DOI] [PubMed] [Google Scholar]
- [78].Levinthal DJ and Strick PL, “Multiple areas of the cerebral cortex influence the stomach,” Proc Nat Acad Sci, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [79].Rebollo I, Devauchelle A-D, Béranger B, and Tallon-Baudry C, “Stomach-brain synchrony reveals a novel, delayed-connectivity resting-state network in humans,” Elife, vol. 7, p. e33321, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [80].Muller L et al. , “Cortical travelling waves: mechanisms and computational principles,” Nature Reviews Neuroscience, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [81].Davis ZW, Muller L, Martinez-Trujillo J, Sejnowski T, and Reynolds JH, “Spontaneous travelling cortical waves gate perception in behaving primates,” Nature, vol. 587, no. 7834, pp. 432–436, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [82].Choe AS, Tang B, Smith KR, Honari H, Lindquist MA, Caffo BS, and Pekar JJ, “Phase-locking of resting-state brain networks with the gastric basal electrical rhythm,” PloS one, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]








