Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Apr 17;12084:553–566. doi: 10.1007/978-3-030-47426-3_43

BRUNCH: Branching Structure Inference of Hybrid Multivariate Hawkes Processes with Application to Social Media

Hui Li 14,, Hui Li 15, Sourav S Bhowmick 14
Editors: Hady W Lauw8, Raymond Chi-Wing Wong9, Alexandros Ntoulas10, Ee-Peng Lim11, See-Kiong Ng12, Sinno Jialin Pan13
PMCID: PMC7206163

Abstract

Multivariate Hawkes processes (MHPs) are a class of point processes where an arrival in one dimension can affect the future arrivals in all dimensions. Existing MHPs are associated with homogeneous link functions. However, in reality, different dimensions may exhibit different temporal characteristics. In this paper, we augment MHPs by incorporating heterogeneous link functions, referred to as hybrid MHPs, to capture the temporal characteristics in different dimensions. Since the branching structure can be utilized to equivalently represent MHPs, we propose a novel model called BRUNCH via intensity-driven Chinese Restaurant Processes (intCRP) to identify the optimal branching structure of hybrid MHPs. Furthermore, we relax the constraint on the shapes of triggering kernels in MHPs. We develop a Monte Carlo-based inference algorithm called MEDIA to infer the branching structure. Experiments on real-world datasets demonstrate the superior performance of BRUNCH and its usefulness in social media applications.

Keywords: Branching structure, Hawkes process, Heterogeneous link functions, Social media

Introduction

Multivariate Hawkes processes (MHPs) are a class of point processes with mutually exciting components to model sequences of discrete events in continuous time, where an arrival in one dimension can affect future arrivals in all dimensions [5, 6]. Recently, MHPs have emerged in multiple fields to capture mutual excitation between dimensions, including high frequency trading [1], social influence analysis [16] and computational biology [13]. However, these MHPs are limited to a specific scenario where a past arrival can only excite the occurrence of future arrivals, and the corresponding link functions1 are linear (i.e., linear MHPs). However, in reality, inhibitory arrivals and non-additive aggregation of effects from past arrivals are present in several application domains [10]. For instance, negative feedbacks of online consumers may inhibit others’ purchasing behaviours. Consequently, MHPs associated with nonlinear link functions, namely nonlinear MHPs [10], have been proposed where effects from past events encompass both excitation and inhibition.

Prior work on MHPs also assume that all dimensions take the same link function i.e., either linear or nonlinear MHPs (homogeneous MHPs). That is, all dimensions follow roughly the same temporal characteristics. Note that a dimension may record actions of a user (e.g., tweet, retweet, comment, share) in social media, behavior of a customer (e.g., purchase, comment, return) in online shopping websites, and so on. In reality, different dimensions may exhibit different temporal characteristics. For example, in Twitter, one individual (i.e., dimension) may be extremely interested in a popular topic and interact with her followers frequently whereas another individual may take little interest in that topic and seldom respond to his followers. In such scenario, the cumulative influence from recent events on the former individual is clearly different from the one on the latter. Hence, homogeneous MHPs are insufficient to capture such diverse temporal characteristics. To address this problem, in this paper we augment MHPs by incorporating heterogeneous link functions, referred to as hybrid MHPs, allowing us to cope with diverse impact of past events on future events in different dimensions.

The cluster Poisson process interpretation [7] of MHPs separates the events into two categories, namely, immigrants and offspring. The offspring events are triggered by past events, while the immigrants arrive independently and thus do not have a existing parent event. Offsprings are structured into clusters associated with each immigrant event. This is called the branching structure [8], which is an useful representation of MHPs in various applications. For example, in social influence analysis it can construct the narrative of information diffusion to pave the way for strategies to encourage or limit individual behaviors [14]. Additionally, the branching structure is widely utilized as a strategy in the maximum likelihood estimation of MHPs [16]. Unfortunately, such cluster Poisson process representation can only be applied to linear MHPs due to the mutually exciting assumption. Nonlinear MHPs cover both mutual excitation and mutual inhibition stochastically. Consequently, existing approaches based on the cluster Poisson process representation cannot be adopted to infer the branching structure of hybrid MHPs. In this paper, we infer the branching structure of hybrid MHPs regardless of the shapes of the triggering kernel functions2.

We propose a novel probabilistic model called BRUNCH (Branching stRUcture iNferenCe of Hybrid multivariate Hawkes processes) to reveal the branching structure of hybrid MHPs without assuming homogeneity of link functions or shapes of triggering kernel functions (Sect. 3). It is important for our probabilistic model to emphasize the following two features mirrored by the event sequences in MHPs: (a) the chronological order of events is nonexchangeable; and (b) the triggering relations could distribute within or across dimensions stochastically. To this end, we propose intensity-driven Chinese Restaurant Process (intCRP), a novel extension of classical CRP [3] in which the random seating assignment of the customers depends on the triggering kernels between them. In particular, intCRP has a nested structure – inner intCRP to explore the possible triggering relations among events occurring in one dimension (i.e., event links), and outer intCRP to identify the collection of triggering relations between all events and their parents (if any) across dimensions (i.e., cluster links). Obviously, the changes to the triggering relations within and across dimensions are highly coupled, i.e., the inner intCRP and outer outCRP are strongly interlaced. Since there are countably infinite sets of triggering relations, we propose a novel inference approach called MEDIA (MontE Carlo-baseD Inference Approach) that leverages the triggering nature of MHPs to sample event links and cluster links alternatively (Sect. 4). Finally, we apply BRUNCH on real-world social media datasets, and the experimental study in Sect. 5 demonstrates its superior performance and usefulness. Formal algorithms and proofs of theorems and lemmas appear in [9]. List of key symbols used in this paper is given in Table 1.

Table 1.

Key notations.

Notation Definition Notation Definition
Inline graphic Event sequences Inline graphic k-th arrival in i-th dimension
Inline graphic i-th event sequence Inline graphic Event number until t in i-th dimension
Inline graphic Collection of event links Inline graphic Event link from event Inline graphic to event Inline graphic
Inline graphic Collection of cluster links Inline graphic Cluster link from cluster s to cluster g
Inline graphic Events triggered by Inline graphic Inline graphic Collection of cascades
Inline graphic Parent event of Inline graphic Inline graphic The cascade the event Inline graphic belongs to

Preliminaries

In this section, we introduce relevant concepts for understanding this paper.

Multivariate Hawkes Processes (MHPs)

The conditional intensity function of the i-th dimension for an M-dimensional MHP takes the following form [10]:

graphic file with name M18.gif 2.1

where Inline graphic is the base intensity capturing the arrival rate of exogenous events independent of historical events. The term Inline graphic represents the accumulation of endogenous intensity caused by history [4]. The coefficient Inline graphic measures the influence from the j-th dimension to the i-th dimension, allowing mutual excitation (Inline graphic) and mutual inhibition (Inline graphic). The triggering kernel function Inline graphic quantifies the triggering effect from the event Inline graphic (i.e., Inline graphic denotes the l-th arrival occurring in the j-th dimension) to the occurrence rate of the i-th dimension. Most of the existing work use predefined kernel functions with unknown parameters, such as the exponential kernels [16] and the power-law kernels [15]. The link function Inline graphic, recognizes the triggering pattern of the i-th dimension over the historical events. The linear MHPs [6] is the case Inline graphic with nonnegative Inline graphic for each dimension, while nonlinear MHPs apply various Inline graphic to guarantee the positive intensities. In hybrid MHPs, we allow each dimension to take a personalized link function to capture diverse temporal characteristics in real-world scenarios.

Branching Structure

Recall that the events in MHPs are classified as either immigrants or offsprings. An immigrant event arrives independently of other events, while an offspring event is triggered by a previous event. In the sequel, we refer to an immigrant together with its offsprings as a cascade. The collection of triggering relations in cascades is called the branching structure [8]. The cluster Poisson processes [7] could equivalently represent the branching structure of linear MHPs. Briefly, each immigrant starts one cascade, which consists of offspring events of the 1Inline graphic, 2Inline graphic, 3Inline graphic, Inline graphic generations, controlled by the endogenous intensity in Eq. 2.1 [11]. Due to the non-linear link functions, the above branching structure representation based on the cluster Poisson processes is inapplicable for nonlinear MHPs and hybrid MHPs.

Suppose that Inline graphic denotes the i-th event sequence during a time window [0, t]. Then Inline graphic is the collection of events from M dimensions during [0, t]. While modeling the sequences Inline graphic via hybrid MHPs, the corresponding branching structure could be represented by the variable set Inline graphic mathematically, such that

  • Inline graphic if Inline graphic is an immigrant; and Inline graphic if event Inline graphic triggers Inline graphic; and

  • Inline graphic if there are some events Inline graphic triggered by Inline graphic; otherwise, Inline graphic.

Our objective is to infer Inline graphic for each event. Consider Fig. 1(a). Directed links sketch an example of branching structure. Inline graphic indicates the event Inline graphic is an immigrant, and Inline graphic shows Inline graphic triggers Inline graphic. Inline graphic and Inline graphic leads to Inline graphic, denoting that Inline graphic triggers Inline graphic and Inline graphic successively.

Fig. 1.

Fig. 1.

(a) An example of branching structure of MHPs and the events on the time axis; (b) Branching structure construction using BRUNCH.

The BRUNCH Model

While modeling the asynchronous time-stamped event sequences via hybrid MHPs, we aim to reveal the underlying branching structure. Although the traditional Chinese Restaurant Process (CRP) [3] provides a flexible class of distributions that is amenable for modeling dependencies between elements, the exchangeability assumption here is problematic for elements with temporal dependencies. This is because events in MHPs occur at different time points, and are nonexchangeable. In addition, in our problem setting the influence that determines the pairwise dependencies between events is not homogeneous within and across different sequences. Intuitively, it is natural to quantify the influence via the triggering kernels in Eq. 2.1. In order to tackle these issues, we present the BRUNCH model, which is based on intensity-driven Chinese restaurant processes (intCRP), a new variant of CRP that allows a number of intensity-driven distributions as priors on triggering relations between events.

The events in one cascade could stem from different dimensions. So, beyond the triggering relations obtained from single-sequence intCRP, we need to capture the cross-sequence triggering relations among events. To this end, BRUNCH allows us to identify the possible dependencies among multi-dimensional event sequences in hybrid MHPs. Specifically, it presents a class of prior distributions over branching structure according to intCRP, which has nested structure, inner intCRP and outer intCRP. Briefly, the inner intCRP identifies the possible triggering relations in each sequence independently, which are referred to as event links. Linked events in each sequence form one cluster. Subsequently, outer intCRP captures the potential cross-sequence triggering relations, referred to as cluster links, which connects parent events with children from cross-sequence clusters.

We resort to the Chinese Restaurant metaphor to describe the generative process of event links and cluster links in BRUNCH. Imagine a collection of event sequences as a collection of restaurants, and the events in each sequence as customers entering a restaurant. The linked events in each sequence compose one cluster, and such clusters correspond to tables. Figure 1(b) illustrates the process. Note that in traditional CRP, the probability of a customer sitting at a table is computed from the number of other customers already sitting at that table.

Event Link Construction

In each sequence, if one event is an immigrant, there exists one self-link with itself; otherwise, there exists a triggering dependency for the event. That is, if event Inline graphic generates event Inline graphic, the link from Inline graphic to Inline graphic is created spontaneously. The inner intCRP assigns event link Inline graphic in a biased way, according to the following cluster-specific distribution:

graphic file with name M65.gif 3.1

where self-affinity Inline graphic yields new immigrant events, and larger self-affinity favors more cascades. Inline graphic describes how the affinity between a pair of events affects the probability for Inline graphic triggering Inline graphic. In accordance with the propagation characteristics described by hybrid MHPs, child events tend to be generated by preceding events with stronger triggering effect. We define Inline graphic as the product of two parts: Inline graphic and Inline graphic. For a window size W, we set the decay function Inline graphic such that for Inline graphic, and zero otherwise. It determines the probability to link with events that are at most W timespan away, and disregards the historical events as time progresses. Intuitively, the possible links existing between events that are far away from each other are negligibly rare. For Inline graphic, we apply the self-triggering kernel Inline graphic which decays the probability of connecting events along with the timespan to the current one. Consequently, the event links Inline graphic in i-th dimensional sequence assign events into clusters, where two events are assigned to the same cluster if one is reachable from the other by traversing the directed links. Once the event link Inline graphic is confirmed, we can obtain the branching structure Inline graphic within sequences.

Cluster Link Construction

Accordingly, the collection of event links Inline graphic will divide the event sequences Inline graphic into clusters, denoted by Inline graphic. By involving the mutual-triggering kernels, we could measure the pairwise affinity between cross-sequence events. Hence, given two clusters, s and g, the outer intCRP assigns the cluster link Inline graphic according to the following cascade-specific distribution:

graphic file with name M84.gif 3.2

where Inline graphic. Only if one event (i.e., Inline graphic) in cluster s generates the earliest event in cluster g, there exists a cluster link Inline graphic. In particular, self-loop cluster link is non-existent. Once the cluster link Inline graphic is determined, we could construct the equivalent branching structure Inline graphic. In summary, we construct the complete branching structure Inline graphic via scanning the obtained event links Inline graphic and cluster links Inline graphic.

Intuitively, a collection of cluster links will divide clusters into cascades. We represent one cascade as one set of events. Let Inline graphic denote the cascade associated with event Inline graphic, and Inline graphic records the final cascade assignments. Notice that events Inline graphic and Inline graphic belong to one cascade (i.e., Inline graphic) if and only if they are reachable via combinations of event links and cluster links. Given the collection of event links Inline graphic and cluster links Inline graphic, we denote Inline graphic as the final collection of cascades. We initialize the Hawkes likelihood parameters Inline graphic where Inline graphic are the hyper-parameters, and Inline graphic is the collection of distributions.

The central goal of BRUNCH is to infer the posterior distribution of the latent links Inline graphic, given a collection of time-stamped events. It places a prior distribution over a combinatorial number of possible event links and cluster links, according to inner intCRP (Eq. 3.1) and outer intCRP (Eq. 3.2), respectively. Intuitively, applying Bayes’s Theorem, the posterior distribution takes the form Inline graphic Unfortunately, we cannot compute Inline graphic. Hence, the posterior inference of links is intractable. To address this, in the following section we present a strategy that approximately infers the posterior distribution.

Model Inference

As the number of event links and cluster links varies with the observed events, we need to undertake Bayesian inference over a link set of unknown cardinality. Moreover, changes over event links may induce subsequent changes to other event links and current cluster links. To this end, we propose the Monte Carlo-based inference approach (MEDIA) that leverages the triggering nature of hybrid MHPs to sample event links and cluster links.

We aim to construct a Markov chain whose stationary distribution is the target posterior distribution Inline graphic. The state of the chain is represented by Inline graphic, a collection of links over event sequences. Furthermore, event links Inline graphic and cluster links Inline graphic are strongly coupled, that is, sampling event links could trigger a chain of merges and splits to the current structure, as shown in Fig. 2. In view of this, we design the Monte Carlo-based inference approach, which involves two key phases: (a) sampling event links Inline graphic via a Metropolis-Hastings rule, which could possibly bring changes to current cluster links Inline graphic, and then (b) update cluster links Inline graphic via a Gibbs sampler. We elaborate on them in turn.

Fig. 2.

Fig. 2.

Sampling event links in Panel (a) leads a chain of changes to current branching structure, and constructs new one in Panel (b).

Sampling Event Links. Let current links be Inline graphic. After sampling one event link, the reconstructed links are denoted as Inline graphic. The iterative procedure runs until approaching the stationary distribution as follows:

  1. Using the current links Inline graphic, sample a candidate link set Inline graphic from the transition probability Inline graphic;

  2. Sampled Inline graphic leads to cluster links Inline graphic. Calculate the acceptance probability Inline graphic for the candidate set Inline graphic,
    graphic file with name M124.gif
    Notice that we are considering the ratio of Inline graphic under two different structures, so the denominator Inline graphic is eliminated.

Theorem 4.1

The above Metropolis-Hastings rule satisfies detailed balance.

Based on Theorem 4.1, we can guarantee the resulting Markov chain converges to a stationary distribution uniquely [2]. BRUNCH provides a joint distribution for a collection of events and current links as following:

graphic file with name M127.gif 4.1

If one event is an offspring, it has only one parent event. So once the affinity functions in inner intCRP are predefined, the generated event links are conditionally independent. Furthermore, event links divide all events into clusters. Hence, once the event links and the affinity functions in outer intCRP are predefined, the generated cluster links are conditionally independent. That is to say, event links Inline graphic are conditionally independent given Inline graphic, and cluster links Inline graphic are conditionally independent given Inline graphic, thereby causing the independent cascades. As a consequence, the joint distribution on events and links equals to:

graphic file with name M132.gif 4.2

The activities belonging to cascade I are represented as Inline graphic, hence,

graphic file with name M134.gif 4.3

where the parameter set Inline graphic could be drawn from pre-determined distributions associated with the hyper-parameters Inline graphic. Hence, the above integral is tractable. Based on the hybrid MHPs associated with conditional intensity (Eq. 2.1), we derive the conditional probability density [8] that an event occurs at time Inline graphic is:

graphic file with name M138.gif 4.4

where the integral Inline graphic is not always analytically integrable w.r.t. various link functions of hybrid MHPs. To address the issue, we propose the approximation: Inline graphic.

We describe how replacing an event link affects the current links Inline graphic, and there are four cases:

1. Split. If adding event link Inline graphic breaks the existing link Inline graphic and divides Inline graphic and Inline graphic into two clusters, it shows that event Inline graphic is an offspring. Moreover, there are no existing clusters to be merged after sampling event link Inline graphic. Thus, the new cluster including Inline graphic has the same incoming cluster links as the previous cluster containing both Inline graphic and Inline graphic. We assume that the incoming links are independently linked to the new cluster with equal probability. New cluster s including Inline graphic collects its outgoing cluster links Inline graphic according to the distribution Inline graphic where Inline graphic represents the set of cluster links excluding the ones Inline graphic. Then, the transition probability is:

graphic file with name M156.gif 4.5

where Inline graphic records the number of incoming cluster links for the old cluster containing both Inline graphic and Inline graphic. Calculate Inline graphic according to Eq. 3.1.

2. Split and Merge. Adding event link Inline graphic breaks existing link Inline graphic. Moreover, the cluster including event Inline graphic and the cluster including event Inline graphic are merged after sampling event link Inline graphic. Thus, the outgoing cluster links of new cluster retain the outgoing cluster links of the old cluster including Inline graphic, and the incoming cluster links of new cluster combine the incoming cluster links connecting to the cluster including Inline graphic and the cluster including Inline graphic. The transition probability is:

graphic file with name M169.gif 4.6

Also, the incoming cluster links are assumed to be assigned to the new merged cluster equally. When the sampling link Inline graphic merges two clusters from one cascade, Inline graphic. Otherwise, when Inline graphic merges two clusters from different cascades, it combines the two cascades. Hence,

graphic file with name M173.gif 4.7

wherein Inline graphic and Inline graphic represents the events in the cascade Inline graphic excluding the events in the cascade Inline graphic. Similarly, Inline graphic and Inline graphic denotes the events in the cascade Inline graphic excluding the events in the cascade Inline graphic.

3. Merge. After adding event link Inline graphic, if there is no new cluster to appear, it shows that event Inline graphic is an immigrant. Moreover, sampling event link Inline graphic causes two existing clusters to be merged. Hence, the transition probability is:

graphic file with name M185.gif 4.8

Considering the two merged clusters come from one cascade or two cascades, Inline graphic is analogous to the analysis of Inline graphic.

4. No Change. There is no new cluster after setting Inline graphic, and also no merge after sampling event link Inline graphic. In this case, the corresponding transition probability is: Inline graphic.

Obviously, after sampling each event link, the resulting split (i.e., case 1) and merge (i.e., case 3) are inverse of each other, meanwhile, the other two cases (i.e., split and merge vs. no change) are inverse transform. Accordingly, by taking the inverse pairs, we derive the corresponding acceptance ratios Inline graphic, where the Hastings ratio Inline graphic.

As aforementioned, sampling an event link may lead four possible changes to current links Inline graphic. Hence, the corresponding Hastings ratio is:

  • A single offspring event becomes an immigrant and the previous cluster is split into two clusters. Thus the candidate partition structure Inline graphic is generated. The transitions corresponding to Inline graphic and Inline graphic are the inverse of each other. Substituting Eq. 4.1, 4.5 and 4.8, the Hastings ratio works out to be
    graphic file with name M197.gif 4.9
    When the offspring event changes its parent event from one cluster to another cluster under the condition that the two clusters locate in different cascades, the reverse transition Inline graphic leads to the merger of two cascades. Consequently, the transition probability is:
    graphic file with name M199.gif
    Similarly,
    graphic file with name M200.gif
    . The corresponding transition probability becomes: Inline graphic
  • A single offspring event switches to an immigrant, and sampling a new event link leads to the combination of two local clusters. In this case, the transitions corresponding to Inline graphic and Inline graphic are the inverse of each other. If the two merged clusters come from one cascade,
    graphic file with name M204.gif 4.10
    If the sampling new event link causes two cascades to be merged, we obtain Inline graphic.
  • Sampling Inline graphic leads the immigrant event Inline graphic to trigger new offspring. Similar to Inline graphic, we can calculate the Hastings ratios: Inline graphic.

  • Samplings which change event links but cause no change to the cluster links always have Inline graphic.

Sampling Cluster Links. Once the event links Inline graphic are sampled via the aforementioned Metropolis-Hasting rule, we update the cluster links Inline graphic again via Gibbs sampler as follows: [i)] Scan each cluster Inline graphic; [ii)] Draw Inline graphic Repeat the above steps until convergence. Inline graphic represents the outgoing cluster links for cluster s, and Inline graphic records the set of cluster links excluding the ones Inline graphic. We calculate Inline graphic according to Eq. 3.2.

We keep all event links in an adjacency matrix Inline graphic, wherein rows and columns are indexed by ordered events from all sequences, value 1 or 0 is recorded in entry Inline graphic according to whether Inline graphic triggers Inline graphic or not. The formal description of MEDIA, is given in [9]. The time complexity of MEDIA is Inline graphic [9].

Experiments

In this section, we investigate the performance of our model and inference algorithm and report the key results. All experiments are performed on a machine with 16 GB RAM with Intel(R) Core(TM) E5-1620V2 CPU@3.70 GHz processor running on Windows 8.1 Pro.

Competitors. Recall that there is no existing work that infers the branching structure of nonlinear MHPs. Hence, we are confined to compare our proposed frameworks to techniques that infer the branching structure of linear MHPs in [12, 17]. In particular, we consider the following strategies for our study. (a) Cluster-L: Based on the alternative representation of the linear Hawkes process in terms of cluster Poisson processes, [12, 17] propose the cluster-based method. (b) MEDIA-L: Use MEDIA to infer the branching structure of linear MHPs. (c) MEDIA-E: Apply MEDIA to infer the branching structure of exponential MHPs (i.e., the corresponding link function Inline graphic). (d) MEDIA-H: The odd dimensions of MHPs take linear link function, and the even dimensions adopt exponential one. Then we use MEDIA to infer the branching structure of hybrid MHPs.

As mentioned earlier, BRUNCH is not sensitive to shapes of the triggering kernel functions in hybrid MHPs. Hence, we adopt an exponential kernel function Inline graphic in the experiments (see [9] for the performance associated with other kernel functions). For each dimension, the hyper-parameter Inline graphic is sampled by a uniform distribution Inline graphic. The base intensity Inline graphic is set varying over dimensions and is sampled from Inline graphic, then the coefficient Inline graphic is sampled from Inline graphic, and the decay parameter Inline graphic has the form of Inline graphic where c is sampled from Inline graphic. The initial parameters Inline graphic need satisfy the stability and uniqueness conditions (see details in [9]). Additionally, we set the time window size to Inline graphic h.

Datasets and Ground Truth. We fit the aforementioned models on two real-world social media datasets: (1) Facebook (Fa): 43, 679, 231 events from 109, 211 individuals during March 2018 to May 2018; (2) Twitter (Tw): 51, 622, 139 events from 123, 972 individuals from March 2018 to May 2018. While crawling the triggering relations of events (i.e., which event triggers which events) as ground truth via Facebook Graph API and Twitter Streaming API, we crawl the social network structure in advance. For each immigrant event in each individual’s sequence, we grab the triggered events (i.e., offsprings) starting from the individual’s followers via a depth-first search algorithm. Then, while modeling the observed timestamped events via MHPs, we aim to infer the branching structure without the knowledge of social network structure.

Inferring Branching Structure. We convert the branching structure to a binary matrix S as mentioned in Sect. 4. Given the estimated links Inline graphic, we update Inline graphic according to Inline graphic, and then derive the across-dimensional branching structure Inline graphic from Inline graphic before filling in the final Inline graphic. Afterwards, comparing the estimated matrix Inline graphic with the ground truth S, we evaluate the effectiveness for all the aforementioned strategies in terms of F1-Score.

Figures 3(a) and 3(b) plot the results. Clearly, our proposed model BRUNCH with MEDIA outperforms the baseline Cluster-L, obtaining higher inference F1-Score. That is, the triggering relations among events (i.e., branching structure of MHPs) identified by the MEDIA approach are more reliable. While applying the same inferring procedure MEDIA, hybrid MHPs (mixing exponential MHPs and linear MHPs) show superior inference performance compared to other alternatives. In particular, MEDIA-H is superior to MEDIA-L and MEDIA-E. This further verifies the justifiability of our proposed hybrid MHPs.

Fig. 3.

Fig. 3.

Experimental results.

Convergence. We compare the convergence rate of our proposed techniques in Fig. 3(c) on Facebook data. The results on Twitter are qualitatively similar (see [9]). Clearly, MEDIA-H is of higher likelihood than MEDIA-L and MEDIA-E. This further validates the usefulness of our hybrid MHPs.

Scalability. Figures 3(d) plots the scalability of our algorithms with increasing events on Facebook data. The results are qualitatively similar on Twitter (see [9]). We run the inference methods on different sizes of datasets (i.e., slice different percentages of events in datasets as input data for BRUNCH). Observe that the average inference time of MEDIA stabilizes with increasing number of events. Since the number of event links and cluster links grows significantly as events increases, we expect the average runtime of MEDIA becomes relatively stable when more than 70% input data are utilized.

Conclusions

In this paper, we propose a novel probabilistic model called BRUNCH to infer the branching structure of hybrid MHPs. It bridges a significant chasm between hybrid MHPs (nonlinear MHPs as well) and branching structure inference. We handle the inferencing procedure via the MEDIA method, which provides a heuristic to make coordinated changes to both event links within clusters and cluster links within cascades. Empirically, our model demonstrates good performance and application potential in the real world.

Acknowledgments

This work is partly supported by the National Natural Science Foundation of China (No. 61672408, 61972309) and National Engineering Laboratory of China for Public Safety Risk Perception and Control by Big Data (PSRPC).

Footnotes

1

Link functions describe the dynamics of the comprehensive effects from previous events.

2

Triggering kernel functions describe the dynamics of how previous events trigger future events and may vary widely across different applications, e.g., the triggering patterns in social media can be very different from the ones in high frequency trading.

Contributor Information

Hady W. Lauw, Email: hadywlauw@smu.edu.sg

Raymond Chi-Wing Wong, Email: raywong@cse.ust.hk.

Alexandros Ntoulas, Email: antoulas@di.uoa.gr.

Ee-Peng Lim, Email: eplim@smu.edu.sg.

See-Kiong Ng, Email: seekiong@nus.edu.sg.

Sinno Jialin Pan, Email: sinnopan@ntu.edu.sg.

Hui Li, Email: HLI019@e.ntu.edu.sg.

Hui Li, Email: hli@xidian.edu.cn.

Sourav S. Bhowmick, Email: assourav@ntu.edu.sg

References

  • 1.Bacry E, Muzy JF. Hawkes model for price and trades high-frequency dynamics. Quant. Finance. 2014;14:1147–1166. doi: 10.1080/14697688.2014.897000. [DOI] [Google Scholar]
  • 2.Bishop CM. Pattern Recognition and Machine Learning. Heidelberg: Springer; 2006. [Google Scholar]
  • 3.Blei DM, Frazier PI. Distance dependent Chinese restaurant processes. JMLR. 2011;12:2461–2488. [Google Scholar]
  • 4.Farajtabar, M., Du, N., Rodriguez, M.G., Valera, I., Zha, H., Song, L.: Shaping social activity by incentivizing users. In: NIPS (2014) [PMC free article] [PubMed]
  • 5.Hawkes AG. Point spectra of some mutually exciting point processes. J. Roy. Stat. Soc.: Ser. B (Methodol.) 1971;33:438–443. [Google Scholar]
  • 6.Hawkes AG. Spectra of some self-exciting and mutually exciting point processes. Biometrika. 1971;58:83–90. doi: 10.1093/biomet/58.1.83. [DOI] [Google Scholar]
  • 7.Hawkes AG, Oakes D. A cluster process representation of a self-exciting process. J. Appl. Probab. 1974;11:493–503. doi: 10.2307/3212693. [DOI] [Google Scholar]
  • 8.Lee, Y., Lim, K.W., Ong, C.S.: Hawkes processes with stochastic excitations. In: ICML (2016)
  • 9.Li, H., Li, H., Bhowmick, S.S.: Brunch: branching structure inference of hybrid multivariate Hawkes processes with application to social media. Technical report http://www.ntu.edu.sg/home/assourav/TechReports/BRUNCH-TR.pdf
  • 10.Mei, H., Eisner, J.M.: The neural Hawkes process: A neurally self-modulating multivariate point process. In: NIPS (2017)
  • 11.Møller J, Rasmussen JG. Perfect simulation of Hawkes processes. Adv. Appl. Probab. 2005;37:629–646. doi: 10.1239/aap/1127483739. [DOI] [Google Scholar]
  • 12.Rasmussen JG. Bayesian inference for Hawkes processes. Methodol. Comput. Appl. Probab. 2013;15:623–642. doi: 10.1007/s11009-011-9272-5. [DOI] [Google Scholar]
  • 13.Reynaud-Bouret P, Rivoirard V, Grammont F, Tuleau-Malot C. Goodness-of-fit tests and nonparametric adaptive estimation for spike train analysis. J. Math. Neurosci. (JMN) 2014;4(1):1–41. doi: 10.1186/2190-8567-4-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wang, Y., Theodorou, E., Verma, A., Song, L.: A stochastic differential equation framework for guiding online user activities in closed loop. In: AISTATS (2018)
  • 15.Zhang C. Modeling high frequency data using hawkes processes with power-law kernels. Procedia Comput. Sci. 2016;80:762–771. doi: 10.1016/j.procs.2016.05.366. [DOI] [Google Scholar]
  • 16.Zhou, K., Zha, H., Song, L.: Learning social infectivity in sparse low-rank networks using multi-dimensional Hawkes processes. In: AISTATS (2013)
  • 17.Zhou, K., Zha, H., Song, L.: Learning triggering kernels for multi-dimensional Hawkes processes. In: ICML (2013)

Articles from Advances in Knowledge Discovery and Data Mining are provided here courtesy of Nature Publishing Group

RESOURCES