Skip to main content
. 2018 Nov 22;18(12):4093. doi: 10.3390/s18124093
Symbol Description Data Structure Supporting Process
T Number of Weibo posts Int T w-LDA
U Number of Weibo users Int U
K Number of topics Int K
V Number of words in the vocabulary Int V
V Number of uers profiles Int V
Np Number of words in p-th user profile Int N[P]
α K-dimensional prior weight vectors of topics in a document, Float a[K]
β V-dimensional vector prior weight of words in a topic Float b[V]
φz V-dimensional vector of probabilities, represents distribution of words in topic z Double phi [Z][V]
ϑp K-dimensional vector of probabilities, represents distribution of topics in user profile p Double theta [P][K]
zi Identity of current topic of word wi in user profile pi Int 1…K
wi Identity of current word in user profile pi Int 1…V
pi Identity of current user profiles Int 1…P
ni,j(pi) Document-Topic matrix, the number of times topic j has been assigned to words in user profile pi. int npt [P][K]
ni,j(wi) Topic-Word matrix, Number of times that word wi has been assigned to topic j int ntw [K][V]
Wi Identity of current word vector (200 dimensions) trained by traffic word2vec Double [200] Similarity measure
Tw(i=1K) Identity of current topic word cluster detected from Weibo Double tw[K]
Tn(j=1K) Identity of current topic word cluster detected from News Double tn[K]
WEw(m,i) Word embedding- cluster tensor, identity of the current word embedding in i-th cluster detected from Weibo Double cew[K] [Wi]
WEn(n,j) Word embedding- cluster tensor, identity of the current word embedding in i-th cluster detected from News Double cen [K] [Wi]
disR(WEw(m,i),WEn(n,j)) Words similarity, measure the similarity between word embedding WEw(m,i) and WEn(n,j) Double wd
DR( Tw(i), Tn(j)) Topic similarity matrix, measure the distances between each words in the given topic cluster Tw(i) and Tn(j) Double td[K][K] Event fusion
μR(Tw(i),Tn(j)) Average shortest distance between Tw(i) and Tn(j) Double atd
μR*(Tw(i),Tn(j)) Normalized average shortest distance between Tw(i) and Tn(j) Double natd
σR*(Tw(i),Tn(j)) The standard deviation of normalized topic distances Double sd