|
Algorithm 1 SE-KIA (offline component) |
-
1:
Input: Query logs , search corpus , co-occurrence threshold , search method Search(keywords, ) that returns an ordered list of documents ranked by relevance
-
2:
Output: Keyword co-occurrence word graph
-
3:
Step 1: Data Extraction
-
4:
Parse to extract all queries containing exactly two keywords
-
5:
Filter out pairs with frequency , obtaining set
-
6:
Step 2: Posteriori Search Result Retrieval
-
7:
for each do
-
8:
-
9:
-
10:
-
11:
end for
-
12:
Step 3: Semantic Entropy Calculation
-
13:
for each do
-
14:
Compute intersections:
-
15:
-
16:
-
17:
Calculate occurrence counts:
-
18:
number of documents in
-
19:
number of documents in
-
20:
Compute associated probabilities:
-
21:
-
22:
-
23:
Compute semantic entropy:
-
24:
-
25:
-
26:
end for
-
27:
Step 4: Relative importance determination
-
28:
for each do
-
29:
if then
-
30:
-
31:
else if
then
-
32:
-
33:
else
-
34:
-
35:
end if
-
36:
end for
-
37:
Step 5: Graph Construction
-
38:
Initialize where ,
-
39:
for each do
-
40:
-
41:
-
42:
end for
-
43:
return
|