Skip to main content
. 2013 Jan 16;14:10. doi: 10.1186/1471-2105-14-10

Table 2.

Collocations found in redundant and non-redundant corpora

  All informative (redundant) Last informative (non-redundant)
Word Types
81,928
40,774
Words
3,641,031
545,231
Collocations
15,814
2,527
Collocations/Word
0.004
0.004
Avg. number of patients per collocation
18.2
66
% collocations that appear in notes of 3 patients or less 36 % 1 %

Collocations were extracted using a stringent cutoff of 0.001 PMI.