Table 7.
Subcategorization of semantic similarity classification discrepancies.
| Subcategory | Human error (n = 63) |
ParAlg error (n = 511) |
||
|---|---|---|---|---|
| Proportion | Count | Proportion | Count | |
| Associated | .30 | 19 | .12 | 62 |
| Category coordinate | .03 | 2 | .04 | 20 |
| Diminutive | .00 | 0 | .00 | 1 |
| No relationship | .38 | 24 | .69 | 352 |
| Rater disagreement a | .19 | 12 | .10 | 52 |
| Related proper name | .00 | 0 | .00 | 0 |
| Shared morpheme | .08 | 5 | .01 | 3 |
| Subordinate | .00 | 0 | .01 | 6 |
| Superordinate | .00 | 0 | .03 | 13 |
| Synonym | .00 | 0 | .00 | 2 |
Note. An additional 235 cases were categorized as uncertain, where the three rater judgments were not in exact agreement with regard to the presence or absence of semantic similarity. There were two cases of data missing at random for rater judgments of semantic similarity and semantic subcategorization. There were another five cases of data missing at random for semantic subcategorization only, four of which were categorized as uncertain. The remaining case was categorized as a human error; as such, it was included in the numeric total but not listed in the table.
Cases where the three raters' subcategorization of a human- or ParAlg-related error was not in exact agreement.