Table 3.
Orthographic features.
Feature name | Regular expression |
---|---|
ALLCAPS | ∧[A − Z] + $ |
CAPSMIX | ∧[A − z] ∗ ([A − Z] [a − z]∣[a − z] [A − Z]) [A − z] ∗ $ |
INITCAP | ∧[A − Z] |
PUNCTUATION | ∧[∖.:]$ |
Orthographic features.
Feature name | Regular expression |
---|---|
ALLCAPS | ∧[A − Z] + $ |
CAPSMIX | ∧[A − z] ∗ ([A − Z] [a − z]∣[a − z] [A − Z]) [A − z] ∗ $ |
INITCAP | ∧[A − Z] |
PUNCTUATION | ∧[∖.:]$ |