Table 7. C3 training dataset document length analysis.
| Average lengths | Max lengths | Exceeding ratio* | Samples | |
|---|---|---|---|---|
| 152 | 1,540 | 12.3% | 5,856 | |
| 290 | 1,274 | 15.9% | 6,013 | |
| 222 | 1,540 | 14.2% | 11,869 |
Note:
The exceeding ratio means the percentage of the number of samples with a docment length exceeding 512.