Skip to main content
. 2021 Apr 6;7:e443. doi: 10.7717/peerj-cs.443

Table 2. Dataset sizes.

The human text datasets used in our experiments were taken from the samples of human text published with the respective language models and resized to match the size of the machine text datasets.

Model Dataset full name Short Full Filtered
Name Train Valid Test Train Valid Test
Machine datasets
GPT2 Small-117M s 250,000 5,000 5,000 185,622 3,732 3,722
GPT2 xl-1542M xl 250,000 5,000 5,000 193,052 3,868 3,851
GPT2 Small-117M-k40 s-k 250,000 5,000 5,000 201,236 4,062 4,082
GPT2 xl-1542M-k40 xl-k 250,000 5,000 5,000 214,202 4,312 4,243
GPT3 175B GPT3 1,604 201 201 886 122 101
Grover Grover-Mega Grover 8,000 1,000 1,000 7,740 964 961
Human datasets
GPT2 Webtext 250,000 5,000 5,000 190,503 3,813 3,834
GPT3 GPT3-webtext 1,604 201 201 1,235 160 155
Grover realNews 8,000 1,000 1,000 7,725 972 976