Skip to main content
. 2022 May 12;22(10):3683. doi: 10.3390/s22103683

Table 3.

Specifications of the Uzbek language corpus dataset.

Category of Dataset Training Validation Testing Total
Duration (h) 180.36 16.2 10.8 207.36
#Utterances 162.02 10.98 7.81 180.81
#Words 811.98k 46.95k 45.3k 904.23k
#Unique Words 90.36k 20.16k 18.34k 128.86k
#Speakers 1082 107 93 1282