Skip to main content
. 2022 May 12;22(10):3683. doi: 10.3390/s22103683

Table 2.

Contains the specifics of the Common Voice and SpeechOcean datasets. The ‘training’ set trains the ASR model, while the ‘testing’ set tests its accuracy.

Dataset Total Words Total Unique Words Total Duration (h) Total Utterances Average Duration (s) Average Word per Utterances
Common Voice (training)
Common Voice (testing)
SpeechOcean
497,326 78,604 127.41 117,526 4.27 4
23,450 14,108 11.69 8840 4.23 5
406,904 50,256 79.95 63,284 5.93 8
Common Voice (training) + SpeechOcean 904,230 128,860 207.36 180,810 4.80 6