Skip to main content
. 2021 Jun 25;52(3):3002–3017. doi: 10.1007/s10489-021-02572-3

Table 8.

Extracted patterns from 5 serum metagenomic datasets, which act as features to predict viral and non-viral genomes

Viral pattern Frequency Repeated in no. of filters Filter numbers Non-viral pattern Frequency Repeated in no. of filters Filter numbers
AAGAAAA 1610 27 4-9, 12-32 AAAGAAA 2753 32 1-32
TAAAAAA 1171 14 1-2,14-22, 30-32 ACACACA 3446 17 16-32
AAAACAA 791 12 6-9, 14-21 AAAAATA 2379 14 2-15
CAGAAAA 867 6 4-9 AAAAAAC 1679 5 2,7-10
AAAAAAA 3143 32 1-32 AAAAAAA 18473 32 1-32
AAAAGAA 3796 32 1-32 AAAAGAA 2806 23 1,11-32
TTTTTTT 1919 31 2-32 TTTTTTT 6023 30 3-32

Neutral patterns, present in viral and non-viral genome sequences