Skip to main content
. 2022 Nov 18;11:giac104. doi: 10.1093/gigascience/giac104

Table 3.

Ultra-large dataset properties

Family No. of sequences %id Sequence length
Combined Seed Min Max Avg
PF00005 3,489,586 55 26 18 683 146
PF07690 1,861,106 192 13 37 577 284
PF00096 1,783,511 159 41 12 34 23
PF00072 1,767,045 52 25 28 156 110
PF00400 1,594,257 1,465 24 12 101 35
PF00069 1,154,714 38 21 24 511 227
PF12796 945,198 184 24 27 153 78
PF13855 766,271 62 28 26 73 57
PF00041 666,310 98 20 27 139 81
PF07679 579,519 48 21 25 149 83

Sequence identity is based on full alignment. Sequence lengths are given for the combined dataset.