Skip to main content
. 2021 Dec 1;50(D1):D236–D245. doi: 10.1093/nar/gkab1089

Table 4.

Partial comparison of the proportion and detailed classification of detected repeats generated based on two databases of the Glycine max genome

Combination of RepBase and Dfam [bases masked: 36.11%] msRepDB [bases masked: 41.54%]
Repeat types Number of elements Length occupied Percentage of sequences Number of elements Length occupied Percentage of sequences
Retroelements 199 220 289 032 002 bp 29.52% 244 764 328 295 871 bp 33.54%
+SINEs 0 0 bp 0.00% 0 0 bp 0.00%
+Penelope 0 0 bp 0.00% 0 0 bp 0.00%
+LINEs 12 626 10 304 690 bp 1.05% 13 156 10 432 965 bp 1.07%
++CRE/SLACS 0 0 bp 0.00% 0 0 bp 0.00%
+++L2/CR1/Rex 0 0 bp 0.00% 0 0 bp 0.00%
+++R1/LOA/Jockey 0 0 bp 0.00% 0 0 bp 0.00%
+++R2/R4/NeSL 0 0 bp 0.00% 0 0 bp 0.00%
+++RTE/Bov-B 3 790 2 001 199 bp 0.20% 3 945 2 017 968 bp 0.21%
+++L1/CIN4 8 836 8 303 491 bp 0.85% 9 211 8 414 997 bp 0.86%
+LTR elements 186 594 278 727 312 bp 28.47% 231 608 317 862 906 bp 32.47%
++BEL/Pao 0 0 bp 0.00% 0 0 bp 0.00%
++Tyl/Copia 58 199 80 563 666 bp 8.23% 83 194 87 429 549 bp 8.93%
++Gypsy/DTRS1 126 690 195 309 037 bp 19.95% 140 926 225 546 399 bp 23.04%
+++Retroviral 0 0 bp 0.00% 340 206 126 bp 0.02%
DNA transposons 58 468 41 514 301 bp 4.24% 61 139 42 789 484 bp 4.37%
+hobo-Activator 7 612 2 233 822 bp 0.23% 5 901 1 964 869 bp 0.20%
+Tc1-IS630-Pogo 117 56 379 bp 0.01% 321 75 504 bp 0.01%
+En-Spm 0 0 bp 0.00% 0 0 bp 0.00%
+MuDR-IS905 0 0 bp 0.00% 0 0 bp 0.00%
+PiggyBac 0 0 bp 0.00% 0 0 bp 0.00%
+Tourist/Harbinger 923 564 171 bp 0.06% 1 006 582 191 bp 0.06%
+Other 0 0 bp 0.00% 0 0 bp 0.00%
Rolling circles 538 252 405 bp 0.03% 967 740 481 bp 0.08%
Unclassified 0 0 bp 0.00% 46 116 9 214 511 bp 0.94%
Total interspersed repeats 330 546 303 bp 33.77% 378 050 943 bp 38.62%
Small RNA 2 223 902 022 bp 0.09% 2 221 901 834 bp 0.09%
Satellites 19 885 2 175 759 bp 0.22% 9 389 6 367 996 bp 0.65%
Simple repeats 323 670 15 236 633 bp 1.56% 307 769 14 416 738 bp 1.47%
Low complexity 82 139 4 344 053 bp 0.44% 75 689 3 964 123 bp 0.40%

The test results were obtained by using RepeatMasker based on the msRepDB database and the combination of Dfam and RepBase respectively under the default parameter settings.