Table 2. BK polyomavirus (BKPyV) integrations sites and microhomology.
ID | Human sequence match | Virus sequence match | Maximum MH length | MH sequence | Chromosome | Position | Nearest gene (Symbol) | Nearest gene (Ensembl ID) | Distance to Nearest Gene | Nearest RE | Distance to nearest RE |
---|---|---|---|---|---|---|---|---|---|---|---|
TBC02 | CATCATGATGATGGG | GATGGGCAGCCTA | 5 | ATGGG | chr2 | 120378301 | INHBB | ENSG00000163083 | –26499 | MIRb | –45 |
TBC02 | CTCCTGCTCATGAA | CATGAAGGT TAAGCATGCTA |
5 | ATGAA | chr4 | 145732354 | C4orf51 | ENSG00000237136 | 0 | AluSq2 | –474 |
TBC02 | ACCATTTAATTCCCAA | AGTGGAAATTAC | 2 | AC | chr4 | 145732375 | C4orf51 | ENSG00000237136 | 0 | AluSq2 | –495 |
TBC03.1 | GCCTTTCTTG TGGACTGGGT |
ATTTTCATTTCT ACTGGGGTCAGGA |
0 | No overlap | chr1 | 93693546 | BCAR3 | ENSG00000137936 | 0 | MIRb | 377 |
TBC03.1 | TCTGTTTCT TATTTCAGAA |
GGGTTCTCCTG TTTATAAGGTC |
2 | TC | chr1 | 93693570 | BCAR3 | ENSG00000137936 | 0 | MIRb | 353 |
TBC03.1 | AGAGCCTTG GTGGTGG |
GGTGGCAAA CAGTGCAG |
5 | GGTGG | chr1 | 93693890 | BCAR3 | ENSG00000137936 | 0 | MIRb | 33 |
TBC03.1 | GATACTTTTT AGACATGC |
AACCATGACC TCAGGAAGGA |
4 | CATG | chr1 | 93694075 | BCAR3 | ENSG00000137936 | 0 | MIRb | 0 |
TBC03.1 | CCTCAAAGC CACCCACTCC |
TTTCCATGA GCCCCAAA |
5 | CCAAA | chr1 | 93694843 | BCAR3 | ENSG00000137936 | 0 | MER5A | –92 |
TBC03.1 | CAATTTTTTTTTTTT | TTTTTTTATT TGTAAGGGTG |
7 | TTTTTTT | chr12 | 50449935 | LARP4 | ENSG00000161813 | 0 | AluSc | 0 |
TBC03.1 | TGCAAGGTG CTTCATGTAT |
AGGGGGCTTA AAGGATGCA |
4 | TGCA | chr14 | 95764390 | ENSG00000257275 | –6735 | MIRb | 0 | |
TBC03.1 | TAGCCAAAA AAAAAAAGG |
AAAAAAAAA GGCCACAG |
11 | AAAAAAAAAGG | chr20 | 8525269 | PLCB1 | ENSG00000182621 | 0 | MamSINE1 | 154 |
TBC03.1 | CAATTTGGA AAACAAT |
ATGCAAGGG CAGTGCACA |
2 | AT | chr3 | 73059264 | PPP4R2 | ENSG00000163605 | 0 | MER103C | 69 |
TBC03.1 | TAAAAAGTGTCA | AAGTGTCAA TAGAGAAAAA |
8 | AAGTGTCA | chr4 | 142307350 | INPP4B | ENSG00000109452 | 0 | L2a | 0 |
TBC03.1 | TCACACAAT TT-TACTCCTCT |
ACACTTTTTAC ACTCCTCTA |
8 | ACTCCTCT | chr8 | 140923993 | PTK2 | ENSG00000169398 | 0 | L2a | 0 |
TBC03.2 | GTTGAGTT GGAGCA |
CATCTAAATAA TCTCTCAAACT |
2 | CA | chr1 | 93693160 | BCAR3 | ENSG00000137936 | 0 | MER5A1 | –10 |
TBC03.2 | ACCCAGTCCA CAAGAAAGGC |
CCAGTAGAA ATGAAAAT |
0 | No overlap | chr1 | 93693546 | BCAR3 | ENSG00000137936 | 0 | MIRb | 377 |
TBC03.2 | TCTGTTTCT TATTTCAG |
GTTCTCCTGT TTATAAGGTC |
2 | TC | chr1 | 93693570 | BCAR3 | ENSG00000137936 | 0 | MIRb | 353 |
TBC04 | GAGTGAGT TCATAG |
CAACACTGTG GTGAG-TGAGTT |
4 | GAGT | chr3 | 5202593 | EDEM1 | ENSG00000134109 | 0 | L2b | –466 |
TBC06 | CAGACATT -AGGA |
TGAGGACC TAACCTGT |
4 | AGGA | chr2 | 201676427 | MPP4 | ENSG00000082126 | 0 | MIR1_Amn | 0 |
TBC08 | TCCACTTT CAGTACTT |
TGCAAAA AATCAAAT |
1 | T | chr6 | 148535326 | SASH1 | ENSG00000111961 | 0 | AluSq | 995 |
TBC09.1 | GGGGCGG TAACTAGAAG |
ACTAGAAG CTTGTCGT |
8 | ACTAGAAG | chr17 | 61340185 | BCAS3 | ENSG00000141376 | 0 | L2-3_Crp | 0 |
TBC09N | GAGAAAAT AGGACTCGG |
AAGATTCGC CTGAGAAAA |
7 | GAGAAAA | chr18 | 8169205 | PTPRM | ENSG00000173482 | 0 | MER127 | –648 |
TBC09N | TCCATCC TCCTCTAC |
CTCCTCT ACATTGT |
9 | CTCCTCTAC | chr3 | 34028749 | LINC01811 | ENSG00000226320 | 130585 | L2b | 0 |
TBC09N | ATGTAAT ATAAAACT |
CATGATT TTAACCCAG |
0 | No overlap | chr3 | 117678477 | ENSG00000239268 | 0 | L2c | 0 |
MH: microhomology; RE: Repeat element.