>gi|322417944|ref|YP_004197167.1|_extraction RNAdirected DNA polymerase %5BGeobacter sp. M18%5D -------------------------------------------------------------------GIDGVTFEAVEEK E-GVS--AFI-----AELEDALR-------N------KTY----QPD--------------------------------- -----------------PVKRV-MI--PK------------SD--G-----------------SQRPL------------ ------GIPT----IR-DRVAQMAVKLV-IE-PIFEADF-------C--------------------------------- ---ESSYGFRP------------------KR----------------------------SAHDA---VDDV--------- ----AYS--MNT----------------------GYTE-VI-DA--------DLSKYF-DTIP----------------- -HAN--LMAV-I-AERI---------CDGA------------------ILHLI---------QMWLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------M-----E-V--------------- -------------------------------------------------------------------------------- ---------------DKDGTKRNIGG--GKG-NR-K-----------GTP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------HI----------- --------LD--RI------W-----------------E-RGN------------------------------------- ----------------------LQQR-LGAR------------IVR----------YADDIVI-------------LCR- ------RAK----AD---------------------------KAMATLRY---------------------VL-E----- ----R--L----GLSL------N-E------AK----T---TTVNAY------------K-------------------- D---------------KFDFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pe.ca.I3/CP000142.2/2649551..2651540/Pelobacter_extraction carbinolicus/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTFAAIEER E-GVS--ALI-----AELEEALR-------S------KTY----KPD--------------------------------- -----------------PVKRV-MI--PK------------AD--G-----------------SQRPL------------ ------GIPT----IR-DRVAQMAVKLV-VE-PIFEADF-------C--------------------------------- ---DTSYGFRP------------------KK----------------------------SAHDA---VDDV--------- ----AYA--MNI----------------------GYTE-VI-DA--------DISKYF-DTIP----------------- -HTN--LMAV-V-AERI---------CDGA------------------ILHLI---------QMWLKS------------ -------------------------------------------------------------------------------- ------------------SV------------------------------------M-----E-V--------------- -------------------------------------------------------------------------------- ---------------GKDGKKRNVGG--GKG-NR-R-----------GTP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------HI----------- --------LD--RI------W-----------------E-RRN------------------------------------- ----------------------LQQR-LNAR------------IVR----------YADDTVL-------------LCR- ------RNK----SD---------------------------EAMAVLRQ---------------------IL-E----- ----R--L----GLTL------N-E------AK----T---KVVNGY------------K-------------------- G---------------GFDFLGFSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|350554847|ref|ZP_08923874.1|_extraction RNAdirected DNA polymerase %28Reverse transcriptase%29 %5BThiocystis violascens DSM 198%5D -------------------------------------------------------------------GSDGVSFEAIEQG E-GVE--GFL-----KGLAEELR-------E------KRY----RAQ--------------------------------- -----------------PVRRA-MI--PK------------GD--G-----------------RERPL------------ ------GIPT----IR-DRVVQMAVKLV-IE-PIFEADF-------T--------------------------------- ---PHSYGFRP------------------QR----------------------------SAHDA---IDDI--------- ----ANA--LWA----------------------GHTH-VI-DA--------DLSSYF-DTIP----------------- -HAN--LMTV-V-AERM---------TDGA------------------ILALL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------I-----G-V--------------- -------------------------------------------------------------------------------- ---------------DDQGKRRTVGG--GKA-NR-V-----------GTP-----QGGVI------------------S- PLLSNLYL-----------------------------------------------------------HL----------- --------LD--RI------W-----------------D-RHR------------------------------------- ----------------------LKDK-LGAH------------IVR----------YADDFVV-------------LCK- ------QG-----VE---------------------------EPLKVVRH---------------------VT-D----- ----R--L----GLTL------N-E------TK----T---HVVDAK------------E-------------------- T---------------GFHFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|345870111|ref|ZP_08822065.1|_extraction RNAdirected DNA polymerase %28Reverse transcriptase%29 %5BThiorhodococcus drewsii AZ1%5D -------------------------------------------------------------------GIDGVTFTAIEAG I-GKD--AYV-----AALREELE-------Q------KTY----RAD--------------------------------- -----------------GVRRV-WI--PK------------PD--G-----------------SERPL------------ ------GIPT----IR-DRIVQMAFKLV-VE-PIFEADF-------C--------------------------------- ---EHSYGFRP------------------QR----------------------------SAHDA---IDAI--------- ----AEA--LLR----------------------GHTQ-VI-DA--------DLSKYF-DTIP----------------- -HAK--LMGV-I-AERL---------VDGP------------------VLGLI---------RQWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------I-----E-E--------------- -------------------------------------------------------------------------------- ---------------DERGQHRPT-G--GKG-NR-R-----------GTP-----QGGVA------------------S- PLLANLYL-----------------------------------------------------------HL----------- --------LD--RI------W-----------------V-RHD------------------------------------- ----------------------LERR-LGAR------------LVR----------YADDAVI-------------LCR- ------HS-----TE---------------------------KPMAVFTA---------------------VL-E----- ----K--L----DLTL------N-V------QK----T---HVVDAR------------A-------------------- D---------------GFEFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cKu.st.I1/CT573074.1/62738..64755/Candidatus_extraction Kuenenia stuttgartiensis/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GADGITFEDVES- Y-GVE--KFL-----GEIIEELE-------N------KTY----EPQ--------------------------------- -----------------PVLRV-YI--PK------------TN--G-----------------KTRPL------------ ------GIPV----IK-DRVVQMSVKLV-IE-PIFEADF-------E--------------------------------- ---DSSYGFRP------------------GR----------------------------SAGDA---VRKI--------- ----KEK--LRE----------------------GKTE-VF-DA--------DLSSYF-DTIP----------------- -HKE--LLLL-I-GMRI---------SDKN------------------VLHLI---------KMWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------I-----E-E--------------- -------------------------------------------------------------------------------- ---------------GKPG------G--GRK-NK-I-----------GTP-----QGSVI------------------S- PLLANIYL-----------------------------------------------------------HM----------- --------LD--KA------V-----------------N-RENG------------------------------------ ----------------------VFYK-YGIT------------IIR----------YADDWVL-------------MAK- ------RI-----PR---------------------------EALDYLNR---------------------LL-K----- ----K--L----KLSL------N-E------DK----S---KIVKAE------------E-------------------- E---------------SFDFLGHTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|354960451|dbj|BAL13130.1|_extraction hypothetical protein BJ6T_78840 %5BBradyrhizobium japonicum USDA 6%5D -------------------------------------------------------------------GVDGITFEQIDA- S-GLE--AWL-----AGLRDELV-------T------KTY----RPD--------------------------------- -----------------PVRRV-MI--PK------------PG--G-----------------GERPL------------ ------GIPT----IR-DRVVQAAAKIV-LE-PIFEADF-------E--------------------------------- ---DGAYGYRP------------------RR----------------------------NAVDA---VKEV--------- ----HRL--MCR----------------------GYTD-VV-DA--------DLSKYF-DTIP----------------- -HSD--LLKS-V-ARRI---------VDRN------------------VLRLI---------KLWLRV------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----E-R--------------- -------------------------------------------------------------------------------- ---------------DSNGKRRMS-G--GKS-NK-C-----------GTP-----QGGVI------------------S- PLLSVIYM-----------------------------------------------------------NR----------- --------FL--KH------W-------------------RLSG------------------------------------ ----------------------RCEA-FHGQ------------IIS----------YADDFVI-------------LSR- ------GH-----AE---------------------------DALTWTKA---------------------VM-T----- ----K--L----GLTL------N-E------TK----T---SVKNAR------------L-------------------- E---------------SFDFLGYTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ni.ha.I1/CP000320/75444..77354/Nitrobacter_extraction hamburgensis/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTFGQIEG- A-GVD--AWL-----AGLREDLV-------S------KTY----QPD--------------------------------- -----------------PVRRV-MI--PK------------PG--G-----------------GERPL------------ ------GIPT----IR-DRVVQAAAKIV-LE-PIFEAGF-------E--------------------------------- ---DSAYGYRP------------------RR----------------------------SAIDA---VKET--------- ----HRL--LCR----------------------GYTD-VV-DA--------DLSKYF-DTIP----------------- -HAD--LLRS-V-ARRV---------LDRN------------------VLRLI---------KLWLQV------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----E-R--------------- -------------------------------------------------------------------------------- ---------------DGDGKRHMS-G--GKS-ST-R-----------GTP-----QGGVA------------------S- PLLSVIYM-----------------------------------------------------------NR----------- --------FL--KH------W-------------------RLTG------------------------------------ ----------------------RGEV-FHAH------------VIS----------YADDFVI-------------LSR- ------GH-----AE---------------------------EALTWTRA---------------------VM-T----- ----K--L----GLTL------N-E------AK----T---SVKNAR------------R-------------------- E---------------GFDFLGYTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|115293161|locus|VBIRhiTro150571_4165|_extraction Mobile element protein [Rhizobium tropici CIAT 899] -------------------------------------------------------------------GVDGVTFTQIEA- S-GVD--AWL-----AGLREELV-------S------KTY----RPD--------------------------------- -----------------PVRRV-MI--PK------------PG--G-----------------GERAL------------ ------GIPS----IR-CRVIQTAAKLV-LE-PIFEADF-------E--------------------------------- ---DGAYGYRP------------------RR----------------------------SAVDA---VKET--------- ----HRL--MCR----------------------GYTD-VV-DA--------DLSKYF-DTIP----------------- -HSD--LLKS-V-ARRI---------VDRS------------------VLRLI---------RLWLRA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----E-R--------------- -------------------------------------------------------------------------------- ---------------DGDGKRRMT-G--GKS-ST-H-----------GTP-----QGGVV------------------S- PLLSVIYM-----------------------------------------------------------NR----------- --------FL--KH------W-------------------RLSG------------------------------------ ----------------------LGEE-FRAH------------VIS----------YADDFVI-------------LSR- ------DH-----AA---------------------------EALAWTRT---------------------VM-T----- ----K--L----GLSL------N-E------AK----T---SVKDAR------------R-------------------- E---------------HFDFLGYSLG------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|296163794|ref|ZP_06846488.1|_extraction RNAdirected DNA polymerase %28Reverse transcriptase%29 %5BBurkholderia sp. Ch11%5D -------------------------------------------------------------------GVDRQDFAEVEA- Y-GVQ--KWL-----GELALALR-------L------ETY----RPD--------------------------------- -----------------SIRRV-FI--PK------------AN--G-----------------KLRPL------------ ------GIST----LR-DRVCMTAAMLV-LE-PIFEADL-------P--------------------------------- ---PEQYAYRP------------------GR----------------------------NAQQA---VIEV--------- ----EER--LHR----------------------GQTD-VV-DA--------DLADYF-GSIP----------------- -HAE--MMLS-L-ARRI---------VDRR------------------VLHLI---------KMWLEC------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----E-T--------------- -------------------------------------------------------------------------------- ---------------DDRGRQKRTTE--ARD-SR-R-----------GIP-----QGSPI------------------S- PLLANVYM-----------------------------------------------------------RR----------- --------FV--LA------W-------------------KKLG------------------------------------ ----------------------LQRS-LGSR------------IVT----------YADDLVI-------------LCK- ------KGK----AE---------------------------EALLNLRQ---------------------IM-G----- ----K--L----KLTV------N-E------EK----T---RICKVP------------E-------------------- G---------------EFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D.a.I1/CP000089/759875..761862/Dechloromonas_extraction aromatica/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDRQDFEDVEA- Y-GVR--RWL-----EELALALK-------E------ESY----RPD--------------------------------- -----------------PIRRV-FI--PK------------AN--G-----------------KLRPL------------ ------GIST----LH-DRVCMTAAMLV-LE-PIFEADL-------P--------------------------------- ---DEQYAYRP------------------GR----------------------------NAQQA---AEEV--------- ----KNR--LYL----------------------GQTD-VV-DA--------DLSDYF-GSIP----------------- -HSE--LMKS-L-ARRI---------VDRR------------------VLHLI---------KMWLEC------------ -------------------------------------------------------------------------------- ------------------AV------------------------------------E-----E-T--------------- -------------------------------------------------------------------------------- ---------------DQRGRKKRTTE--AKD-QG-R-----------GIP-----QGSPI------------------S- PLLSNLYM-----------------------------------------------------------RR----------- --------FV--LA------W-------------------KKLG------------------------------------ ----------------------LERS-LGSR------------IVT----------YADDLVI-------------LCK- ------CGK----AE---------------------------EALQWMRT---------------------IM-G----- ----K--L----KLTV------N-E------EK----T---RICQVP------------A-------------------- G---------------TFDFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|344200432|ref|YP_004784758.1|_extraction RNAdirected DNA polymerase %5BAcidithiobacillus ferrivorans SS3%5D -------------------------------------------------------------------GVDGERFEDVEA- Y-GVE--RWI-----GELAETLR-------K------KMY----QPQ--------------------------------- -----------------AVKRV-YI--PK------------PG--G-----------------KMRPL------------ ------GIPT----LR-DRVVQTATMMV-IE-PIFEADL-------Q--------------------------------- ---PEQYAYRA------------------GR----------------------------NALTA---VREV--------- ----HSL--LKT----------------------GHKQ-VV-DA--------DLSSYF-DTIP----------------- -HAE--LMKS-V-ARRI---------VDRH------------------LLHLI---------KMWLDA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----E-G--------------- -------------------------------------------------------------------------------- ---------------DGRGNMQRTTV--NRD-QG-R-----------GTP-----QGAPI------------------S- PLLSSLYM-----------------------------------------------------------RR----------- --------FI--LG------W-------------------KQRG------------------------------------ ----------------------YEER-FGSR------------IVC----------YADDLVI-------------CCR- ------W-Q----AE---------------------------QAMAAMQD---------------------MM-G----- ----R--L----KLTV------N-A------EK----T---RICRVP------------E-------------------- A---------------YFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >So.us.I2/CP000473/3231872..3233814/Solibacter_extraction usitatus/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTIEEIMKT DQGVA--GFL-----EGIENSLR-------R------KTY----RPE--------------------------------- -----------------AVQRV-YI--EK------------EN--G-----------------KLRPL------------ ------GIPT----VR-DRVVQMATLLI-LE-PIFEADF-------L--------------------------------- ---DCSYGFRP------------------GR----------------------------SAHQA---LEEI--------- ----RGH--VEA----------------------GYQA-VY-DA--------DLKGYF-DSIP----------------- -HTQ--LLAC-V-RMRV---------VDRS------------------VLKLI---------RMWLEA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------V-----E-R--------------- -------------------------------------------------------------------------------- ---------------EEGGGGS---K--WSR-PE-K-----------GTP-----QGGVA------------------S- PLLANLYL-----------------------------------------------------------HW----------- --------FD--AL------F-----------------Y-GPEG------------------------------------ ----------------------PGGK-ADAK------------LVR----------YADDFVV-------------MAK- ------QM-----GT---------------------------ETIEFIES---------------------RLEG----- ----K--F----QLEI------N-R------EK----T---RVVDLR------------E------------------EG A---------------SLDFLSHTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ge.ur.I1/CP000698/1525569..1527641/Geobacter_extraction uraniireducens/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVSIESIEVR ADGIS--GYL-----DEIQESLR-------T------KNY----KPS--------------------------------- -----------------PVRRV-YI--TK------------PN--G-----------------KLRPL------------ ------GIPC----VR-DRIVQAAVLLI-LE-PIFEVDF-------L--------------------------------- ---DCSHGFRP------------------KR----------------------------RPHGA---LDQV--------- ----GNN--LQL----------------------GRQE-VY-DA--------DLSSYF-DSIP----------------- -HEH--LIVE-L-ERRI---------ADRS------------------VLKLI---------RQWLHS------------ -------------------------------------------------------------------------------- ------------------PV--------------------------------------------R--------------- -------------------------------------------------------------------------------- ---------------EEDG------S--ISR-PK-Q-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------HR----------- --------LD--RA------F-----------------HEEADS------------------------------------ ----------------------PYHF-ARAR------------MVR----------FADDFVV-------------MAR- ------HM-----GN---------------------------RITGWLEE---------------------KLET----- ----D--L----GLSI------N-R------DK----T---GIVRMN------------K------------------K- E---------------SLNFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pe.th.I2/AP009389/2519125..2521096/Pelotomaculum_extraction thermopropionicum/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GADGQSFKDIEEK V-GVE--RFL-----KEIAEELR-------N------GTY----RPM--------------------------------- -----------------PVRRV-YI--LK------------PD--G-----------------SQRPL------------ ------GIPT----IK-DRIAQMACLTV-IQ-PIFEADF-------L--------------------------------- ---DCSYGFRP------------------KR----------------------------NAHQA---IGAI--------- ----TEN--IKQ----------------------GFTA-VY-DA--------DLTKCF-DSIQ----------------- -HRL--IMDS-L-AERI---------TDGK------------------VLRLI---------KGWLEA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------V----------------------- -------------------------------------------------------------------------------- ----------------EPGGPK---Q--GRK-NY-Q-----------GTP-----QGGVI------------------S- PLLANIVL-----------------------------------------------------------NR----------- --------LD--RL------W-----------------H-RPGG------------------------------------ ----------------------PRER-YNAR------------LVR----------YADDFVV-------------LAR- ------FI-----GE---------------------------PIKNELES---------------------II-T----- ----S--M----GLNL------N-E------KK----T---RILDLN------------K------------------G- D---------------ILNFLGYSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >S.ag.I2/AE014217/10188..12210/Streptococcus_extraction agalactiae/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDDFTIEEIEA- Y-GVQ--KFL-----DEIEDQLR-------N------KKY----QPK--------------------------------- -----------------AVKRV-YI--PK------------AN--G-----------------KKRPL------------ ------GIPT----VR-DRVVQTAVKIV-IE-PIFEADF-------Q--------------------------------- ---EFSYGFRP------------------KR----------------------------SANQA---IREI--------- ----YKY--LNY----------------------GCEW-VI-DA--------DLKGYF-DTIP----------------- -HDK--LLLL-V-KERV---------TDKS------------------IIKLL---------SLWLEA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------NQ--------V--RSN-I--L-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------NA----------- --------LD--RY------W-----------------K-NNRL------------------------------------ ----------------------EGRG-HDAH------------LIR----------YADDFVI-------------LCS- -------NN----PK---------------------------KYYQYAKQ---------------------RI-D----- ----K--L----GLTL------N-E------EK----T---RIV---------------H------------------AT E---------------GFDFLGYTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61054282|locus|VBICloCla155345_1776|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium clariflavum DSM 19732] -------------------------------------------------------------------GIDKVSIDDVKA- Y-GEE--KLL-----DEIAEDLR-------A------EKY----RCK--------------------------------- -----------------PVRRT-YI--PK------------QD--G-----------------RKRAL------------ ------GIPT----IK-DRIVQMAAKIV-IE-PVFEADF-------Q--------------------------------- ---PCSYGFRP------------------KR----------------------------NAKQA---MDRI--------- ----YEM--ADKG---------------------GALW-VI-DA--------DIRDYF-GSIN----------------- -HDK--LLLL-V-KQRI---------TDRR------------------VLKLI---------KGWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------GQ--------Y--SES-T--V-----------GAP-----QGGVI------------------S- PLLSNIYL-----------------------------------------------------------NY----------- --------FD--VC------W-------------------SKRF------------------------------------ -------------------------G-HLGE------------LVR----------YADDFVI-------------LCK- -----KLSQ----AE---------------------------EALRAVKW---------------------IM-K----- ----K--L----ELTL------H-S------EK----T---RLVDMY------------F------------------GK D---------------SFDFLGFNN------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115613588|locus|VBIDehSp228777_0963|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dehalobacter sp. CF] -------------------------------------------------------------------GIDKQTLSDIEE- M-GVE--KFL-----LTCQRSLK-------E------NNY----RPM--------------------------------- -----------------PVRRQ-YI--PK------------KD--G-----------------KMRPL------------ ------GIPV----IR-DRVIQMAVKLV-IE-PIFEADF-------H--------------------------------- ---ESSYGFRP------------------KR----------------------------SAKQA---LDRV--------- ----RKA--CNRK---------------------G-NW-VC-DV--------DIQSYF-DNIN----------------- -QEK--LMKL-V-EMRI---------SDKK------------------VLKLI---------RKWFKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------GV--------I--TRT-D--I-----------GTP-----QGGVI------------------S- PLLSNIYL-----------------------------------------------------------NV----------- --------LD--LL------W-----------------E-KHG------------------------------------- -------------------------K-ESGE------------LTR----------YADDFVI-------------ICK- -----TKKD----AD---------------------------KAMVIVQA---------------------IM-K----- ----R--L----DLTL------H-P------TK----T---RLVGMW------------T------------------GE E---------------GFDFLGMHH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UA.I4/AY714820/20258..22206/uncultured_archaeon_extraction /Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDDVTIDEFER- --NLE--QNL-----NEIQRLLR-------Q------DRY----VPK--------------------------------- -----------------PVKRV-YI--PK------------PD--G-----------------KQRPL------------ ------GIPT----IR-DRVVQQALKNV-IE-PIFEAEF-------L--------------------------------- ---DSSFGYRP------------------GK----------------------------SAKQA---IEQI--------- ----ETV--RDE----------------------GHEW-VV-DA--------DIKAFF-DTVN----------------- -HEK--LIDA-V-AERI---------SDGR------------------VLGLI---------RAFLEA------------ -------------------------------------------------------------------------------- ------------------DI------------------------------------M-----E-Q--------------- -------------------------------------------------------------------------------- ---------------GQ--------G--RAK-NV-V-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------H------------ -------------Y------F-----------------D-ERM------------------------------------- -------------------------A-LGFE------------VVR----------YADDVLV-------------LCG- -----SEEE----AE---------------------------EAISHVKE---------------------IL-E----- ----E--L----ELTL------H-P------QK----T---KIKN--------------F------------------S- E---------------GVDFLGFTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42840809|locus|VBICloCf158569_1553|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium cf. saccharolyticum K10] -------------------------------------------------------------------GMDGITFEMIEE- Y-GVE--EYL-----LDIQEDLQ-------N------KQY----RPK--------------------------------- -----------------PVKRV-YI--PK------------PD--G-----------------KQRPL------------ ------GIPT----IR-DRIVQQACKIV-IE-PIFEANF-------L--------------------------------- ---DSSYGFRP------------------KR----------------------------DAKQA---TEKV--------- ----K----KEL----------------------YKNWYAV-DA--------DIQGYF-DNIN----------------- -HEI--LLGL-L-KRRI---------SDRR------------------VIKLC---------RQWLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----E-N--------------- -------------------------------------------------------------------------------- ---------------GK--------Y--YPT-E--K-----------GSP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------HV----------- --------LD--SY------W-----------------K-NHK------------------------------------- -------------------------E-LGV-------------IVR----------YADDAVI-------------VCR- -----TRKD----AE---------------------------LAFEHLKR---------------------MM-T----- ----K--L----KLTL------N-P------QK----T---KIVD--------------M------------------NK E---------------SFDFLGFRY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >My.va.I1/CP000511/2360134..2362120/Mycobacterium_extraction vanbaalenii/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDRVTLVAVEE- Y-GVD--RML-----RELRHDLR-------E------GVY----CPA--------------------------------- -----------------PARRV-EI--PK------------PR--G-----------------GTRPL------------ ------GIPT----VR-DRVAQAAAKIV-LE-PIFEADF-------M--------------------------------- ---SCSYGFRP------------------KR----------------------------SATQA---MERL--------- ----RVG--FIE----------------------GSQF-VV-EF--------DIANFF-GEID----------------- -HDR--LLAE-V-SRRV---------SDRR------------------VLKLL---------RLWLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----V-D--------------- -------------------------------------------------------------------------------- ---------------GV--------V--SRT-V--A-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------HV----------- --------LD--TE------L-----------------A-RRN------------------------------------- ----------------------------VGE------------LVR----------YADDGVV-------------LCR- -----SAAQ----AE---------------------------HALAAVGE---------------------IL-A----- ----S--L----GLRL------H-P------DK----T---KVVDLR------------E------------------GG E---------------GLDFLGCHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Rh.sp.I1/CP000432/23005..25058/Rhodococcus_extraction sp./Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDRITLEEVEE- Y-GVA--RLL-----DELAVELK-------E------GSY----RPL--------------------------------- -----------------PARRV-FI--PK------------PG--T-------------V---EQRPL------------ ------SIPS----VR-DRIVQAAWKLV-AE-PVFEADF-------L--------------------------------- ---PCSFGFRP------------------RR----------------------------GAHDA---LQVL--------- ----IDE--SWR----------------------GCRW-VV-ET--------DIANCF-EAIP----------------- -IEK--LMQA-V-EERV---------CDQP------------------FLKLL---------RVMLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------Q--VRR-PV-T-----------GTP-----QGGVA------------------S- ALLCNVYL-----------------------------------------------------------HR----------- --------LD--RA------W-----------------DVDEHG------------------------------------ ----------------------V--------------------LVR----------YADDALV-------------MCR- -----SRRQ----AE---------------------------AALTRLRE---------------------LL-A----- ----D--L----GLEP------K-E------AK----T---RIVHLR------------V------------------GG E---------------GVDFLGFHH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|115244729|locus|VBIMycCan270121_2577|_extraction RNAdirected DNA polymerase [Mycobacterium canettii CIPT 140070010] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------------------------------------------------ML--------- ----IDE--SWQ----------------------GKRW-VV-ET--------GIANCF-SGIP----------------- -QEK--LMQA-I-EERV---------SDQG------------------VLRLL---------RAMLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----Q-D--------------- -------------------------------------------------------------------------------- ---------------G---------S--VRR-EA-S-----------GTP-----QGGPL------------------S- PLLYNVYL-----------------------------------------------------------HR----------- --------MD--RV------W-----------------DTEEHG------------------------------------ ----------------------V--------------------LVR----------YCDDLVV-------------MCR- -----SREQ----AE---------------------------AALQRLTV---------------------LL-G----- ----D--L----GLAP------K-A------SK----T---RIVHLV------------E------------------GG Q---------------GVDFLGFHNR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Pe.th.I1/AP009389/2583061..2585155/Pelotomaculum_extraction thermopropionicum/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGETVEAFGQ- --NLG--QRL-----IQLHHELK-------T------GTY----EPQ--------------------------------- -----------------PVKRV-EI--PK------------PD--G-----------------STRPL------------ ------GIPT----VR-DRVVQQALLNI-LQ-PIFEPGF-------H--------------------------------- ---PSSYGYRP------------------GR----------------------------SCHQA---VAKA--------- ----ERF--MNKY---------------------GLEY-VV-DM--------DLSKCF-DRLD----------------- -HEL--ILEE-V-NRKI---------SDGS------------------VLKLI---------KKFLTA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----K-D--------------- -------------------------------------------------------------------------------- ---------------G---------Q--WDE-ID-T-----------GSP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------DR----------- --------FD--QA------M-----------------K----------------------------------------- -------------------------S-RGIR------------IVR----------YADDILV-------------FAR- -----TRKE----AG---------------------------NYRQVATQ---------------------ILEG----- ----E--L----KLEV------N-K------EK----T---HLTSVH--------------------------------- E---------------GVAYLGFII------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|38271879|locus|VBITheSp141296_0259|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Thermincola potens JR] -----------------------------------------------------------------------MTVEAFGQ- --NLQ--EEL-----RQLHHELK-------T------GIY----EPQ--------------------------------- -----------------PVLRV-EI--PK------------VD--G-----------------SKRPL------------ ------GIPT----VR-DRVVQQALLNI-LQ-PIFEPDF-------H--------------------------------- ---PSSYGYRP------------------GR----------------------------SCHQA---VAKA--------- ----EMF--INKY---------------------GLSH-VV-DM--------DLSKCF-DRLN----------------- -HDL--ILEG-V-NRKV---------SDGS------------------VLKLI---------KKFLTA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----K-D--------------- -------------------------------------------------------------------------------- ---------------G---------A--WEE-TD-L-----------GSP-----QGGVI------------------S- PLLTNIYL-----------------------------------------------------------DS----------- --------FD--QE------M-----------------K----------------------------------------- -------------------------E-RGIR------------MVR----------YADDILL-------------FAA- -----TYQD----AK---------------------------KYQRIATD---------------------FLEQ----- ----E--L----KLTV------N-R------EK----T---HLTDNR--------------------------------- K---------------GVAYLGFVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|22670018|locus|VBINatThe92436_0719|_extraction Mobile element protein [Natranaerobius thermophilus JW/NMWNLF] -------------------------------------------------------------------GIDQVTVEAYGS- --NLE--ENL-----ETLHHDLK-------I------GAY----KPQ--------------------------------- -----------------PVRRV-KI--PK------------PD--G-----------------STRPL------------ ------GIPT----VK-DRVVQQATLNI-LQ-PIFDPDF-------H--------------------------------- ---PSSYGYRP------------------NR----------------------------SCHKA---IAKS--------- ----EQF--INKY---------------------NLRH-VV-DM--------DLSKCF-DKLN----------------- -HEL--IIEE-V-AKKV---------SDGS------------------VLKLI---------KKFLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------A--IED-TE-I-----------GSP-----QGGVI------------------S- PLLTNIYL-----------------------------------------------------------DR----------- --------FD--KE------M-----------------K----------------------------------------- -------------------------S-RNIR------------IVR----------YADDILI-------------FAY- -----TPRQ----AK---------------------------RYKDIATE---------------------ILED----- ----E--L----KLTV------N-K------EK----T---HITNDR--------------------------------- K---------------GVPYLGVII------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Al.or.I2/CP000853/2108190..2110275/Alkaliphilus_extraction oremlandii/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGETVFNFHL- --NLE--LNI-----EFLHDKLK-------T------NGY----EPS--------------------------------- -----------------PVRRV-EI--QK------------PD--G-----------------GVRLL------------ ------GIPT----VK-DRVVQQAIVNI-IE-PIFDKTF-------H--------------------------------- ---PSSYGYRP------------------NH----------------------------SQHGA---VAKA--------- ----ERF--MNKY---------------------GLEH-VV-DM--------DLSKCF-DTLD----------------- -HEI--MMKA-V-SERI---------SDGR------------------VLKLI---------EKFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----H-S--------------- -------------------------------------------------------------------------------- ---------------D---------N--FSR-TE-V-----------GSP-----QGGVI------------------S- PLLSNIYL-----------------------------------------------------------NQ----------- --------FD--QR------M-----------------M----------------------------------------- -------------------------S-KGIR------------IVR----------FADDILI-------------FAK- -----DKKT----AG---------------------------NYKAYATQ---------------------VLEN----- ----E--L----KLKV------N-N------EK----T---KLTNVN--------------------------------- E---------------GVEFLGFVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D.p.I1/CR522871/6124..8213/Desulfotalea_extraction psychrophila/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGQSVKDFAE- --SLD--VNL-----DRLLTELR-------E------KSY----QPQ--------------------------------- -----------------PVRRV-EI--PK------------EN--G-----------------GIRLL------------ ------GIPA----VR-DRVVQQALLDI-LQ-PIFDPDF-------H--------------------------------- ---PSSYGYRP------------------GR----------------------------SCHQA---ITKA--------- ----TMF--IRKY---------------------DRKW-VV-DM--------DLSKCF-DTLD----------------- -HDL--ILSS-L-SRRI---------KDGS------------------ILGLL---------KKILKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----T-D--------------- -------------------------------------------------------------------------------- ---------------E---------G--WQA-SE-V-----------GSP-----QGGVI------------------S- PLIANIYL-----------------------------------------------------------DQ----------- --------FD--QF------M-----------------K----------------------------------------- -------------------------K-RGHR------------IVR----------YADDILI-------------LCS- -----SKSA----AK---------------------------NALLQASC---------------------FLEK----- ----G--L----LLTV------N-R------EK----T---HICHSW--------------------------------- S---------------GVAFLGVSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20854494|locus|VBIAliSal95923_2257|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Aliivibrio salmonicida LFI1238] -------------------------------------------------------------------GIDGQRIDDFTQ- --NLE--VEL-----RKLLLELQ-------E------KRY----QAR--------------------------------- -----------------PVKRV-EI--AK------------DD--G-----------------GIRLL------------ ------GIPT----VR-DRIVQQCLTNI-MT-PIFDPNF-------H--------------------------------- ---PSSYGYRV------------------GR----------------------------SCHQA---ISKA--------- ----TLF--IRKY---------------------NKQH-VV-DM--------DLSKCF-DMLD----------------- -HDL--IIKF-V-RKRI---------VDGS------------------ILGLI---------RQFLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-G--------------- -------------------------------------------------------------------------------- ---------------E---------N--WQN-SV-I-----------G------------------------------S- PLLANIYL-----------------------------------------------------------DE----------- --------FD--QE------M-----------------M----------------------------------------- -------------------------R-RKHR------------IVR----------YADDILI-------------FCT- -----SKKG----AE---------------------------NALKVASH---------------------ILEV----- ----T--L----KLKV------N-E------RK----T---HIAHSD--------------------------------- T---------------GIKFLGVEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20872676|locus|VBIAltMac49397_0697|_extraction Mobile element protein [Alteromonas macleodii str. 'Deep ecotype'] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------MRV---------------------------------------------------------- ----------------------------------------------Y--------------------------------- ---YSLYGH------------------------------------------------------L---LNKA--------- ----RLFKDLKRY---------------------SKR------------------------------------------- -------------KRGR---------IDGQ------------------SLSAF---------A----------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------SA-TE-T-----------GSP-----QGGVI------------------S- PLIANIYL-----------------------------------------------------------DA----------- --------FD--EE------M-----------------K----------------------------------------- -------------------------Q-RGHR------------IVR----------YADDILI-------------LCC- -----SRTA----AE---------------------------NAKAQATH---------------------ILEG----- ----K--L----KLSV------N-T------EK----T---HITHSD--------------------------------- D---------------GVKFLGVEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|91201518|emb|CAJ74578.1_extraction -------------------------------------------------------------------GADGVTIERYEG- --NLD--LNL-----RIMRKELT-------E------QTY----FPL--------------------------------- -----------------PLLRI-LV--DK------------GN--G-----------------EARAL------------ ------CIPS----VR-DRIVQAAVLQL-IE-PVLEKEF-------E--------------------------------- ---ECSFAYRK------------------GR----------------------------SVKQA---VYKV--------- ----REY--YEQ----------------------GYQW-VV-DA--------DIDAFF-DSVD----------------- -YSL--LLLK-F-KCYI---------HDPC------------------IQNLV---------GLWLKG------------ -------------------------------------------------------------------------------- ------------------EV------------------------------------W-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------T--VTT-LK-K-----------GIP-----QGSPI------------------S- PILANLYL-----------------------------------------------------------DE----------- --------FD--EE------L-----------------T----------------------------------------- -------------------------R-NGYK------------LVR----------FSDDFII-------------LCK- -----NSGM----AK---------------------------ESLKLTKK---------------------IL-E----- ----K--L----LLEL------D-E------E---------QVINFD------------Q-------------------- ----------------GFKFLGVIF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|54439799|locus|VBIDesAce170587_1406|_extraction RNAdirected DNA polymerase( EC:2.7.7.49 ) [Desulfobacca acetoxidans DSM 11109] -------------------------------------------------------------------GVDGVSLGGFKE- --DLA--VNL-----AILGEELR-------S------GEY----APL--------------------------------- -----------------PLLRF-LV--AK------------RD--G-----------------SPRPL------------ ------SVPT----VR-DRVAQAAVLNS-IE-PIFEAQF-------E--------------------------------- ---EVSFAYRK------------------GR----------------------------SVRQA---AYRI--------- ----KEL--RDQ----------------------GYRF-VV-DA--------DLDAFF-DNIN----------------- -HEL--LLAK-V-ANII---------TDPD------------------ILRLI---------GLWVQA------------ -------------------------------------------------------------------------------- ------------------EV------------------------------------Y-----D-G--------------- -------------------------------------------------------------------------------- ---------------E---------K--IYM-ME-K-----------GIP-----QGAVI------------------S- PVLANLFL-----------------------------------------------------------DE----------- --------LD--EG------L-----------------I----------------------------------------- -------------------------R-KGYA------------LVR----------YADDFVI-------------LAR- -----TRPE----AE---------------------------AAMAFTEE---------------------IL-E----- ----K--M----NLAL------D-M------ED----T---EITDFK------------R-------------------- ----------------GFTYLGLIF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43053635|locus|VBIHalPra106773_0208|_extraction Mobile element protein [Halanaerobium praevalens DSM 2228] -------------------------------------------------------------------GVDRIDTVEFKE- --NYA--VHM-----RELYREFL-------E------DRY----QPK--------------------------------- -----------------PALRV-FI--PK------------SD--G-----------------RQRPL------------ ------GIPT----VK-DRIAQAAVRGI-LE-PIYEKEF-------C--------------------------------- ---DCSLGFRK------------------GK----------------------------SQIDA---INKI--------- ----EEY--KEQ----------------------GYKW-VL-DA--------DIKGFF-DNIN----------------- -HEL--LIEF-I-RQKV---------TDGW------------------VIEII---------KSWLTM------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----K-D--------------- -------------------------------------------------------------------------------- ---------------G---------E--YIP-KE-K-----------GTP-----QGGVI------------------S- PLLANIFL-----------------------------------------------------------HE----------- --------FD--KI------M-----------------V----------------------------------------- -------------------------E-RGYK------------LVR----------FADDFVV-------------MTK- -----SKRK----AK---------------------------RAYEVVKE---------------------IITE----- ----K--L----KLEL------H-P------EK----T---VITNFG------------E-------------------- ----------------GFVFLGFEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115343859|locus|VBIHalHal149681_1092|_extraction Retrontype reverse transcriptase [Halobacteroides halobius DSM 5150] -------------------------------------------------------------------GIDGVEVEEFRE- --NYT--KNM-----SALYRQLT-------E------DRY----EPQ--------------------------------- -----------------PVLRT-YI--SK------------GN--G-----------------EQRPL------------ ------GIPV----IK-DRIAQQAVKQI-LE-IHFEEIF-------C--------------------------------- ---DCSYGFRP------------------NR----------------------------STEDA---IKKV--------- ----EEY--KEQ----------------------GYNW-VL-DA--------DVKSYF-DTID----------------- -HEI--LMEL-I-AEEV---------SDGW------------------ILDII---------RSWLTI------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----T-E--------------- -------------------------------------------------------------------------------- ---------------Q---------G--REE-TT-E-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------HH----------- --------FD--KK------M-----------------T----------------------------------------- -------------------------R-RGYK------------IVR----------FADDFII-------------MAK- -----SKAK----AE---------------------------RALEVTRQ---------------------IIEN----- ----E--L----NLRL------H-P------RK----T---VITNFD------------D-------------------- ----------------GFKFLEFRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61456050|locus|VBISulAci142080_3112|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GVDHQSCDQFAE- --HLG--EEL-----DRLGQAMR-------E------HRY----QPL--------------------------------- -----------------PVRRI-WI--PK-----------PGT--R-----------------KQRPL------------ ------GVPA----IR-DRVAEEAMRRV-RE-PIGEPTF-------S--------------------------------- ---PDSYGFRP------------------GR----------------------------SAHDA---VHRI--------- ----FDH--LAH----------------------GYHW-VV-DA--------DIQDYF-GSID----------------- -QQL--LIDK-V-AERI---------SDGT------------------VLGWI---------RDMLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-A--------------- -------------------------------------------------------------------------------- ---------------G---------Q--WHA-TP-R-----------GTR-----QGSVI------------------S- PLLANIYL-----------------------------------------------------------DA----------- --------LD--QA------M-----------------A----------------------------------------- -------------------------KLPGVQ------------FIR----------YADDWCA-------------LAR- -----TKEE----AE---------------------------AALTTAQT---------------------VL-D----- ----E--L----RLTL------H-P------EK----T---RIVDVR------------E-------------------- T---------------AFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115432920|locus|VBICriEpi239080_0668|_extraction retrontype reverse transcriptase [Crinalium epipsammum PCC 9333] -------------------------------------------------------------------GVDEETTDDFNH- --NLN--SNL-----SQLRDAVA-------N------STY----QPL--------------------------------- -----------------PFKQV-FI--PK------------QK--G-----------------SWREL------------ ------KIPT----VR-DRIVQQALLNV-LA-PIMENKF-------S--------------------------------- ---PASFAYRP------------------HM----------------------------SYINA---VEQV--------- ----AHW--RDL----------------------GYHW-VM-DA--------DVSKYF-DSID----------------- -HQR--LLIV-V-RKYL---------DNPG------------------ILCLI---------KAWISA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----T-K--------------- -------------------------------------------------------------------------------- ---------------E---------G--IVR-ND-K-----------GIP-----QGAVI------------------S- PMLANIYL-----------------------------------------------------------DE----------- --------FD--KI------I-----------------S----------------------------------------- -------------------------A-SDLK------------LVR----------YADDFLV-------------LAT- -----TQER----IV---------------------------KAYSEVEQ---------------------IL-N----- ----S--F----KLTL------H-P------EK----T---QITNFE--------------------------------- R---------------GFRFLGHGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115605079|locus|VBIRivSp77222_5259|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rivularia sp. PCC 7116] -------------------------------------------------------------------GVDGETISSFAS- --NQT--VNV-----YQLMNSVA-------D------GSY----QPF--------------------------------- -----------------PCKQV-II--PK------------RN--G-----------------SQREL------------ ------KIPT----IR-DRIVQQALLNV-IS-PLMEEKF-------S--------------------------------- ---PVSFAYRP------------------NL----------------------------SYINA---VEKI--------- ----ADW--RDM----------------------GYVW-VL-DA--------DIVKFF-DNID----------------- -HHR--LLQQ-V-RLHI---------DHPG------------------ILCLI---------KAWISV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------E-----T-R--------------- -------------------------------------------------------------------------------- ---------------E---------G--LIL-PQ-K-----------GIP-----QGAVI------------------S- PILANIYL-----------------------------------------------------------HE----------- --------FD--EI------I-----------------S----------------------------------------- -------------------------A-SDLE------------IVR----------YADDFLV-------------LST- -----SQER----IA---------------------------IAKSQVID---------------------LL-D----- ----S--L----GLEI------N-T------DK----T---QITSFE--------------------------------- R---------------GFRFLGHGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21560796|locus|VBICyaSp136448_5986|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanothece sp. PCC 7424] -------------------------------------------------------------------GIDGETIEHFAL- --NLD--FNL-----TFLLNSVT-------N------SNY----IPQ--------------------------------- -----------------PLKQV-LI--PK------------SQ--E-----------------KWREL------------ ------RIPT----VR-DRIVQQALLNV-LY-PVMEERF-------S--------------------------------- ---DASFAYRP------------------NR----------------------------SYLDA---VKRA--------- ----AYW--RDL----------------------GYQW-VL-DA--------DIVEYF-DNIS----------------- -HSL--LLKE-V-RKTV---------DNSG------------------ILCLI---------KAWISA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------S-----T-D--------------- -------------------------------------------------------------------------------- ---------------K---------G--IIF-PE-K-----------GVP-----QGAVI------------------S- PMLANIYL-----------------------------------------------------------DE----------- --------FD--HR------I-----------------T----------------------------------------- -------------------------Q-SDLK------------LVR----------YADDFLV-------------LSD- -----TEDG----IM---------------------------RAYSQVVQ---------------------LL-H----- ----F--W----GLKL------H-E------EK----T---QITHFK--------------------------------- K---------------GFQFLGHGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|68548733|ref|ZP_00588202.1_extraction -------------------------------------------------------------------GYDKQSITDYSW- --RIE--EHL-----ADLGRQLL-------T------NTY----EPQ--------------------------------- -----------------PLLKL-VM--LK------------PT--G-----------------KLRTL------------ ------LIPT----VM-ERVAQTAAAIV-LT-PLVESEL-------G--------------------------------- ---ANTFAYRP------------------GL----------------------------SRMTA---AREI--------- ----ERL--RNL----------------------GYNW-VV-DA--------DISSFF-DTVD----------------- -HPL--LFQR-F-RELC---------DDEE------------------LLTLI---------ARWLTA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------V-----D-G--------------- -------------------------------------------------------------------------------- ---------------QN--------P--KVK-NT-I-----------GLP-----QGCPI------------------S- PMLANLYL-----------------------------------------------------------DK----------- --------FD--ER------M-----------------E----------------------------------------- -------------------------Q-EGFK------------LVR----------FADDFLI-------------LCK- -----SKPK----AE---------------------------AALQLSES---------------------AL-A----- ----E--L----KLQL------N-N------EK----T---RITTFA--------------------------------- E---------------GFKYLGYLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119357846|ref|YP_912490.1_extraction -------------------------------------------------------------------GWDNTSIQDYSL- --RLE--ENL-----KSLSHALL-------T------GTY----RQS--------------------------------- -----------------PLLKL-VM--LK------------PD--G-----------------KERVL------------ ------LIPG----VI-DRVAQTAASIV-LS-PIIEAEL-------G--------------------------------- ---NCTFAYRP------------------GI----------------------------SREGA---AREI--------- ----DRL--HRE----------------------GYQW-VL-DA--------DIRNFF-DNVR----------------- -HDL--LFQR-L-VELV---------DDKE------------------MISLL---------HRWLTA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------V-----D-G--------------- -------------------------------------------------------------------------------- ---------------LN--------P--RTR-NT-M-----------GLP-----QGCPI------------------S- PALANLYL-----------------------------------------------------------DR----------- --------FD--ET------M-----------------E----------------------------------------- -------------------------Q-QGFK------------LVR----------FADDYLV-------------LCK- -----TRPK----AE---------------------------AALKLSES---------------------AL-A----- ----E--L----KLEL------H-S------DK----T---RITTFA--------------------------------- E---------------GFKYLGYLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|37677204|ref|NP_937600.1_extraction -------------------------------------------------------------------GVDGVTIQTFAI- --HLD--TNL-----NTLLSAWN-------H------GNY----APS--------------------------------- -----------------PYRPL-TI--Q-------------PN--E-------------K---KTRQL------------ ------AIPT----VA-DRIIHTAIAQK-LV-AKFEPEF-------E--------------------------------- ---HISYGYRP------------------NR----------------------------SYTHA---IRHI--------- ----EQL--RNQ----------------------GYLY-VL-DA--------DIKGYF-DHIC----------------- -HKR--LKQI-L-QKYL---------EDNW------------------VESIM---------TLLLSQ------------ -------------------------------------------------------------------------------- ------------------QM------------------------------------P-----A----------------- -------------------------------------------------------------------------------- -------------------------Q--TLL-LG-R-----------GIP-----QGSPL------------------S- PLLANLYL-----------------------------------------------------------DG----------- --------FD--EA------L-----------------L----------------------------------------- -------------------------D-RGEQ------------IVR----------YADDFVV-------------LVT- -----HEQQ----AQ---------------------------HCLAFVTQ---------------------YL-A----- ----S--L----KLQL------N-T------EK----T---RVVSFQ--------------------------------- D---------------GFTFLGVSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115659900|locus|VBIChaMin231992_1871|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------GIDGITKKTLAAE S-AKT--EFV-----NALRTELQ-------T------KQF----RPM--------------------------------- -----------------PVRRV-YI--PK------------SN--G-----------------KQRPL------------ ------GIPT----LK-DRTVQMLLKMV-LE-PIYESDF-------L--------------------------------- ---NCSNGFRP------------------QR----------------------------RTQDC---IARL--------- ----DSYINRRN----------------------KYYW-VI-EG--------DIKAAF-DSIH----------------- -HQI--LLKI-M-AKRI---------ADNR------------------LLKLV---------ESFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-G--------------- -------------------------------------------------------------------------------- ---------------H---------L--FKH-TD-I-----------GTP-----QGGIC------------------S- PLLANIYL-----------------------------------------------------------HQ----------- --------LD--LY------W-----------------WQKYGN------------------------------------ ----------------------LDRKEKERRRTR---HQGNCALIR----------YADDWLL-------------LTN- ----GSKAE----AM---------------------------RLKEEFSI---------------------FLKE----- ----E--L----QLEL------S-L------EK----T---HITHVN--------------------------------- D---------------GIDFLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Co.ca.I1/FP929038.1/3172164..3174036/Coprococcus_extraction catus/Bacterial E/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDEITKKEYER- --NLE--QNI-----DDLVERLK-------R------KSY----KPQ--------------------------------- -----------------PSIRV-YI--PK------------SN--G-----------------KLRPL------------ ------GIAC----YE-DKIVQLALKKI-LE-AIYEPRF-------L--------------------------------- ---NCMYGFRP------------------NR----------------------------GCHNA---IKEL--------- ----YKR--LNNT---------------------KICY-IV-DA--------DIKGFF-DHMK----------------- -HEW--IIKF-L-KLYI---------KDPN------------------IIGLV---------KKYLKV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------E--LMV-NE-E-----------GSA-----QGNII------------------S- PILANIYM-----------------------------------------------------------HN----------- --------VL--TL------W-----------------Y----------------------------------------- -------------------------K-FIITK-E---CKGDNFLIA----------YADDFVA-------------GFQ- -----CKWE----AE---------------------------NYYKLLKE---------------------RM-E----- ----K--F----GLQL------E-D------SK----S---RLLQSG------------AYI-ARAKQ----KSGECIRL Q---------------TFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Fa.pr.I2/FP929046.1/829768..831634/Faecalibacterium_extraction prausnitzii/Bacterial E/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDKVTKDEYGK- --NLD--RNI-----KDLVQRLK-------N------KFF----KPL--------------------------------- -----------------PSLRV-YI--PK------------AN--G-----------------KKRPL------------ ------GIAS----YE-DKIVQMAVKKI-LG-AIYEPRF-------L--------------------------------- ---NCMYGFRP------------------NR----------------------------GCHEA---IKEV--------- ----YQR--ISYG---------------------KISY-IV-DA--------DIKGFF-DHID----------------- -HEW--MMKF-L-EWNI---------QDKN------------------LLWLI---------RKYLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-Q--------------- -------------------------------------------------------------------------------- ---------------G---------K--FEP-TE-E-----------GSA-----QGSVM------------------S- PMLANIYM-----------------------------------------------------------HH----------- --------VL--TL------W-----------------F----------------------------------------- -------------------------K-LVVKK-E---MQGECFLVN----------FADDFVA-------------GFQ- -----YKSE----AE---------------------------RYYKELKE---------------------RM-E----- ----K--F----GLEL------E-S------SK----S---RLIEFG------------RFA-EQNRR----ARGEH-KP E---------------TFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|58517021|locus|VBICloBot180836_2089|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium botulinum H04402 065] -------------------------------------------------------------------GIDRVTKVEYGA- --NLE--ENI-----SGLVIRLK-------N------KSY----KPL--------------------------------- -----------------PVLRV-FI--SK------------GN--G-----------------KMRPL------------ ------GIAA----YE-DKFVQLAIKKI-LE-AIYEPRF-------L--------------------------------- ---ENMYGFRP------------------RR----------------------------GCHNA---IKAA--------- ----YDR--IYEN---------------------KINY-IV-DA--------DIKGFF-DNMS----------------- -HEW--IMKF-L-GVYI---------SDPN------------------FLWLI---------NKYLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----T-D--------------- -------------------------------------------------------------------------------- ---------------G---------T--LID-SI-S-----------GSA-----QGSII------------------S- PVIANVYM-----------------------------------------------------------HN----------- --------VL--ML------W-----------------Y----------------------------------------- -------------------------K-FIVLN-G---IKGKSFLVT----------YADDFIA-------------GFQ- -----YKWE----AE---------------------------KYYIELKR---------------------RM-A----- ----K--F----NLEL------E-D------SK----S---RLLEFG------------RFA-EGNRK----ARGEG-KP E---------------TFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sy.wo.I1/NZ_AAJG01000003/20007..22007/Syntrophomonas_extraction wolfei/Bacterial E2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDQVTKQAYEE- --NLE--ANI-----ADLIGRMK-------R------QAY----KPQ--------------------------------- -----------------PVRRV-YI--PK-----------EGS--N-----------------KRRPL------------ ------GIPS----YE-DKLVQKGLARI-LN-TIYEQDF-------L--------------------------------- ---DCSFGFRP------------------GR----------------------------GCHDA---LKVL--------- ----NHI--IERK---------------------KVNY-IV-DA--------DIRGFF-DHVD----------------- -HEW--MMKF-L-ELRI---------ADPN------------------LLRLI---------KRFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-A--------------- -------------------------------------------------------------------------------- ---------------G---------I--VYD-TP-K-----------GTP-----QGGIV------------------S- PILANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------E-KVVKK-R---CQGEAYLVR----------YADDFVC-------------CFQ- -----NKSD----AE---------------------------WFYANLRE---------------------RL-N----- ----K--F----NLEV------A-E------EK----T---RIIAFG------------RFA-DKESK----KQGRK-KP D---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Mo.th.I1/CP000232.1/2324936..2328581/Moorella_extraction thermoacetica/Bacterial E/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGITKEQYGD- --NLE--ANI-----QSLLERLK-------R------KAY----RPQ--------------------------------- -----------------PVRRV-YI--PK-----------PGS--D-----------------KKRPL------------ ------GIPA----YE-DKIVQLAASKI-LN-AIYEAEF-------L--------------------------------- ---DMSFGFRP------------------QR----------------------------GCHDA---LKLL--------- ----NYL--IVAR---------------------KVNY-IV-DA--------DIKGFF-DHVN----------------- -HDW--LMKF-L-GHRI---------ADPN------------------FLRFI---------RRFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------E--LRD-AT-E-----------GTP-----QGGIV------------------S- PILANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------E-KAVRK-H---CRGEAYMVR----------YADDFIC-------------CFQ- -----YKHE----AE---------------------------AFYRALKA---------------------RL-A----- ----K--F----SLSV------A-E------EK----T---KIIPFG------------RFA-TQWCK----RMGQN-KP D---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|21720033|locus|VBIDesAce42372_1641|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfotomaculum acetoxidans DSM 771] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------------------M------------------S- PILANIYL-----------------------------------------------------------HY----------- --------TL--DL------W-----------------F----------------------------------------- -------------------------E-RVVRK-Y---CRREAYIVR----------YCDDFVC-------------CFQ- -----YKID----AE---------------------------RFYLALIQ---------------------RL-K----- ----K--F----NLEI------A-E------ER----T---KIIEFG------------RFA-CTQCK----KYGKT-KP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Cl.be.I2_extraction A new protein sequence entered manually -------------------------------------------------------------------GVDKVTKEEYET- --NLE--NNI-----DNLLIRMK-------T------FKY----RPQ--------------------------------- -----------------PVRRV-YI--DK-----------SGS--N-----------------KKRPL------------ ------GIPA----YE-DKVVQLAINKI-LK-SIYEQDF-------I--------------------------------- ---DSSFGFRQ------------------NR----------------------------SCHDA---LKIL--------- ----NVY--LSEK---------------------NVNY-VV-DA--------DIKGFF-DNVD----------------- -HKW--LMKF-L-EHRI---------ADKN------------------LLRYI---------GRFLKT------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------K--FYK-VY-E-----------GTP-----QGGII------------------S- PTLANIYL-----------------------------------------------------------HY----------- --------VL--DI------W-----------------F----------------------------------------- -------------------------N-NFIKK-K---CKGQAYIVR----------YADDFVC-------------CFQ- -----YEDE----AK---------------------------AFYEALKN---------------------RL-D----- ----K--F----NLQV------A-E------DK----T---KILYFG------------KNA-YYDRKFKRAKLESY-KD R---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115615774|locus|VBIDehSp228777_1721|_extraction retrontype reverse transcriptase [Dehalobacter sp. CF] -------------------------------------------------------------------GIDGETKASYGG- --NLE--ENL-----RNLLEQLK-------E------GSY----RPT--------------------------------- -----------------PVRRK-FI--PK-----------AGS--N-----------------KLRPL------------ ------GIPV----LE-DKLVQNALVII-LE-SIYEQDF-------L--------------------------------- ---EDSYGFRP------------------GR----------------------------SQHDA---LKDL--------- ----SRK--IGTR---------------------KVGY-IV-DA--------DIRGYF-DHVD----------------- -HEW--LLKM-L-QERI---------SDSK------------------ILKLI---------KRFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--LSK-TE-E-----------GVP-----QGGSL------------------S- PLLGNIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------N-KIITK-Q---CQGEAYLTR----------FADDTVA-------------CFQ- -----YQKD----AE---------------------------RFYEALKK---------------------RL-K----- ----K--F----NLEI------A-E------EK----T---RIIEFG------------RYA-QRDVQ----RRGGR-KP E---------------TFDFLGITH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Cl.be.I3/CP000721/3718265..3720149/Clostridium_extraction beijerinckii/Bacterial E2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDDVTKQEYSK- --ELD--NNI-----ENLIVKLR-------N------HSY----KPQ--------------------------------- -----------------AVKRV-YI--PK------------GD--G-----------------KTRPL------------ ------GIPS----YE-DKLVQMALNKI-LQ-SIYEAEF-------K--------------------------------- ---DFSYGFRP------------------KR----------------------------NCHSA---IKAL--------- ----NKV--IENG---------------------RINY-VV-DA--------DIKGFF-NNVN----------------- -HEW--MIKF-L-EVRI---------GDPN------------------IISLV---------KKFLKA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--IKT-TE-I-----------GTP-----QGSIV------------------S- PTLANIYL-----------------------------------------------------------HY----------- --------SL--DL------W-----------------F----------------------------------------- -------------------------E-KVIKR-N---FRGQSEITR----------YADDFVC-------------CFQ- -----YESE----AR---------------------------QFCRLLVS---------------------RL-N----- ----K--F----NLEV------E-R------TK----S---KLILFG------------RFA-EEIRK----SRGFK-NA E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UA.I7/FP565147.1/1619711..1621718/uncultured_archaeon_extraction /Bacterial E2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTVGEYAK- --ALD--ENI-----ADLVARLK-------A------KQY----KPQ--------------------------------- -----------------PVLRV-YI--PK------------PN--G-----------------EKRPL------------ ------GIPA----VE-DKIVQMALKKI-LE-AIFEQDF-------I--------------------------------- ---DTSYGFRP------------------NR----------------------------SCHDA---LTEL--------- ----DRI--IMNV---------------------PVNF-VV-DM--------DISKFF-DTVD----------------- -HKR--LMEC-L-RQRI---------VDPT------------------LLQLI---------GRFLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--YSE-MD-Q-----------GTP-----QGGVL------------------S- PVLANVYL-----------------------------------------------------------HY----------- --------VL--DK------W-----------------F----------------------------------------- -------------------------E-NEVLP-Q---LTGFAQLIR----------YADDFVV-------------CFE- -----KETE----AR---------------------------AFGVALRR---------------------RM-G----- ----K--F----GLTI------S-E------EK----S---KIIEFG------------RCT-CTRAK----RYGR--KC E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UA.I6/FP565147.1/2174432..2176370/uncultured_archaeon_extraction /Bacterial E2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTWRKYEE- --NLD--ENT-----EDLVTRLI-------A------KQY----RPQ--------------------------------- -----------------PVKRA-YI--PK------------SN--G-----------------ERRPL------------ ------GIPA----LE-DKIVQLAIKKI-LE-AIFEEDF-------C--------------------------------- ---DVSYGFRP------------------NR----------------------------SCHDA---LDMV--------- ----DMI--IMTK---------------------PVSY-VV-DM--------DIAKFF-DTVD----------------- -HEC--LMEC-L-KQRV---------VDPS------------------LLRII---------ARCLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--YLE-TD-K-----------GTP-----QGGIL------------------S- PILANIYL-----------------------------------------------------------HY----------- --------AL--DL------W-----------------F----------------------------------------- -------------------------E-KEVKE-Q---LKGFAQLIR----------YADDFIV-------------CFQ- -----HDDE----AR---------------------------AFGKTLRE---------------------RL-A----- ----K--F----GLTI------S-E------EK----S---RIIKFG------------RYA-CQQAR----KQSK--KC A---------------TFDFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115349385|locus|VBIThiMob160332_0442|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Thioflavicoccus mobilis 8321] -------------------------------------------------------------------GIDGISKEQYGA- --NLD--ENI-----KELSSRLR-------N------MGY----RPQ--------------------------------- -----------------PKRRT-YI--PK-----------PGS--V-----------------KGRPL------------ ------AISC----FE-DKLVELAIKRV-LE-PIYEVQF-------E--------------------------------- ---DSSYGYRP------------------GR----------------------------SQHQC---LDDL--------- ----GRT--IQQS---------------------RINT-IV-EA--------DIRSFF-NTVD----------------- -HAW--MLKF-L-GHRI---------GDPR------------------IIRLI---------GCLLKG------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------L--VQA-SE-E-----------GTP-----QGSIL------------------S- PLLSNIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------S-RRVRP-Q---CRGEAYYFR----------FADDFVA-------------GFQ- -----YRQE----AE---------------------------QFQTALGE---------------------RL-G----- ----Q--F----KLRL------A-E------EK----T---RCLAFG------------RFA-RSNAQ----KQGQ--KP G---------------EFTFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ta.sp.I2/CP000923.1/1286631..1288551/Thermoanaerobacter_extraction sp./Bacterial E/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDKVTWEEYDV- --NVD--ENV-----ETLIAKMK-------R------FSY----RPQ--------------------------------- -----------------PARRV-YI--PK------------AN--G-----------------KLRPL------------ ------GIPC----YE-DKLVAAVMADI-LN-EVYENIF-------L--------------------------------- ---DTSYGFRP------------------GR----------------------------SCHDA---IKEL--------- ----NRI--IGRC---------------------KISY-VL-EA--------DIKGFF-DNVD----------------- -QKQ--LMEF-I-AHDI---------DDKN------------------FSRYI---------VRFLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--YHE-SD-K-----------GTA-----QGSPL------------------S- PILANIYL-----------------------------------------------------------HY----------- --------TL--DV------W-----------------F----------------------------------------- -------------------------A-YLKRNGK---FRGEAYIVR----------YADDFVM-------------LFQ- -----YKSD----AD---------------------------KMYEALPK---------------------RM-A----- ----K--F----GLEL------A-M------DK----T---KILPFG------------RFA-KQNSKDG--------KT E---------------TFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >W.e.I2_extraction A new protein sequence entered manually -------------------------------------------------------------------GIDRVTVEAYGE- --NLE--EKL-----KTLVDSMK-------R------KQY----HRY--------------------------------- -----------------QVKRV-YI--PK-----------AGS--K-----------------EKRGL------------ ------GIPS----TE-DKLVQVMLKKI-LE-NIYEANF-------M--------------------------------- ---DSSYGFRP------------------GR----------------------------NCHQA---INAL--------- ----DKA--VMHK---------------------PINY-IV-EV--------DIKKFF-DNVQ----------------- -HKW--LMNC-L-RERI---------ADPN------------------LLWLI---------KRFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------V-----E-V--------------- -------------------------------------------------------------------------------- ---------------G---------C--YKA-TD-Q-----------GTP-----QGGIV------------------S- PVLANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------E-KKFKP-K---ARGYLQLIR----------FCDDFVV-------------GCE- -----REED----AK---------------------------EFLELLKQ---------------------RL-S----- ----K--F----GLEI------A-E------NK----T---KIVKFG------------KKE-WYQAE----REKR--RT A---------------SFNFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|24026234|locus|VBIWolEnd95846_0368|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Wolbachia endosymbiont of Culex quinquefasciatus Pel] -------------------------------------------------------------------GIDWVTVEAYGE- --NLK--ERL-----EGLVDSMK-------G------KQY----QPQ--------------------------------- -----------------PVRRV-YI--PK-----------AGS--K-----------------EKRGL------------ ------GIPS----TE-DKLVQIMLKKI-LE-NIYEANF-------L--------------------------------- ---DSSYGFRP------------------GR----------------------------NCHQT---VNAL--------- ----DKA--VMYK---------------------PINY-IV-EV--------DIKKFY-DNIQ----------------- -HKW--LMRC-L-RERI---------TDPN------------------LLWLI---------KRFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------V-----E-A--------------- -------------------------------------------------------------------------------- ---------------G---------Y--YEA-TK-Q-----------GTP-----QGGIV------------------S- PVLANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------L----------------------------------------- -------------------------E-KKFKP-R---SRGYIQLIR----------FCDDFVV-------------CCE- -----SKVD----AE---------------------------EFLELLKQ---------------------RL-N----- ----K--F----GLEV------S-E------NK----T---RVVKFG------------KRE-WQQ-------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ch.ph.I2/CP000492/3012641..3014550/Chlorobium_extraction phaeobacteroides/Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGITWKDYGE- --GLE--ENL-----ADLHRRIH-------T------GAY----RAQ--------------------------------- -----------------PSRRK-YI--PK------------AN--G-----------------QQRPL------------ ------GIAA----LE-DKIVQRAVVAI-LT-PIYEAEF-------L--------------------------------- ---GFSYGFRP------------------GR----------------------------SQHDA---LDAL--------- ----AYG--IKVK---------------------KIGW-VL-DA--------DISRFF-DTIS----------------- -HEW--MIRF-L-EHRI---------GDKR------------------IVRLI---------IKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------S---------V--RIE-AE-E-----------GTP-----QGAVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------AY--DL------W-----------------A----------------------------------------- -------------------------K-QWREK-H---CKGDMIVVR----------FADDSVA-------------GFQ- -----NKED----GE---------------------------RFLADLKE---------------------RL-A----- ----K--F----ALTL------H-P------EK----T---RLIEFG------------RYA-AKNRQ----RRGQG-RP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >S.ma.I1/BX664015/172056..173964/Serratia_extraction marcescens/Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGIRWMDYAG- --NMK--NNI-----TDLHRRLH-------Q------GSY----RAQ--------------------------------- -----------------PGRRH-YI--PK------------AD--G-----------------KQRPL------------ ------GIAS----LE-DKIVQYALVKI-LN-AVYENDF-------M--------------------------------- ---GFSYGFRP------------------GR----------------------------SQHDA---LDAL--------- ----ATG--LVRT---------------------NVNW-VL-DA--------DISQFF-DRVS----------------- -HEW--LIRF-T-EHRI---------GDRR------------------VIRLI---------RKWLTA------------ -------------------------------------------------------------------------------- ------------------GT------------------------------------S-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------Q--WRA-TE-E-----------GTP-----QGAVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------VF--DL------W-----------------A----------------------------------------- -------------------------H-QWRRR-Y---ATGNVVMVR----------YADDIVI-------------GFD- -----KRYD----AR---------------------------RFRIAMQR---------------------RL-R----- ----E--F----GLTV------H-P------EK----T---RLMEFG------------RFA-AENRA----IRGKG-KP E---------------TFNFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|19071807|locus|VBIBurCen118154_0098|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Burkholderia cenocepacia J2315] -------------------------------------------------------------------GVDGVTWHDYEQ- --DLD--RNL-----EDLHGRLR-------R------QAY----RAL--------------------------------- -----------------PSRRR-YI--PK------------AD--G-----------------KQRPL------------ ------GIAA----LE-DKIVQRALVAV-LN-AVYEMDF-------L--------------------------------- ---GFSYGFRP------------------QR----------------------------SQHDA---LDAL--------- ----ATG--IART---------------------SVSW-IL-DA--------DISRFF-DTVD----------------- -HDW--LIRF-V-EHRI---------GDQR------------------VIRLI---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GA------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--IEP-TD-E-----------GTP-----QGSVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------VF--DL------W-----------------A----------------------------------------- -------------------------N-QWRKR-H---AEGNVVIVR----------YADDVVV-------------GFD- -----KPHD----AK---------------------------RFRRAMQQ---------------------RL-E----- ----Q--F----GLSV------H-P------EK----T---RLIEFG------------RFA-ARNRA----SRGLG-KP E---------------TFNFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Fr.sp.I4/CP000820/1651830..1653736/Frankia_extraction sp./Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTWTDYGQ- --DLE--ANL-----QDLHVRVQ-------S------GCY----RAT--------------------------------- -----------------PSRRA-YI--PK------------AD--G-----------------RLRPL------------ ------GIAS----LE-DKIVQRAVVEV-LG-AVYEVDF-------R--------------------------------- ---GFSYGFRP------------------GR----------------------------GPHDA---LDAL--------- ----AVG--IWRK---------------------RVNW-VL-DA--------DIRDFF-GQID----------------- -HSW--LRRF-L-EHRI---------ADKR------------------VLRLI---------DKWLAA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------E--WTA-CE-E-----------GSP-----QGASV------------------S- PLLANVYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------V----------------------------------------- -------------------------D-WWRRR-H---ARGDVIVVR----------WADDFIV-------------GFE- -----YEED----AR---------------------------RFLDELRE---------------------RF-A----- ----K--F----GLEL------H-P------DK----T---RLIEFG------------RYA-ARDRK----RRGLG-KP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|42682369|locus|VBIStiAur43712203747_1515|_extraction Mobile element protein [Stigmatella aurantiaca DW4/31 (Prj:54333)] -------------------------------------------------------------------GVDGVTWEQYAG- --NLE--ANV-----RDLHTRLH-------R------GAY----RAR--------------------------------- -----------------PSRRA-YI--PK------------AD--G-----------------RQRPL------------ ------GIAA----LE-DKLVQRAVVEV-LN-AVYETDF-------L--------------------------------- ---GFSYGFRP------------------GR----------------------------SQHQA---LDAL--------- ----SAG--IYLK---------------------KVNW-VL-DA--------DIRGFF-DAID----------------- -HGW--MQKF-L-EHRI---------EDTR------------------LLRLV---------QKWLAA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------K--WTQ-SK-E-----------GTP-----QGATV------------------S- PLLANLYL-----------------------------------------------------------HY----------- --------VF--DL------W-----------------S----------------------------------------- -------------------------Q-RWRKR-V---ARGEVIIVR----------YADDFVV-------------GFQ- -----HRSD----AE---------------------------RFWRELRE---------------------RL-R----- ----S--F----ALEL------H-P------EK----T---RLIEFG------------LYV-AERRR----ERDQG-RP E---------------TFNFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >A.v.I5/CP001157/2471407..2473316/Azotobacter_extraction vinelandii/Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMSWREYEE- --DLH--QRV-----GKLHARLH-------R------GAY----RAT--------------------------------- -----------------PSRRV-YI--PK------------AD--G-----------------RQRPL------------ ------GIAS----LE-DKIVQQAVVTV-LN-AIYEEDF-------Q--------------------------------- ---GFSYGFRP------------------GR----------------------------SQHDA---LDAL--------- ----TVA--LKSQ---------------------KVNW-IL-DA--------DITSFF-DEID----------------- -HEW--MLMF-L-GHRI---------ADRR------------------MLGLI---------CKWLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------R--RLA-AT-K-----------GTP-----QGAVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------A----------------------------------------- -------------------------R-QWRQR-H---ARGEMIVVR----------YADDSVV-------------GFR- -----TQWQ----AQ---------------------------RFLVQLQE---------------------RM-A----- ----R--F----GLSL------N-A------SK----T---RLIEFG------------RFA-VQNRR----RQGLG-KP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.j.I2/BA000040/2069342..2071253/Bradyrhizobium_extraction japonicum/Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTWQDYEE- --DLE--PRL-----ADLHKRVQ-------R------GTY----RPQ--------------------------------- -----------------PSRRT-YI--PK------------AD--G-----------------KQRPL------------ ------AIAA----LE-DKIVQGATVIV-LN-AIYEGDF-------C--------------------------------- ---GFSYGFRP------------------GR----------------------------GPHDA---LDAL--------- ----CTA--IETR---------------------QVNW-II-DA--------DIQNFF-GAVS----------------- -QPW--LVRF-L-EHRI---------GDKR------------------IIRLI---------QKWLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--VTA-DD-R-----------GTG-----QGPVI------------------S- PLLGNIYL-----------------------------------------------------------HY----------- --------AL--DL------W-----------------A----------------------------------------- -------------------------K-RWRQR-E---VSGGMIIVR----------YADDVVV-------------GFE- -----REDD----AR---------------------------RFLDAMRA---------------------RL-E----- ----E--F----ELTL------H-P------AK----T---RLIEFG------------RHA-AAQRK----QRGLG-KP E---------------TFAFMGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UB.I1/AY691909/2430..4342/uncultured_bacterium_extraction /Bacterial E1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDEMTWRKYKE- --GSP--GRI-----ADLNERVH-------T------GSY----RAK--------------------------------- -----------------PVRRS-YI--NK------------SD--G-----------------RKRPL------------ ------GVTA----LE-DKIVQQAVSTI-LN-QIYETDF-------M--------------------------------- ---GFSYGFRE------------------KR----------------------------SQHNA---LDAL--------- ----YIG--ISRR---------------------KINY-IL-DA--------DISGFF-DKIN----------------- -HDW--LLKF-L-EHRV---------ADRK------------------ILRLI---------KKWLKV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------K--RTS-LE-V-----------GTP-----QGSVI------------------S- PVLANVYL-----------------------------------------------------------HY----------- --------AQ--DL------W-----------------A----------------------------------------- -------------------------H-QWRKR-H---ADGDVIIVR----------YADDSVV-------------GFQ- -----YRKD----AD---------------------------RFLKDLIE---------------------RM-G----- ----Q--F----GLEL------H-P------VK----T---RLIEFG------------RFA-VVNRR----KRGER-KP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|21698146|locus|VBIDesRet71890_0666|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfohalobium retbaense DSM 5692] -------------------------------------------------------------------GVDGLTCAEYED- --GLR--EGL-----KELHARVH-------R------GSY----RAQ--------------------------------- -----------------PSKRI-HI--PK------------PD--G-----------------HKRPI------------ ------GIAA----LE-DKIVQHAVGKV-LS-AIYEEDF-------L--------------------------------- ---GFSYGFRP------------------RR----------------------------GAHDA---LDAL--------- ----NVG--LTHR---------------------KVSW-VL-DA--------DIQGFF-DTIS----------------- -HEW--MIRF-L-EHRI---------ADPR------------------ILRLV---------RKWLRV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------S-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--WSQ-TS-M-----------GTP-----QGAVI------------------S- PILGNIYL-----------------------------------------------------------HY----------- --------VL--DQ------W-----------------V----------------------------------------- -------------------------H-H-RRR-H---ARGDIIIVR----------YADDYVL-------------GFQ- -----YRHE----AE---------------------------RFLTDLKA---------------------RL-D----- ----R--F----GLSL------H-P------EK----T---RLIEFG------------RFA-TESRR----KRGQG-KP E---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|45180964|locus|VBIBurRhi170666_0331|_extraction Mobile element protein [Burkholderia rhizoxinica HKI 454] -------------------------------------------------------------------GVDGVTWQSYEV- --GLG--SNL-----RDLHRRVH-------T------GSY----RAL--------------------------------- -----------------PVLRR-YI--PK------------AD--A-----------------GLRPL------------ ------GVAA----LE-DKLVQSVMVEV-LN-AIYEEDF-------L--------------------------------- ---GFSYGFRP------------------GR----------------------------NQHDA---LDAL--------- ----AAA--IQWR---------------------PVNW-IL-DA--------DIRSFF-DTVN----------------- -RQW--LIRF-V-KHRV---------ADPR------------------VIRLI---------GKWLDA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------R--LMS-VQ-A-----------GTP-----QGSVI------------------C- PLLANIYL-----------------------------------------------------------HY----------- --------VF--DL------W-----------------I----------------------------------------- -------------------------E-RWRRQ-R---ARGTVVVSR----------YADDTVV-------------GCQ- -----HEAD----AL---------------------------RLMKELRQ---------------------RM-E----- ----E--F----DLTL------H-P------EK----T---RVLEFG------------RYA-AERRR----RKGMG-KP Q---------------TFAYLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sr.me.I4_extraction A new protein sequence entered manually -------------------------------------------------------------------GVDGMTVAKYEE- --RLE--QNL-----HDLCDRVH-------T------GSL----PAQ--------------------------------- -----------------PVRRV-YI--PK------------AD--G-----------------GKRPL------------ ------GVPA----LE-DKIVQGAVAEV-LS-AVYEADF-------C--------------------------------- ---GFSYGFRP------------------GR----------------------------NPHMA---LDAL--------- ----HTA--IMSQ---------------------RVNW-ML-DA--------DIRSFF-DSVD----------------- -HEW--LLQM-V-AHRI---------ADPR------------------ILQLI---------KLWLRA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----E-S--------------- -------------------------------------------------------------------------------- ---------------G---------E--TYE-TD-R-----------GTP-----QGAGI------------------S- PLLANIFL-----------------------------------------------------------HY----------- --------IL--DL------W-----------------V----------------------------------------- -------------------------H-QWRRR-H---ARGRIVIVR----------YADDFVM-------------GFE- -----KKDD----AQ---------------------------EMLLALKE---------------------RL-G----- ----E--F----GLAL------H-E------GK----T---RLIEFG------------RFA-ALSRQ----RRGER-KP E---------------TFAFLGFIH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Chlorobifid|21392973|locus|VBIChlPha122104_2646|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Chlorobium phaeobacteroides DSM 266] -------------------------------------------------------------------GTDGKSWKTYEA- --QLE--ERL-----PKLHEEIH-------T------GSY----RAQ--------------------------------- -----------------PVKRV-YI--PK------------TD--G-----------------QKRPL------------ ------GITA----IE-DKLVQQAVVTV-LN-QIYETEF-------Y--------------------------------- ---GFSYGYRP------------------GR----------------------------APENA---LDAL--------- ----ATA--ILKR---------------------PINW-IL-DA--------DLQKFF-DSIP----------------- -HDK--LMAL-I-SIRV---------GDKR------------------ILRLI---------GKWLKT------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------K--RYR-QT-E-----------GTP-----QGSVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------VV--DE------W-----------------V----------------------------------------- -------------------------E-QERRR-R---NNGEVIIIR----------YADDLVL-------------GFQ- -----YKTE----AE---------------------------RYLEALSE---------------------RV-Q----- ----T--Y----GLKL------H-P------EK----T---SLKEFG------------RYA-EERRR----KRGEE--- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|23677855|locus|VBISorCel80414_10115|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sorangium cellulosum So ce56] ---------------------------------------------------------------------------MYGE- --ELD--ARL-----LDLQDRIL-------R------GSY----HPQ--------------------------------- -----------------PVRRV-HI--PK------------GS--G------------------TRPL------------ ------GIPA----LE-DKIVQQAVRRG-LE-LIYESMF-------L--------------------------------- ---GFSYGFRP------------------RR----------------------------STHDA---LDAL--------- ----AVA--IGKR---------------------KVNW-IV-DA--------DIRAFY-DTIA----------------- -HAW--MQRF-I-EHRI---------GDRR------------------LVRLL---------MKWLHA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--LHE-VD-E-----------GTP-----QGGII------------------S- PLMANIYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------A----------------------------------------- -------------------------H-AWRKR-H---ARGEVYIVR----------YADDVVM-------------GFE- -----DGRD----AR---------------------------SMRAALSK---------------------RL-A----- ----S--F----GLEL------H-P------DK----T---RVLFFG------------RYA-YEKCE----RRGLR-KP A---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32294365|locus|VBIPseHal105694_0399|_extraction Reverse transcriptase [Pseudoalteromonas haloplanktis TAC125] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------KVNW-VL-DL--------DISKFF-DTVE----------------- -HDW--LIKF-I-EHRI---------GDKR------------------IIRLI---------RQWIKV------------ -------------------------------------------------------------------------------- ------------------GT------------------------------------V-----D-SH-------------- -------------------------------------------------------------------------------- ---------------G---------H--RQQ-ST-I-----------GTP-----QGAVI------------------S- PLLANIYL-----------------------------------------------------------HY----------- --------SF--DL------W-----------------L----------------------------------------- -------------------------N-K-QRK-Y---ARGNVTIIR----------YADDAVL-------------GFQ- -----KHQD----AI---------------------------DCQRALTQ---------------------RL-L----- ----C--F----GLKV------H-P------NK----T---KLIRFG------------RFAPTQYRE----NPSRG-KP G---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ps.tu.I1/AAOH01000003/353461..355380/Pseudoalteromonas_extraction tunicata/Bacterial E2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGITMPAYQQ- --QLV--GNI-----TRLSDALK-------H------KRF----RAN--------------------------------- -----------------DIKRV-FI--PK------------AN--G-----------------KQRPL------------ ------GLPT----VD-DKLVQQGVSQI-LQ-SIWEADF-------L--------------------------------- ---PNSYGYRP------------------NK----------------------------SAHQA---LHSL--------- ----ALN--LQFK---------------------GYGY-IV-EA--------DIKGFF-NNLD----------------- -HNW--LMKM-L-KQRI---------DDKA------------------MLSLI---------SQWLKA------------ -------------------------------------------------------------------------------- ------------------RI------------------------------------K-----S-PE-------------- -------------------------------------------------------------------------------- ---------------G---------V--FEY-PK-S-----------GTP-----QGGII------------------S- PVLANIYL-----------------------------------------------------------HY----------- --------AL--DL------W-----------------F----------------------------------------- -------------------------E-KKVKP-R---MRGRAMLIR----------YADDFVC-------------AFQ- -----YAND----AE---------------------------RFYEVLPK---------------------RL-K----- ----K--F----NLEV------A-E------EK----T---SLLRFS------------RFHPSRKRQ------------ -----------------FVFLGFAF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115643628|locus|VBIThiNit264030_1141|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Thioalkalivibrio nitratireducens DSM 14787] -------------------------------------------------------------------------------- ------------------MERLK-------T------KRY----RTK--------------------------------- -----------------LVRRC-YI--PK------------EN--G-----------------QERAL------------ ------GIPA----LE-DKLVQLACAKL-LT-AIYEQDF-------L--------------------------------- ---PVSYGYRP------------------GR----------------------------DAKEA---VGDL--------- ----GFN--LQYG---------------------RFGH-VV-EA--------DIQGFF-DHLD----------------- -HDW--LLRM-L-ALRI---------DDRA------------------FLHLI---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----D-TD-------------- -------------------------------------------------------------------------------- ---------------G---------Q--VLH-PD-A-----------GTP-----QGGIV------------------S- PILANVYL-----------------------------------------------------------HY----------- --------AL--DL------W-----------------F----------------------------------------- -------------------------E-RVVRP-R---CRGQALLIR----------YADDYVC-------------AFQ- -----YREE----AE---------------------------GFYRVLPK---------------------RL-A----- ----K--F----GLAV------A-P------EK----T---RILRFS------------RFHPGLPRR------------ -----------------FAFLGFEL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|190138491|locus|VBIRhiEtl298076_5694|_extraction Retrontype RNAdirected DNA polymerase [Rhizobium etli bv. mimosae str. Mim1] -------------------------------------------------------------------GIDGRTADDYEK- --DLE--ANL-----ESLRIRMM-------S------GSY----RAP--------------------------------- -----------------PVRRH-YI--PK------------AD--G-----------------SRRPL------------ ------GIPT----IE-DKVAQRAIVML-LE-PIYEEDF-------L--------------------------------- ---DCSFGFRP------------------ER----------------------------SAHDA---IRTL--------- ----RDG--IMDT---------------------GQRW-VI-DA--------DISKYF-DSID----------------- -HGH--LRSF-L-DLRI---------RDGV------------------IRRMI---------DKWLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-Q--------------- -------------------------------------------------------------------------------- ---------------G---------T--SSR-SV-A-----------GTP-----QGGVI------------------S- PLLANILL-----------------------------------------------------------LH----------- --------VL--DR------W-----------------F----------------------------------------- -------------------------V-EVVKP-R---LKRRCQMVR----------YADDFVM-------------SFE- -----DHLD----GR---------------------------RMLAVLGK---------------------RF-E----- ----R--Y----GLRL------H-P------DK----T---RYVDF------------------RFRRPHG--------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|42685679|locus|VBIStiAur43712203747_3158|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Stigmatella aurantiaca DW4/31 (Prj:54333)] -------------------------------------------------------------------GIDRQTAKDYEA- --NLE--VNL-----KSLLERIK-------S------GRY----KAP--------------------------------- -----------------PVRRT-YI--PK------------AD--G-----------------SQRPL------------ ------GIPT----FE-DKVAQRAIVLL-LE-PIYEQDF-------R--------------------------------- ---PFSFGFRP------------------GR----------------------------SAHQA---LREL--------- ----RSS--ILER---------------------NGRW-VL-DV--------DLRRYF-DTIE----------------- -HGK--LREV-L-ARRV---------ADGV------------------VRRMI---------DKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------P--LLR-LE-Q-----------GTP-----QGGVI------------------S- PLLANVYL-----------------------------------------------------------HY----------- --------VL--DE------W-----------------Y----------------------------------------- -------------------------E-REVVP-R---MKGKCSLIR----------YADDLVM-------------VFE- -----DFLD----CR---------------------------RVLEVLGK---------------------RL-A----- ----K--Y----GLTL------H-P------GK----T---RMVDF------------------RFKRPGGGQHPAT-QA T---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|61290805|locus|VBINiaKor154066_6177|_extraction Mobile element protein [Niastella koreensis GR2010] -------------------------------------------------------------------GVDEETWIDYHK- --QRE--TRI-----PQLLAAFK-------S------GNY----RAP--------------------------------- -----------------NIRRV-YI--PK------------DK--G-----------------KLRPL------------ ------GLPT----VE-DKVLQTAVTRV-LR-PVYEDIF-------Y--------------------------------- ---HSSYGFRP------------------GK----------------------------SQHQA---LEEL--------- ----TRQ--VSLE---------------------GKRY-II-DA--------DMQNYF-GSIN----------------- -HQC--LRDL-L-DLRI---------KDGV------------------IRKMI---------DKWLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LVY-PT-E-----------GTP-----QGGSI------------------S- PLISNVYL-----------------------------------------------------------HY----------- --------VL--DE------W-----------------F----------------------------------------- -------------------------Y-QQIRP-L---LKGDSFLIR----------FADDFLL-------------GFT- -----NKED----AL---------------------------RVMHVLPK---------------------RL-G----- ----K--Y----GLML------H-P------EK----T---KLIDL------------------TTKK-GG---PDQ-EK N---------------TFDFLGFCH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22095829|locus|VBIHalHal112047_0768|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Halorhodospira halophila SL1] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------------------------------MV------------------S- PLLANVFL-----------------------------------------------------------HE----------- --------VL--DE------W-----------------F----------------------------------------- -------------------------E-TQAKP-R---LRGPAQLVR----------YADDAVL-------------LFK- -----LRDD----AE---------------------------RVLKVLPR---------------------RF-E----- ----K--Y----GLEL------H-P------EK----T---RLIGFQ------------RPP-------RNVKRPWPK-P E---------------TFDLLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|23659127|locus|VBISorCel80414_0791|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sorangium cellulosum So ce56] -------------------------------------------------------------------GVDGITKEQYGQ- --DLE--HNV-----RDLHARMK-------S------MRY----RHQ--------------------------------- -----------------PIRRV-HI--PK------------ER--G-----------------KTRPI------------ ------GISC----TE-DKIVQAAVREM-LE-VIYEPVF-------R--------------------------------- ---DVSYGFRP------------------GR----------------------------SAHDA---LRAL--------- ----NRM--LL-G---------------------GVEW-IL-EA--------DIESFF-DSID----------------- -RTK--LMEM-L-QARV---------ADKS------------------LLRLV---------GKCLHV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-G--------------- -------------------------------------------------------------------------------- ---------------A---------E--FYA-PE-D-----------GTV-----QGSVL------------------S- PLLGNVYL-----------------------------------------------------------HH----------- --------VL--DL------W-----------------I----------------------------------------- -------------------------E-REVQP-R---LVGKATLIR----------YADDFII-------------GFE- -----REDD----AK---------------------------RVTEVLPR---------------------RF-E----- ----R--Y----GLKL------H-P------DK----T---RLLPFG------------RPD-------NG--QPGGKGP A---------------TFDFLGFTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|42464802|locus|VBIXenNem38452_2364|_extraction Ribonuclease III (EC 3.1.26.3) [Xenorhabdus nematophila ATCC 19061] -------------------------------------------------------------------GIDRMTKAAYGE- --HLD--GNI-----HNLILRIR-------R------GTY----RPK--------------------------------- -----------------AARIT-QI--PK------------ED--G-----------------SKRPL------------ ------AISC----TE-DKLVQLAVSDI-LS-RIYEPLF-------L--------------------------------- ---PCSYGFRP------------------GL----------------------------NCHAA---LKAL--------- ----QQQ--TYRN---------------------WNGA-VV-EI--------DIRKYF-NTIP----------------- -HIE--LMSL-L-RKKI---------SDRR------------------FLRLI---------EVLITA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------I-----E-G--------------- -------------------------------------------------------------------------------- ---------------K---------Q--VSE-NV-R-----------GCP-----QGSIL------------------S- PVLANIYL-----------------------------------------------------------HQ----------- --------VI--DE------W-----------------F----------------------------------------- -------------------------D-EISRS-H---IHGRAEMVR----------YADDRVF-------------TFE- -----FMSE----AE---------------------------RFYKVLPK---------------------RL-N----- ----K--Y----GLEL------H-D------DK----S---QRIPAG------------HIAALRASQ-------SGRRL P---------------TFNFLGFTC------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|35290299|locus|VBILegLon159544_1142|_extraction Ribonuclease III (EC 3.1.26.3) [Legionella longbeachae NSW150] -------------------------------------------------------------------GIDGVTKEVYGK- --KLE--DNL-----QDLLARIR-------R------HAY----TPQ--------------------------------- -----------------ASRLV-EI--PK------------ED--G-----------------STRPL------------ ------AISC----FE-DKIVQMAVTKL-LT-AIYEPLF-------L--------------------------------- ---PCSYGYRE------------------GK----------------------------NGHEA---LRAL--------- ----MKY--SNEF---------------------RKGA-TL-EI--------DLRKYF-NTIP----------------- -HGK--LLEI-L-EKKI---------TDRR------------------FLKLI---------RKLIRS------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------V-----A-N--------------- -------------------------------------------------------------------------------- ---------------G---------K--AEL-NE-L-----------GCP-----QGSII------------------S- PILSNIYL-----------------------------------------------------------HS----------- --------VV--DS------W-----------------F----------------------------------------- -------------------------D-EISKS-H---LIGKTAMVR----------FADDMVF-------------LFQ- -----RSED----AE---------------------------KFYKVLPK---------------------RL-E----- ----K--Y----GLQL------H-V------DK----S---SLLKSG------------SKEAEEADT-------RGERL Q---------------TYKFLGFTC------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|22828908|locus|VBIOriTsu129072_1468|_extraction Ribonuclease III (EC 3.1.26.3) [Orientia tsutsugamushi str. Ikeda] -------------------------------------------------------------------GIDGITKEDYGK- --KLK--ANL-----LSLLTRIR-------K------GQY----QAK--------------------------------- -----------------PARIV-KI--PK------------ED--G-----------------GKRPL------------ ------VISC----FE-DKIIESTVSKI-LN-SVFEPIF-------L--------------------------------- ---KYSYGFHP------------------KL----------------------------NAHDA---LREL--------- ----NRL--TYNF---------------------NKGA-IV-EI--------DITKCF-NTIK----------------- -HCE--LMEF-L-RKRI---------SDKK------------------FLRLV---------MKLIET------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------I-----E-N--------------- -------------------------------------------------------------------------------- ---------------D---------T--IVT-NK-E-----------GCR-----QGSIV------------------S- PILANVFL-----------------------------------------------------------HY----------- --------VI--DS------W-----------------F----------------------------------------- -------------------------A-KISEE-N---LIGQTGMVR----------YCDDMVF-------------VFE- -----SEAD----AK---------------------------RFYDVLPK---------------------RL-N----- ----K--Y----GLNI------N-E------AK----S---QMIKSG------------RDHAAN--------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115615051|locus|VBIDehSp228777_0955|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Dehalobacter sp. CF] -------------------------------------------------------------------GVDKVTAKEFAE- --ELK--QNI-----ENLAEHLE-------K------KRY----RAK--------------------------------- -----------------LLRRV-DI--PK------------GE--G-----------------KTRPL------------ ------GIPA----IA-DKLVQSAAAKI-LE-AIYEQDF-------L--------------------------------- ---ASSYGYRP------------------KV----------------------------SAHTA---IKDL--------- ----SKE--LNYG---------------------DYSY-IV-EA--------DIKGFF-QNID----------------- -HAW--LIRM-L-EQRI---------DDKA------------------FVGLI---------KKWLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----K-QD-------------- -------------------------------------------------------------------------------- ---------------G---------E--VEH-PI-T-----------GSP-----QGGII------------------S- PILANTYL-----------------------------------------------------------HY----------- --------VL--DL------W-----------------F----------------------------------------- -------------------------E-KIVKP-N---CEGEAYLCR----------YCDDFVC-------------AFQ- -----YKGD----AD---------------------------KFYRSLPK---------------------RL-E----- ----K--F----GLEL------A-V------DK----T---QIIQFN------------RWL----RK----------QS S---------------SFEYLGFEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ac.ma.I1/CP000840.1/228971..230873/Acaryochloris_extraction marina/Bacterial E/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTKAEYQE- --NLE--TNL-----QNLHLKLR-------Q------MSY----RPQ--------------------------------- -----------------PVRQV-EI--PK------------ED--G-----------------SMRPL------------ ------GISC----TE-DKVVQEMTRRI-LE-AIYEPVF-------I--------------------------------- ---DTSYGFRP------------------KR----------------------------SCHDA---LRQL--------- ----NRE--VMRK---------------------PVNW-VA-DI--------DLAKFF-DTMP----------------- -HQE--ILSV-L-SIRI---------KDGN------------------LLRLI---------ARMLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------Q-----T-P--------------- -------------------------------------------------------------------------------- ---------------G---------G--VVY-DE-L-----------GSP-----QGSIV------------------S- PVIANIFL-----------------------------------------------------------DY----------- --------VL--DQ------W-----------------F----------------------------------------- -------------------------T-NVVRH-H---CRGYCAIIR----------YADDVAA-------------VFE- -----HEED----AI---------------------------RFMRVLPR---------------------RL-E----- ----K--Y----GLRL------N-T------KK----T---HLLAFG------------KRNARRCFQ-------TGQRP S---------------TFDFLGLTH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|296169325|ref|ZP_06850953.1|_extraction RNAdirected DNA polymerase %5BMycobacterium parascrofulaceum ATCC BAA614%5D -------------------------------------------------------------------GADGTAPRSVGAA E-AV---GLL-----QRLREELK-------E------RIF----RPD--------------------------------- -----------------PVREV-MI--PK------------AN--G-----------------KLRRL------------ ------GIAT----VA-DRVVQASLKLV-LE-PIFEADF-------H--------------------------------- ---PCAYGFRP------------------GR----------------------------RAQDA---IAEI--------- ----HHLASGSR----------------------AYHW-VF-EG--------DITACF-DEIS----------------- -HSA--LMGR-V-RRRV---------GDKR------------------VLALV---------KSFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----S-KD-------------- -------------------------------------------------------------------------------- ---------------L---------G--YRD-TI-T-----------GTP-----QGGIL------------------S- PLLSNVAL-----------------------------------------------------------SV----------- --------LD--EH-FAAK-W-----------------KALG-------------------------------------- ----------------------PEWT-RAKHRRA---GVPTMKIVR----------YADDFCV-------------MVH- ----GTRAD----AE---------------------------ALWDEIAA---------------------VL-A----- ----P--M----GLRL------S-V------EK----S---RICHVD--------------------------------- E---------------GFEFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|260905481|ref|ZP_05913803.1|_extraction RNAdirected DNA polymerase %5BBrevibacterium linens BL2%5D -------------------------------------------------------------------GVDGVAPRSLLHG Q-AV---EVL-----TMIRRQVK-------T------GEF----RPL--------------------------------- -----------------PVRER-RI--PK------------SN--G-----------------KTRSL------------ ------GIPT----LA-DRVVQASLKLV-LE-PIFEADF-------Y--------------------------------- ---PSSYGFRP------------------RR----------------------------RAQDA---IAEI--------- ----HKFTSRPL----------------------NYEW-VF-EA--------DITACF-DEID----------------- -HTG--LIQR-L-RGRI---------TDKR------------------VLALV---------RRFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----S-ED-------------- -------------------------------------------------------------------------------- ---------------G---------V--NRN-TH-T-----------GTP-----QGGIL------------------S- PLLANIAL-----------------------------------------------------------SG----------- --------LD--DH-FQKK-W-----------------ESLG-------------------------------------- ----------------------PSWT-RAKLRRR---GIPVMKLIR----------YADDFVV-------------LVH- ----GSVEH----VE---------------------------ALWHEVAE---------------------VL-A----- ----P--M----GLRL------S-V------EK----T---KVTHID--------------------------------- E---------------GFDFLGWRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|336177663|ref|YP_004583038.1|_extraction RNAdirected DNA polymerase %5BFrankia symbiont of Datisca glomerata%5D -------------------------------------------------------------------GVDGRTAASIVAR I-GIP--EYL-----DGLRSALK-------D------RSF----RPL--------------------------------- -----------------PVRER-MI--PK------------AG--G-----------------KLRRL------------ ------GIAT----IT-DRVVQASLKLA-LE-PIFEADF-------L--------------------------------- ---PCSYGFRP------------------MR----------------------------RAHDA---VAEI--------- ----RYLTSKPR----------------------CYEW-IV-EG--------DIKACF-DEIS----------------- -HTS--LTGR-V-RARI---------GDRR------------------VLALV---------KAFLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----V-ED-------------- -------------------------------------------------------------------------------- ---------------R---------L--VRP-TT-A-----------GTP-----QGSIL------------------S- PLLSNVAL-----------------------------------------------------------SV----------- --------LD--EH-VARS-P-----------------GGPG-------------------------------------- ----------------------TGKTEKAKRLRH---GLPNFKLVR----------YADDWCL-------------VIK- ----GTKAD----AE---------------------------ALREEIAG---------------------VL-S----- ----T--M----GLRL------S-R------EK----T---LITHID--------------------------------- D---------------GLDFLGWRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ca.ac.I1/CP001700/4538431_extraction 4540611/Catenulispora acidiphila/Unclassified/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGRTVSRIEGQ --GVE--EFL-----AGLRESLK-------S------GEF----WPV--------------------------------- -----------------PVKER-MI--PK------------AN--G-----------------KLRRL------------ ------GIPT----VA-DRVVQAALKLV-LE-PIFEVDF-------E--------------------------------- ---PCSYGFRP------------------NR----------------------------RAHDA---IAEI--------- ----HHYAS--R----------------------GYEW-VL-EG--------DIEACF-DNID----------------- -HTA--LMGR-V-RERV---------GDKR------------------VLRLI---------KAFLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------F-----S-EG-------------- -------------------------------------------------------------------------------- ---------------R---------A--VRD-TR-T-----------GTP-----QGGIL------------------S- PLLANVAL-----------------------------------------------------------AV----------- --------LD--EH-FAQV-W-----------------QETG-------------------------------------- ----------------------RTWAARDWHRRR---GGATFKLVR----------YADDFVI-------------LAY- ----GSRQH----VE---------------------------DLTADVAQ---------------------VL-S----- ----T--V----GLRL------S-P------TK----T---AVAHID--------------------------------- E---------------GFDFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Rh.js.I1/CP000433/185762_extraction 187934/Rhodococcus jostii/Unclassified/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTVADVEAS V-GMS--GFL-----DDLRTQLK-------D------GSF----RPL--------------------------------- -----------------PVRER-KI--PK------------PGGSG-----------------KVRKL------------ ------GIPT----VA-DRVVQAALKLV-LE-PIFEADF-------L--------------------------------- ---PVSYGFRP------------------KR----------------------------RAHDA---VAEI--------- ----QYFGT--K----------------------GYRW-VL-DA--------DIEACF-DSIE----------------- -HTA--LMGR-V-RERV---------KDKR------------------VLLLV---------KSFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----S-ET-------------- -------------------------------------------------------------------------------- ---------------G---------H--HED-NP-T-----------GTP-----QGGIL------------------S- PLLANIAL-----------------------------------------------------------SV----------- --------LD--EH-VHGP-W-----------------QPGGAM------------------------------------ ----------------------STPTGRALRRRR---GLPNWRIVR----------YADDFVV-------------LVF- ----GSCDD----VN---------------------------DLREEIAD---------------------VI-A----- ----P--L----GLRF------S-E------SK----T---RVVHMG--------------------------------- E---------------GFDFLGFRL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|317124020|ref|YP_004098132.1|_extraction RNAdirected DNA polymerase %5BIntrasporangium calvum DSM 43043%5D -------------------------------------------------------------------GVDGVTVRQIRQR G-EVG--VFL-----AGIAASLR-------D------GTY----RPA--------------------------------- -----------------PVRRV-LI--PK------------PG--G-----------------KSRPL------------ ------GIPT----VT-DRVVQQSLRMV-LE-PIFEADF-------L--------------------------------- ---PVSYGFRP------------------KR----------------------------RAHDA---VAEI--------- ----HFYAG--R----------------------GYRW-VL-DA--------DIEGCF-DHID----------------- -HTA--LLGL-V-RERI---------KDKK------------------TVALV---------RAFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----S-DL-------------- -------------------------------------------------------------------------------- ---------------G---------L--EAA-AG-E-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------SV----------- --------LD--EA-IMAP-W-----------------AQGGDQ------------------------------------ ----------------------STQTGRAKRRYH---GLGNWRIVR----------YADDFVI-------------MTN- ----GSRDD----VL---------------------------ALKEQAAE---------------------VL-A----- ----R--V----GLRL------S-E------SK----T---RVTHLS--------------------------------- E---------------GIDFLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|115168014|locus|VBIMycSme175119_3779|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Mycobacterium smegmatis JS623] -------------------------------------------------------------------GIDKRTARGIEAS ADGVA--GFL-----EQLREALR-------S------GTF----RPV--------------------------------- -----------------PVRRV-EI--PK------------AS--G-----------------KVRKL------------ ------GIPT----VA-DRVVQASLKLV-LE-PVFETDF-------S--------------------------------- ---DSSYGFRP------------------RR----------------------------RAQDA---IEDI--------- ----RMFAH--R----------------------GYEW-VF-EA--------DIAACF-DEID----------------- -HSA--LLQQ-V-RGRI---------GDKR------------------ILGLV---------KAFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-TD-------------- -------------------------------------------------------------------------------- ---------------G---------D--TYD-TF-T-----------GTP-----QGGIL------------------S- PLLANIAL-----------------------------------------------------------SV----------- --------ID--DH-FDTQ-W-----------------A---AH------------------------------------ ----------------------RNGSARRSHRQH---GGATYRLVR----------YADDFVV-------------LVY- ----GEREH----AE---------------------------QLWEHMSD---------------------LL-A----- ----P--M----GLRL------A-P------DK----T---QVVHID--------------------------------- E---------------GFDFLGFRLQ------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Sr.me.I2/AE006469/1065613..1067822/Sinorhizobium_extraction meliloti/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTVGRIRNR --SEH--RFL-----VDLQADLR-------S------GAY----RPS--------------------------------- -----------------PARRK-LI--PK------------AGKPG-----------------QFRPL------------ ------GIPT----IR-DRVVQGAAKIL-LE-PIFEAQF-------W--------------------------------- ---HVSYGFRP------------------GR----------------------------NTHGA---LEYI--------- ----RRAALPQK-RDEDTRRN-----------RLPYPW-VI-EG--------DIKGCF-DNIN----------------- -HHH--LLER-M-RKRI---------GDRR------------------VVRLV---------GLFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----T-ED-------------- -------------------------------------------------------------------------------- ---------------Q---------F--LR--TD-A-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------SA----------- --------IE--ER-YER--W------TYHRKKTQARRKSNGVA------------------------------------ ----------------------AAASARDSDRIA---GRCVYLPVR----------YADDFVV-------------LVS- ----GSLEE----AM---------------------------AEKSALAD---------------------YLIK----- ----T--T----GLTL------L-P------EK----T---KVTAMT--------------------------------- E---------------GFEFLGFRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|32443874|locus|VBISpiLin97822_6935|_extraction Putative reverse transcriptase [Spirosoma linguale DSM 74] -------------------------------------------------------------------GIDGMTVGSIRQR I-GEA--PFL-----ATLQQQLR-------T------GSY----KPS--------------------------------- -----------------PCRRK-LI--PK------------AGKPG-----------------KFRPL------------ ------GIPT----IA-DRVVQSAIKQV-LE-PILEARF-------W--------------------------------- ---PVSYGFRP------------------GR----------------------------GCHGA---LEHI--------- ----RMSMRPRKVNKQDNKRH-----------EMPYQW-VI-EG--------DIQSCF-DHID----------------- -HHQ--LMDR-I-RQHS---------ADRR------------------VNQLL---------VQFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----S-EE-------------- -------------------------------------------------------------------------------- ---------------Q---------F--LR--TD-A-----------GTP-----QGGIV------------------S- PLLANVAL-----------------------------------------------------------GL----------- --------IE--ER-YER--W------VNHQTKRRQSRQCDGIK------------------------------------ ----------------------AAMWSRSVDRQA---GRAVYFPFR----------YADDFVI-------------LVS- ----GTQEN----AQ---------------------------AERKVLQT---------------------LLQE----- ----K--M----GLTL------S-P------EK----T---KITPLT--------------------------------- E---------------GFQFLGHRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|42684501|locus|VBIStiAur43712203747_2581|_extraction Mobile element protein [Stigmatella aurantiaca DW4/31 (Prj:54333)] -------------------------------------------------------------------GVDGLTARKVVAK --GVD--TFI-----DEVRKELR-------S------GAY----RPC--------------------------------- -----------------PVRRV-LI--PK------------PGQPW-----------------KFRPL------------ ------GIPT----VR-DRVVQAAVKNI-LE-PIFEADF-------F--------------------------------- ---PSSYGFRP------------------GR----------------------------SAHAA---LEEL--------- ----RKLLLPQHANTEAGTEI-----------RLPYQW-AI-EG--------DIKGCF-DNID----------------- -HHG--LMER-V-RRRV---------GDTK------------------VNRLI---------VAFLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----A-EE-------------- -------------------------------------------------------------------------------- ---------------Q---------F--LR--SS-T-----------GTP-----QGGIL------------------S- PLLANIAL-----------------------------------------------------------AV----------- --------ID--ER-YERHVWPRRTPTLLHDTRMVQLR------------------------------------------ -----------------------AAQNRNNDRRSRRDGRLVLVPIR----------YADDFII-------------LVGA KPGPGSHERARTAAL---------------------------AEKAALAA---------------------LLKE----- ----T--L----NLEL------S-E------AK----T---AITPVT--------------------------------- S---------------PMRFLGHHV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >S.f.I1/CP001383.1/1091555..1093825/Shigella_extraction flexneri/Bacterial A/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVNKTMLQA- --RLA--VEL-----QILRDELL-------S------GHY----QPL--------------------------------- -----------------PARRV-YI--PK------------SN--G-----------------KLRPL------------ ------GIPA----LR-DRIVQRAMLMA-ME-PIWESDF-------H--------------------------------- ---TLSYGFRP------------------ER----------------------------SVHHA---IRTV--------- ----KLQLTDCGE-T-------------------RGRW-VI-EG--------DLSSYF-DTVH----------------- -HRL--LMKA-V-RRRI---------SDAR------------------FMTLL---------WKTIKA------------ -------------------------------------------------------------------------------- ------------------GH------------------------------------I-------DV-------------- -------------------------------------------------------------------------------- ---------------G---------L--FRA-AS-E-----------GVP-----QGGVI------------------S- PLLSNIML-----------------------------------------------------------NE----------- --------FD--QY-LHER-Y------LSGKARKDRWYWNNSIQ------------------------------------ ----------------------RGRSTAVRENWQ---WKPAVAYCR----------YADDFVL-------------IVK- ----GTKAQ----AE---------------------------AIREECRG---------------------VLEG----- ----S--L----KLRL------N-M------DK----T---KITHVN--------------------------------- D---------------GFIFLGHRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E.c.I4/AB024946/48555..50824/Escherichia_extraction coli/Bacterial A/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVNKTMLQA- --RLA--VEL-----QILRDELL-------S------GHY----QPL--------------------------------- -----------------PARRV-YI--PK------------SN--G-----------------KLRPL------------ ------GIPA----LR-DRIVQRAMLMA-ME-PIWESDF-------H--------------------------------- ---TLSYGFRP------------------ER----------------------------SVHHA---IRTV--------- ----KLQLTDCGE-T-------------------RGRW-VI-EG--------DLSSYF-DTVH----------------- -HRL--LMKA-V-RRRI---------SDAR------------------FMTLL---------WKTIKA------------ -------------------------------------------------------------------------------- ------------------GH------------------------------------I-------DV-------------- -------------------------------------------------------------------------------- ---------------G---------L--FRA-AS-E-----------GVP-----QGGVI------------------S- PLLSNIML-----------------------------------------------------------NE----------- --------FD--QY-LHER-Y------LSGKARKDRWYWNNSIQ------------------------------------ ----------------------RGRSTAVRENWQ---WKPAVAYCR----------YADDFVL-------------IVK- ----GTKAQ----AE---------------------------AIREECRG---------------------VLEG----- ----S--L----KLRL------N-M------DK----T---KITHVN--------------------------------- D---------------GFIFLGHRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|20797198|locus|VBIAgrRad129173_0726|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Agrobacterium radiobacter K84] -------------------------------------------------------------------GIDGMDKQRLQV- --KLD--QHL-----DDLRTSLL-------E------ESY----RPQ--------------------------------- -----------------PVKRI-YI--PK------------SN--G-----------------KLRPL------------ ------GIPT----LT-DRIVQRAMLMA-ME-PIWESDF-------H--------------------------------- ---RLSYGFRP------------------ER----------------------------SVHHA---VRTV--------- ----RIQLQDGADTT-------------------RGRW-II-EG--------DLASYF-DTVH----------------- -HRL--LLKC-V-RRRV---------QDGR------------------FVDLL---------WRFLKA------------ -------------------------------------------------------------------------------- ------------------GH------------------------------------I-------DR-------------- -------------------------------------------------------------------------------- ---------------G---------L--FTA-SS-E-----------GVP-----QGGVL------------------S- PLLSNIML-----------------------------------------------------------HE----------- --------FD--AW-LEAK-Y------LSDKARKDRWAWNFGIK------------------------------------ ----------------------QGRPITVRESRQ---WKPAVAYCR----------YADDFVV-------------IVK- ----GTKAQ----AE---------------------------EIREECRA---------------------FLEG----- ----E--L----KLTL------N-M------EK----T---HVTHVN--------------------------------- D---------------GFVFLGHRII------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Gfid|61475875|locus|VBIVibSp220376_1845|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Vibrio sp. EJY3] -----------------------------------------------------------------------MTKHHLQG- --KLG--DYL-----RKLKLELQ-------S------GNY----QPM--------------------------------- -----------------PARRI-YI--PK------------AN--G-----------------KQCPL------------ ------GIPT----LR-DRIVQRAILMA-ME-PIWENDF-------H--------------------------------- ---SLSYGFRP------------------ER----------------------------SVHHA---IHTV--------- ----RLQLADSTD-T-------------------RGRW-VI-EG--------DLSSYF-DTVH----------------- -HRL--LIKC-V-RKRI---------SCNG------------------FLDLL---------WRFIKS------------ -------------------------------------------------------------------------------- ------------------GH------------------------------------V-------ER-------------- -------------------------------------------------------------------------------- ---------------N---------L--FCA-TQ-Q-----------GVP-----QGGVI------------------S- PFLSNIML-----------------------------------------------------------NE----------- --------FD--QY-LHQR-H------LSKKARKDRWYLNNSIK------------------------------------ ----------------------IGRRSAIENNWQ---WQPAVAYCR----------YADDFIL-------------IVQ- ----GTKQD----AE---------------------------NIRNESRQ---------------------FLEG----- ----K--L----KLTL------N-M------EK----T---HITHVN--------------------------------- D---------------GFVFLGHRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61451050|locus|VBISulAci142080_0650|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------QQ-Y------CGTKAKRDRWQWNDGIS------------------------------------ ----------------------RQRPIALREQRQ---WRPAVSYFR----------YADDFVV-------------IVK- ----GTRQH----AE---------------------------AVRLACRN---------------------FLED----- ----T--L----KLTL------N-M------EK----T---HITHVD--------------------------------- D---------------GFEFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61451056|locus|VBISulAci142080_0653|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GADGMTKRRWEA- --QQA--EEL-----ERLRTELL-------T------DTY----RPH--------------------------------- -----------------PARRI-YI--PK------------PN--G-----------------KQRPL------------ ------GIPC----LR-DRVVQRAMLMA-MD-PIWESDF-------R--------------------------------- ---WMSYGFRP------------------GR----------------------------SVHHA---VRSV--------- ----KLALTDTVQGT-------------------AGRW-VI-EG--------DLASYF-DTVH----------------- -HRL--LMKA-V-KRRI---------ADRR------------------FLRVL---------WRMLKA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------I-------DH-------------- -------------------------------------------------------------------------------- ---------------G---------L--FRS-TH-E-----------GVP-----QGGVL------------------S- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------------K---VDPIVK---------------TDF------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|258515071|ref|YP_003191293.1|_extraction RNAdirected DNA polymerase %5BDesulfotomaculum acetoxidans DSM 771%5D -------------------------------------------------------------------GIDGITFDNIEAS --GIE--IFL-----QQIQKELI-------S------GTY----WPT--------------------------------- -----------------QNRRK-EI--PK------------GD--G-----------------KYRIL------------ ------GIPT----IR-DRVVQGALKLI-LE-PIFEADF-------Q--------------------------------- ---EGSYGYRP------------------KR----------------------------NPHQA---IDRV--------- ----AKA--VVE----------------------NKTR-VI-DL--------DLRSYF-DTVR----------------- -HDL--LLKK-V-AKRV---------NDEN------------------VMRLL---------KLILKA------------ -------------------------------------------------------------------------------- ------------------S------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-R-----------GVP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------NE----------- --------VD--KM------L----------------------------------------------------------- -------------------------E-KAKEVTR-HEQYTHIEYAR----------FADDIVI-------------LID- -----AYPKWNWLEK---------------------------AVYQRLLE---------------------EL-T----- ----K--L----DVQL------N-E------EK----T---RIVNL-------------A------------------NG E---------------SFGFLGFDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ma.sp.I2/CP000471/2464047..2465973/Magnetococcus_extraction sp./Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTFEAIEES --GVE--QFL-----GEVRKELV-------S------GSY----RPL--------------------------------- -----------------KNRRK-AI--PK------------GD--G-----------------KERVL------------ ------GIPS----IR-DRVVQGALKLI-LE-PIFEADF-------Q--------------------------------- ---SGSYGYRP------------------KR----------------------------MAHQA---VNRV--------- ----AIA--IAQ----------------------GKTQ-VI-DA--------DLKSYF-DTVQ----------------- -HDL--ALRK-V-SERV---------DDDQ------------------VMHLL---------KLIFKT------------ -------------------------------------------------------------------------------- ------------------S------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-R-----------GVP-----QGGVI------------------S- PLISNLYL-----------------------------------------------------------NE----------- --------VD--KM------L----------------------------------------------------------- -------------------------E-RAKEVTR-KGKYTHIEYAR----------FADDLVI-------------LVD- -----GHHRWNGLAR---------------------------KVYQRLGE---------------------EL-A----- ----K--L----KVQL------N-L------EK----T---RVVDL-------------T------------------RG E---------------DFTFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|302392022|ref|YP_003827842.1|_extraction RNAdirected DNA polymerase %5BAcetohalobium arabaticum DSM 5501%5D -------------------------------------------------------------------GVDGITFEDIEGI --GVL--KYL-----KKIREELV-------N------ETY----KPQ--------------------------------- -----------------ENRKQ-EI--PK------------GN--G-----------------KVRVL------------ ------GIPT----IK-DRIVQGALKLI-LE-PIFEADF-------Q--------------------------------- ---ESSYGYRP------------------KR----------------------------TAHQA---VKKI--------- ----EKA--IVS----------------------GKRK-VI-DL--------DLSSYF-DTVK----------------- -HHI--LLAK-I-AKRV---------IDKE------------------VMHLI---------KLMLKA------------ -------------------------------------------------------------------------------- ------------------S------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-E-----------GVP-----QGGVI------------------S- PLFANLYL-----------------------------------------------------------NE----------- --------VD--RM------L----------------------------------------------------------- -------------------------E-RAKEVTKSKGKYTELEYAR----------FADDIVI-------------AVS- -----SHPSMNWLLS---------------------------KVIQRLKE---------------------EL-D----- ----K--I----KVKV------N-K------EK----T---KVVNL-------------E------------------KG E---------------RISFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >W.e.I5/AM999887.1/284826..286812/Wolbachia_extraction endosymbiont/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTFESIETE --GSR--KYL-----QRIRHELI-------T------KTY----SPN--------------------------------- -----------------RNRRK-EI--PK------------SG--E-----------------KFRTL------------ ------NIPC----IR-DRIVQTALKLI-LE-PIFESDF-------Q--------------------------------- ---KGSYGYRP------------------KR----------------------------NAHEA---VQKV--------- ----TEA--AIK----------------------GNTK-VI-DV--------DLKSYF-DSVR----------------- -HHI--LMEK-I-AKRI---------NDKE------------------IMRMI---------KLILKI------------ -------------------------------------------------------------------------------- ------------------G------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-R-----------GMA-----QGSPL------------------S- PLLSNIYL-----------------------------------------------------------NE----------- --------VD--KM------L----------------------------------------------------------- -------------------------E-KAKEVTK-EGKYQRMEYAR----------WADDLVI-------------LIR- -----EYPKREWLER---------------------------AVYRRLEE---------------------EL-A----- ----K--L----EVRV------N-E------EK----T---KVINL-------------K------------------KG E---------------TFSFLGFDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|19201997|locus|VBIBurPhy25146_7879|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Burkholderia phymatum STM815] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------MKGSTR-SGKYTYVEYAR----------FADDLVV-------------LID- -----AHPRNAWLLK---------------------------VVSRRLRD---------------------EF-A----- ----K--L----QVEV------N-E------EK----S---RTVDL-------------D------------------RA E---------------SFGFLGFDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.th.I3/DQ363750/30070..32039/Bacillus_extraction thuringiensis/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGKSFADIELE --GVI--PFL-----TGIQEELQ-------A------GIY----QPQ--------------------------------- -----------------ANRKV-EI--PK------------TN--G-----------------KMRTL------------ ------QIPC----IR-DRVVQGALKLI-LE-AIFEADF-------C--------------------------------- ---PNSYGFRP------------------KR----------------------------SPHQA---LAEV--------- ----RRS--ILR----------------------RMTI-II-DV--------DLSRYF-DTIR----------------- -HNI--LLEK-I-AKRV---------QDPQ------------------VMHLV---------KQVIKA------------ -------------------------------------------------------------------------------- ------------------T------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-I-----------GVP-----QGGPF------------------S- PLAANIYL-----------------------------------------------------------NE----------- --------VD--WT------F----------------------------------------------------------- -------------------------D-TIRRKTA-DGNYEAVNYHR----------FADDIVI-------------AVS- -----GHSSKSGWAE---------------------------LALRRLWE---------------------QL-K----- ----P--L----GVEL------N-L------EK----T---QMVNV-------------L------------------KG E---------------SFGFLGFDL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|58553039|locus|VBICupNec201015_1883|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Cupriavidus necator N1] -------------------------------------------------------------------GIDDLSFEDIEAS --GRI--VFL-----AEIQADLK-------T------GRY----EPK--------------------------------- -----------------PNRRV-EI--PK------------SN--G-----------------KVRVL------------ ------QVPC----IR-DRVVQGALKLI-LE-AVFEADF-------C--------------------------------- ---PNSYGFRP------------------KR----------------------------SPHRA---LAEV--------- ----RRS--VLR----------------------RMST-VV-DV--------DLSRYF-DTIQ----------------- -HST--LLGK-I-AKRI---------QDPQ------------------VMHLV---------KQVIKA------------ -------------------------------------------------------------------------------- ------------------A------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------K-V-----------GVP-----QGGPF------------------S- PLAANIYL-----------------------------------------------------------TE----------- --------ID--WM------L----------------------------------------------------------- -------------------------D-EIRRKTA-QGPYEAVNYHR----------FADDIVI-------------TVS- -----GHHTKRGWAE---------------------------RALLRLRE---------------------QL-V----- ----P--L----GVEL------N-T------EK----T---TVVDT-------------L------------------HG E---------------AFGFLGFDL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sg.ce.I1/AM746676.1/9205316..9207294/Sorangium_extraction cellulosum/Unclassified/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTFEQIETA --GRG--EFL-----AGLAAELR-------G------RTY----RPA--------------------------------- -----------------PLRRR-EI--PK------------EG--G-----------------KARVI------------ ------SIPT----IR-DRVVQAALRMI-LE-PIFEADF-------S--------------------------------- ---DSSYGARP------------------GR----------------------------SAHQA---LKEV--------- ----REG--LRR----------------------RQHR-VV-DV--------DLSRYF-DTIR----------------- -HDR--LLAK-V-ARRV---------CDDE------------------VLALI---------KQFLVR------------ -------------------------------------------------------------------------------- ------------------T------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------E-R-----------GVP-----QGSPL------------------S- PLLANVAL-----------------------------------------------------------NE----------- --------LD--QA------L----------------------------------------------------------- -------------------------N-RGKALLT---------YVR----------YLDDMVV-------------LAP- -----DSLRGRAWAD---------------------------RALERIRE---------------------EA-E----- ----A--I----GVSL------N-T------DK----T---RVVTLT------------D------------------RD A---------------VFTFLGFDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Mx.xa.I1/CP000113/2433780..2435766/Myxococcus_extraction_1 xanthus/Bacterial F/ORF Sequence %28a.a%29 -------------------------------------------------------------------GQDGITFEHIEER --GRA--GFL-----GAVAEELR-------T------GTY----RPR--------------------------------- -----------------PYRRR-EI--PK------------EG--G-----------------KVRVI------------ ------SIPS----IR-DRVVQGALRLV-LE-PIFEADF-------S--------------------------------- ---GSSFGARP------------------GR----------------------------SAHEA---IDTV--------- ----RQG--LRR----------------------RRHR-VV-DV--------DLKAYF-DSIR----------------- -HAP--LLER-V-ARRV---------QDGE------------------VLALV---------KQFLRS------------ -------------------------------------------------------------------------------- ------------------T------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------G-----------------D-R-----------GIP-----QGSPL------------------S- PLLANIAL-----------------------------------------------------------ND----------- --------LD--HV------L----------------------------------------------------------- -------------------------D-RGRGFLT---------YAR----------YLDDMVV-------------LAP- -----DSEKGRRWAA---------------------------RALERIRQ---------------------EA-E----- ----A--L----GVSL------N-K------EK----T---RTVTMT------------D------------------RN A---------------SFAFLGFDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|54666312|locus|VBIDesCar168000_0691|_extraction Retrontype reverse transcriptase [Desulfotomaculum carboxydivorans CO1SRB] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------MRRQ-YI--PK-----------K-N--G-----------------KLRPL------------ ------GIPN----IE-DRIVQQAIVNV-LSPKCEEHIF-------H--------------------------------- ---KWSCGYRP------------------NL----------------------------G----------I--------- ----KRVMQ-------------------------IILW-NI-ETGYNHIYDCDIKGFF-DNIP----------------- -HKK--LMKV-L-TKYI---------ADGT------------------VLDMI---------WAWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--FHP-TD-S-----------GTP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------NE----------- --------LD--WT-L-EE-H-----------------------G----------------------------------- -----------------------------------------VRFVR----------YADDFLL-------------FAK- -----SKED----IE---------------------------R-AAEVAK---------------------TTLD----- ----E--L----GLEV------S-I------EK----T---RFVDFD------------K-------------------- D---------------DFNFVGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.a.I2/AF079317/53812..56360/Novosphingobium_extraction aromaticivorans/ML/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGKTFEDFGP- -------DRL-----APLIASVA-------T------GAY----KPK--------------------------------- -----------------PVRRV-FI--PK------------GK--G-----------------KRRPL------------ ------GIPT----RD-DRLVQEVARQL-LE-RIYEPVF-------S--------------------------------- ---KASHGFRP------------------GR----------------------------SCHTA---LEHV--------- ----KAVWT-------------------------GVKW-LV-DV--------DVAGFF-ENID----------------- -HDI--LLKL-L-RKRI---------DDER------------------FIDLI---------RDMLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-G--------------- -------------------------------------------------------------------------------- ---------------R---------A--HTQ-TY-S-----------GTP-----QGGIV------------------S- PILANIYL-----------------------------------------------------------HE----------- --------LD--EF-M-AG-R-----------------ITAFEKG--KTRATNPEYRR-LAG----RIAKRRERLKRLEA SDN---ADQVTVKAILAEINTLSKQMRSLPSRDAMDAGFRRLRYCR----------YADDFLIG------------VIG- -----SKDD----AR---------------------------GVFAEVRT---------------------FLTE----- ----V--L----ALTV------S-E------EK----S---GIRKA-------------S-------------------- D---------------GTKFLGYEV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.a.I1/AF079317/43084..45661/Novosphingobium_extraction aromaticivorans/ML/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGQTFDGFSP- -------DKV-----RSIIERLA-------N------GTY----RPQ--------------------------------- -----------------PARRV-YI--PK------------AN--G-----------------QKRPL------------ ------GVPT----TE-DKLVQEVVRTI-LE-QIYEPLF-------S--------------------------------- ---RHSHGFRP------------------KR----------------------------SCHTA---LESI--------- ----RAIWT-------------------------GVKW-LI-DV--------DVVGFF-DNID----------------- -HDV--LVSL-L-EKRI---------ADRR------------------FVRLI---------RGLLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------V--FHK-TY-S-----------GTP-----QGGVV------------------S- PMLANIYL-----------------------------------------------------------HE----------- --------LD--MF-M-QA-K-----------------MAGFDKG--KQRSPSPDARR-IRN----RLSYVRRTVDQLRA KGR---GDDPRVTSFLEEIGRLKAERLAVPASDAFDPNYRRLRYCR----------YADDFIIG------------VTG- -----SKSE----AR---------------------------QIMEEVRT---------------------YLSD----- ----H--L----KLAV------S-A------EK----S---GIHKA-------------S-------------------- D---------------GARFLGYEV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87114490|locus|VBIEscBla78014_3566|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shimwellia blattae DSM 4481 = NBRC 105725] -------------------------------------------------------------------GINNNTMDEMSV- -------GRI-----INLIQLIN-------S------GSY----KPR--------------------------------- -----------------PCRRT-HI--PK--------DARKPN--G-----------------KKRPL------------ ------GIPT----GD-DKLIQEVMRML-LE-EIYEPVF-------S--------------------------------- ---DWNYGFRP------------------KR----------------------------SCHSA---LKEI--------- ----RNSWK-------------------------GTKW-VC-DV--------DIKGYF-DNID----------------- -HDL--LLKF-L-SKRI---------ADNK------------------FLALL---------KKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------W---------R--YFG-TH-S-----------GTP-----QGGII------------------S- PILANVFL-----------------------------------------------------------HK----------- --------LD--EF-M-KN-R-----------------ISEFGKG--GRRKPNPIYKRALQN----RAN--RIKWIRQGF GASGMPADEQKIQKWRHEADELEKKLRTLSSVIMDDSEFKRMRYVR----------YADDFLIG------------VTG- -----SKNE----AK---------------------------KIMKEVVD---------------------FVET----- ----E--L----HLEI------S-K------EK----S---GIIDP-------------K-------------------- K---------------GFTFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|58846094|locus|VBIPelHal211702_2804|_extraction Retrontype reverse transcriptase [Pelagibacterium halotolerans B2] -------------------------------------------------------------------GVSGNTLDGFGE- -------ERV-----AALMHAIS-------T------GTY----KPS--------------------------------- -----------------PVRRT-YI--LK--------DPKNPA--G-----------------KKRPL------------ ------GIPT----GD-DKLVQEVVRAL-LE-VIYEPVF-------S--------------------------------- ---DRSHGFRP------------------GR----------------------------SCHTA---LNQI--------- ----VRSWK-------------------------GTKW-IC-EV--------DIKGYF-DNID----------------- -HET--LLGL-L-ARKI---------DDRA------------------FLKLI---------REFLVA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------T--YNA-TY-S-----------GTP-----QGGVV------------------S- PILANIYL-----------------------------------------------------------HE----------- --------LD--QF-M-DS-R-----------------MAAFNRG--ARRKPNPEYCR-LNN----LASIRRRKLRVHGD SHS-------KAARWRQEMQEMEAAKALLPSVDMHDEGFKQLHYVR----------YADDFLIG------------IVG- -----SKEE----AA---------------------------QIMAEVRS---------------------FVEG----- ----P--L----KLTI------S-A------EK----S---RMGAM-------------S-------------------- K---------------GTVFLGYGVQ------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >B.t.I2/AE015928/3241156..3243662/Bacteroides_extraction thetaiotaomicron/ML/ORF Sequence %28a.a%29 -------------------------------------------------------------------GADGKTIDGMSI- -------DRV-----EQLIGSLK-------N------ETY----QPN--------------------------------- -----------------PSKRT-YI--PK-----------K-N--G-----------------KKRPL------------ ------GIPS----FD-DKLVQEVVRMI-LE-AIYEGSF-------E--------------------------------- ---HTSHGFRP------------------KR----------------------------SCHTA---LIDI--------- ----QKTFT-------------------------AVKW-FI-EG--------DIKGFF-DNIN----------------- -HDV--LINI-L-RERI---------ADER------------------FLRLI---------RKFLNA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------V--FHR-TY-S-----------GTP-----QGGII------------------S- PILANIYL-----------------------------------------------------------DK----------- --------FD--KY-I-KE-Y-----------------INRFNKG--VTRKGDARYKL-YEQ----RRYRLAKKLKNE-- ------KDVKVRKQMTAEIKRLREERNNYPARNEMDSSIKRLKYVR----------YADDFLIG------------ITG- -----NLED----CK---------------------------TVKEDIKN---------------------YLNE----- ----A--L----KLEL------S-D------EK----T---LITNA-------------Q-------------------- K---------------PAKFLGYDV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87116544|locus|VBIAliFin145170_0639|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes finegoldii DSM 17242] -------------------------------------------------------------------GSDGKTIDGMSL- -------KRI-----ENLIDALK-------D------ESY----QPK--------------------------------- -----------------PARRT-YI--PK-----------K-N--G-----------------NMRPL------------ ------GIPS----ID-DKLVQEVLRML-LE-AIYEGSF-------E--------------------------------- ---NTSHGFRP------------------KR----------------------------SCHTA---LIQV--------- ----QKNFT-------------------------AAKW-FI-EG--------DIEGFF-DNIN----------------- -HDV--LIGI-L-KERI---------ADDR------------------FIRLM---------WKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------T--FHR-TY-S-----------GTP-----QGGII------------------S- PILANIYL-----------------------------------------------------------DK----------- --------LD--KY-M-KE-Y-----------------ACQFDRG--DRRAMNLEYKR-YSR----KIWWLGTKLKQT-- ------KDKDTRKELIDAIKQHQKNRMHLPSVDEMDEGYRRIKYVR----------YADDFIIG------------VIG- -----SKSD----CE---------------------------AIKEDIKN---------------------FLGE----- ----K--L----KLTL------S-E------EK----T---LITHG-------------N-------------------- R---------------KAKFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87116554|locus|VBIAliFin145170_0644|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes finegoldii DSM 17242] -------------------------------------------------------------------GSDGRSIDEMSL- -------ARI-----ETLIASLK-------D------ESY----QPH--------------------------------- -----------------PSRRV-HI--PK-----------K-N--G-----------------KTRPL------------ ------GIPA----FE-DKLVQEVVRMI-LE-AIYEGHF-------E--------------------------------- ---TTSHGFRP------------------KR----------------------------SCHTA---LLHI--------- ----QKTFS-------------------------GAKW-FI-EG--------DIKGFF-DNID----------------- -HDV--LVGI-L-RERI---------SDDR------------------FIRLI---------RKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------T--FHN-TY-S-----------GMP-----QGGIV------------------S- PILANIYL-----------------------------------------------------------DK----------- --------LD--KY-V-KE-Y-----------------IRHFDMG--TKRRPGKESND-LAN----ERKRTVRKLKKV-- ------KDGTEKAALVARLKAIEQERAAFPSGDEMDGSYRRLKYIR----------YADDFILG------------VIG- -----SKED----AL---------------------------RIKEDIKS---------------------FLSE----- ----S--L----ALEL------S-E------EK----T---LITHT-------------G-------------------- K---------------SAKFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42711385|locus|VBIAliSha154597_1257|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes shahii WAL 8301] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------MV-LE-AIYEGHF-------E--------------------------------- ---DTSHGFRP------------------HR----------------------------SCHTA---LNAV--------- ----QKTFT-------------------------GKKW-FI-EG--------DIKGFF-DNVN----------------- -HDI--LIDI-L-KERI---------SDER------------------FIRLI---------RKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-Q--------------- -------------------------------------------------------------------------------- ---------------W---------Q--FHG-TY-S-----------GMP-----QGGII------------------S- PILANIYL-----------------------------------------------------------DK----------- --------LD--KY-M-KE-Y-----------------ASKFDKG--DRGRQQREYEV-LTY----QKRLVMRELKTA-- ------TNNVERKVLVNRLKEIDKTRSAMPCFAPMDGNFKRLKYVR----------YADDFLIG------------IIG- -----SKED----AV---------------------------KIKDDIKR---------------------FLAD----- ----R--L----ALEL------S-D------EK----T---LITHT-------------E-------------------- K---------------PAKFLGYEV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ba.fr.I1/AY515263/38446..40893/Bacteroides_extraction fragilis/ML/ORF Sequence %28a.a%29 -------------------------------------------------------------------GTDGQTISGMSI- -------KRI-----QSIIDKLR-------D------ESY----QPH--------------------------------- -----------------PAKRI-YI--PK-----------K-N--G-----------------KQRPL------------ ------GIPS----FE-DKLVQKVIQMI-LE-SIYEGSF-------E--------------------------------- ---KCSHGFRP------------------HR----------------------------NCHTA---MASI--------- ----MEGFD-------------------------GTRW-FI-EG--------DIKGFF-DNID----------------- -HDI--MITI-L-SERI---------ADER------------------FLRLI---------RKFLNA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-K--------------- -------------------------------------------------------------------------------- ---------------W---------K--FHK-TF-S-----------GTP-----QGGII------------------S- PILANIYL-----------------------------------------------------------DQ----------- --------LD--KY-V-VE-Y-----------------ISQFNRG--KMRKRNPEYKR-IAS----RKDKRVKKLKTE-- ------TDEQKRAALRSEIVELHREMQKHPATLDMDEDFRRMRYVR----------YADDFLIG------------IIG- -----SKDD----CV---------------------------NIKADIKR---------------------FLCE----- ----K--L----KLEL------S-D------EK----T---LITHG-------------H-------------------- D---------------HAKFLGFEV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|46993147|locus|VBIOdoSpl147623_0215|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Odoribacter splanchnicus DSM 220712] -------------------------------------------------------------------GTDGKTEDEMSI- -------DRI-----NKLIESIK-------D------ETY----SPN--------------------------------- -----------------PAKRI-YI--PK-----------K-N--G-----------------KMRPL------------ ------GIPS----FE-DKLVQEAVRMV-LE-AIYEGHF-------E--------------------------------- ---WTSHGFRP------------------NR----------------------------SCHTA---LKSL--------- ----QNNFN-------------------------GAKW-FI-EG--------DIKGFF-DNID----------------- -HDV--LIEI-M-KGRI---------ADDR------------------FLRLI---------RKFLNA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------W---------Q--FNK-TY-S-----------GTP-----QGGII------------------S- PVLANIYL-----------------------------------------------------------DK----------- --------FD--KY-M-NE-Y-----------------ANKFNKG--TVRSRNKDICK-LNS----RVHYLKRRINEV-- ------EDVNVRTRMVEELHEKQKRILTMPSGNDMDRNFRRLRYLR----------YADDFLIG------------VIG- -----TKNE----CE---------------------------TIKADITK---------------------FMQE----- ----K--L----RLEM------S-Q------EK----T---LITNA-------------Q-------------------- D---------------SAKFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|19673908|locus|VBIStrPne132160_1355|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus pneumoniae ATCC 700669] -------------------------------------------------------------------GVDELTIDGMSI- -------ARI-----DQLIDSLK-------D------ESY----QPH--------------------------------- -----------------PSRRT-YI--PK-----------K-N--G-----------------KLRPL------------ ------GIPS----FD-DKLLQQVIKMI-LE-AIYEGQF-------E--------------------------------- ---PSSHGFRP------------------NK----------------------------SCHTA---LTQI--------- ----QKTYT-------------------------GTKW-FI-EG--------DIKSFF-DNIN----------------- -HDV--MIHI-L-RERI---------TDER------------------FLRLI---------RKFLNA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------K--FYK-TY-S-----------GTP-----QGGII------------------S- PILANIYL-----------------------------------------------------------DK----------- --------FD--KY-M-TD-Y-----------------VKNFCQG--KYRKRTPEYRQ-NEI----ALGKARRALECV-- ------STENQRQEVIQRIRQLEKERVLIPHSDPMDSSFKRLTYTR----------YADDFICG------------VIG- -----SKED----AH---------------------------RIKADIKD---------------------YLEA----- ----V--L----KLEL------S-V------EK----T---LITNA-------------R-------------------- D---------------KAKFLGYHL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|19729760|locus|VBIStrPyo25933_1754|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus pyogenes MGAS10750] -------------------------------------------------------------------GVDNQTISAMSL- -------ERI-----NKIIDSLK-------D------ESY----SPT--------------------------------- -----------------PTKRV-YI--PK-----------K-N--G-----------------KLRPL------------ ------GIPS----IG-DKLVQEVCRML-LN-SIYDESF-------E--------------------------------- ---DTSHGFRD------------------NR----------------------------SCHTA---LRQI--------- ----QNRFV-------------------------RCKW-FV-EG--------DIKGFF-DNID----------------- -HNI--MIDI-L-SKRI---------DDER------------------FLRLI---------RKFLKS------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-Q--------------- -------------------------------------------------------------------------------- ---------------N---------Q--YHN-TY-S-----------GMP-----QGSII------------------S- PILSNIYL-----------------------------------------------------------DK----------- --------FD--KY-M-QN-Y-----------------KESFDKG--NKRKQNKEYKA-LYD----RRKRLENKLSKT-- ------TNKTEIDDIKSEIEEINKRYFNIPCLNPMDENFKRIQYVR----------YADDFIIG------------IIG- -----SKAD----AE---------------------------MVKQDIGQ---------------------FIKS----- ----E--L----NLEL------S-D------EK----T---LVTKS-------------T-------------------- D---------------RAKFLGFDI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42835086|locus|VBICloCf158569_1256|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium cf. saccharolyticum K10] -------------------------------------------------------------------GTDGKTIDGMGM- -------ARI-----NALIEKMR-------N------SSY----QPN--------------------------------- -----------------PARRT-YI--PK-----------S-N--G-----------------KMRPL------------ ------GIPS----FD-DKLIQEVVRLI-LE-SIYEPTF-------S--------------------------------- ---DHSHGFRM------------------NK----------------------------SCHTA---LKYV--------- ----QKYFT-------------------------GTKW-FV-EG--------DIKGCF-DNVD----------------- -HHV--LIAI-L-RKRI---------ADEQ------------------FIGLL---------WKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------N--YHN-TY-S-----------GTP-----QGSII------------------S- PILANIYL-----------------------------------------------------------NE----------- --------LD--HF-M-AE-Y-----------------AEKFNCG--DRRRINPAFKK-KLDVCRGKEERLKRNISKM-- ------SEE-EKEGLLAEISELRRSLRSMPYSDQMDEGYKRVFYIR----------YADDFLIG------------VIG- -----RKAD----AE---------------------------QVKQDVGH---------------------FIRE----- ----N--L----HLEM------S-E------EK----T---LITHG-------------H-------------------- D---------------FAKFLGYEV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18911848|locus|VBIBacCer120424_2093|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus Q1] -------------------------------------------------------------------GTDKETIDGFSM- -------DWI-----ENIISSLK-------D------ESY----KPN--------------------------------- -----------------PSRRV-YI--PK-----------K-D--D-----------------KQRPL------------ ------GIPS----IK-DKIIQEVVKEI-LV-SMYEPIF-------S--------------------------------- ---KASHGFRP------------------NK----------------------------SCHSA---LNDI--------- ----KMTFG-------------------------GIKW-WI-EG--------DIKGFF-DNID----------------- -HHV--LIGI-L-RKRI---------KDEK------------------FIKLI---------WKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------K--FNK-TF-S-----------GTP-----QGGII------------------S- PVLANIYL-----------------------------------------------------------HE----------- --------LD--AF-M-EK-Q-----------------IIKFDEG--KRRRDNPVYKK-YNTAIWYRKNKLKEKWNTL-- ------NDD-ERKELQSEISTLEKEREKHSAVDNMDASFKRLKYVR----------YADDFVVG------------VIG- -----SKED----SK---------------------------RIKEEITE---------------------FLHT----- ----S--L----KLEL------S-Q------EK----T---LITSN-------------K-------------------- N---------------LIKFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|54164737|locus|VBIBacThu155232_5952|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis serovar chinensis CT43] -------------------------------------------------------------------GVDQRSIDGFSM- -------KEV-----EDLISVLK-------S------KSY----QPY--------------------------------- -----------------PSRRT-YI--EK-----------K-N--G-----------------KKRPL------------ ------GIPS----FY-DKLVQEVIRMI-LE-AIYDSSF-------S--------------------------------- ---SSSHGYRK------------------GK----------------------------GCHSA---LLEI--------- ----KRTFT-------------------------GSKW-FI-EG--------DIKGFF-DNIE----------------- -HHT--LVTI-L-KRRI---------KDEA------------------FIELI---------WKFLRA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-E--------------- -------------------------------------------------------------------------------- ---------------W---------K--FHN-TY-S-----------GAP-----QGGII------------------S- PIISYIYL-----------------------------------------------------------NE----------- --------LD--TY-M-KK-Y-----------------QDRFESG--KKRQINKEYSN-LQY----KVRKIQEKIDTAYL N-----GEVTRITELKEQQKVLKGKLLQTPYNNPMDENYRRLKYVR----------YADDFLIG------------VIG- -----SKED----AI---------------------------LIKNEIAS---------------------FLKE----- ----E--I----KLEL------S-M------EK----T---LITNAF------------K-------------------- K---------------HAKFLGFEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18918679|locus|VBIBacCer120424_5472|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus Q1] -------------------------------------------------------------------GTINNTVDGFSK- -------NRV-----SKIINNIK-------N------GNY----KPT--------------------------------- -----------------PVKRV-YI--DK-----------KGS--K-----------------KKRPL------------ ------GIPT----FD-DKLVQLVIKYI-LE-AIYEPNF-------S--------------------------------- ---ENSHGFRK------------------NR----------------------------GCHTA---LKQI--------- ----KKSGS-------------------------GTKW-FI-EG--------DIQGFF-DNID----------------- -HHI--LINL-L-RKRI---------NDET------------------LIGLI---------WKFLRA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------W---------Q--FHK-TF-S-----------GTP-----QGGIL------------------S- PLLANIYL-----------------------------------------------------------NE----------- --------LD--IY-M-EK-Y-----------------AERFGKGQPKDREVDKRYQY-LHL----KIKRGRKKADLLRE Q-----GKLNESQELIHQVNEWIKERGQRPYYNPMSDKFKSLKYVR----------YADDFIVM------------LIG- -----SKDD----AN---------------------------AIKSDIAQ---------------------FLNE----- ----E--L----KLTL------S-E------EK----T---LITHS-------------S-------------------- K---------------KAKFLGYNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|47030643|locus|VBISynGly105927_0075|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Syntrophobotulus glycolicus DSM 8271] -------------------------------------------------------------------GVDNRTIDGFKY- -------EMI-----DTLIEKLK-------T------EQY----YPK--------------------------------- -----------------PVRRT-YI--PK-----------K-N--G-----------------KTRPL------------ ------GIPC----FE-DKLLQEVIRQL-LE-SIYEPIF-------S--------------------------------- ---DNSHGFRP------------------DR----------------------------SCHTA---LCQI--------- ----KNTMR-------------------------GANW-VI-EG--------DITGCF-DNID----------------- -HTI--LLNI-L-SQKI---------EDGR------------------FIELI---------RRFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-F--------------- -------------------------------------------------------------------------------- ---------------K---------Q--MHR-SL-S-----------GCP-----QGGII------------------S- PILSNIYL-----------------------------------------------------------NE----------- --------FD--RY-M-DE-I-----------------INKNTKG--KKRRSNPEYQR-LRG----KRYTAIKKGNLE-- -----------------EIKRLTKEIQSIPSLDPMDSNFTRVKYVR----------YADDFVIE------------VIG- -----SKEM----AE---------------------------SIKEDVAT---------------------FLKE----- ----K--L----NLEL------N-Q------EK----T---LITNLG------------N-------------------- E---------------KANFLGYEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|190354377|locus|VBIStrAng166616_0608|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus anginosus C238] -------------------------------------------------------------------GVTEETIDGMSI- -------QKI-----DMIIEQLR-------Q------ETY----YWR--------------------------------- -----------------PARRE-YI--PK-----------K-N--G-----------------KHRPL------------ ------GIPV----WS-DKLLQEVIRMI-LE-AYYEPQF-------S--------------------------------- ---EHSHGFRP------------------KR----------------------------GCHTA---LQEI--------- ----Q-TWQ-------------------------GTHW-FI-EG--------DISSYF-DTID----------------- -HCV--LITM-L-SKQI---------QDGR------------------FIRLI---------KNMLEA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----D-D--------------- -------------------------------------------------------------------------------- ---------------W---------K--FRK-TI-S-----------GTP-----QGGVI------------------S- PLLANIYL-----------------------------------------------------------HQ----------- --------FD--KW-VGEE-L-----------------IPQYTRG--KKQKANSAYNR-LSR----KIKFYQDKGEYK-- -----------------KAHQIIVERRNIPSVDTYDTNYRRLRYVR----------YADDFILG------------FTG- -----SKAE----AK---------------------------DIKKQIGD---------------------FLNI----- ----K--L----HLEL------S-Q------EK----T---LITHAT------------E-------------------- E---------------SAKFLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >L.l.I1/U50902/2854..5345/Lactococcus_extraction lactis/ML/ORF Sequence %28a.a%29 -------------------------------------------------------------------GILDDTADGFSE- -------EKI-----KKIIQSLK-------D------GTY----YPQ--------------------------------- -----------------PVRRM-YI--AK-----------KNS--K-----------------KMRPL------------ ------GIPT----FT-DKLIQEAVRII-LE-SIYEPVF-------E--------------------------------- ---DVSHGFRP------------------QR----------------------------SCHTA---LKTI--------- ----KREFG-------------------------GARW-FV-EG--------DIKGCF-DNID----------------- -HVT--LIGL-I-NLKI---------KDMK------------------MSQLI---------YKFLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-N--------------- -------------------------------------------------------------------------------- ---------------W---------Q--YHK-TY-S-----------GTP-----QGGIL------------------S- PLLANIYL-----------------------------------------------------------HE----------- --------LD--KF-V-LQ-L-----------------KMKFDRE--SPERITPEYRE-LHN----EIKRISHRLKKL-- -------EGEEKAKVLLEYQEKRKRLPTLPCTSQTN---KVLKYVR----------YADDFIIS------------VKG- -----SKED----CQ---------------------------WIKEQLKL---------------------FIHN----- ----K--L----KMEL------S-E------EK----T---LITHS-------------S-------------------- Q---------------PARFLGYDI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D.h.I4/AP008230.1/5193183..5195085/Desulfitobacterium_extraction hafniense/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGMQVDELLP- --FLE--NHK-----DELVKSLW-------D------GKY----RPK--------------------------------- -----------------PVRRV-EI--LK-----------E-N--G-----------------KMRKL------------ ------GIPT----VV-DRLIQQAITQV-LS-PIFEEQF-------S--------------------------------- ---DSSFGFRP------------------KR----------------------------SAHDA---LRRC--------- ----QSHING------------------------GYRY-VV-DM--------DLEKYF-DTVN----------------- -QSK--LIQI-L-SETI---------KDGR------------------VISLI---------HKFLQS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----V-D--------------- -------------------------------------------------------------------------------- ---------------G---------L--FEE-SP-E-----------GVP-----QGGPL------------------S- PLLGNIML-----------------------------------------------------------NE----------- --------CD--H------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDMMI-------------FCK- -----SKKA----AK---------------------------RTLDHILP---------------------YIEG----- ----K--L----FLKV------N-R------EK----T---KVAHVN--------------------------------- ----------------YVKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43183350|locus|VBIRumTor148568_2143|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Ruminococcus torques L214] -------------------------------------------------------------------GNDGMQVDELLP- --FLR--ENQ-----DTLIRKIR-------E------GKY----KPN--------------------------------- -----------------PVRRV-EI--PK-----------ETE--G-----------------EFRKL------------ ------GVPT----VV-DRVIQQAIAQE-LS-PVYEKQF-------S--------------------------------- ---ENSFGFRP------------------KR----------------------------GAHDA---LRQC--------- ----QKNVND------------------------GYVY-VV-DM--------DLEKFF-DTVC----------------- -QSK--LIEV-L-SRTI---------KDGR------------------VISLI---------HKYLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----A-K--------------- -------------------------------------------------------------------------------- ---------------G---------M--FER-TE-V-----------GMP-----QGGPL------------------S- PLLSNVML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LES--------------RGHRFVR----------YADDCMI-------------FCK- -----SRKS----AE---------------------------RTLKNIIP---------------------FIEG----- ----K--L----FLKV------N-R------KK----T---EVSHIS--------------------------------- ----------------KVKYLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42783608|locus|VBIButBac39087_0859|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [butyrateproducing bacterium SM4/1] -----------------------------------------------------------------------MQVDELLP- --YLI--NHR-----DELVRQLR-------E------GKY----KPN--------------------------------- -----------------PVRRV-EI--PK-----------EEK--G-----------------KFRKL------------ ------RIPT----VV-DRMIQQAIAQE-LT-PIYEEQF-------S--------------------------------- ---DNSYGFRP------------------GR----------------------------SAHDA---LAKC--------- ----RKYVDE------------------------GHVY-AI-SM--------DLEAYF-DTVN----------------- -HSK--LIEV-L-SRTM---------KDGR------------------VISLI---------HRYLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------G--FHA-TP-E-----------GVP-----QGGPL------------------S- PLCGNVML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHKFVR----------YADDCII-------------LCK- -----SRKS----AE---------------------------RTLKHIIP---------------------FITE----- ----K--L----YLKI------N-L------EK----T---TVSHIS--------------------------------- ----------------KVKYLGYGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|22668882|locus|VBINatThe92436_0159|_extraction Mobile element protein [Natranaerobius thermophilus JW/NMWNLF] -------------------------------------------------------------------GIDGMGVDELLQ- --YLK--ENG-----DHLRQRVL-------D------GKY----RPN--------------------------------- -----------------PVRRV-EI--PK-----------E-D--G-----------------KKRKL------------ ------GIPT----VV-DRVIQQAIAQV-LS-PIYEEQF-------S--------------------------------- ---DNSYGFRP------------------GR----------------------------STHDA---IKKS--------- ----QQNINE------------------------GYKY-VV-DM--------DLEKYF-DTVN----------------- -QSK--LIEV-L-SKTI---------KDGR------------------VISLI---------NKYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-K--------------- -------------------------------------------------------------------------------- ---------------H---------T--YKD-TE-V-----------GVP-----QGGPL------------------S- PILSNIML-----------------------------------------------------------HE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHEFVR----------YADDLLI-------------FCK- -----SRRS----AG---------------------------RTLKNILP---------------------FIEN----- ----K--L----FLKV------N-K------DK----T---VVAYVG--------------------------------- ----------------KVRFLGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.t.I3/AE015928/3254698..3258524/Bacteroides_extraction thetaiotaomicron/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDKMSCEQLLS- --WLK--ANK-----DELIISLQ-------S------GTY----RPN--------------------------------- -----------------PVRRV-EI--PK-----------D-N--G-----------------KKRLL------------ ------GIPT----VV-DRLVQQAINQV-LT-LIYERQF-------S--------------------------------- ---KTSYGFRP------------------QR----------------------------GCHDA---LRKA--------- ----QKIVSE------------------------GYIY-VV-DL--------DLERFF-DTVS----------------- -HSK--LIEI-L-SRTI---------KDGR------------------VISLI---------HKYLRS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----N-R--------------- -------------------------------------------------------------------------------- ---------------G---------M--FEM-ST-E-----------GTP-----QGGPL------------------S- PLLSNIML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHPFVR----------YADDAMI-------------FCK- -----SKRA----AK---------------------------RVRESITL---------------------FIEG----- ----K--L----FLKV------N-H------EK----T---VVSYVK--------------------------------- ----------------GVKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Fl.jo.I1/CP000685/4416242..4418139/Flavobacterium_extraction johnsoniae/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDKLSTEHLQE- --WLL--KHK-----ESLIESLE-------K------GKY----KPQ--------------------------------- -----------------AVRRV-EI--PK-----------E-G--G-----------------KTRSL------------ ------GIPT----VV-DRLVQQSIIQI-LT-PIYEQEF-------H--------------------------------- ---TSSHGFRP------------------KR----------------------------GCHTA---LKEV--------- ----ESHLND------------------------EYCY-VV-DL--------DLEKFF-DTVN----------------- -HSR--LIEL-L-SKKV---------KDPR------------------VISLI---------HKYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----V-V--------------- -------------------------------------------------------------------------------- ---------------N---------K--FEE-SV-L-----------GVP-----QGGPL------------------S- PLLSNIML-----------------------------------------------------------HE----------- --------LD--K------------------------------E------------------------------------ ----------------------LAR--------------RGHRFVR----------YADDCLI-------------FCK- -----SKRA----CL---------------------------RVKESITA---------------------FIES----- ----V--L----YLRV------N-K------EK----T---TVGYIR--------------------------------- ----------------GKKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|47240589|locus|VBISphSp165585_3447|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sphingobacterium sp. 21] -----------------------------------------------------------------------------QD- --YLV--ENK-----DELITAIL-------R------GRY----RPN--------------------------------- -----------------PVRRV-AI--PK-----------D-N--G-----------------QQRQL------------ ------GIPT----VV-DRFIQQAIAQV-LL-PLYEPQF-------S--------------------------------- ---EHSYGFRP------------------RR----------------------------NAHHA---LKQC--------- ----RDYITA------------------------GYSY-AV-DL--------DIEKFF-DQVN----------------- -HSK--LIEV-L-SGTI---------KDGR------------------VLSLI---------HKYLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------Q-----V-G--------------- -------------------------------------------------------------------------------- ---------------G---------S--YER-SE-M-----------GVP-----QGGPL------------------S- PLLSNIML------------------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------------------VR----------YADDLLL-------------MCK- -----SKRS----GQ---------------------------RVMGSLIS---------------------FIEN----- ----K--L----HLKV------N-R------DK----S---QTAPVS--------------------------------- ----------------RVKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >S.ag.I1/AJ292930/182..2038/Streptococcus_extraction agalactiae/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTIEQMDD- --YLH--QNW-----RETKKLIK-------E------RSY----KPQ--------------------------------- -----------------PVLRV-EI--PK-----------P-N--G-----------------GVRNL------------ ------GIPT----AM-DRMIQQAIVQV-LS-PLCEKHF-------S--------------------------------- ---EYSYGFRP------------------NR----------------------------SCETA---IVQL--------- ----LEYLND------------------------GYEW-IV-DI--------DLEKFF-DTVP----------------- -QDR--LMSL-V-HNII---------QDGD------------------TESLI---------RKYFHS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------Q--RHK-TL-V-----------GTP-----QGGNL------------------S- PLLSNIML-----------------------------------------------------------NE----------- --------LD--K------------------------------G------------------------------------ ----------------------LEK--------------RGLRFVR----------YADDCVI-------------TVG- -----SEAA----AK---------------------------RVMHSVSS---------------------YIEK----- ----R--L----GLKV------N-M------TK----T---KIVRPN--------------------------------- ----------------KLKYLGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|61433042|locus|VBIStrMac222691_1557|_extraction Retrontype reverse transcriptase [Streptococcus macedonicus ACADC 198] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --MSNIML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LES--------------RGLHFVR----------YADDCVI-------------TVG- -----SGAA----AK---------------------------RVMHSISR---------------------FIEQ----- ----G--L----GLKV------N-M------TN----L---T-------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|54587919|locus|VBIPorAsa172508_0403|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Porphyromonas asaccharolytica DSM 20707] -------------------------------------------------------------------GIDGMKVEELRD- --YMN--ANW-----TSIKQSIL-------E------RRY----KPA--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------GVRKL------------ ------GIPT----VV-DRTLQQSIVQV-LT-PIFEAEF-------Q--------------------------------- ---ENSYGFRP------------------GR----------------------------SCEQA---VQKL--------- ----LEYLNE------------------------GAEW-IV-DI--------DLEKFF-DNVP----------------- -QDK--LMSY-V-GRVI---------HDPD------------------TESLI---------RKYLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--YEA-TE-L-----------GTP-----QGGNL------------------S- PLLSNVML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------MVR--------------RGLRYVR----------YADDCVI-------------AVR- -----SEAS----AK---------------------------RVMHSVTQ---------------------WIER----- ----V--L----GLKV------N-A------TK----T---HVCRPS--------------------------------- ----------------KLKYLGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42987381|locus|VBIEubRec155814_2884|_extraction Mobile element protein [Eubacterium rectale M104/1] -------------------------------------------------------------------GVDGMKYTELKE- --HLA--KNG-----ETIKGQLR-------T------RKY----KPQ--------------------------------- -----------------PARRV-EI--PK-----------P-D--G-----------------GVRNL------------ ------GVPT----VT-DRFIQQAIAQV-LT-PIYEEQF-------H--------------------------------- ---DHSYGFRP------------------NR----------------------------CAQQA---ILTA--------- ----LNIMND------------------------GNDW-IV-DI--------DLEKFF-DTVN----------------- -HDK--LMTL-I-GRTI---------KDGD------------------VISIV---------RKYLVS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----I-D--------------- -------------------------------------------------------------------------------- ---------------D---------E--YED-SI-V-----------GTP-----QGGNL------------------S- PLLANIML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------MEK--------------RGLNFVR----------YADDCII-------------MVG- -----SEMS----AN---------------------------RVMRNISR---------------------FIEE----- ----K--L----GLKV------N-M------TK----S---KVDRPS--------------------------------- ----------------GLKYLGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|102075684|locus|VBIAmpXyl149409_1227|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Amphibacillus xylanus NBRC 15112] -------------------------------------------------------------------GVDGMSVEQIQG- --YLA--LNK-----DDLLKSIR-------N------RTY----KPE--------------------------------- -----------------PVLRV-EI--PK-----------P-N--G-----------------GVRLL------------ ------GIPT----VK-DRVIQQAIAQI-LT-PLFDRQF-------S--------------------------------- ---DYSYGFRP------------------RR----------------------------YAEMA---IIKG--------- ----LEYMND------------------------GYEW-IV-DI--------DLERFF-DTVN----------------- -HDR--LMNL-V-ARTV---------EDGD------------------VISLI---------RKFLVS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------Q-----I-D--------------- -------------------------------------------------------------------------------- ---------------E---------E--YKE-TI-I-----------GTP-----QGGNL------------------S- PLLSNIML-----------------------------------------------------------HE----------- --------LD--M------------------------------E------------------------------------ ----------------------LEN--------------RGLRFVR----------YADDCII-------------FAK- -----SQMA----AN---------------------------RIMRSITR---------------------FIEE----- ----K--L----GLIV------N-A------DK----S---KVTNPN------------N-------------------- T---------------DFKFLGFG-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|201988153|locus|VBIBacThu93926_0157|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis YBT1518] -------------------------------------------------------------------GVDGVTVDELKQ- --YLK--ENK-----DELRQRIR-------T------RKY----QPQ--------------------------------- -----------------AALRV-EI--PK-----------E-N--G-----------------KMRKL------------ ------GIPT----VV-DRVVQQAIHQI-LS-PIFEKQF-------S--------------------------------- ---EFSYGFRP------------------KR----------------------------SCEMA---IVKS--------- ----LEFLNA------------------------GYEW-IV-DI--------DLERFF-DTVH----------------- -HDK--LMRI-I-SNTI---------SDGD------------------VISLI---------RKYLVS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----V-N--------------- -------------------------------------------------------------------------------- ---------------G---------K--YEE-TS-V-----------GTP-----QGGNL------------------S- PLLSNIML-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LES--------------RELQFVR----------YADDALI-------------FVK- -----SEKA----AS---------------------------RVMKSIVR---------------------FIEK----- ----N--L----GLIV------N-T------EK----S---KISRPE--------------------------------- ----------------DLKFLGFG-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.h.I1/AP001507/130149..132031/Bacillus_extraction halodurans/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDEMDVKSLRL- --HLH--ENW-----TSIRNEII-------E------GSY----FPK--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------GVRKL------------ ------GIPT----VM-DRFLQQAIAQI-LT-QLYDPTF-------S--------------------------------- ---ERSFGFRP------------------HR----------------------------RGHNA---VRQA--------- ----KQWMKE------------------------GYRW-VV-DI--------DLEKFF-DKVN----------------- -HDR--LMRK-L-SSRI---------QDPR------------------VLQLI---------RRYLQT------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-R--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-NT-E-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--N------------------------------E------------------------------------ ----------------------LEK--------------RGLKFVR----------YADDCNI-------------YVR- -----SKRA----GL---------------------------RIMESVTS---------------------FIEN----- ----R--L----KLKV------N-R------EK----S---AVDRPW--------------------------------- ----------------NRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >O.i.I1/BA000028/2785523..2787411/Oceanobacillus_extraction iheyensis/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDEMSVKFLRR- --HLY--DNW-----DSLRENLR-------K------GTY----TPS--------------------------------- -----------------PVRRV-EI--PK-----------P-S--G-----------------GVRML------------ ------GIPT----VT-DRFIQQAIAQV-LH-TIFDPSF-------S--------------------------------- ---EHSYGFRP------------------NR----------------------------RGHDA---VRKA--------- ----RGFIKE------------------------GYRW-VI-DM--------DLEKFF-DKVN----------------- -HDK--LMGV-L-AKRI---------KDKE------------------LLRLI---------RKYLQS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--VVS-SE-E-----------GTP-----QGGPL------------------S- PLLSNIIL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEE--------------RGLRFVR----------YADDCNI-------------YVR- -----TKKA----GN---------------------------RVMNSITT---------------------FIEE----- ----K--L----RLKV------N-K------EK----S---AVDRPW--------------------------------- ----------------KRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|21969758|locus|VBIGeoSp101709_0522|_extraction Mobile element protein [Geobacillus sp. WCH70] -------------------------------------------------------------------GTDGMSVKDLRR- --HLV--EHW-----DVIRRALE-------E------GTY----EPC--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------GVRLL------------ ------GIPT----VT-DRFIQQAIAQV-LT-PIFDPSF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------RGHDA---VKKA--------- ----KQYIQE------------------------GYTW-VV-DI--------DLEKFF-DRVN----------------- -HDK--LMGI-L-AKRI---------PDKI------------------LLKLI---------RKYLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--VME-TQ-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFVR----------YADDCNI-------------YVR- -----TKKA----GE---------------------------RVMKSITA---------------------FIEK----- ----K--L----RLKV------N-E------TK----S---AVDRPW--------------------------------- ----------------RRKFLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|20956876|locus|VBIAnoFla45531_2524|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Anoxybacillus flavithermus WK1] -------------------------------------------------------------------GVDMMPVQNLRT- --HIV--ENW-----QSIKEAII-------K------GTY----EPM--------------------------------- -----------------PVRRV-EI--PK-----------P-D--G-----------------GVRLL------------ ------GIPT----VT-DRLIQQAIAQV-LS-KIYDPMF-------S--------------------------------- ---EHSYGFRP------------------NR----------------------------SAHDA---VRKA--------- ----QGYIKE------------------------GYRW-VV-DI--------DLEKFF-DQVN----------------- -HDR--LMST-L-AKRI---------HDKP------------------LLKLI---------RKYLQS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-H--------------- -------------------------------------------------------------------------------- ---------------G---------V--VSS-TE-K-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHQFVR----------YADDCNI-------------YVK- -----SKRA----GE---------------------------RTMASVQR---------------------FIER----- ----K--L----RLKV------N-E------KK----S---AVDRPW--------------------------------- ----------------KRKFLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|102073184|locus|VBIAmpXyl149409_0044|_extraction Mobile element protein [Amphibacillus xylanus NBRC 15112] -------------------------------------------------------------------GIDQMPTTDLRV- --YVM--ENW-----HTMREQLL-------S------GTY----QPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------GVRKL------------ ------GIPT----VT-DRLIQQAIAQQ-LT-LVFDPTF-------S--------------------------------- ---EFSYGFRP------------------NR----------------------------RAHTA---VKQA--------- ----RAYIEE------------------------GYRW-VV-DM--------DLEKFF-DKVH----------------- -HDR--LMAR-L-ATRI---------KDKV------------------LLHLI---------RQFLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-MT-E-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFVR----------YADDFHI-------------YVK- -----SSRA----GE---------------------------RVMESITT---------------------FIEK----- ----K--L----RLKV------N-R------EK----S---AVDRPW--------------------------------- ----------------KRKLLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >S.th.I1/AP006840/1010793..1012672/Symbiobacterium_extraction thermophilum/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVPTERLRD- --QIR--VEW-----SRIREELL-------Q------GTY----RPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-G--G-----------------GKRML------------ ------GIPT----VM-DRLIQQALLQV-LT-PIFDPTF-------S--------------------------------- ---ESSYGFRP------------------GR----------------------------RGHDA---VRKA--------- ----RQYVEE------------------------GYDW-VV-DM--------DLEKFF-DRVN----------------- -HDV--LMAR-V-ARRV---------TDKR------------------VLRLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----L-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--VVA-TE-E-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHHFVR----------YADDCNI-------------YVR- -----SKRA----GE---------------------------RVYRSVRH---------------------FLQE----- ----R--L----RLKV------N-E------EK----S---AVDRPW--------------------------------- ----------------KRQFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|124876795|locus|VBIGeoSp266368_3508|_extraction Mobile element protein [Geobacillus sp. GHH01] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------MK----------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------------------------------------C- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGLKFYR----------YADDCNI-------------YVK- -----SLRA----GQ---------------------------RVKQSIQR---------------------FLER----- ----T--L----KLKV------N-E------EK----S---AVDRPW--------------------------------- ----------------KRAFLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|45221880|locus|VBIGeoSp94955_0342|_extraction Mobile element protein [Geobacillus sp. Y412MC52] -------------------------------------------------------------------GIDGVSTDQLRD- --YIR--AHW-----STIRAQLL-------A------GTY----RPA--------------------------------- -----------------PVRRV-EI--PK-----------P-G--G-----------------GTRQL------------ ------GIPT----VV-DRLIQQAILQE-LT-PIFDPDF-------S--------------------------------- ---PSSFGFRP------------------GR----------------------------NAHDA---VRQA--------- ----QGYIQE------------------------GYRY-VV-DM--------DLEKFF-DRVN----------------- -HDI--LMSR-V-ARKV---------KDKR------------------VLKLI---------RAYLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-E--------------- -------------------------------------------------------------------------------- ---------------G---------V--KVQ-TE-E-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGLKFCR----------YADDCNI-------------YVK- -----SLRA----GQ---------------------------RVKQSIQR---------------------FLEK----- ----T--L----KLKV------N-E------EK----S---AVDRPW--------------------------------- ----------------KRAFLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|22104403|locus|VBIHelMod36755_0020|_extraction Mobile element protein [Heliobacterium modesticaldum Ice1] -------------------------------------------------------------------GIDGMGLESLRP- --YLK--EEW-----SRIKQELL-------E------GTY----RPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-Q--G-----------------GTRKL------------ ------GIPT----VV-DRLIQQALNQI-LM-PIFDPDF-------S--------------------------------- ---TNSYGFRP------------------GK----------------------------SAHQA---VKKA--------- ----KEYIAD------------------------GYRW-VV-DM--------DLAQFF-DRVN----------------- -HDI--LMAR-V-ARKV---------KDKR------------------ILKLI---------REYLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----L-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--RVK-SE-E-----------GTP-----QGGPL------------------S- PLLANIIL-----------------------------------------------------------DD----------- --------LD--K------------------------------A------------------------------------ ----------------------LES--------------RGHRFCR----------YADDCNV-------------YVR- -----SRRA----GQ---------------------------RVMEGMAK---------------------FLEG----- ----R--L----KLQV------N-W------EK----S---AVDRPW--------------------------------- ----------------NRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|20880100|locus|VBIAmmDeg104956_0293|_extraction Mobile element protein [Ammonifex degensii KC4] -------------------------------------------------------------------GVDGMEPEALRS- --YLK--EHW-----PRIKEELL-------A------GTY----RPM--------------------------------- -----------------PVRRV-EI--PK-----------P-G--G-----------------GVRLL------------ ------GIPT----VL-DRLIQQALLQV-LT-PIFDPGF-------S--------------------------------- ---PHSYSFRP------------------GR----------------------------SAHQA---VEQA--------- ----RRYVAQ------------------------GYRH-VV-DL--------DLAQFF-DRVN----------------- -HDL--LMAR-V-ARKV---------KDKR------------------VLKLI---------RAYLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------C--CVR-TE-E-----------GTP-----QGGPL------------------S- PLLANIIL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDSNV-------------YVK- -----SCRA----GQ---------------------------RVFESLKR---------------------FLEQ----- ----R--L----KLRI------N-E------EK----S---AVDYAW--------------------------------- ----------------RRGILGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|32495095|locus|VBITheIta22270_0082|_extraction Mobile element protein [Thermoanaerobacter italicus Ab9] -------------------------------------------------------------------GVDGMGVDELLP- --YLK--ENW-----ATIKQQLL-------E------GKY----KPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-D--G-----------------GKRLL------------ ------GIPT----VL-DRLIQQAIAQI-LN-KVYNHTF-------S--------------------------------- ---DSSYGFRP------------------GR----------------------------SAKDA---IKAA--------- ----EAYINE------------------------GYTW-VV-DM--------DLEKFF-DRVN----------------- -HDI--IMSK-L-EKRI---------GDKR------------------VLKLI---------RRYLES------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--KVS-TE-E-----------GTP-----QGGPL------------------S- PLLANIML-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFCR----------YADDCNI-------------YVR- -----SRSA----GN---------------------------RVMKSIKK---------------------FIES----- ----K--L----KLKV------N-E------AK----S---AVDRPW--------------------------------- ----------------RRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >C.a.I1/AE001437/3710916..3712835/Clostridium_extraction acetobutylicum/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMKVDELLQ- --YLK--QNG-----KTLIASIF-------N------GKY----CPK--------------------------------- -----------------AVRRV-EI--PK-----------P-D--G-----------------GIRLL------------ ------GIPT----VV-DRTIQQAISQV-LT-PIFEKTF-------S--------------------------------- ---ENSYGFRP------------------KR----------------------------SAKQA---IKKA--------- ----KEYMEE------------------------GYKW-VV-DI--------DLAKYF-DTVN----------------- -HDK--LMAL-V-ARKI---------KDKR------------------VLKLI---------RLYLQS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--VSE-TE-R-----------GCP-----QGGPL------------------S- PLLSNIML-----------------------------------------------------------TE----------- --------LD--R------------------------------E------------------------------------ ----------------------LEK--------------RGHKFCR----------YADDNNV-------------YVR- -----SKKA----GD---------------------------RVMRSITR---------------------FIEN----- ----K--L----KLKV------N-K------EK----S---AVDRPW--------------------------------- ----------------RRKFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42639654|locus|VBIHalSp157090_0576|_extraction Mobile element protein [Halanaerobium hydrogeniformans] -------------------------------------------------------------------GIDGMSVDELLP- --LLK--RNG-----SQLLKDIL-------E------GNY----KPQ--------------------------------- -----------------AVRRV-EI--PK-----------P-G--G-----------------GVRLL------------ ------GIPT----VI-DRMIQQAITQQ-LT-PIFDPGF-------S--------------------------------- ---EYSYGFRP------------------GR----------------------------NAHQA---VNKA--------- ----KEYIND------------------------GYTW-VV-DI--------DLEKYF-DTVQ----------------- -HDK--LMSL-V-ARKV---------QDKR------------------VLKLI---------RAYLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-D--------------- -------------------------------------------------------------------------------- ---------------G---------L--IKK-TD-E-----------GCP-----QGGPL------------------S- PLLSNIML-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RNHKFCR----------YADDSQI-------------YVK- -----SRKA----AK---------------------------RVMKSLTV---------------------FIEK----- ----K--L----KLKV------N-A------TK----S---AVGRPW--------------------------------- ----------------RRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|124770602|locus|VBICloSac181665_0553|_extraction Mobile element protein [Clostridium saccharoperbutylacetonicum N14(HMT)] -------------------------------------------------------------------GVDGMKTDELRE- --HIK--KHW-----ETIKVKLL-------E------SKY----NPS--------------------------------- -----------------PVRRK-EI--SK-----------P-D--G-----------------GVRLL------------ ------GIPT----VQ-DRLIQQAIAQV-LS-KIYEPLF-------S--------------------------------- ---ENSFGFRP------------------HR----------------------------GAKDA---ITKS--------- ----KQYITQ------------------------GNRW-VI-DM--------DLEKFF-DKVN----------------- -HDI--LMNK-L-EKKI---------QDKR------------------LLSLI---------RKYLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--SVA-SE-E-----------GTP-----QGGPL------------------S- PLLANIML-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGYKFCR----------YADDNNI-------------YVK- -----SKRA----GF---------------------------RVMKSITN---------------------IIEN----- ----N--L----KLKV------N-K------DK----S---AVDFVS--------------------------------- ----------------KRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sy.fu.I1/CP000478/3922427..3924309/Syntrophobacter_extraction fumaroxidans/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVSVDALRA- --CLR--EHW-----PRIKEELL-------E------GRY----QPQ--------------------------------- -----------------PVRKV-EI--PK-----------P-G--G-------------K---GMRQL------------ ------GIPT----VM-DRLIQQALNQV-MQ-PIFDPDF-------S--------------------------------- ---ESSYGFRP------------------GR----------------------------SAHQA---VLRA--------- ----REYAAT------------------------DRRW-VV-DM--------DLEKFF-DRVN----------------- -HDI--LMAR-L-ARKI---------ADRR------------------VLQLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GS------------------------------------M-----V-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VSP-RT-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEQ--------------RGHAFCR----------YADDCNI-------------YVK- -----SRRA----GQ---------------------------RVLESLTR---------------------FLAN----- ----R--L----KLKV------N-V------DK----S---AVARPW--------------------------------- ----------------VRKFLGYSM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|21697130|locus|VBIDesRet71890_0165|_extraction Mobile element protein [Desulfohalobium retbaense DSM 5692] -------------------------------------------------------------------GVDGMSVNDVWG- --YCT--LNW-----ARIKEELL-------D------GRY----EPQ--------------------------------- -----------------PVLGV-EI--PK-----------P-G--G-----------------GVRQL------------ ------GIPT----AL-DRLIQQALHQV-LS-PIFNPHF-------S--------------------------------- ---ESSYGFRP------------------GR----------------------------SAHQA---VLKA--------- ----REHAAA------------------------GKRW-VV-DM--------DLEKFF-DRVN----------------- -HDV--LMAR-V-ARKV---------KDKR------------------VLVLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----Q-G--------------- -------------------------------------------------------------------------------- ---------------G---------I--ASK-RK-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHAFCR----------YADDCNI-------------YVQ- -----TKRS----GE---------------------------RAMASITR---------------------FLTE----- ----R--L----KLRV------N-A------DK----S---AVDRPW--------------------------------- ----------------KRKFLGYSM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pe.ca.I2/CP000142/1559685..1561618/Pelobacter_extraction carbinolicus/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGMPVGDLKT- --YLQ--EQW-----PRIKEELL-------T------GTY----QPQ--------------------------------- -----------------PVRKV-EI--PK-----------P-G--G-----------------GMRML------------ ------GIPT----VL-DRLIQQALHQE-LM-RLFEPEF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------SAHQA---VQSA--------- ----RRHVAS------------------------GRRW-AV-DI--------DLEKFF-DRVG----------------- -HDI--LMSR-V-ARKV---------KDRR------------------VLGLI---------RRYLTV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-G--------------- -------------------------------------------------------------------------------- ---------------G---------I--ISP-RV-Q-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------FD--K------------------------------E------------------------------------ ----------------------LER--------------RGHAFCR----------YADDCNI-------------YVH- -----SRRA----AE---------------------------RVMTSLTR---------------------FLEQ----- ----Q--L----KLKV------N-R------VK----S---AVGRPW--------------------------------- ----------------ERTFLGYSM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ge.ur.I2/CP000698/242469..244398/Geobacter_extraction uraniireducens/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDNMPVTALKG- --YLQ--EEW-----PRIREELL-------T------GTY----HPQ--------------------------------- -----------------PVRKV-EI--PK-----------P-G--G-----------------GTRML------------ ------GIPT----VL-DRLIQQAVHQV-LS-PLFDPGF-------S--------------------------------- ---ISSHGFRP------------------GR----------------------------SAHQA---IKAA--------- ----RKYVES------------------------GLRW-VV-DI--------DLEKFF-DRVH----------------- -HDT--LMSL-V-KRKV---------GDRL------------------VLSLI---------DSYLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----E-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--TSP-RL-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------LD--K------------------------------K------------------------------------ ----------------------LER--------------RGHKFCR----------YADDANI-------------YVA- -----TRRS----GE---------------------------RVMASITG---------------------YLSE----- ----R--L----KLTV------N-Q------GK----S---AVDRPW--------------------------------- ----------------KRSFLSYSM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|20468517|locus|VBIOchAnt73124_1762|_extraction Mobile element protein [Ochrobactrum anthropi ATCC 49188] -------------------------------------------------------------------GIDKMSVGDLKQ- --HLV--SHW-----PRIREDLL-------A------GRY----EPA--------------------------------- -----------------PVRGV-EI--PK-----------P-G--G-----------------GKRLL------------ ------GIPT----VL-DRLIQQAIHQV-LM-PIFDPGF-------S--------------------------------- ---NSSFGFRP------------------ER----------------------------SAHDA---ILAA--------- ----RSYVAD------------------------GYRV-VV-DL--------DLEKFF-DRVN----------------- -HDV--LMAR-V-ARKV---------YDKR------------------VLRLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----M-G--------------- -------------------------------------------------------------------------------- ---------------G---------M--TTM-RS-E-----------GTP-----QGGPL------------------S- PLLSNVLL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEQ--------------RGHRFCR----------YADDCNI-------------YVR- -----SIRA----GQ---------------------------RVMASLTA---------------------FLGR----- ----R--L----KLKV------N-A------TK----S---AVDHPW--------------------------------- ----------------NRVFLGYTMT------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >DEfid|102051933|locus|VBIDesTol47847_2395|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfobacula toluolica Tol2] -------------------------------------------------------------------GIDNMTVDQLPG- --YLR--RHW-----PKVKGKLL-------Q------GNY----KPL--------------------------------- -----------------PVKRK-EI--PK-----------P-D--G-----------------GVRLL------------ ------GIPT----VL-DRLIQQAVSEI-LQ-QIWDPHF-------S--------------------------------- ---ESSHGFRP------------------GR----------------------------SQHDA---ILQG--------- ----KVYLLS------------------------GYTH-SV-NM--------DLSKFF-DRVN----------------- -HDR--LMSR-L-AERI---------KDKR------------------VLKLI---------RSYLTA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--VVS-AA-E-----------GTP-----QGGPL------------------S- PVISNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFVR----------YADDFVI-------------YLK- -----SKKA----AE---------------------------RVMKSVTR---------------------FITV----- ----K--L----RLKV------N-E------EK----S---KVSRPW--------------------------------- ----------------LDKFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|102053344|locus|VBIDesTol47847_3092|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfobacula toluolica Tol2] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------M--------DLSKFF-DRVN----------------- -HDR--LMSR-L-ATRI---------KDKR------------------VLKLI---------RKYLTA------------ -------------------------------------------------------------------------------- ------------------GT------------------------------------M-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--VVF-ST-E-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGLRFVR----------YADDFVI-------------YLK- -----SKKA----AQ---------------------------RVMESIKR---------------------FITL----- ----K--L----KLKV------N-E------EK----S---SVGNAW--------------------------------- ----------------RSKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ha.ch.I2/NC_007645/98723..100647/Hahella_extraction chejuensis/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDQMPVAALKG- --HLQ--QHW-----PTLRERLL-------A------GDY----HPQ--------------------------------- -----------------PVRRV-SI--PK-----------P-Q--G-----------------GERIL------------ ------GIPT----VQ-DRLIQQALHQV-LS-PMLEPIF-------S--------------------------------- ---DHSYGFRP------------------GR----------------------------SAHQA---VRAM--------- ----QRHIND------------------------GHRW-VV-DL--------DLEQFF-DRVN----------------- -HDV--LMGL-L-ARRI---------ADRR------------------MLTLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GM------------------------------------L-----D-G--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-RR-E-----------GAP-----QGGPL------------------S- PLLSNVLL-----------------------------------------------------------TE----------- --------LD--R------------------------------E------------------------------------ ----------------------LER--------------RGHRFCR----------YADDCNI-------------YVR- -----SERA----GH---------------------------RVMTSITH---------------------YLKM----- ----H--L----RLKV------N-A------EK----S---VVDRPW--------------------------------- ----------------RRSYLGYSV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|54384448|locus|VBIPaeMuc192881_5730|_extraction Mobile element protein [Paenibacillus mucilaginosus KNP414] -------------------------------------------------------------------GVDGVTVAHLQV- --YLK--THW-----EAVRAALL-------T------GTY----RPS--------------------------------- -----------------PVKRV-EI--PK-----------P-G--G-----------------GVRLL------------ ------GIPT----VM-DRFLQQALLQV-MN-PIFDAHF-------S--------------------------------- ---WHSYGFRP------------------RK----------------------------RAHDA---VKQA--------- ----QRYIQD------------------------GLRW-VV-DM--------DLEKFF-DRVN----------------- -HDM--LMAR-V-ARKV---------TDKR------------------VLKLI---------RAYLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----A-D--------------- -------------------------------------------------------------------------------- ---------------R---------A--LER-TD-E-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LTK--------------RGLRFVR----------YADDCNI-------------FVA- -----SKRA----GE---------------------------RVMKSMTD---------------------FVEG----- ----K--L----KLKV------N-R------DK----S---AVDRPW--------------------------------- ----------------NRKLLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >So.us.I3/CP000473/9594438..9596378/Solibacter_extraction usitatus/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTVIGIKD- --YLK--QHW-----PAIRGQLL-------S------GTY----EPK--------------------------------- -----------------PVRRV-EI--AK-----------P-D--G-----------------GVRKL------------ ------GIPT----VL-DRFIQQAVMQV-LQ-RRWDRTF-------S--------------------------------- ---DYSYGFRP------------------GR----------------------------SAQQA---VAQA--------- ----QQYIAE------------------------GHGW-CV-DL--------DLEKFF-DRVN----------------- -HDK--LMGQ-I-AKRI---------ADKR------------------LLKLI---------RAFLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-SV-E-----------GTP-----QGGPL------------------S- PLLSNLVL-----------------------------------------------------------DE----------- --------FD--R------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDCNI-------------YVR- -----SERA----GQ---------------------------RVMESITQ---------------------FITQ----- ----K--L----KLKV------N-E------TK----S---AVARPQ--------------------------------- ----------------ERKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bu.vi.I2/CP000617/381828..383697/Burkholderia_extraction vietnamiensis/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTVQALPA- --FLR--EQW-----PSIRATLL-------N------GTY----KPQ--------------------------------- -----------------PVRRV-EI--PK-----------PDG--G-----------------GVRKL------------ ------GIPC----AL-DRFVQQAVLQV-LQ-RQWDPTF-------S--------------------------------- ---EASYGFRP------------------GR----------------------------SAHQA---VAKA--------- ----QSYIQS------------------------GYRW-VV-DL--------DLEKFF-DRVN----------------- -HDI--LMSR-V-ARRV---------SDRR------------------VLKLI---------RSFLTA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-H--------------- -------------------------------------------------------------------------------- ---------------G---------L--VGA-TD-E-----------GTP-----QGGPL------------------S- PLLSNLML-----------------------------------------------------------DD----------- --------LD--R------------------------------E------------------------------------ ----------------------LGR--------------RGLRFVR----------YADDCNV-------------YVR- -----SERA----GQ---------------------------RVMVGLKA---------------------FLTG----- ----K--L----KLKV------N-E------AK----S---AVARPH--------------------------------- ----------------TRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Br.sp.I1/CP000494/6816299..6818172/Bradyrhizobium_extraction sp./Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMTVDDLPT- --YLK--ANW-----LTIRAQLL-------D------GTY----KPQ--------------------------------- -----------------AVRRV-EI--PK-----------A-S--G-----------------GVRLL------------ ------GIPT----VV-DRFIQQAVLQV-LQ-GEWDRTF-------S--------------------------------- ---DASYGFRP------------------GR----------------------------SAHQA---VTKA--------- ----QAYIAS------------------------RHRI-VV-DI--------DLEKFF-DRVN----------------- -HDI--LMGL-V-AKRV---------ADKR------------------LLKLI---------RGFLTA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-G--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-TE-E-----------GAP-----QGGPL------------------S- PLLSNLML-----------------------------------------------------------DV----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDCNI-------------YVR- -----SRKA----GE---------------------------RVMASIET---------------------FLER----- ----C--L----KLKV------N-R------AK----S---AVARPN--------------------------------- ----------------HRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|21803083|locus|VBIDicZea111179_3566|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dickeya zeae Ech1591] -------------------------------------------------------------------GVEGMSVSELPD- --YLK--HHW-----PELKAQLL-------S------GSY----CPS--------------------------------- -----------------PVRRV-TI--PK-----------P-G--G-----------------GERLL------------ ------GIPT----VV-DRFVQQATMQV-LQ-RQWDASF-------S--------------------------------- ---DSSYGFRP------------------GR----------------------------SAHQA---VKQA--------- ----QGYIGS------------------------GHHW-VV-DL--------DLEKFF-DRVN----------------- -HDV--LMSR-V-AKRV---------SDKR------------------VLSLI---------RGFLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-A--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSP-VT-E-----------GMP-----QGGPL------------------S- PLLSNLLL-----------------------------------------------------------DD----------- --------FD--K------------------------------E------------------------------------ ----------------------LEK--------------RGLKFAR----------YADDCNI-------------YVK- -----SERA----GN---------------------------RVMEGLTH---------------------WLSR----- ----K--L----KLKV------N-A------KK----S---AVAHPA--------------------------------- ----------------MRKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|180719125|locus|VBIEscCol277189_4107|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Escherichia coli HVH 121 (46877826)] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------RRT-SA--GH-----------P-D--G-----------------GGSLH------------ --------PA----GD-DAGIAGAL---------WDSSF-------S--------------------------------- ---DNSYGFRP------------------GR----------------------------SAHQA---VIQA--------- ----REHIGA------------------------GYHW-VV-DL--------DLEKFF-DRVN----------------- -HDV--LMSR-I-EKRV---------SDKR------------------VLSLI---------RRFLNA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-A--------------- -------------------------------------------------------------------------------- ---------------G---------L--VRP-VT-E-----------GTP-----QGGPL------------------S- PLLSNLLL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGLKFVR----------YADDCNV-------------YVK- -----SERA----DN---------------------------RIMAGLTH---------------------WLSH----- ----K--L----KLKV------N-A------KK----S---AVARPE--------------------------------- ----------------TRKFPGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|190303244|locus|VBISerSp8482_4195|_extraction Mobile element protein [Serratia sp. ATCC 39006] -------------------------------------------------------------------GVDNLSVGELKG- --WLK--QHW-----ASVREALL-------Q------GNY----VPQ--------------------------------- -----------------AIRQV-EI--PK-----------P-D--G-----------------GVRIL------------ ------GIPT----VV-DRLIQQAIQQH-LT-PDYEPEF-------S--------------------------------- ---DSSYGFRP------------------GR----------------------------NAGQA---VQQA--------- ----QSYMQS------------------------GRRW-VV-DL--------DLEKFF-DRVN----------------- -HDI--LMAR-L-SWKI---------KDTR------------------LLKLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------DR------------------------------------V-----A-G--------------- -------------------------------------------------------------------------------- ---------------S---------E--ITR-RR-E-----------GMP-----QGSPL------------------S- PLLSNILL-----------------------------------------------------------TD----------- --------LD--R------------------------------E------------------------------------ ----------------------LER--------------RGHKFCR----------YADDGNI-------------YVC- -----SRQA----GE---------------------------HAMKEISH---------------------YLEN----- ----K--L----RLKV------N-A------HK----S---AVDRPW--------------------------------- ----------------KRKFLGYSV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21262583|locus|VBICanAcc132554_3525|_extraction Mobile element protein [Candidatus Accumulibacter phosphatis clade IIA str. UW1] -------------------------------------------------------------------GVDGLTVFELKA- --WLQ--QHW-----PSVKAALL-------A------GDY----LPA--------------------------------- -----------------AIRKV-EM--PK-----------P-N--G-----------------GVRIL------------ ------GIPT----VL-DRLIQQALLQV-LQ-PEFEPEF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------NAWQA---VQRA--------- ----QGYIRE------------------------ERRW-VV-DL--------DLEKFF-DRVN----------------- -HDI--LMSR-V-ARRV---------KDER------------------VLKLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------GM------------------------------------M-----S-E--------------- -------------------------------------------------------------------------------- ---------------G---------M--VSA-RT-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------TD----------- --------LD--R------------------------------E------------------------------------ ----------------------LER--------------RGHRFCR----------YADDCNI-------------YVK- -----SKMA----GQ---------------------------HAMDAITD---------------------YLEQ----- ----K--L----KLRV------N-R------DK----S---AVARPW--------------------------------- ----------------QRKFLGYSV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|190304141|locus|VBISerSp8482_4636|_extraction Putative maturaserelated protein [Serratia sp. ATCC 39006] -------------------------------------------------------------------GVDKLTVQELKP- --WLK--QHW-----LSVKGTLI-------A------GSY----LPR--------------------------------- -----------------AIRKV-DI--PK-----------P-N--G-----------------DVRTL------------ ------GIPT----VV-DRLIQQAIAQT-LS-PYVEPSF-------S--------------------------------- ---NSSYGFRP------------------NR----------------------------NAWQA---VRQA--------- ----QQYIQS------------------------GKRW-VV-DM--------DLEKFF-DRVD----------------- -HDI--LMSR-L-ARTI---------KDKR------------------LLKLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------DM------------------------------------V-----E-G--------------- -------------------------------------------------------------------------------- ---------------K---------E--VIK-RD-K-----------GMP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHSFCR----------YADDCNI-------------YVS- -----SQKA----GK---------------------------HAQKDISE---------------------FLMN----- ----T--L----KLQV------N-V------RK----S---AVARPW--------------------------------- ----------------ERKFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21533277|locus|VBICupTai42494_3259|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cupriavidus taiwanensis] -------------------------------------------------------------------GVDALEVTALRD- --WLK--VSW-----PSVRAALL-------G------GQY----IPQ--------------------------------- -----------------SVRAV-DI--PK-----------P-S--G-----------------GVRTL------------ ------GIPT----VV-DRLIQQALLQV-LQ-PLYEPGF-------S--------------------------------- ---ESSYGFRP------------------RR----------------------------SAQQA---VLQA--------- ----QRYVQE------------------------GRRW-VV-DI--------DLEKFF-DRVN----------------- -HDI--LMSR-V-ARQV---------KDVR------------------VLKLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----R-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VEA-RR-Q-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------TD----------- --------WD--R------------------------------E------------------------------------ ----------------------LEK--------------RGLAFCR----------YADDCNI-------------YVR- -----SQAA----GQ---------------------------RLLAGMMT---------------------FLAE----- ----R--L----NLQV------N-E------AK----S---ACARPW--------------------------------- ----------------ARKFLGYSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|45186915|locus|VBIBurRhi170666_3219|_extraction Mobile element protein [Burkholderia rhizoxinica HKI 454] -------------------------------------------------------------------GVDGLPVEQFKD- --WLK--MHW-----PSVKAALL-------D------ARY----MPA--------------------------------- -----------------AVRAV-DI--PK-----------S-A--G-----------------GVRTL------------ ------GIPT----VL-DRLIQQALHQV-LQ-PIFEPGF-------C--------------------------------- ---ESSYGFRP------------------RR----------------------------SAQQA---VLAA--------- ----QRYVQE------------------------GRRW-VV-DI--------DLAKFF-DRVN----------------- -HDI--LMAR-V-ARQV---------KDAR------------------VLKLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----R-E--------------- -------------------------------------------------------------------------------- ---------------G---------V--APA-RR-E-----------GAP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------TD----------- --------WD--R------------------------------E------------------------------------ ----------------------LER--------------RGHAFCR----------YADDCNI-------------YVR- -----SKAA----GE---------------------------RLLTQMTT---------------------FLAK----- ----R--L----KLHI------N-E------AK----S---ACARPW--------------------------------- ----------------ERKFLGYSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >P.a.I1/U77945/1..1919/Pseudomonas_extraction alcaligenes/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GADGMTVADLAG- --YVK--QYW-----PTLKARLL-------A------GEY----HPQ--------------------------------- -----------------AVRAV-EI--PK-----------P-Q--G-----------------GTRQL------------ ------GIPS----VV-DRLIQQALQQQ-LT-PIFDPLF-------S--------------------------------- ---DYSYGFRP------------------GR----------------------------STHQA---IEMA--------- ----RAHVTA------------------------GHRW-CV-EL--------DLEKFF-DRVN----------------- -HDI--LMAC-I-ERRI---------KDKC------------------VLRLI---------RRYLEA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----S-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VSP-RQ-E-----------GTP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------LD--R------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDANI-------------YVR- -----SPRA----GE---------------------------RVLVSVER---------------------FLRE----- ----R--L----KLTV------N-R------KK----S---QVARAW--------------------------------- ----------------KCDYLGYGM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >P.s.I1/AE016853/2381076..2382906/Pseudomonas_extraction syringae/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGLGIVETAE- --HLK--TAW-----PGIRAQLL-------A------GTY----RPD--------------------------------- -----------------PVRRV-LI--PK-----------P-G--G-----------------GERKL------------ ------GIPT----VT-DRLIQQALLQV-LQ-PLLDPDF-------S--------------------------------- ---NHSYGFRP------------------ER----------------------------SAHQA---VLAA--------- ----QQYIHS------------------------GRQI-VV-DV--------DLEQFF-DCVE----------------- -HDV--LIAR-L-GRKV---------KDRD------------------VLRLI---------RAYLNS------------ -------------------------------------------------------------------------------- ------------------GA------------------------------------L-----I-E--------------- -------------------------------------------------------------------------------- ---------------G---------M--VMT-ST-R-----------GTP-----QGGPL------------------S- PLLANVVL-----------------------------------------------------------DE----------- --------VD--K------------------------------E------------------------------------ ----------------------LER--------------RGHCFVR----------YADDANV-------------YVR- -----SPKA----GQ---------------------------RVMALLRR----------------------LYG----- ----R--L----GLRV------N-E------SK----S---AVASAF--------------------------------- ----------------GRKFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.f.I1/NZ_AAAC01000271/24723..26575/Burkholderia_extraction fungorum/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGLDIGQTAR- --HLV--TAW-----PVIREQLL-------K------GTY----RPD--------------------------------- -----------------PVRRV-TI--PK-----------P-D--G-----------------GEREL------------ ------GIPT----VT-DRLIQQALLQV-LQ-PILDPTF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------RAHDA---VLAA--------- ----QSYVQS------------------------GRRI-VV-DV--------DLEKFF-DRVN----------------- -HDI--LIDR-L-KRRI---------DDAG------------------VIRLV---------RTYLNS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--VQQ-RD-Q-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--K------------------------------E------------------------------------ ----------------------LER--------------RGHCFAR----------YADDANV-------------YVR- -----SRRA----GE---------------------------RVMALLRR----------------------LYG----- ----R--L----RLKV------N-E------TK----S---AVASVF--------------------------------- ----------------GRKFLGYSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|31925696|locus|VBIAllVin64954_2919|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Allochromatium vinosum DSM 180] -------------------------------------------------------------------GVDGLDIEQTAR- --LLV--TEW-----PKIRDQLL-------R------GKY----RPS--------------------------------- -----------------PVRRV-TI--PK-----------P-D--G-----------------GEREL------------ ------GIPT----VT-DRLIQQALLQV-LQ-PRLEPTF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------RAHDA---ILAA--------- ----QGFIQS------------------------GRKI-VV-DV--------DLEKFF-DRVN----------------- -HDI--LIDR-L-QKRI---------DDAG------------------IIQLI---------RAYLNS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----N-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--VLE-RY-Q-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--K------------------------------V------------------------------------ ----------------------LEK--------------HGHCFAR----------YADDCNV-------------YVR- -----SRKA----GE---------------------------RVMALLRK----------------------CYG----- ----T--L----RLKV------N-E------AK----S---AVASVT--------------------------------- ----------------GRTFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|54141352|locus|VBISinMel152503_5994|_extraction Mobile element protein [Sinorhizobium meliloti SM11] -------------------------------------------------------------------GADGLSIEATAA- --HLR--TAW-----PGIRERVL-------A------GTY----RPM--------------------------------- -----------------PVRRV-TI--PK-----------P-D--G-----------------GEREL------------ ------GIPT----VT-DRLIQQALLQV-LQ-PLLDPTF-------S--------------------------------- ---EHSHGFRP------------------GR----------------------------SAHDA---VLEA--------- ----QSYVQS------------------------GRRI-VV-DV--------DLEKFF-DRVN----------------- -HDI--LIDR-L-SKRI---------SDKR------------------VIRLI---------RAYLNS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-H--------------- -------------------------------------------------------------------------------- ---------------G---------V--VQE-RV-M-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--K------------------------------E------------------------------------ ----------------------LER--------------RGHCFVR----------YADDCNV-------------YVG- -----SRKA----GE---------------------------RVMALLRR----------------------LYG----- ----R--L----HLTI------N-E------GK----S---AVTSVF--------------------------------- ----------------GRKFLGFSFW------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Afid|54129720|locus|VBISinMel152503_0312|_extraction Mobile element protein [Sinorhizobium meliloti SM11] -------------------------------------------------------------------GADGLSIEATAA- --HLR--TSW-----PGIRERVL-------A------RTY----RPM--------------------------------- -----------------PVRRV-TI--PK-----------P-D--G-----------------GEREL------------ ------GIPT----VT-DRLIQQALLQV-LQ-PLLDPAF-------S--------------------------------- ---EHSHGFRP------------------GR----------------------------SAHGA---VLAA--------- ----QSLVQS------------------------GRRI-VV-DV--------DLEKFF-DRVN----------------- -HDI--LIDR-L-SKRI---------SDKR------------------VIRLI---------RAYLNS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-H--------------- -------------------------------------------------------------------------------- ---------------G---------V--VQE-RV-M-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--K------------------------------E------------------------------------ ----------------------LER--------------RGHCFVR----------YADDCNV-------------YVG- -----SRKA----AN---------------------------GSW----R----------------------FCG----- -------------------------------------------------------------------------------- ---------------------GFT-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|161737930|locus|VBIAzoVin292307_2623|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Azotobacter vinelandii CA] -------------------------------------------------------------------GVDGLDIDQTES- --HLR--QVW-----PSIRQQLL-------M------GTY----QPL--------------------------------- -----------------SVRRV-CI--PK-----------P-D--G-----------------SEREL------------ ------GIPS----VT-DRLIQQALLQV-LQ-PLIDPSF-------S--------------------------------- ---EHSHGFRP------------------GR----------------------------RAWDA---VLSA--------- ----QRYAQE------------------------GYCI-VV-DV--------DLSRFF-DRVN----------------- -HDI--LIDR-L-RRQV---------NDTG------------------VIRLV---------RAYLNA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VVE-RL-E-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DA----------- --------VD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFAR----------YADDCNV-------------YVR- -----SQKA----GE---------------------------RVMALLKR----------------------CYD----- ----K--L----RLKI------N-E------SK----S---AVAGVF--------------------------------- ----------------GRSFLGYCL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23651005|locus|VBISodGlo61428_2803|_extraction Mobile element protein [Sodalis glossinidius str. 'morsitans'] -------------------------------------------------------------------GVDGLSIAQTGQ- --HLK--YAW-----PTIRQQVM-------I------GTY----RPQ--------------------------------- -----------------PVRRV-GI--PK-----------P-D--G-----------------SEREL------------ ------GIPT----VI-DRLIQQALLQV-LQ-PLIDPTF-------S--------------------------------- ---EYRYGFRP------------------GR----------------------------RGHDA---VLAS--------- ----HQYVQD------------------------GYRV-VV-DV--------DLSKFF-DRIN----------------- -HDI--LIDR-L-RKHV---------NDAG------------------VIRLV---------RAYLNA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----K-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VVE-HA-E-----------GTP-----QGCPL------------------S- LLLTNVLL-----------------------------------------------------------DE----------- --------VD--R------------------------------E------------------------------------ ----------------------LEL--------------RGHRFAR----------YADDCNV-------------YVR- -----SEKT----GE----------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bu.cp.I1/NZ_AAEH01000016/115992..117833/Burkholderia_extraction cepacia/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGKSIAETAE- --HLK--THW-----PGIREALL-------D------GSY----RPW--------------------------------- -----------------PVRRV-QI--PK-----------P-D--G-----------------GMREL------------ ------GIPT----VA-DRLIQQALLQV-PQ-PIIDPTF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------RARDA---VLMA--------- ----QRHVQD------------------------GYRM-VV-DV--------DLEKFF-DRVN----------------- -HDI--LMER-L-SRRI---------DDKA------------------VLRLI---------RLYLVA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VSE-RY-E-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--R------------------------------E------------------------------------ ----------------------LER--------------RGHKFVR----------YADDCNV-------------YVR- -----SGRS----GE---------------------------RVLEGLCK----------------------LYD----- ----R--L----HLKV------N-E------AK----T---AVAPAT--------------------------------- ----------------GRKFLGYRL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|22964000|locus|VBIPolSp102244_5444|_extraction Reverse transcriptase [Polaromonas sp. JS666] -------------------------------------------------------------------GVDGLTIEETPE- --YLK--THW-----SRIRLELL-------N------GTY----RPQ--------------------------------- -----------------AVRRV-EI--PK-----------P-T--G-----------------GMREL------------ ------GIPT----VL-DRLIQQALLQV-LQ-PMIDLTF-------S--------------------------------- ---EFSYGFRP------------------GR----------------------------SAHDA---VLQA--------- ----QRYVQE------------------------GFQV-VV-DV--------DLEKFF-DRVN----------------- -HDI--LMDR-L-AKRI---------ADKA------------------VLRLI---------RQYLQA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----A-G--------------- -------------------------------------------------------------------------------- ---------------G---------V--VMD-RS-E-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------DE----------- --------VD--L------------------------------D------------------------------------ ----------------------LQR--------------RGHRFAR----------YADDCNV-------------YVR- -----SQKA----GE---------------------------RVLLSLRK----------------------LYE----- ----K--L----HLKV------N-E------KK----T---EVGPVF--------------------------------- ----------------GRKFLGYCL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|190413778|locus|VBIVarPar264937_3261|_extraction Reverse transcriptase [Variovorax paradoxus B4] -------------------------------------------------------------------GVDGRTVQQTGE- --DLK--TQW-----PDIRRGLL-------D------GTY----RPS--------------------------------- -----------------PVRRV-GI--PK-----------L-G--G-----------------GTREL------------ ------GIPT----VV-DRLIQQALLQV-LQ-PLIDPTF-------S--------------------------------- ---EHSYGFRP------------------GR----------------------------SAHQA---VQAA--------- ----RQYVEQ------------------------GRRV-VV-DV--------DLGKFF-DRVN----------------- -HDI--LMDR-L-GKRI---------ADKA------------------VLRLI---------RHYLNA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----A-H--------------- -------------------------------------------------------------------------------- ---------------G---------V--MQM-RV-E-----------GTP-----QGGPL------------------SP PLLANVLL-----------------------------------------------------------DE----------- --------VD--R------------------------------A------------------------------------ ----------------------LER--------------RGRKFVR----------YADDCNV-------------YVK- -----SERA----GQ---------------------------RVLDGVRA----------------------CYA----- ----K--L----RLKV------N-E------TK----T---AVATAW--------------------------------- ----------------GRKFLGYCL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.sp.I1/NZ_AAOX01000004/96386..98244/Bacillus_extraction sp./Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDEKDIEATRL- --YLR--ENG-----QEIIQLIR-------E------GKY----KPQ--------------------------------- -----------------PVRRV-EI--PK-----------A-N--G-----------------GKRQL------------ ------GIPT----VT-DRVIQQAVVQR-LT-PIFERQF-------S--------------------------------- ---HFSYGFRP------------------NK----------------------------SAHQA---IEQA--------- ----RQYIEE------------------------GYNF-VV-DM--------DLEKFF-DRVQ----------------- -HDK--LMSL-I-AKTI---------SDKP------------------TLKLI---------RRFLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----V-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--VIT-NR-E-----------GTP-----QGGPL------------------S- PLLSNIIL-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFVR----------YADDCNI-------------YVK- -----SIKA----GE---------------------------RVKQGVTE---------------------FLER----- ----K--L----KLKV------N-E------EK----S---AVGKPS--------------------------------- ----------------ARTFLGVSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61457273|locus|VBISulAci142080_3712|_extraction Mobile element protein [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GVDGVTTDEFVD- --YLW--EHW-----PTIQGQLR-------A------GTY----HPQ--------------------------------- -----------------PIRGV-EI--PK-----------P-T--G-----------------GVRML------------ ------GIPT----AI-DRFIQQAVLQV-LT-PIFDPQF-------S--------------------------------- ---DHSYGFRP------------------GR----------------------------SAHQA---VRQV--------- ----RRQAEA------------------------GAEW-VI-DL--------DLEKFF-DRIN----------------- -HDI--LMAR-V-ARRV---------QDPQ------------------VLRLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GL------------------------------------M-----L-H--------------- -------------------------------------------------------------------------------- ---------------G---------V--STP-RT-Q-----------GAA-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LAR--------------RGLAYVR----------YADDAMI-------------LVH- -----SRRA----GE---------------------------RVLASVSR---------------------YLDR----- ----T--L----HLPV------N-L------TK----S---AVDRLV--------------------------------- ----------------RRTYLGFKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61455293|locus|VBISulAci142080_2739|_extraction Mobile element protein [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GVDGVSTKALVD- --YLG--AHW-----PMIRPQLR-------D------GTY----RPH--------------------------------- -----------------AIRGV-EI--PK-----------P-T--G-----------------GVRTL------------ ------GIPT----VV-DRFIQQAVLQV-LT-PIFDPHF-------A--------------------------------- ---DFSFGFRP------------------GR----------------------------SAHQA---VRHV--------- ----RRLAED------------------------GAEW-GV-DL--------DLEQFF-DRIN----------------- -HDI--RMAR-V-ARRV---------QDLQ------------------VLRLI---------RRYLQA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------R-----V-D--------------- -------------------------------------------------------------------------------- ---------------G---------V--SAP-RT-A-----------GAA-----QGSPL------------------S- PLLANIVL-----------------------------------------------------------DD----------- --------FD--K------------------------------E------------------------------------ ----------------------LER--------------RGVAFVR----------YADDAMI-------------FVH- -----SQRA----GE---------------------------RVLTSVTR---------------------YLEH----- ----R--L----HLPV------N-T------AK----S---AVDRLT--------------------------------- ----------------RRPYLGFKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >P.ae.I1/AY029772/3515..5441/Pseudomonas_extraction aeruginosa/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGMNIDEFPA- --WVRS-GNW-----KALKQQLV-------T------GCY----QPS--------------------------------- -----------------PVRRV-EI--AK-----------P-D--G-----------------GTRQL------------ ------GIPT----VT-DRVIQQAITQV-LT-PIFDPEF-------S--------------------------------- ---EHSFGFRP------------------GR----------------------------NGQQA---VKQV--------- ----QSIIKE------------------------GRRF-AV-DV--------DLSKFF-DRVN----------------- -HDL--LMTR-L-GDKV---------KDKR------------------LLRLI---------KRYLRA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----D-N--------------- -------------------------------------------------------------------------------- ---------------Q---------F--KGE-SR-V-----------GVP-----QGGPL------------------S- PLLANIML-----------------------------------------------------------DS----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFAR----------YADDFTI-------------LVK- -----SQRA----GE---------------------------RVLRSISQ---------------------YLQS----- ----R--L----KLVV------N-T------DK----S---RVVKTN--------------------------------- ----------------ESQFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22934106|locus|VBIPhoPro109272_1767|_extraction Mobile element protein [Photobacterium profundum SS9] ------------------------------------------------------------------------------M- --FMR--------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------IPC----VI-DRVIQQAIAQV-LT-PIFDPDF-------S--------------------------------- ---NNSYGFRP------------------GR----------------------------NGQQA---VRPV--------- ----QSTIKQ------------------------RRHY-AV-DV--------DLSKFF-DRVN----------------- -HDL--LMTH-L-GYKV---------KDKR------------------LLKLI---------SRYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------ICQSKGD-N--------------- -------------------------------------------------------------------------------- ---------------P---------L--YMK-SR-E-----------GVP-----QGGPL------------------S- PLLANIML-----------------------------------------------------------DL----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHKFAR----------YADDFTI-------------LVK- -----SQRA----GQ---------------------------RVLLSISR---------------------YLQN----- ----R--L----KLTV------N-T------TK----S---HVVRTT--------------------------------- ----------------ESKFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sh.ba.I2/CP000563/2137684..2139633/Shewanella_extraction baltica/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGMTIEAFPL- --WMQQ-GGW-----QRCKSLLE-------R------GEY----NPS--------------------------------- -----------------AVRRV-EI--DK-----------P-D--G-----------------GKRKL------------ ------GIPN----VI-DRVIQQAIAQI-LT-PLFDPFF-------S--------------------------------- ---ANSFGFRP------------------NR----------------------------NAKQA---VLQV--------- ----RDIIKQ------------------------KRKF-AV-DV--------DLSKFF-DRVN----------------- -HDL--LMTQ-L-RIKV---------QDKR------------------LLALI---------GKYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------T-----V-N--------------- -------------------------------------------------------------------------------- ---------------D---------Q--FEA-SF-E-----------GVP-----QGGPL------------------S- PLLSNIML-----------------------------------------------------------DS----------- --------LD--K------------------------------E------------------------------------ ----------------------LES--------------RGHKFAR----------YADDFII-------------LVK- -----SIRA----GE---------------------------RVLKSITR---------------------YLAT----- ----K--L----KLVV------N-E------QK----S---QVVEVG--------------------------------- ----------------QSKFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >G.s.I1/AE017180/1028657..1030564/Geobacter_extraction sulfurreducens/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVTIDAFPE- --RFR--PLW-----GDIRASLA-------T------GTY----QPQ--------------------------------- -----------------PVLRV-EI--PK-----------P-T--G-----------------GTRPL------------ ------GIPT----VL-DRLIQQATAQV-LT-PIFDPEF-------S--------------------------------- ---ASSFGFRP------------------GR----------------------------SAHNA---VRQL--------- ----REYLRQ------------------------GYRI-AV-DI--------DLAKFF-DTVN----------------- -HDL--LMTM-V-GRRV---------RDKR------------------VLTLI---------GRYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------E-----V-D--------------- -------------------------------------------------------------------------------- ---------------G---------R--LEK-TR-M-----------GVP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DH----------- --------LD--K------------------------------E------------------------------------ ----------------------LES--------------RGHKFVR----------YADDFVI-------------LVK- -----SERA----GE---------------------------RVMGSVRK---------------------YLTN----- ----K--L----KLTV------N-E------DK----S---KVARSG--------------------------------- ----------------DLSFLGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|38166048|locus|VBIDesAlk70802_2461|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfurivibrio alkaliphilus AHT2] --------------------------------------------------------------------MDNMPIADFMA- --FAR--EHW-----EEIRASLL-------A------GTY----QPL--------------------------------- -----------------PVKRV-EI--PK-----------P-T--G-----------------GTRPL------------ ------GIPT----VL-DRLIQQAMAQV-LL-PIFDPDF-------S--------------------------------- ---EASYGFRP------------------GR----------------------------SAHDA---IHRV--------- ----RDYIRQ------------------------GYRV-AV-DA--------DLSKFF-DTVD----------------- -HDL--LMNR-V-GRKV---------RDQR------------------VLRLV---------GKYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-D--------------- -------------------------------------------------------------------------------- ---------------G---------R--RRE-TR-K-----------GVP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFAR----------YADDFII-------------LVK- -----SRRA----GE---------------------------RVMTGITR---------------------FLES----- ----K--L----KLVV------N-Q------EK----S---KVAPTN--------------------------------- ----------------ESGFLGFIF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|124817422|locus|VBIDesSul232581_2428|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfocapsa sulfexigens DSM 10523] -------------------------------------------------------------------GIDEITVGDFPF- --TFR--ECW-----PEIRSTIL-------E------GNY----TPS--------------------------------- -----------------PVQRV-EI--PK-----------P-D--G-----------------STRPL------------ ------GIPT----VL-DRVIQQAIAQV-MS-PIFEPHF-------S--------------------------------- ---ESSCGFRP------------------GR----------------------------SAHDG---VKQI--------- ----KQYIRQ------------------------GYKV-AV-DM--------DLSKFF-DTVN----------------- -HDV--LMNR-V-SRRI---------EDKR------------------VLKLI---------GKYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----V-N--------------- -------------------------------------------------------------------------------- ---------------G---------R--RLA-TP-L-----------GVP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHHFVR----------YADDFII-------------LVK- -----SLSA----AE---------------------------RVMASVSR---------------------FLKR----- ----E--L----RLIV------N-E------KK----S---SFGKVE--------------------------------- ----------------ECSFLGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|31921407|locus|VBIAllVin64954_0826|_extraction Mobile element protein [Allochromatium vinosum DSM 180] -------------------------------------------------------------------GIDGMCIEDFPE- --FAR--SSL-----PAIRQALR-------E------GTY----RPQ--------------------------------- -----------------PVRRV-TI--PK-----------P-N--G-----------------GERLL------------ ------GIPT----VM-DRVIQQAIAQV-LG-PIFDPGF-------S--------------------------------- ---DASFGFRP------------------GR----------------------------SAHGA---LRRV--------- ----QTYIGE------------------------GYRI-AV-DL--------DLAKFF-DTVQ----------------- -HDV--LMAR-V-GRKV---------RDKR------------------LLALI---------GDYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----V-G--------------- -------------------------------------------------------------------------------- ---------------G---------T--LEA-TE-I-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDLLI-------------LVR- -----SHRA----GE---------------------------RVMASVSR---------------------YLTG----- ----T--L----KLVV------N-E------QK----S---RVVKTD--------------------------------- ----------------ACKFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115641687|locus|VBIThiNit264030_2345|_extraction Mobile element protein [Thioalkalivibrio nitratireducens DSM 14787] -------------------------------------------------------------------GMDGMPIEDFPT- --FAR--RHW-----PQIRRQLA-------D------GVY----QPQ--------------------------------- -----------------PVRRV-AI--PK-----------P-K--G-----------------GERLL------------ ------GIPT----VM-DRVIQQAIAQV-LT-PIFDPDF-------S--------------------------------- ---DSSFGFRP------------------GR----------------------------SAHGA---LRQV--------- ----QGHIQA------------------------GYRI-AV-DL--------DLAKFF-DNVQ----------------- -HDV--LMAR-V-ARKV---------RDKR------------------LLALI---------GRYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----V-G--------------- -------------------------------------------------------------------------------- ---------------K---------S--VQA-TG-I-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--R------------------------------E------------------------------------ ----------------------LER--------------RGHRFTR----------YADDLVI-------------LVK- -----TLRA----GD---------------------------RVKASVTR---------------------FLAR----- ----K--L----ALLV------N-E------QK----T---RVVKTN--------------------------------- ----------------DCQFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21259697|locus|VBICanAcc132554_2105|_extraction Mobile element protein [Candidatus Accumulibacter phosphatis clade IIA str. UW1] -------------------------------------------------------------------GIDGLRIEDFPA- --YAC--EHW-----PAIRQTLS-------E------GRY----QPQ--------------------------------- -----------------AVRRV-II--PK-----------P-N--G-----------------GERAL------------ ------GIPT----VV-DRVVQQAIAQI-MT-PIFDPEF-------S--------------------------------- ---ESSYGFRP------------------RR----------------------------SAHGA---LKQV--------- ----RADLKA------------------------GYRI-AV-DL--------DLAKFF-DNVD----------------- -HDI--LMAR-V-ARKV---------SDKR------------------LLALI---------GRYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----I-G--------------- -------------------------------------------------------------------------------- ---------------S---------T--LQP-SE-L-----------GTP-----QGGPL------------------S- PLLANILL-----------------------------------------------------------DD----------- --------LD--R------------------------------T------------------------------------ ----------------------LEG--------------RGHRFAR----------YADDLMV-------------LVK- -----SERA----GQ---------------------------RVKASLTA---------------------YLGR----- ----Q--L----KLPV------N-E------KK----S---QVAKIE--------------------------------- ----------------QCVFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.e.I1/AL954747/2285095..2287101/Nitrosomonas_extraction europaea/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVTTAEWPE- --HAR--AHW-----PATREQIE-------A------GRY----RPQ--------------------------------- -----------------PVRRV-DI--PK-----------P-D--G-----------------GQRQL------------ ------GIPT----VT-DRVIQQAIAQV-LI-PIFDPGF-------S--------------------------------- ---ASSFGFRP------------------GR----------------------------NAHQA---IRQV--------- ----QAHVKA------------------------GYRW-AV-DL--------DLARFF-DNVN----------------- -HDL--LMSL-L-SRSI---------ADKR------------------LLALI---------GRYLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----V-G--------------- -------------------------------------------------------------------------------- ---------------E---------H--PQP-SE-V-----------GTP-----QGGPL------------------S- PLLANVLL-----------------------------------------------------------HQ----------- --------FD--L------------------------------E------------------------------------ ----------------------LER--------------RGHRFAR----------YADDVII-------------LVK- -----SRRA----AE---------------------------RVMQSLTY---------------------FLQS----- ----T--L----KLTV------N-L------AK----S---QVAPMS--------------------------------- ----------------ECSFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20127583|locus|VBIVibHar24526_0381|_extraction Mobile element protein [Vibrio harveyi ATCC BAA1116] -------------------------------------------------------------------GVDKLDIDATIF- --KLRQASNG-----QALRQSLL-------D------GSY----RPQ--------------------------------- -----------------PVLGV-GI--PK-----------P-S--G-----------------GVRQL------------ ------GIPT----VI-DRIVQQAITSV-LS-DIYEAKF-------S--------------------------------- ---NSSYGFRP------------------NR----------------------------SAHHA---LAAA--------- ----SRYIRE------------------------GRGY-VV-DI--------DLAKYF-DTVN----------------- -HDR--LMHR-L-SEDI---------ADKR------------------VLKLI---------RSYLQA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----R-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--VEQ-RQ-R-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHKFCR----------YADDCQI-------------YVG- -----SEEA----AY---------------------------RVKESITE---------------------YLEQ----- ----K--L----KLTV------N-R------EK----S---AATRVT--------------------------------- ----------------ERTYLSHRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >La.re.I1/AY911856/603..2512/Lactobacillus_extraction reuteri/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDDMTVNDLLP- --YLR--ENK-----TELIASLR-------E------GKY----KPA--------------------------------- -----------------PVKRV-EI--PK-----------P-N--G-----------------GVRKL------------ ------GIPT----VV-DRMVQQAVAQI-LT-PIFERVF-------S--------------------------------- ---DNSFGFRP------------------HR----------------------------GAHDA---IAKV--------- ----VDLYNQ------------------------GYRR-VV-DL--------DLKAYF-DNVN----------------- -HDL--MIKY-L-QQYI---------DDPW------------------TLRLI---------RKFLTS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-H--------------- -------------------------------------------------------------------------------- ---------------G---------L--FAK-SE-K-----------GTP-----QGGPL------------------S- PILANIYL-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LTR--------------RGHHFVR----------YADDCNI-------------YVK- -----SQRA----GE---------------------------RVMRSITQ---------------------FLEK----- ----R--L----KVKV------N-P------DK----T---KVGSPL--------------------------------- ----------------RLKFLGFSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|43088904|locus|VBILacSal150030_1121|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Lactobacillus salivarius CECT 5713] -------------------------------------------------------------------GIDEMTVDELFQ- --YLR--ENK-----EELTTSLR-------E------GSY----KPL--------------------------------- -----------------PVKRV-EI--PK-----------L-N--G-----------------GTRKL------------ ------GIPT----VI-DRMVQQAVAQV-LT-PIFEEIF-------S--------------------------------- ---ENSFGFRP------------------NR----------------------------GAQDA---IDKV--------- ----ISYYNQ------------------------GYKR-VV-DL--------DLKSYF-DNVN----------------- -HDL--MIKY-L-QQYI---------DDEW------------------TLKLI---------RKFLTS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------L--FVK-SE-K-----------GTP-----QGGPL------------------S- PLLANIYL-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LTK--------------RGHRFVR----------YADDCNI-------------YVK- -----SQRA----GE---------------------------RVMRSITK---------------------FLEK----- ----Q--L----KVKV------N-T------DK----T---RVGSPI--------------------------------- ----------------KLKFLGFS-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|87094488|locus|VBILacPen232155_1085|_extraction Retrontype reverse transcriptase [Lactobacillus pentosus KCA1] -------------------------------------------------------------------GVDGMTIDQLPE- --YTR--KHR-----KELLESLR-------N------GTY----RPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-D--G-----------------STRKL------------ ------GVPT----VI-DRMIQQAVVQV-LS-PIYEQVF-------S--------------------------------- ---DNSYGFRP------------------GR----------------------------SAHDA---IKSV--------- ----TSLYNQ------------------------GYHY-VV-DL--------DLKAYF-DTVN----------------- -HDL--LMNF-I-QQQV---------TDPW------------------LLHLI---------RRFLTS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----N-G--------------- -------------------------------------------------------------------------------- ---------------K---------L--FQD-TT-E-----------GTP-----QGGNL------------------S- PLLANIYL-----------------------------------------------------------NE----------- --------LD--T------------------------------L------------------------------------ ----------------------LAQ--------------RGHQFVR----------YADDCNI-------------YVK- -----SKRA----GE---------------------------RVLRNVTA---------------------FLEN----- ----R--L----KLTI------N-R------HK----T---TVGSPL--------------------------------- ----------------RLKFLGFT-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115615696|locus|VBIDehSp228777_0406|_extraction Mobile element protein [Dehalobacter sp. CF] -------------------------------------------------------------------GVDGMTVDEMLP- --WLR--KHR-----EELLQSLG-------N------GMY----RPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-D--G-----------------GVRKL------------ ------GVPT----VI-DRMVQQALVQI-LQ-PIFEPLF-------S--------------------------------- ---EASYGYRP------------------GR----------------------------SAQQA---MKEA--------- ----KEYYEQ------------------------GYTR-AA-DI--------DLSKYF-DTMN----------------- -HEL--LMNI-I-RKEV---------KDKR------------------IIDLI---------KKFLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--KSK-TE-E-----------GSP-----QGGPL------------------S- PLLSNIYL-----------------------------------------------------------NE----------- --------FD--K------------------------------E------------------------------------ ----------------------MER--------------RGHKHLR----------YADDIAV-------------YTK- -----SRRA----AE---------------------------RVLESCKQ---------------------YLEK----- ----K--L----KLKV------N-S------EK----S---KAGSPL--------------------------------- ----------------KLKFLGFAL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|45159896|locus|VBIBacCel7049_2030|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cellulosilyticus DSM 2522] -------------------------------------------------------------------GIDGMSVDELLP- --YLA--LED-----RNLILSIK-------D------GSY----RPQ--------------------------------- -----------------PVKRV-EI--KK-----------P-D--G-----------------GKRKL------------ ------GIPT----VK-DRLVQQMILQV-IE-KKIDPQF-------S--------------------------------- ---DNSYGFRP------------------NR----------------------------SAHDA---MRKA--------- ----KQYYEE------------------------GFRY-VV-DI--------DMKQYF-DTVN----------------- -QDK--LMHH-V-EQFI---------DDPT------------------VLILI---------RKFLRS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------S-----I-D--------------- -------------------------------------------------------------------------------- ---------------E---------E--IEP-SE-V-----------GTP-----QGGNL------------------S- PILGNIYL-----------------------------------------------------------HQ----------- --------LD--L------------------------------E------------------------------------ ----------------------LER--------------RGHKFIR----------YADDCNI-------------YVK- -----SRKA----GD---------------------------RVLKSITK---------------------FLEE----- ----E--L----KLTV------N-K------DK----S---EVGRPT--------------------------------- ----------------KRKFLGFC-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|38119357|locus|VBIBacSel78655_0161|_extraction Mobile element protein [Bacillus selenitireducens MLS10] -------------------------------------------------------------------GVDGMTVDQLEA- --HVR--QYA-----KPLIAKIQ-------K------GTY----QPL--------------------------------- -----------------PVKRV-EI--PK-----------E-N--G-----------------KKRKL------------ ------GIPA----VR-DRMVQQAIFQV-IE-PIIDPHF-------S--------------------------------- ---PNSYGFRP------------------GK----------------------------NAKQA---IKQA--------- ----AKYYDE------------------------GFKM-VV-DI--------DLKSYF-DTIP----------------- -HQK--LMNY-L-EQYI---------QDPI------------------ILKLI---------WKFLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----I-G--------------- -------------------------------------------------------------------------------- ---------------D---------N--WES-SR-N-----------GAP-----QGGNL------------------S- PILSNVYL-----------------------------------------------------------HE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDFCI-------------YVK- -----SRRA----AE---------------------------RVLLNTTT---------------------FLEG----- ----T--L----KLSV------N-Q------EK----S---AIGSPT--------------------------------- ----------------KRKFLGFC-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|42345729|locus|VBIGamPro61291_1517|_extraction hypothetical protein [gamma proteobacterium HdN1] -------------------------------------------------------------------GIDGMPVEDLES- --HLR--HHW-----PTLRQSLL-------D------GTY----QPK--------------------------------- -----------------PVKRV-EI--PK-----------G-D--G-----------------TKRAL------------ ------GIPT----VI-DRFVQQIIAQA-LS-ALWEPHF-------H--------------------------------- ---PSSFGFRP------------------AR----------------------------SAQQA---VKYV--------- ----QTLQRE------------------------KYEW-VV-DL--------DLKSFF-DEVN----------------- -HDR--LIAR-L-KTRV---------EDKV------------------LLRLI---------NKFLHA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------N-----A-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--LLR-SE-K-----------GVP-----QGGPL------------------S- PILANIVL-----------------------------------------------------------DE----------- --------LD--W------------------------------E------------------------------------ ----------------------LEH--------------RGHKFAR----------YADDCNI-------------MVK- -----SKAA----GE---------------------------RVMKSIRR---------------------FLET----- ----T--L----RLRV------N-D------QK----S---AVDRPT--------------------------------- ----------------KRNFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42649895|locus|VBILeaBys116579_0020|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Leadbetterella byssophila DSM 17132] -------------------------------------------------------------------GVDGMQIDNLRD- --YLN--THW-----QSLRSDIL-------S------GTY----RPQ--------------------------------- -----------------AVRKV-EI--PK-----------A-S--G-----------------GKRML------------ ------GIPT----VI-DRVIQQSISQW-LG-LKYEGDF-------H--------------------------------- ---DNSYGFRP------------------NR----------------------------NAHQA---VSKA--------- ----QEYLNL------------------------GYTW-VV-EL--------DLEQFF-DQVN----------------- -HDI--LMHL-L-SKKI---------TDHR------------------VLALI---------GKYLRC------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----D-H--------------- -------------------------------------------------------------------------------- ---------------G---------L--EQK-RT-K-----------GTP-----QGSPL------------------S- PLLSNIIL-----------------------------------------------------------NE----------- --------LD--R------------------------------E------------------------------------ ----------------------LSS--------------RGHRFVR----------YADDCSI-------------YTR- -----SNKS----AT---------------------------RIMGNITS---------------------YIES----- ----T--L----KLKV------N-R------EK----S---KVSRPS--------------------------------- ----------------QSSLLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|42167991|locus|VBIArtAri166201_0751|_extraction Mobile element protein [Arthrobacter arilaitensis Re117] -------------------------------------------------------------------GVDGLEAHELRD- --WCR--EHW-----IETRKSLD-------A------GTY----APL--------------------------------- -----------------PVRQV-MI--PK-----------P-D--G-----------------GERML------------ ------GVPS----VL-DRLIQQALAQV-LS-PIFDEGF-------A--------------------------------- ---PMSYGFRP------------------GK----------------------------SAHDA---ASMA--------- ----RKVIEQ------------------------GYRW-VV-EV--------DLDAFF-DRVN----------------- -HDV--LMSR-V-ARKV---------KDKR------------------VLKLV---------RKYLTA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----A-Q--------------- -------------------------------------------------------------------------------- ---------------G---------V--RRE-TV-E-----------GTP-----QGSPL------------------S- PLLSNIIL-----------------------------------------------------------DD----------- --------FD--Q------------------------------E------------------------------------ ----------------------FWS--------------RDHRFVR----------YADDIRI-------------FVK- -----SKRA----AE---------------------------RVLGQATK---------------------VLEQ----- ----R--L----KLKV------N-R------QK----S---VINPAS--------------------------------- ----------------VATLLGFGFY------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|61290801|locus|VBINiaKor154066_6175|_extraction Retrontype reverse transcriptase [Niastella koreensis GR2010] -------------------------------------------------------------------GIDGVEAKDFKL- --QLD--GAW-----VQVKSQLE-------N------GSY----QPQ--------------------------------- -----------------AVKRV-TI--PK-----------P-N--G-----------------GERHL------------ ------GIPT----YM-DRLIQQAISQV-LV-KQYEPTF-------S--------------------------------- ---ENSYGFRS------------------EK----------------------------NAHQA---ALKA--------- ----KEYINA------------------------GYSH-VV-DL--------DLSQFF-DRVN----------------- -HDY--LMNE-L-SRRI---------TDKR------------------VLKLI---------HKILRS------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------A-----E-G--------------- -------------------------------------------------------------------------------- ---------------A---------N--RIP-CK-Q-----------GVP-----QGGPL------------------S- PLLSNIIL-----------------------------------------------------------DK----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGLRYVR----------YADDCSI-------------YVK- -----SKRA----GD---------------------------RVMESITR---------------------YIEK----- ----E--L----KLVV------N-A------VK----S---SVTRPW--------------------------------- ----------------LMKLLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >M.sp.I1/AF339846/29388..31287/Microscilla_extraction sp./Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGMQVKELRY- --WFS--NNH-----QKLIEQLK-------E------GNY----RPM--------------------------------- -----------------TIKGQ-EI--PK-----------P-G--G-----------------GVRQL------------ ------GIPT----VQ-DRLVQQAIAQQ-LS-KRYDPTF-------S--------------------------------- ---QYSYGFRK------------------GR----------------------------NAHQA---LRQA--------- ----GAYVKE------------------------GFNY-VV-DL--------DLEKFF-DKVN----------------- -HDR--LMWL-L-GRRI---------SDKR------------------VLKLI---------GKFLRS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L-----I-G--------------- -------------------------------------------------------------------------------- ---------------G---------L--ENQ-RI-S-----------GTP-----QGSPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RGHRFVR----------YADDMIL-------------LVR- -----SQEA----AE---------------------------RAYSSITS---------------------FIEN----- ----R--L----LLKV------N-K------DK----S---RICRPY--------------------------------- ----------------QLNFLGHSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|115236219|locus|VBIEchVie187570_1054|_extraction Mobile element protein [Echinicola vietnamensis DSM 17526] -------------------------------------------------------------------GIDGMSVEELRQ- --WFS--SHY-----HEFQSQII-------T------GKY----RVE--------------------------------- -----------------SVREV-QI--PK-----------P-N--G-----------------GVRIL------------ ------GIPT----AK-DRLVQQAISQV-LS-LHYDPTF-------S--------------------------------- ---DRSYGFRP------------------DR----------------------------GAHDA---LRQA--------- ----GQEVSE------------------------GRDW-IV-DI--------DLEKFF-DTVN----------------- -HDR--LMWL-L-GTRI---------GDKT------------------LLKLI---------GKFLRA------------ -------------------------------------------------------------------------------- ------------------GM------------------------------------L-----K-D--------------- -------------------------------------------------------------------------------- ---------------G---------L--VSQ-GV-K-----------GMP-----QGSPL------------------S- PLLSNIIL-----------------------------------------------------------DE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEV--------------RGHRFVR----------YADDLIV-------------MVK- -----SEPS----AK---------------------------RVLSSLTA---------------------FIEQ----- ----R--M----LLKV------N-K------SK----S---KISRPY--------------------------------- ----------------ELNFLGHSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|54204419|locus|VBILacJoh171132_1733|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Lactobacillus johnsonii DPC 6026] -------------------------------------------------------------------GVDKRTIYEIDD- --YFK--KHQ-----VEIKQSIR-------A------MKY----KPQ--------------------------------- -----------------AVRRV-YI--PK-----------A-N--G-----------------KKRPL------------ ------GIPT----VV-DRVIQQAISQV-LM-KIYDPEF-------S--------------------------------- ---AYSYGFRP------------------KR----------------------------SSHDA---MEQV--------- ----LEYLDE------------------------GYQW-VI-DL--------DIEKYF-DTVN----------------- -HDK--LIST-L-REQI---------NDKT------------------TLHLI---------RSFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------L--VKP-NK-L-----------GVP-----QGGPL------------------S- PILSNIYL-----------------------------------------------------------DK----------- --------FD--K------------------------------E------------------------------------ ----------------------LEE--------------RGLHFVR----------YADDCNI-------------FVK- -----SKMS----AD---------------------------RVMKSATS---------------------WLER----- ----K--L----FLKV------N-A------TK----T---KVVRPT--------------------------------- ----------------KSNFLGFT-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E.c.I7/AY785243/414..2383/Escherichia_extraction coli/Bacterial C/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDNMSIEEFND- --FAK--LHW-----LGIKQQLL-------N------GSY----QPL--------------------------------- -----------------PVKRV-MI--PK-----------P-D--G-----------------GERML------------ ------GIPA----VI-DRVIQQAIAQV-IS-PYFEPQF-------S--------------------------------- ---PHSYGYRP------------------HK----------------------------RASQA---VNHV--------- ----QSCVKQ------------------------GYKT-AV-DI--------DLSKFF-DEVD----------------- -HDM--LMNR-V-SRKI---------KDKA------------------LMRLL---------GKYLRA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------A---ERE-T--------------- -------------------------------------------------------------------------------- ---------------G---------L--WFE-ST-K-----------GVP-----QGGPL------------------S- PLLSNILL-----------------------------------------------------------DE----------- --------LD--K------------------------------K------------------------------------ ----------------------LTY--------------KHLKFAR----------YADDIII-------------LVK- -----TKSE----GL---------------------------IIQREITA---------------------FITK----- ----R--L----KLKV------N-E------SK----S---RVGPVS--------------------------------- ----------------GSKFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|54139251|locus|VBISinMel152503_4967|_extraction hypothetical protein [Sinorhizobium meliloti SM11] -------------------------------------------------------------------GRDGQTVDMAEA- --KAT--SII-----GRLRRELL-------N------GKY----RPG--------------------------------- -----------------DVRRV-WL--PK-----------A-G--G-----------------GRRGL------------ ------GIPN----IV-DRVVQQAVLQV-LE-PIFEPVF-------H--------------------------------- ---DSSHGFRP------------------KR----------------------------GAHTA---IAEA--------- ----SKYLKE------------------------GYQT-IV-DL--------DLASFF-DRVH----------------- -HQR--LLAR-I-AQRV---------KDQR------------------IITLI---------NLMLKA------------ -------------------------------------------------------------------------------- ------------------AV------------------------------------V----MP-D--------------- -------------------------------------------------------------------------------- ---------------G---------T--RVA-PQ-E-----------GTP-----QGGPL------------------S- PLLSNIVL-----------------------------------------------------------DE----------- --------LD--R------------------------------E------------------------------------ ----------------------LAR--------------RRLRFVR----------YADDSNI-------------FVR- -----SERA----GQ---------------------------RVMSSIRD---------------------FLER----- ----R--M----RLQV------N-E------EK----S---GMRTPN--------------------------------- ----------------EVHFLGFRFR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|46912205|locus|VBICelAlg158510_3341|_extraction hypothetical protein [Cellulophaga algicola DSM 14237] -------------------------------------------------------------------GIDGMTVQELKA- --FID--AHR-----SKVVHQLI-------S------KSY----RSQ--------------------------------- -----------------AIKGV-AI--PK-----------A-N--G-----------------KTRLL------------ ------GVPT----VV-DRWLQQAVSQQ-LV-IHFELDF-------E--------------------------------- ---SESYGFRP------------------RK----------------------------NLQQA---VLKS--------- ----QEYIND------------------------GYQD-LV-DI--------DLKSFF-DEVQ----------------- -HYK--LLQL-I-YNKV---------KCPT------------------TLWLI---------RKWLRA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------L-----K-N--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LCK-RR-K-----------GLP-----QGSPL------------------S- PLLSNIML-----------------------------------------------------------DQ----------- --------LD--K------------------------------H------------------------------------ ----------------------LKV--------------REFRFIR----------YADDFSI-------------YTK- -----SKAA----AR---------------------------AIGNEVYL---------------------FLKE----- ----K--L----DLPV------N-R------AK----S---GIRRPS--------------------------------- ----------------TFKVLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87132848|locus|VBIBelBal168934_2118|_extraction Retrontype reverse transcriptase [Belliella baltica DSM 15883] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------M-I-YQRV---------KCPT------------------TLRLI---------RKWLRA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------K--LHR-RR-K-----------GVP-----QGSPL------------------S- PLLSNILL-----------------------------------------------------------DL----------- --------LD--K------------------------------E------------------------------------ ----------------------LER--------------RNLKYVR----------YADDFSI-------------YTK- -----SKKE----AR---------------------------KVGNEIYL---------------------FLKE----- ----K--L----RLPI------N-R------EK----S---GIRRPS--------------------------------- ----------------NFEMLGHAF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|54567245|locus|VBIKroSp190710_0128|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Krokinobacter sp. 4H375] -------------------------------------------------------------------GVDNVYIKELKS- --ILQ--IYG-----KQYVSHIE-------R------KRY----QVS--------------------------------- -----------------PILGV-EI--PK-----------S-N--G-----------------KKRLL------------ ------GIPT----VV-DRVFQQALHQV-LQ-PLFEPDF-------Q--------------------------------- ---KHSYGFRP------------------QR----------------------------NAHQA---TAES--------- ----LLNINA------------------------GSQD-IV-DI--------DLKSFF-DEVS----------------- -HCI--LLEL-I-YKKV---------QCKA------------------TMRLL---------RSFLRA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------L-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------R--LQK-RR-K-----------GVP-----QGSPL------------------S- PLLSNILL-----------------------------------------------------------NE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RGHRYVR----------YADDFSI-------------YVK- -----SKVA----AK---------------------------RVGNSIYK---------------------YLRD----- ----H--L----QLPI------N-R------VK----S---GVRRPL--------------------------------- ----------------DFQVLGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|46881331|locus|VBIBacSal140776_2677|_extraction hypothetical protein [Bacteroides salanitronis DSM 18170] -------------------------------------------------------------------GVDGVSIRELRK- --VFS--EKK-----LQLIEAIK-------Q------GNY----QVQ--------------------------------- -----------------PILGI-EI--PK-----------G-N--G-----------------KTRLL------------ ------GVPT----TT-ERVLQQALAQT-IA-PLFEPEF-------S--------------------------------- ---NYSYGFRP------------------HK----------------------------NARQA---VGQS--------- ----RDYIHS------------------------GLNH-IV-DI--------DLKNFF-DEVD----------------- -HCL--LLNL-I-YQKV---------KCKA------------------TMQLI---------RKWLRA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------K-----I-N--------------- -------------------------------------------------------------------------------- ---------------G---------K--LRK-RR-K-----------GVP-----QGSPL------------------S- PLLSNILL-----------------------------------------------------------HQ----------- --------LD--K------------------------------E------------------------------------ ----------------------MTR--------------RGHKFVR----------YADDFSI-------------YCK- -----SHNQ----AK---------------------------ATRVVIEK---------------------FLKN----- ----K--L----KLTI------N-E------EK----S---GIRKPI--------------------------------- ----------------HFTILGFGF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|86663631|locus|VBIStaAur229418_1173|_extraction Mobile element protein [Staphylococcus aureus subsp. aureus HO 5096 0412] -------------------------------------------------------------------GIDGMKVSEIQG- --HFA--QYF-----PKIKQKLL-------E------GTY----KPQ--------------------------------- -----------------AVKKV-EI--PK-----------A-N--G-----------------KKRVL------------ ------GIPV----VR-DRVIQQAIKQV-IE-PSIDRTF-------S--------------------------------- ---KHSHGFRP------------------NR----------------------------STGTA---LKEC--------- ----ASYYEA------------------------GYTI-AV-DC--------DLKQCF-DNIN----------------- -HDK--LMYL-F-ERHI---------KDKA------------------VSTFI---------RRRLQV------------ -------------------------------------------------------------------------------- ------------------GA------------------------------------I----DL-S--------------- -------------------------------------------------------------------------------- ---------------G---------E--VAE-RK-I-----------GAP-----QGGVI------------------S- PLLCNIYL-----------------------------------------------------------HE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------RNHRFVR----------YADDFVI-------------FVK- -----TKRA----GE---------------------------RVMDSIKT---------------------FIHK----- ----T--L----KLEV------N-N------DK----S---KVGSPT--------------------------------- ----------------RLKFLSCL-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|45364599|locus|VBIStaPse177932_0703|_extraction Mobile element protein [Staphylococcus pseudintermedius HKU1003] -------------------------------------------------------------------GIDGMKVSELHA- --HFE--QYF-----SQITKKLL-------D------GSY----QPQ--------------------------------- -----------------AVRKV-QI--PK-----------P-N--G-----------------KMRVL------------ ------GIPV----AR-DRVIQQAIRQV-IE-PGIDRTF-------S--------------------------------- ---NHSHGFRP------------------NR----------------------------STGTA---LKQC--------- ----ATYYEE------------------------GYKI-AV-DC--------DLKQCF-DMLN----------------- -HDK--LMYL-F-ERHV---------QDKS------------------ISTFI---------RRSLQV------------ -------------------------------------------------------------------------------- ------------------AA------------------------------------I----DL-S--------------- -------------------------------------------------------------------------------- ---------------G---------E--VAE-RK-I-----------GAP-----QGGVI------------------S- PLLCNIYL-----------------------------------------------------------HE----------- --------LD--K------------------------------E------------------------------------ ----------------------LEK--------------CGHRFVR----------YADDFVI-------------FVR- -----TKRA----GE---------------------------RVMTSVTK---------------------FIEK----- ----Q--L----KLVV------N-E------EK----S---RVGAVT--------------------------------- ----------------RLKFLNCL-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186998609|locus|VBIPseSyr250795_4170|_extraction Retrontype RNAdirected DNA polymerase [Pseudomonas syringae pv. actinidiae ICMP 19102] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------MSQQ-LT-KLWDGGF-------S--------------------------------- ---DHSYGFRP------------------GR----------------------------SNLDA---IRAA--------- ----KAFVVS------------------------GKNW-VV-DV--------DIEAFF-DEVS----------------- -HDR--LMTR-I-TRDI---------HDKR------------------LNKYL---------GSNVRA------------ -------------------------------------------------------------------------------- ------------------DM------------------------------------I-----L-N--------------- -------------------------------------------------------------------------------- ---------------G---------Q--RQK-RS-A-----------GVP-----QGGPL------------------S- PLLANLYL-----------------------------------------------------------DP----------- --------LD--K------------------------------E------------------------------------ ----------------------LEA--------------RGLSFCR----------YADDLMI-------------FVE- -----SERS----AE---------------------------RVLESIVS---------------------WIEK----- ----H--L----KLKV------N-A------SK----S---GTGRPW--------------------------------- ----------------ERAFLGYLI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >En.fm.I3/FN424376.1/17411..20180/Enterococcus_extraction faecium/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGKTIKDIEK- -------LTT-----ERYLDIVK-------KRF----KFY----KPR--------------------------------- -----------------KVKRT-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPS----IW-DRVAQQCILQV-LE-PICEAKF-------N--------------------------------- ---PHSHGFRP------------------NR----------------------------SAEHA---IADC--------- ----AKKMNII-----------------------KMGY-CV-DI--------DIQGFF-DEVW----------------- -HSK--LMRQ-MWTMGI---------RDKE------------------LLTII---------RKMLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------V----LP-N--------------- -------------------------------------------------------------------------------- ---------------G---------T--IQF-PE-K-----------GTP-----QGGIL------------------S- PLLANINL-----------------------------------------------------------SE----------- --------FD--WW-VSEQ-W-----------------ET---R-------HMSEIKT----QYNANGTEHMGNHHRK-- ----------------------MRSHT---------KL-KEFYIVR----------YADDFKL-------------FCH- -----NRKT----AE---------------------------LLYHASIQ---------------------WLEQ----- ----R--L----HLPV------S-I------EK----S---KITNLR------------K-------------------- E---------------SSEFLGFNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18859935|locus|VBIBacCer118379_5432|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus ATCC 10987] -------------------------------------------------------------------GIDGVTIKDVEK- -------LSQ-----EDFIKIVQ-------KRF----SNY----TPR--------------------------------- -----------------KVRRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPS----MW-DRIAQQCIKQV-LE-PICEAKF-------N--------------------------------- ---KHSHGFRP------------------NR----------------------------SPETA---MADA--------- ----TLRVNRS-----------------------HMQY-VV-NV--------DIQGFF-DEVN----------------- -HKK--LMRQ-LWTMGI---------RDKQ------------------LLVII---------RKMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------V----LP-N--------------- -------------------------------------------------------------------------------- ---------------G---------E--MQY-PN-K-----------GTP-----QGGIL------------------S- PLLANINL-----------------------------------------------------------NE----------- --------FD--WW-ITNQ-W-----------------ED---R-------LLKELSL----TIKKGGHVDKYPHYSK-- ----------------------MRKTT---------AL-KEMYIVR----------YADDFKI-------------FTA- -----TKSN----AQ---------------------------KIFKACEM---------------------WLQE----- ----R--L----KLPI------S-K------EK----S---KITNLR------------K-------------------- E---------------SSEFLGFEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|202064373|locus|VBICarSp264223_1846|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Carnobacterium sp. WN1359] -------------------------------------------------------------------GVDDITIKDIEN- -------LEQ-----TIFVEMVR-------KRF----SNY----SPR--------------------------------- -----------------KVRRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPS----IW-DRIAQQCILQV-IE-PICEAKF-------N--------------------------------- ---KHSYGFRP------------------NR----------------------------STEHA---IADM--------- ----LFRINQQ-----------------------KLHY-VV-DV--------DLQGFF-DEIN----------------- -HKK--LMNQ-VWTLGI---------HDKQ------------------LLVII---------RKMLSA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------V----LK-N--------------- -------------------------------------------------------------------------------- ---------------G---------S--IMH-PV-K-----------GTP-----QGGIL------------------S- PLLANISL-----------------------------------------------------------NE----------- --------FD--WW-ISNQ-W-----------------ETF--E-------TRKKYAA----AVMGNGTKNRGLTYRM-- ----------------------LRKNS---------KL-KEIYIVR----------YADDFKL-------------ITS- -----NRRD----AE---------------------------KIFIASQM---------------------WLKE----- ----R--L----GLPI------S-K------EK----S---KITNLR------------K-------------------- E---------------ESEFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.a.I2/AE011190/30945..33835/Bacillus_extraction anthracis/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------ACDNVNIKNIEG- -------MEQ-----SYFLNEVK-------RRF----QNY----QPQ--------------------------------- -----------------KVRRK-EI--SK-----------P-N--G-----------------QTRPL------------ ------GIPA----MW-DRIIQQCILQV-ME-PICEAHF-------S--------------------------------- ---NRSYGFRP------------------NR----------------------------SAEHA---LADA--------- ----SVRVNKQ-----------------------NLTY-VV-DV--------DIKGFF-DEVN----------------- -HVK--LMRQ-LWTLGI---------RDKQ------------------LLVII---------RKILKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q----MP-D--------------- -------------------------------------------------------------------------------- ---------------G---------T--TMF-PT-K-----------GTP-----QGGIL------------------S- PILANVNL-----------------------------------------------------------NE----------- --------FD--WW-ISRQ-W-----------------ETF--K-------AKKVKPR----CMRGIWCNDVVTT--Q-- ----------------------LTKTS---------KM-KPMYIVR----------YADDFKI-------------FTN- -----TRSN----AE---------------------------KIFKATQM---------------------WLEE----- ----R--L----KLSI------S-A------EK----S---KVTNLT------------K-------------------- Q---------------QSEFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E.f.I3/AE016830/2249712..2252481/Enterococcus_extraction faecalis/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDKRTIADLAK- -------LSE-----EEYVRLIR-------KQF----SNY----HPG--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPT----IV-DRIVQQCILQV-ME-PICEAKF-------S--------------------------------- ---ENSNGFRP------------------NR----------------------------SAETA---IAQC--------- ----MRLIQVQ-----------------------HLYH-VV-DL--------DIKGFF-DNIS----------------- -HTK--LIRQ-IWALGI---------RDKK------------------LLCII---------KEMLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------V----LP-N--------------- -------------------------------------------------------------------------------- ---------------G---------E--KTY-PA-R-----------GTP-----QGGIL------------------S- PLLANIVL-----------------------------------------------------------NE----------- --------LD--WW-IASQ-W-----------------EEM--P-------TKTKFKT----RSNAQGTEIKSHAYRA-- ----------------------LRR-S---------RL-KEMHAVR----------YADDFKI-------------FCA- -----THED----AV---------------------------RAYKATEL---------------------WLKD----- ----R--L----GLEI------S-P------DK----S---KVVNLK------------R-------------------- Q---------------YSDFLGFKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|190355818|locus|VBIStrAng166616_1315|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus anginosus C238] -------------------------------------------------------------------GVDGRTIKHLSR- -------LNE-----EEYISLIQ-------KQF----HWY----KPR--------------------------------- -----------------PVKRV-EI--LK-----------P-N--G-----------------KIRPL------------ ------GIPT----IV-DRIVQQCILQI-LE-PICEAKF-------H--------------------------------- ---DSSYGFRP------------------NR----------------------------STEHA---IAEC--------- ----ARLMQIQ-----------------------HLHY-VV-DI--------DIQGFF-DNVY----------------- -HAK--LIRQ-LWNLGI---------QDKK------------------LLCII---------KEMLKA------------ -------------------------------------------------------------------------------- ------------------DI------------------------------------V----MP-D--------------- -------------------------------------------------------------------------------- ---------------K---------E--VIT-PT-K-----------GTP-----QGGIL------------------S- PLLSNVVL-----------------------------------------------------------NE----------- --------LD--WW-VSSQ-W-----------------LTM--P-------THYPYKQ----RTNSQGTEIKSHTYRA-- ----------------------LRT-S---------NL-KEIYIVR----------YADDFKI-------------FCR- -----NYYD----AK---------------------------RTYQAVTK---------------------WLQD----- ----R--L----KLNV------S-E------EK----S---KITNLK------------Q-------------------- R---------------YSEFLGFKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19436501|locus|VBICloCel57783_2839|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium cellulolyticum H10] -------------------------------------------------------------------GTDTLNIKDIEK- -------LSV-----EKLVEMMQ-------RKL----AWY----QPK--------------------------------- -----------------PVKRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPT----IV-DRLVQQCILQV-LE-PICEAKF-------Y--------------------------------- ---ERSNGFRP------------------NR----------------------------SAEHA---MAQC--------- ----YRMVQKQ-----------------------NLYF-VV-DV--------DIKGFF-DNVN----------------- -HSK--LIRQ-MWAMGI---------RDKQ------------------LICII---------KQMLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------V----MP-D--------------- -------------------------------------------------------------------------------- ---------------G---------E--TLY-PT-K-----------GTP-----QGGIL------------------S- PLLANIVL-----------------------------------------------------------NE----------- --------LD--WW-ISSQ-W-----------------EDM--L-------THREYYV----SVNNNGSLNKSGVFRT-- ----------------------LRR-S---------AL-KEMYIVR----------YADDFKI-------------FCR- -----KRSD----AN---------------------------KIFVAVKK---------------------WLKD----- ----R--L----KLEI------S-E------EK----S---KVVNLK------------K-------------------- H---------------YSEFLGFQF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|202104716|locus|VBIEntMun281267_0501|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Enterococcus mundtii QU 25] -------------------------------------------------------------------GVDNKNIDDLKS- -------IPD-----TDFISIVQ-------TKL----SEY----KPQ--------------------------------- -----------------PVKRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPT----IW-DRIVQQCLLQV-LE-PIMEAKF-------H--------------------------------- ---DKNYGFRP------------------NR----------------------------SAHHA---FAQA--------- ----VRMAQVS-----------------------KLTF-VV-DI--------DIEGFF-DNVN----------------- -HSK--LIKQ-LWSLGV---------RDKW------------------LLGVI---------RAMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------I----HK-D--------------- -------------------------------------------------------------------------------- ---------------G---------H--IEH-PK-K-----------GTP-----QGGIL------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--WW-ISSQ-W-----------------ETH--P-------TRHNYDW----YHAEKEYWNKGNKYRA-- ----------------------LRG-T---------SL-KEIYIVR----------YADDFKI-------------FCR- -----KRSD----AD---------------------------KIFLATKL---------------------WLKE----- ----R--L----KLDI------S-Q------EK----S---KVVNLK------------K-------------------- Q---------------KSEFLGFTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.me.I1/AB022308/3853..6569/Bacillus_extraction megaterium/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GDDGLTIEDINR- -------LSV-----SEVVSTIQ-------RMF----EYY----TPQ--------------------------------- -----------------AVRRV-FI--PK-----------A-N--G-----------------KTRPL------------ ------GIPT----IW-DRLFQQCILQV-LE-PICEAKF-------Y--------------------------------- ---KHSYGFRP------------------NR----------------------------NTHHA---KARF--------- ----ETLINRA-----------------------CLYH-CV-DV--------DIKGFF-DNVN----------------- -HAK--LIKQ-LWSLGI---------RDKA------------------LLSII---------SRLLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------I-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------F------PK-K-----------GTP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-VSNQ-W-----------------ESF--E-------THKLYKS-----------NLG--RYNA-- ----------------------LKQ-S---------NL-KHCYIVR----------YADDFKI-------------LCR- -----TRSQ----AI---------------------------KMYYAVND---------------------FLHT----- ----R--L----RLEI------S-E------QK----S---KVVNLK------------K-------------------- N---------------SSEFLGFRS------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|101939315|locus|VBIBacThu242010_6066|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis MC28] -------------------------------------------------------------------GTDGMTINDIKM- -------LST-----DEVIEKVK-------MMF----GWY----EPQ--------------------------------- -----------------SVRRV-FI--PK-----------P-N--G-----------------NRRPL------------ ------GIPT----IW-DRLFQQCVLQI-LE-PICEAKF-------H--------------------------------- ---NHSYGFRP------------------NR----------------------------STHHA---LARM--------- ----KSLVNRKGN---------------------GFHY-CV-DI--------DIKGFF-DNVH----------------- -HGK--LLKQ-LWTIGI---------RDKK------------------LLSII---------SRLLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------V-----N-E--------------- -------------------------------------------------------------------------------- ---------------G---------V------PQ-K-----------GTP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-VSNQ-W-----------------ETI--K-------TSHPYKG-----------NSD--KYRA-- ----------------------LKK-S---------KL-KECFLIR----------YADDAKI-------------LCR- -----DYVT----AL---------------------------KMFEATKD---------------------FLRT----- ----R--L----HLDI------S-L------EK----S---KIINLR------------K-------------------- K---------------ASHFLGFTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|54454697|locus|VBICloBot178872_0058|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium botulinum BKT015925] -------------------------------------------------------------------GTDGSTIKDINN- -------IDI-----DEVITKIK-------TMF----DFY----TPK--------------------------------- -----------------SIRRV-EI--PK-----------A-N--G-----------------KTRPL------------ ------GIPT----IW-DRLFQQCILQV-LE-PICEAKF-------H--------------------------------- ---KHSYGFRP------------------NR----------------------------STHHA---ITRS--------- ----VYLINIT-----------------------KLYH-CV-DV--------DIKGFF-DNVN----------------- -HGK--LLKQ-LWALGV---------KDKK------------------LLKII---------SVMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------E-----G-I--------------- -------------------------------------------------------------------------------- ---------------G---------I------PT-K-----------GVP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-VSNQ-W-----------------ETF--K-------TDKDYT-----------------KYRT-- ------SKTGKIVVDHSIRNKMLKK-S---------KL-KEIYIVR----------YADDFKI-------------FCR- -----TRSQ----AK---------------------------AIDIAVGD---------------------MLKN----- ----R--L----GLEC------S-A------EK----S---KVLNLK------------K-------------------- S---------------YSEFLGFKM------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18825078|locus|VBIBacCer120511_0128|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus AH187] -------------------------------------------------------------------GVDNLTIKDIWH- -------LND-----TKIIHEVR-------KRL----NNY----QPQ--------------------------------- -----------------AVKRV-LI--PK-----------E-G--S-------------D---KKRPL------------ ------GIPT----IW-DRLVQQSILQV-LE-PICEAKF-------H--------------------------------- ---NHSYGFRP------------------NR----------------------------STHHA---LSRV--------- ----VSLINIG-----------------------HQHY-CV-DI--------DIKGFF-DNVC----------------- -HKK--LLRQ-MWTLGI---------RDKS------------------LLCVI---------SKILKS------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------E-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PN-K-----------GTP-----QGGII------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-ISSQ-W-----------------ETY--K-------PHRISTR-----------HLGFRQYAR-- -----------------------KY-T---------NL-KCGYVVR----------YADDFKI-------------MCR- -----TYDE----AQ---------------------------RFYHATVD---------------------FLKS----- ----R--L----GLEI------N-P------KK----S---KVVNLK------------K-------------------- N---------------SSVFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|201989473|locus|VBIBacThu93926_0768|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis YBT1518] -------------------------------------------------------------------GVDGLTIKDVRQ- -------LND-----FQVINQVR-------KRL----MNY----RPS--------------------------------- -----------------PVRRV-YI--PK-----------E-G--S-------------D---KKRPL------------ ------GIPT----IW-DRLVQQCILQV-LE-PICEAKF-------H--------------------------------- ---NHNYGFRP------------------NR----------------------------STHHA---LSRM--------- ----VSLINVG-----------------------KHHY-CV-DI--------DIKGFF-DNVQ----------------- -HGK--LLKQ-MWAIGI---------RDKR------------------LLSII---------SNLLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------I-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PS-K-----------GTP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-ISNQ-W-----------------ETY--K-------PHRFKDG-----------PNGFTTYAR-- -----------------------KY-T---------NL-KGGYIVR----------YADDFKI-------------MCR- -----TYEE----AQ---------------------------RFYHATVD---------------------FLKA----- ----R--L----GLEI------N-P------EK----S---KVVHLK------------K-------------------- N---------------SSDFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|67659680|locus|VBIEntFae233823_1913|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Enterococcus faecium Aus0004] -------------------------------------------------------------------GTDGMTIDDIKQ- -------LSN-----AEIVATVR-------ESL----SNY----RPK--------------------------------- -----------------SVRRV-FI--PK-----------A-G--S-------------D---KMRPL------------ ------GIPC----IW-DRLVQQCILQV-LE-PICEPKF-------H--------------------------------- ---NHSYGFRA------------------NR----------------------------SAHHA---VSRV--------- ----TTLINLS-----------------------KYHY-CV-DV--------DIKGFF-DNVN----------------- -HGK--LLKQ-IWTLGI---------RDKR------------------LICII---------SKMLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------D-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------V------PE-K-----------GTP-----QGGLL------------------S- PLLSLIVL-----------------------------------------------------------NE----------- --------LD--WW-VSSQ-W-----------------ETF--Q-------PKNRSK-------------NGWLQYAK-- -----------------------KY-T---------KL-KSGFIVR----------YADDFKI-------------MCS- -----TYGE----AQ---------------------------RFYHSTVD---------------------FLNK----- ----R--L----KLEI------S-P------EK----S---KVVNLK------------K-------------------- N---------------SSDFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >C.d.I1/X98606/13..2658/Clostridium_extraction difficile/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GTNKRTIIDVGE- -------ENP-----YQLVQYVQ-------NRF----NNF----QPH--------------------------------- -----------------SIRRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPT----IE-DRLVQQCIKQI-LE-PILEAKF-------H--------------------------------- ---KHSYGFRP------------------ER----------------------------SSHHA---IA-I--------- ----FQQWTFK-----------------------GFHY-VV-DI--------DIKGFF-DNVN----------------- -HGK--LVKQ-LWTMKI---------RDKT------------------FISIL---------SRMLKA------------ -------------------------------------------------------------------------------- ------------------EV------------------------------------K-----G-I--------------- -------------------------------------------------------------------------------- ---------------G---------K------ST-K-----------GTP-----QGGIL------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--WW-IDSQ-W-----------------DGF--P-------TKRKYSS-------------LLSKTQS-- ----------------------IRKYS---------NL-KEIKIVR----------YADDFKI-------------MCK- -----DYHT----AQ---------------------------KIFLATKQ---------------------WLKV----- ----R--L----DLDI------S-P------EK----S---KVTNLR------------K-------------------- N---------------YSDFLGFKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19462591|locus|VBICloKlu111549_0642|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium kluyveri DSM 555] -------------------------------------------------------------------GVNTNTIMDIGE- -------ENP-----DELAIYVR-------ERL----INY----KPQ--------------------------------- -----------------PVRRV-EI--PK-----------P-N--G-----------------KMRPL------------ ------GIPT----IE-DRIIQQCIKQV-LE-PICEAKF-------H--------------------------------- ---KDSYGFRP------------------NR----------------------------STHHA---IART--------- ----YSLANIN-----------------------KLTY-VV-DI--------DIKGFF-DNVN----------------- -HSK--LLKQ-MWTMGI---------QDKN------------------LLCVI---------SKMLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------K-----G-V--------------- -------------------------------------------------------------------------------- ---------------G---------I------PN-K-----------GTP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-ISNQ-W-----------------QTL--K-------SKFPYKR-------------EIFKYQA-- ----------------------LKR-S---------KL-KEVYIVR----------YADDFKL-------------FCR- -----SYNN----AK---------------------------KIFKAVTM---------------------WLKE----- ----R--L----GLEI------N-E------EK----S---SIVNLK------------Q-------------------- K---------------YSEFLGFKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115343359|locus|VBIHalHal149681_0148|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Halobacteroides halobius DSM 5150] -------------------------------------------------------------------GTNNKTIKDLEE- -------KST-----EELVEYVR-------NRL----EYY----VPQ--------------------------------- -----------------SVRRV-YI--PK-----------P-D--G-----------------RKRPL------------ ------GIPT----IK-DRLIQQCIKQV-LE-PICEAKF-------H--------------------------------- ---NHSYGFRP------------------NR----------------------------STKHA---IARI--------- ----MYLINFS-----------------------KLHY-TV-DI--------DIKSFF-DNVD----------------- -HNK--LKKQ-LWSMGI---------RDKK------------------LISIL---------GNMLEA------------ -------------------------------------------------------------------------------- ------------------KI------------------------------------E-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------V------PE-K-----------GTP-----QGGII------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------MD--WW-ISNQ-W-----------------ETF--K-------TDYKYNR-------------KGDKITA-- ----------------------IKK-T---------NL-KEIYIIR----------YADDFKI-------------MCR- -----DFET----AS---------------------------KIKIATIK---------------------WLKE----- ----R--L----NLEV------S-E------KK----T---SITNLK------------K-------------------- N---------------HTEFLGIKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115343005|locus|VBIHalHal149681_0330|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Halobacteroides halobius DSM 5150] -------------------------------------------------------------------GTNNKTIKDLEE- -------LTT-----QKLVDYVR-------NRL----EYY----IPQ--------------------------------- -----------------SVRRV-YI--PK-----------P-D--G-----------------RKRPL------------ ------GIPT----IE-DRLIQQCIKQV-LE-PICEAKF-------H--------------------------------- ---NHSYGFRP------------------NR----------------------------STKHA---IART--------- ----MRLINQS-----------------------KLHY-VV-DV--------DIKGFF-DNVD----------------- -HAK--LKKQ-MWSMGI---------KDKK------------------LISII---------GNMLRA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------E-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PD-K-----------GTP-----QGGII------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-VSNQ-W-----------------ETF--E-------TDFKYNQ-------------KSNKYQA-- ----------------------LKKRS---------NL-KEVYIVR----------YADDFKI-------------MCR- -----DYEI----AS---------------------------KIKVATIQ---------------------WLKE----- ----R--L----NLDV------S-K------KK----T---KITNLK------------R-------------------- S---------------YTKFLGIKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19408375|locus|VBICloBot19908_0265|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium botulinum Ba4 str. 657] -------------------------------------------------------------------GTNHKTINDIAG- -------ESE-----DEIIEYVR-------KRL----NKF----YPH--------------------------------- -----------------SVKRI-YI--PK-----------N-N--G-----------------DKRPL------------ ------GIPT----IE-DRLIQRSILQV-LE-PICEAKF-------H--------------------------------- ---PHSYGFRP------------------NR----------------------------STEHA---IARA--------- ----MTLINMN-----------------------KLHY-VV-DV--------DIKGFF-DNVN----------------- -HGK--LLKQ-LWTLGI---------KDKK------------------LIKII---------SLMLKA------------ -------------------------------------------------------------------------------- ------------------QI------------------------------------K-----D-G--------------- -------------------------------------------------------------------------------- ---------------S---------M--ITN-PV-K-----------GTP-----QGGII------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--WW-ISSQ-W-----------------ETF--E-------TKHNYSK-LRTFKNGTTTIDKSHKYRA-- ----------------------LRN-G---------KL-KEIYIVR----------YADDFKV-------------FCK- -----NPKD----AE---------------------------KIFIAIKL---------------------WLKE----- ----R--L----DLET------S-P------EK----S---KVTNLR------------K-------------------- H---------------PTEFLGFEL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|38137486|locus|VBIBacThu148000_5492|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis BMB171] -------------------------------------------------------------------GSNDTTILEIAE- -------QNL-----TTFVAKVQ-------KAL----ENY----NPK--------------------------------- -----------------PIRRV-YI--PK-----------R-N--G-----------------DKRPL------------ ------GIPT----ME-DRIVQQCIKQI-LE-PICEAKF-------Y--------------------------------- ---NHSYGFRP------------------NR----------------------------NAKHA---IVRA--------- ----MSLMNIS-----------------------KFHY-VV-DI--------DIKGFF-DNVN----------------- -HGK--LLKQ-IWSLGI---------RDKS------------------LLSII---------SKILKT------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------E-----N-V--------------- -------------------------------------------------------------------------------- ---------------G---------K------ME-K-----------GTP-----QGGII------------------S- PLLSNIVL-----------------------------------------------------------NE----------- --------LD--WW-ISSQ-W-----------------ETM--I-------TRHNYES----IDKRNNTIIRSHKYTA-- ----------------------LRRTS---------NL-KEMFLVR----------YADDFKI-------------FCK- -----DFNS----AQ---------------------------KTLIAVKK---------------------WLKN----- ----R--L----GLEV------N-N------EK----S---KVTNLR------------R-------------------- N---------------YTEFLGFKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115616442|locus|VBIDehSp228777_1269|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dehalobacter sp. CF] -------------------------------------------------------------------GTDGLTIKDIAG- -------MTN-----QEVITMVK-------RRL----KNF----TPQ--------------------------------- -----------------SVRRV-EI--LK-----------D-N--G-----------------QNRPL------------ ------GIPT----MS-DRLIQACIYQI-LE-PICEARF-------H--------------------------------- ---NHSYGFRP------------------TR----------------------------RTEHA---LATM--------- ----HRMINIQ-----------------------HLHF-VV-DV--------DIKGFF-DNVD----------------- -HGK--LLKQ-MWTMGI---------QDKN------------------LLCII---------SAMLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------E-----G-I--------------- -------------------------------------------------------------------------------- ---------------G---------I------PN-K-----------GVP-----QGGLC------------------S- PLFSNVVL-----------------------------------------------------------NE----------- --------LD--WW-ISDQ-W-----------------ESY--E-------TSYPYKR-------------NEGKIRA-- ----------------------IRRGS---------KL-KECYIIR----------YCDDFKI-------------MCP- -----TRDV----AE---------------------------RMFVAVKL---------------------WLKE----- ----R--L----NLEI------S-S------EK----S---KITNLR------------K-------------------- K---------------SSEFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >En.fm.I1/NZ_AAAK03000007/10877..13634/Enterococcus_extraction faecium/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GTDGTIIKDIGK- -------LPA-----ETVVKKVR-------YIVAGSPHGY----RPK--------------------------------- -----------------PVRRK-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPC----MW-DRLIQQCIKQV-LE-PICEAKF-------S--------------------------------- ---ENSYGFRP------------------NR----------------------------SVENA---IKAT--------- ----YNRLQIS-----------------------QLHY-VI-EF--------DIKGFF-DNVN----------------- -HSK--LIKQ-IWAMGI---------RDKH------------------LIFIL---------KRILKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------K----MT-N--------------- -------------------------------------------------------------------------------- ---------------G---------T--ITY-PE-K-----------GTP-----QGGII------------------S- PLLANIVL-----------------------------------------------------------NE----------- --------LD--HW-VESQ-W-----------------QEN--P-------VTKNYVV----HINKSGSPCKSNAYKE-- ----------------------MKK-T---------KL-KEMYMVR----------YADDFRV-------------FCR- -----YKES----AE---------------------------KAKIAITQ---------------------WIEQ----- ----R--L----KLEV------S-Q------EK----T---RIVNVR------------K-------------------- R---------------YSDFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42996817|locus|VBIEubSir135646_1742|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Eubacterium siraeum 70/3] -------------------------------------------------------------------GTDKLKISDIGK- -------LTA-----DEVTARVR-------RIVKGGKNGY----TPR--------------------------------- -----------------SVRRK-EI--PK-----------P-N--G-----------------STRPL------------ ------GIPC----IW-DRLVQQCIKQV-ME-PICEARF-------S--------------------------------- ---NNSYGFRP------------------NR----------------------------SVENA---IAAI--------- ----YRLMQRS-----------------------GLYY-VV-EF--------DIKGFF-DNVD----------------- -HSK--LIKQ-LWSLNI---------RDKE------------------LLYVI---------RRILKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------L----MP-D--------------- -------------------------------------------------------------------------------- ---------------G---------H--IEH-PA-K-----------GTP-----QGGII------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--HW-IESQ-W-----------------QCN--P-------VTENYSY----RENATGCPIQSHAYRA-- ----------------------MRN-T---------RL-KEMYIVR----------YADDFRI-------------LCR- -----TKEQ----AD---------------------------RTLIAVTH---------------------WLKE----- ----R--L----RLDV------S-P------EK----T---RVVDTR------------R-------------------- S---------------YSEFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18820991|locus|VBIBacCer84800_3811|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus 03BB102] -------------------------------------------------------------------GTNGHTIKHLNK- -------IDA-----DKLIRLTQ-------KRL----ENY----MPH--------------------------------- -----------------AVRRL-FI--SK-----------P-N--G-----------------KMRPL------------ ------GIPT----IE-DRLIQQMFQQV-LE-PIVEGKF-------H--------------------------------- ---PQSYGFRP------------------KR----------------------------GTHDA---LARC--------- ----YHMVNHS-----------------------HQHF-VV-DI--------DIKGFF-DNVN----------------- -HKK--LMRQ-LWTIGI---------RDKK------------------VLSII---------KKMLKA------------ -------------------------------------------------------------------------------- ------------------EV------------------------------------T-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PV-K-----------GTP-----QGGIL------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--WW-VSNQ-W-----------------ETK--P-------TRVPYKL-------------KRNKTDA-- ----------------------LKK-T---------RL-KPMYLVR----------YADDFKI-------------FTN- -----SYDN----AR---------------------------KIKIAVEK---------------------WLKE----- ----R--L----GLEI------S-E------EK----S---KITNLR------------K-------------------- N---------------GTDFLGIRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18848241|locus|VBIBacCer122868_5594|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus AH820] -------------------------------------------------------------------GTDDKTILDLAN- -------TNQ-----DEFIHYMR-------ELV----LNY----KPK--------------------------------- -----------------SVRRV-WI--DK-----------N-Y--S-------------K---GKRPL------------ ------GIPC----IQ-DRIVQQMFLNV-LE-PICEGKF-------Y--------------------------------- ---NHSYGFRP------------------TR----------------------------TTRHA---VARV--------- ----QTLVNIN-----------------------KYHY-TV-DI--------DIKGFF-DNVN----------------- -HSI--LLKQ-VWNIGI---------RDKR------------------VIAVI---------SKMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------K-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PT-K-----------GVP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------ND----------- --------LD--QW-VADQ-W-----------------ECF--E-------TRYQYSV-------------NYSKYVN-- ----------------------LRRNS---------KL-KEGFLVR----------YADDFRI-------------MTN- -----THDS----AV---------------------------KWFHAVVD---------------------FLNK----- ----R--L----KLEI------S-P------NK----S---KIINLR------------K-------------------- K---------------SSSFLGYKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|31950635|locus|VBIBacPse80461_3982|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus pseudofirmus OF4] -------------------------------------------------------------------GTDGYTIIHLAE- -------KNK-----ESFIEEMR-------LRL----ENY----KPQ--------------------------------- -----------------RVRRV-LI--DK-----------N-Y--G-------------T---DKRPL------------ ------GIPT----IA-DRIIQQMFLQV-LE-PICEAKF-------Y--------------------------------- ---NHSYGFRP------------------LR----------------------------STRHA---IARV--------- ----QTLININ-----------------------KLHY-TV-DI--------DIKGFF-DNVN----------------- -HNL--LIKQ-LWNIGV---------KDKR------------------VLAII---------SKMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----K-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PK-K-----------GVP-----QGGIL------------------S- PLLSNVVL-----------------------------------------------------------ND----------- --------LD--QW-VAGQ-W-----------------ECF--N-------TKHQYSG-------------NDVKIAN-- ----------------------LKRAS---------NL-KEGYIVR----------YADDFRI-------------LAR- -----DHNT----AW---------------------------KWFHAVKG---------------------YLKD----- ----R--L----KLEI------S-N------EK----S---RVINLR------------K-------------------- K---------------SSDFLSYKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|101938694|locus|VBIBacThu242010_5758|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis MC28] -------------------------------------------------------------------GIDSFAIDQYKS- -------MDK-----AEFLNLVR-------NRL----NQY----KPK--------------------------------- -----------------AVKRV-FI--PK-----------P-N--G-----------------DKRPL------------ ------GIPT----MF-DRLIQQMIKQI-LE-PICEAKF-------Y--------------------------------- ---EHSYGFRP------------------LR----------------------------GARHA---ISRV--------- ----MYLISRN-----------------------TFHY-AV-EI--------DIKGFF-DNVN----------------- -HTL--LLKQ-LWNMGI---------KDKR------------------VLKLI---------YLILKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------K-----G-V--------------- -------------------------------------------------------------------------------- ---------------G---------I------PR-K-----------GTP-----QGGIL------------------S- PLLSNVVL-----------------------------------------------------------ND----------- --------LD--QW-IARQ-W-----------------HHF--Q-------SDYDYTE-------------PGNRSRA-- ----------------------LKR-T---------KL-KQGYIVR----------YADDFKI-------------MAK- -----DFRT----AQ---------------------------KWFMATKL---------------------YLKE----- ----R--L----KLDI------S-P------GK----S---RIINLR------------K-------------------- N---------------KSEFLGYSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|47119490|locus|VBIEntFae176554_2204|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Enterococcus faecalis 62] -------------------------------------------------------------------GTDSFTIDNYKE- -------MNQ-----AEFIHLIL-------SQL----ENY----KSK--------------------------------- -----------------SIKRV-MI--PK-----------P-N--G-----------------EKRPL------------ ------GIPC----MI-DRIIQQMFKQV-LE-PICEAKF-------Y--------------------------------- ---EHSYGFRP------------------LR----------------------------SAKHA---LGRI--------- ----MYLINIS-----------------------KMHY-AV-DI--------DIKGFF-DNVN----------------- -HRL--LIKQ-LWNIGI---------CDKR------------------VLAIL---------SKSLKS------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------SS-K-----------GTI-----QGGII------------------S- PLLSNVVL-----------------------------------------------------------ND----------- --------LD--HW-VSKQ-W-----------------HTF--E-------TKYPYTK-------------GYNKFRA-- ----------------------LRD-T---------NL-KQGYIVR----------YADDFKI-------------MTN- -----DYPS----AL---------------------------KWFHAVKL---------------------YLKD----- ----R--L----KLDI------S-N------EK----S---KIVNLR------------K-------------------- R---------------KSEFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|87137209|locus|VBIHalHal165146_0228|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Halobacillus halophilus DSM 2266] -------------------------------------------------------------------GTDGKTIDDMKE- -------LSE-----NDLVNEVR-------SKL----QNY----HPK--------------------------------- -----------------KVRRE-WI--EK-----------E-N--G-----------------KWRPL------------ ------GIPC----IL-DRVIQQCFKQV-LE-PIVESQF-------F--------------------------------- ---KHSYGFRP------------------LR----------------------------SAHHA---MARI--------- ----QFLINHS-----------------------QLHY-VV-DV--------DIKSFF-DNVN----------------- -HRL--LKKQ-LWNIGI---------QDRK------------------VLACI---------SKMITS------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------D-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------V------PD-K-----------GSP-----QGGIL------------------S- PLLSNVVL-----------------------------------------------------------ND----------- --------LD--QW-VADQ-W-----------------EVF--P-------LTKSYSS-------------DDARRRA-- ----------------------RKQ-T---------NL-KQGYLVR----------YADDFKI-------------LCR- -----DGKT----AQ---------------------------RWYHAVRL---------------------YLKE----- ----R--L----KLDI------S-P------EK----S---QIVNLR------------K-------------------- R---------------ESEFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|190447919|locus|VBIEntSp299569_0686|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Enterococcus sp. HSIEG1] -------------------------------------------------------------------GVDGITISDIER- -------LNE-----NDFVEIIR-------ANL----SNY----RPG--------------------------------- -----------------PVRRV-YI--PK-----------K-N--G-----------------KKRPL------------ ------GIPN----LY-DRIIQQTIKQV-IE-PIVEAKF-------F--------------------------------- ---KHSYGFRP------------------LR----------------------------SVEQA---MGRM--------- ----HSVINNV-----------------------QLHY-VV-DV--------DIKGFF-DNVN----------------- -HNL--LRHQ-IWNMGI---------RDTK------------------LIAII---------SKILRA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------V-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------T------PV-K-----------GTP-----QGGVL------------------S- PLLANIVL-----------------------------------------------------------ND----------- --------LD--QW-IASQ-W-----------------ENF--P-------SKHRYSR-------------G-KLHRA-- ----------------------LKG-T---------TL-KEGYLVR----------YADDFKL-------------LTR- -----SYSM----AK---------------------------RWYTAIRG---------------------YIEK----- ----H--L----KLEI------S-P------EK----S---GITNLR------------K-------------------- K---------------RTEFLGFEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|202109853|locus|VBIEntMun281267_2992|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Enterococcus mundtii QU 25] -------------------------------------------------------------------GTDGITIDDYKL- -------ANI-----EIFVSYIR-------SVL----SNY----KPQ--------------------------------- -----------------KVRRV-YI--PK-----------S-N--G-----------------KKRPL------------ ------GIPT----MR-DRIIQQMFLQI-LE-PICEAQF-------Y--------------------------------- ---NHSYGFRP------------------NR----------------------------STKHA---MARC--------- ----KFLTRKN-----------------------F-HY-VV-DI--------DIKGFF-DNVN----------------- -HNK--LIKQ-LYTIGI---------KDKR------------------VLAIL---------AKMLKA------------ -------------------------------------------------------------------------------- ------------------TI------------------------------------E-----G-E--------------- -------------------------------------------------------------------------------- ---------------G---------I------PK-K-----------GTP-----QGGIL------------------S- PLLSNVVL-----------------------------------------------------------NE----------- --------LD--WW-IANQ-W-----------------EFL--K-------TKENY-H-------------PAARLKS-- ----------------------LKRKT---------TL-KEMFIVR----------YADDFKI-------------FTK- -----DHQS----AI---------------------------RIYHGVKG---------------------YLSN----- ----H--L----SLDI------S-P------EK----S---KITNLR------------K-------------------- R---------------DSEFLGFSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|22412306|locus|VBILysSph89750_0101|_extraction Mobile element protein [Lysinibacillus sphaericus C341] -------------------------------------------------------------------GTDGITIEQYKI- -------EDV-----ETFVDEIR-------ATL----KNY----KPQ--------------------------------- -----------------TVRRV-EI--PK-----------P-N--G-----------------KTRPL------------ ------GIPT----MR-DRLIQQMFKQI-LE-PICEARF-------Y--------------------------------- ---NHSYGFRP------------------NR----------------------------STHHA---MGRC--------- ----QFLANIA-----------------------LNQH-VV-DI--------DIQGFF-DNVS----------------- -HSK--LLKQ-MYSIGI---------CDKR------------------VLSVV---------SKMLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------K-----G-I--------------- -------------------------------------------------------------------------------- ---------------G---------I------PT-K-----------GTP-----QGGIL------------------S- PLLSNIVL-----------------------------------------------------------ND----------- --------LD--WW-ISNQ-W-----------------ENM--K-------TKFNYKE-------------RKNKVLM-- ----------------------IKRTT---------TL-KEMYIVR----------YADDFKI-------------FTK- -----SHKN----AI---------------------------KLYHAVKG---------------------YLKN----- ----H--L----NLDI------S-N------EK----S---KITNLR------------K-------------------- R---------------ASEFLGFSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|58992730|locus|VBIStrEqu204605_0781|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus equi subsp. zooepidemicus ATCC 35246] -------------------------------------------------------------------GTDGKTIVEIQK- -------LPI-----EMVIKTIR-------NKL----NYY----QPK--------------------------------- -----------------NVRRV-EI--PK-----------D-N--G-----------------KTRPL------------ ------GIPS----IW-DRLIQQCVLQV-LE-PICEAKF-------H--------------------------------- ---ERNNGFRP------------------YR----------------------------STQNA---IAQC--------- ----YKMAQIQ-----------------------NLHF-VV-DV--------DITGFF-DNID----------------- -HSK--LIRQ-LWGLGV---------QDRK------------------LIMII---------KQMLKA------------ -------------------------------------------------------------------------------- ------------------DI------------------------------------L-----F-K--------------- -------------------------------------------------------------------------------- ---------------D---------I--VIT-PE-T-----------GTP-----QGGIL------------------S- PLLANVVL-----------------------------------------------------------NE----------- --------LD--WW-VANQ-W-----------------EMF--KI--KEGSTGYEFTK--VDNEGNILTIDRTQKWNK-- ----------------------LRAKT---------GL-KEMYITR----------YADDFKI-------------FCR- -----DYAT----AV---------------------------KVMKATNL---------------------WLAE----- ----N--L----HLQT------S-D------EK----S---GITNLR------------K-------------------- N---------------YTTFLGIKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.c.I5/AE017195/84166..86938/Bacillus_extraction cereus/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGKTIQDYLR- -------LSE-----EKLIELIR-------GRL----TNF----KAH--------------------------------- -----------------LIKRV-FI--PK-----------A-N--G-------------G---Q-RPL------------ ------GIPT----IE-DRIIQQMMKQV-LE-PVLEAQF-------F--------------------------------- ---KYSFGFRP------------------ER----------------------------TTYHA---LERV--------- ----KVLVHNT-----------------------GYHW-IV-EG--------DIRQFF-DKVN----------------- -HRI--LIKK-LWSMGI---------KDRR------------------ILCLI---------TEFLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------F----KN-I--------------- -------------------------------------------------------------------------------- ---------------I---------R------ND-N-----------GTP-----QGGIL------------------S- PLLANVYL-----------------------------------------------------------HS----------- --------FD--KW-VAKQ-F-----------------EEF--T-------TRHEYS--------------KHDHKLR-- ----------------------GLKSS---------NL-KPGYLIR----------YADDWVL-------------VTN- -----NKSH----AY---------------------------RWKTVIKN---------------------FLQK----- ----E--L----KLEL------S-E------EK----T---RITNIR------------H-------------------- K---------------PIEFLGFKY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|202001215|locus|VBIBacThu93926_6557|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus thuringiensis YBT1518] -------------------------------------------------------------------GVDSLTINDILQ- -------ADE-----EKVIHLIT-------NTI----RDY----TPS--------------------------------- -----------------MVRRV-WI--PK-----------A-G--K-------------K---ELRPL------------ ------GIPT----IL-DRIIQQCVKQV-IE-PICEAQF-------F--------------------------------- ---PYSFGFRP------------------YR----------------------------DGHMA---IERV--------- ----GSLIHKT-----------------------KYHW-IV-EG--------DIRKFF-DKVN----------------- -HNI--LLKN-CFKIGI---------QDKR------------------VLMLI---------KAMLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M----HE-N--------------- -------------------------------------------------------------------------------- ---------------T---------K------TT-L-----------GTP-----QGGII------------------S- PILANIYL-----------------------------------------------------------HD----------- --------FD--MW-VYNQ-W-----------------QNK--K-------TRKNYAN-------------KHSRTTT-- ----------------------LKRTT---------KL-KQGYLIR----------YADDWVI-------------VTN- -----SKTN----AI---------------------------KWKKAVSH---------------------YLKD----- ----K--L----KLEL------S-E------EK----T---KITNVR------------K-------------------- K---------------NIEFLGFKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >G.k.I1/BA000043/1312755..1315536/Geobacillus_extraction kaustophilus/Bacterial B/ORF Sequence %28a.a%29 -------------------------------------------------------------------GTDGKTISDILT- -------LNY-----DEAINFVK-------RCF----KKY----TPN--------------------------------- -----------------PIRRV-HI--PK-----------P-G--K-------------K---EKRPL------------ ------GILT----IA-DRIIQECVRMV-IE-PILEAQF-------F--------------------------------- ---QHSYGFRP------------------YR----------------------------DAKQA---IERC--------- ----VFICNRI-----------------------GYNW-VI-EG--------DIKGFF-DNVN----------------- -HTI--LIKQ-LWHMGI---------RDRR------------------MLMII---------KAMLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I----KE-T--------------- -------------------------------------------------------------------------------- ---------------K---------I------NE-M-----------GTP-----QGGII------------------S- PLLANVYL-----------------------------------------------------------HK----------- --------LD--QW-ITRE-W-----------------EEK--K-------MRNGTTI-------------RTAKYKS-- ----------------------LRDHS---------TITKPEFYVR----------YADDWVL-------------FTN- -----SRGN----AE---------------------------KWKYRIKK---------------------YLKE----- ----N--L----KLEL------S-D------DK----T---LITNIK------------K-------------------- K---------------PMKFLGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|31950695|locus|VBIBacPse80461_4012|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus pseudofirmus OF4] -------------------------------------------------------------------GTDGETIDDILQ- -------DGY-----ESVISRVR-------KCF----LAY----NPK--------------------------------- -----------------LLRRV-HI--DK-----------Q-V--S-------------K---DKRPL------------ ------GIPA----II-DRIIQECIRMI-IE-PILEAQF-------F--------------------------------- ---SHSYGFRP------------------YR----------------------------SAEHA---LSKV--------- ----TNTAYDT-----------------------NYCW-VV-EG--------DIKKFF-DNVN----------------- -HTI--LIKK-LYSMGI---------RDRR------------------VLMII---------KAMLQC------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L----GE-A--------------- -------------------------------------------------------------------------------- ---------------E---------Q------TT-V-----------GTP-----QGGII------------------S- PLLANAYL-----------------------------------------------------------DS----------- --------LD--HW-ITRE-W-----------------ENK--E-------TKHEYSR-------------LDGKYRA-- ----------------------LKNAS---------NL-KPAHFVR----------YADDWVL-------------ITN- -----SKAN----AI---------------------------KWKQRIAK---------------------HLKE----- ----Q--L----KLEL------S-E------EK----T---LITNIK------------K-------------------- K---------------AIKFVGFHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61450525|locus|VBISulAci142080_0388|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GSDGQVMSEILQ- -------QQY-----PDIIQRVQ-------SAL----HHY----EPQ--------------------------------- -----------------LLRRV-WI--PK-----------P-G--K-------------A---EKRPL------------ ------GIPA----MI-DRIVQEILRSI-LE-PIMEAQF-------F--------------------------------- ---EHSYGFRP------------------MR----------------------------DAHQA---LART--------- ----TNLVHDT-----------------------GYHW-IV-EG--------DIKGCF-DNIP----------------- -HGK--LLKQ-LWHMGI---------RDRR------------------ILMII---------KQMLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L----HE-A--------------- -------------------------------------------------------------------------------- ---------------P---------H------VD-Q-----------GTP-----QGGIL------------------S- PLLANVYL-----------------------------------------------------------HK----------- --------LD--QW-VTRE-W-----------------EAK--R-------TRFPYKK-------------RRIRLEA-- ----------------------LQERS---------RL-KPAYFVR----------YADDWIL-------------ITD- -----CKAH----AV---------------------------AWKQRIAQ---------------------YLDQ----- ----N--L----SLTL------S-Q------DK----T---KITNVR------------R-------------------- Q---------------SIHFLGFQF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|96574781|locus|VBIBacCer255427_4629|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus FRI35] -------------------------------------------------------------------GVDGTTINDYLQ- -------MDR-----KQLINLIQ-------SQI----DNY----NPS--------------------------------- -----------------TVRRT-YI--PK-----------G-N--T-------------G---KLRPL------------ ------GIPV----IV-DRIIQEIARMA-IE-PYCEAKF-------Y--------------------------------- ---PHSYGFRP------------------YR----------------------------SSEHA---IARI--------- ----VQNIN-S-----------------------KAYI-AI-EG--------DIKGYF-DNIN----------------- -HNK--LLAI-LWEMGI---------KDKQ------------------FLFLI---------KKMLKS------------ -------------------------------------------------------------------------------- ------------------KI------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------N--IIS-SD-K-----------GTP-----QGGII------------------S- PLLANVYL-----------------------------------------------------------NN----------- --------FD--RM-VSDL-W-----------------ESH--S-------AVTTYAA-------------TRNGKTV-- ----------------------EEKNYQFLRKKSVAKH-YKTNLVR----------YADDWII-------------LTE- -----TKEY----AE---------------------------KLLTKLRK---------------------YMKH----- ----Q--L----SLEL------S-E------EK----T---VITDSR------------E-------------------- E---------------PLHFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18919101|locus|VBIBacCer120424_5683|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus Q1] -------------------------------------------------------------------GIDKKDVNYYLQ- -------MEA-----KQLIKLIR-------QHI----DNY----KPN--------------------------------- -----------------PVRRE-YI--NK-----------G-N--G-------------K---K-RPL------------ ------GIPT----MI-DRIIQEIARIV-LE-PIAEAKF-------F--------------------------------- ---NHSYGFRP------------------YR----------------------------SCHYA---IGRV--------- ----LNTISRS-----------------------KTYI-AI-EG--------DIKSFF-DHIN----------------- -HNK--LVEM-MWNMGI---------KDKR------------------FLIII---------KKMLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-D--------------- -------------------------------------------------------------------------------- ---------------K---------V--ILP-TE-I-----------GTP-----QGGII------------------S- PLLANIYL-----------------------------------------------------------NN----------- --------FD--WM-VAKE-F-----------------EEHRAR-------YTVKHAF-------------R-SGLTK-- ----------------------VGR-----------RH-KKCFLIR----------YADDWII-------------LCE- -----DTVQ----AR---------------------------ILLTKIDK---------------------YYKH----- ----I--L----KLEL------S-K------EK----T---FITDLR------------E-------------------- K---------------PARFLGFDI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|31950623|locus|VBIBacPse80461_3976|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus pseudofirmus OF4] -------------------------------------------------------------------GIDNKTIDYYLH- -------LPY-----EDLVSQVQ-------TCI----EDY----NPE--------------------------------- -----------------PVRRK-YI--PK-----------E-N--S-------------D---KLRPL------------ ------GIPT----MI-DRIIQEITRLV-IE-PIAEAKF-------Y--------------------------------- ---KFSYGFRP------------------MR----------------------------SAEHA---MAEI--------- ----LEKARKS-----------------------KTYW-VI-EG--------DIKGYF-DNIN----------------- -HNK--LITM-LWKIGI---------KDKR------------------VLSII---------KKMLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------V----EE-D--------------- -------------------------------------------------------------------------------- ---------------G---------E--IYP-SD-L-----------GSP-----QGGII------------------S- PLLANIYL-----------------------------------------------------------NF----------- --------FD--WM-IAEE-F-----------------DQH--H-------YINNYER-------------RDKGLRA-- ----------------------IRR-----------DH-KPVYSIR----------YADDWVV-------------LCS- -----SKKQ----AD---------------------------TLLIKIRK---------------------YLKH----- ----Q--L----SLEL------S-E------EK----T---KITNLV------------E-------------------- E---------------KASFLGFEFFV----------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|45223831|locus|VBIGeoSp94955_1285|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Geobacillus sp. Y412MC52] -------------------------------------------------------------------GIDQKIVDDYLL- -------MPT-----EKVFGMIK-------AKL----NDY----KPI--------------------------------- -----------------PVRRC-NK--PK-----------G-N--AKSSKRKGNSPNEEG---ETRPL------------ ------GISA----VT-DRIIQEMLRIV-LE-PIFEAQF-------Y--------------------------------- ---PHSYGFRP------------------YR----------------------------STEHA---LAWM--------- ----LKIINGS-----------------------KLYW-VV-KG--------DIESYF-DHIN----------------- -HKK--LLNI-MWNMGV---------RDKR------------------VLCIV---------KKMLKA------------ -------------------------------------------------------------------------------- ------------------GQ------------------------------------V-----I-Q--------------- -------------------------------------------------------------------------------- ---------------G---------K--FYP-TA-K-----------GIP-----QGGII------------------S- PLLANVYL-----------------------------------------------------------NS----------- --------FD--WM-VGQE-Y-----------------EYH--P-------NNANYRE-------------KKNALAA-- ----------------------LRNK----------GH-HPVFYIR----------YADDWVI-------------LTD- -----TKEY----AE---------------------------KIREQCKQ---------------------YLAC----- ----E--L----HLTL------S-D------EK----T---FIADIR------------E-------------------- Q---------------RVKFLGFCI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pe_ph_I1/CP001110/398581_extraction Pe.ph.I1/CP001110/398581 400415/Pelodictyon phaeoclathratiforme/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 413 bp -------------------------------------------------------------------GVDRESLQAFET- --KLK--DNL-----YKVWNRLS-------S------GSY----FPP--------------------------------- -----------------PVRGV-GI--PK--------K--S----G-----------------GVRML------------ ------GVPT----VA-DRVAQSVVKMV-LE-PILEPVF-------H--------------------------------- ---EDSYGYRP------------------GR----------------------------SAHDA---IAVV--------- ----RKRNW-------------------------EYDW-VV-EF--------DIKGLF-DNID----------------- -HEL--LMRA-L-RKHC---------QTPW------------------VFLYV---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------E-----T-PE-------------- -------------------------------------------------------------------------------- ---------------G---------E--LIE-RT-K-----------GTP-----QGGVV------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DR-WVSENL----------------------------------------------------------- ---------------------------------------PGVPFCR----------YSDDGVL-------------HCK- -----SKIQ----AE---------------------------LVKRKIGE---------------------RF-R----- ----E--C----GLEL------H-P------DK----T---QIVYCR------------D-------------------- S-NRKD--EHPVN---QFTFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D115_extraction 418 bp -------------------------------------------------------------------GVDQESIEAFEK- --NLK--GNL-----YKLWNRLS-------S------GSY----FPP--------------------------------- -----------------PVKGV-GI--PK--------K--T----G-----------------GIRML------------ ------GVPT----VA-DRVAQTVGKET-LE-PLLEPIF-------H--------------------------------- ---QDSYGYRP------------------GR----------------------------SALDA---VGVV--------- ----RERCW-------------------------KYDW-VV-EF--------DISKFF-DTMN----------------- -HEL--LMRA-V-RKHC---------QIEW------------------VLLYV---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------M-----S-PE-------------- -------------------------------------------------------------------------------- ---------------G---------D--LVE-RT-K-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DR-WVSENL----------------------------------------------------------- ---------------------------------------PGVPFCR----------YADDGVL-------------HCK- -----SKEQ----AV---------------------------LVMKKITK---------------------RF-E----- ----A--C----GLRV------N-P------DK----T---RIVYCK------------D-------------------- D-KRKE--DHPVT---SFTFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pr_ae_I3/CP001108/2285675__2287507/Prosthecochloris_extraction Pr.ae.I3/CP001108/2285675..2287507/Prosthecochloris aestuarii/Bacterial D/ORF Sequence %28a.a%29 413 bp -------------------------------------------------------------------GVDHETIEQFDR- --HLK--DNL-----YKIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVKSV-PI--PK--------K--S----G-----------------GERVL------------ ------GIPT----VS-DRIAQTVVKLM-LE-PILDPLF-------H--------------------------------- ---KNSYGYRP------------------GR----------------------------SALDA---VAMV--------- ----RRRCW-------------------------EYDW-VV-EF--------DIKGLF-DNID----------------- -HDL--LMRA-L-RKHC---------ETPW------------------ILLYV---------KRWLKA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----T-AT-------------- -------------------------------------------------------------------------------- ---------------G---------A--IVE-RS-S-----------GTP-----QGGVV------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DM-WVTQNL----------------------------------------------------------- ---------------------------------------RSVRFCR----------YADDGVI-------------HCK- -----SREQ----AE---------------------------LVLHKIRK---------------------RF-E----- ----Q--C----KLEL------H-P------DK----T---RIAYCQ------------D-------------------- V-NRQE--AYPNV---QFTFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UA_I10/FP565147_1/504827__502990/uncultured_archaeon_extraction UA.I10/FP565147.1/504827..502990/uncultured_archaeon /Bacterial D/ORF Sequence %28a.a%29 414 bp -------------------------------------------------------------------GVDKKSIADFEK- --DLN--NNL-----YKIRNRMS-------S------GSY----FPP--------------------------------- -----------------PVRTV-GI--PK--------K--S----G-----------------GERLL------------ ------GIPT----VA-DRVAQTVAKMY-LE-PLVEPYF-------H--------------------------------- ---KDSYGYRP------------------GK----------------------------SAIQA---VGVT--------- ----RKRCW-------------------------RYDW-ML-EF--------DIKGLF-DNIN----------------- -HNL--LIRA-V-RKHT---------NCKW------------------MLLYI---------DRWLKA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------Q-----R-QD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LVQ-RE-K-----------GTP-----QGGVI------------------S- PLLANLVL-----------------------------------------------------------HY----------- --------VF--DK-WMERNY----------------------------------------------------------- ---------------------------------------PQVPFCR----------YADDGVV-------------HCR- -----SEAE----AL---------------------------KLRKTLGA---------------------RF-G----- ----K--Y----NLEL------H-P------EK----T---KIVYCK------------D-------------------- D-DRRD--DYPNT---SFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D411_extraction 411 bp -------------------------------------------------------------------GIDEVSLEEFEA- --DLD--NNL-----YKIWNRMT-------S------GSY----FPP--------------------------------- -----------------PVKAI-EI--EK--------K--S----G-----------------GKRVL------------ ------GIPT----VG-DRVAQMVAKIY-LN-PLVDPYF-------H--------------------------------- ---KDSYGYRE------------------GK----------------------------SAIDA---LEVT--------- ----RQRCW-------------------------QYDW-VL-EF--------DIKGLF-DNID----------------- -HEL--LMRA-V-KKHV---------KIPW------------------LILYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------I-----Q-AN-------------- -------------------------------------------------------------------------------- ---------------G---------R--VEE-RS-K-----------GTP-----QGGVI------------------S- PVLANLFM-----------------------------------------------------------HY----------- --------AF--DK-WMERTH----------------------------------------------------------- ---------------------------------------PDKPFAR----------YADDGVI-------------HCR- -----TLEE----AR---------------------------LLLESLKE---------------------RM-E----- ----E--C----KLKL------H-P------EK----T---RIVYCK------------D-------------------- D-KRKG--EYPNT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|21725011|locus|VBIDesAce42372_4086|_extraction Mobile element protein [Desulfotomaculum acetoxidans DSM 771] -------------------------------------------------------------------GIDDESLEAFEA- --NLK--NNL-----YKIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVKAV-EI--PK--------K--T----G-----------------GKRIL------------ ------GVPT----VA-DRVAQMVAKIY-FE-PLVEPHF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SAVDA---LAVT--------- ----RQRCW-------------------------KYDW-VL-EF--------DIKGLF-DNIN----------------- -HDL--LMKA-V-RKHT---------DNPW------------------VILYI---------QRWLKA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------M--LKE-RT-K-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------AY--DV-WMARNH----------------------------------------------------------- ---------------------------------------PDKPFAR----------YADDSVA-------------HCR- -----SKKD----AE---------------------------KLHDSLKE---------------------RF-A----- ----E--C----ELEL------H-P------DK----T---RIVYCK------------D-------------------- D-DRRG--EHQET---KFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >M_a_I53/AE011185/2451_extraction M.a.I53/AE011185/2451 4325/Methanosarcina acetivorans/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 415 bp -------------------------------------------------------------------GVDDENIAAFES- --DLT--NNL-----YKIWNRMS-------S------GCY----FPP--------------------------------- -----------------SVKAI-EI--PK--------K--S----G-----------------GTRIL------------ ------GIPT----VL-DRVAQMVTKIY-LE-PQLEPLF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SAADA---LAAT--------- ----RKRCW-------------------------RYNW-LL-EF--------DIKGLF-DNIN----------------- -HDL--LMKQ-V-SMHT---------DKPW------------------IILYI---------QRWLKA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------Q-----M-AD-------------- -------------------------------------------------------------------------------- ---------------G---------T--VNE-RT-K-----------GTP-----QGGVV------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DQ-WMDSHH----------------------------------------------------------- ---------------------------------------RYNPFER----------YADDSVI-------------HCR- -----SREE----AE---------------------------RLWIELDK---------------------RL-S----- ----E--F----GLEL------H-P------SK----T---RIVYCK------------D-------------------- D-DRQG--DYPET---KFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61457293|locus|VBISulAci142080_3722|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sulfobacillus acidophilus DSM 10332] -------------------------------------------------------------------GVDGESLRRFEE- --DLK--NNL-----YKIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVKAV-EI--PK--------K--S----G-----------------GVRIL------------ ------GVPT----VA-DRIAQMVVKLT-FE-PLVEPIF-------H--------------------------------- ---PDSYGYRP------------------GR----------------------------SAHDA---LAQT--------- ----RQRCW-------------------------RYDW-VL-EF--------DIKGLF-DNIP----------------- -HDL--LMKA-V-RQHT---------DNPW------------------GLLYI---------ERWLVA------------ -------------------------------------------------------------------------------- ------------------PL------------------------------------Q-----R-AD-------------- -------------------------------------------------------------------------------- ---------------G---------S--QEP-RT-C-----------GTP-----QGSVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------AF--DV-WMSRRH----------------------------------------------------------- ---------------------------------------ADKPFER----------YADDAVV-------------HCR- -----SYAL----AA---------------------------ALKEDLAR---------------------RL-A----- ----G--C----GLEW------H-P------TK----T---RIVYCQ------------D-------------------- D-DRRD--TYPET---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|161812455|locus|VBICloPas18034_4908|_extraction Retrontype RNAdirected DNA polymerase [Clostridium pasteurianum BC1] -------------------------------------------------------------------GVDKESIEDFEK- --NLK--NNL-----YKIWNRMS-------S------GTY----FPP--------------------------------- -----------------PVRAV-EI--PK--------K--N----G-----------------GIRIL------------ ------GVPT----VS-DRIAQMVVKIH-FE-PKVEPIF-------H--------------------------------- ---PDSYGYRP------------------MK----------------------------SAIDA---IAIV--------- ----RKRCW-------------------------RYNW-VL-EF--------DIKGLF-DNIN----------------- -HEL--LMKA-V-RKHT---------NCTW------------------ILLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------PL------------------------------------Q-----D-KD-------------- -------------------------------------------------------------------------------- ---------------G---------S--IIT-RT-S-----------GTP-----QGSVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------TF--DK-WMEINF----------------------------------------------------------- ---------------------------------------PSNPWAR----------YADDAVA-------------HCK- -----TKYE----AD---------------------------NLLIKLNQ---------------------RF-K----- ----Q--C----ALEL------H-P------EK----T---QIVYCK------------D-------------------- D-DRRG--NYPIT---KFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D152_extraction 414 bp -------------------------------------------------------------------GIDEQSIDEFER- --NLK--DNL-----YKVWNRMS-------S------GSY----IPP--------------------------------- -----------------AVKAV-EI--PK--------K--A----G-----------------GIRTL------------ ------GIPT----VA-DRIAQMTVKLY-FE-PLVEPFF-------H--------------------------------- ---EDSYGYRP------------------KK----------------------------SAIQA---IETT--------- ----RKRCW-------------------------KYNW-VL-EF--------DIKGLF-DNID----------------- -HEL--LMRA-V-DKHT---------DIEW------------------VKLYI---------KRWLTA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------Q-----T-KE-------------- -------------------------------------------------------------------------------- ---------------G------------IKE-RT-S-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------AF--DK-WMAINH----------------------------------------------------------- ---------------------------------------PRNPFAR----------YADDAVI-------------HCK- -----TEEE----AK---------------------------RVLESLNQ---------------------RM-N----- ----E--C----KLEL------H-P------SK----T---KIVYCK------------D-------------------- A-DRRE--DHKNI---TFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Le_pn_I3/CP000675_2/2801059__2799175/Legionella_extraction Le.pn.I3/CP000675.2/2801059..2799175/Legionella pneumophila/Bacterial D/ORF Sequence %28a.a%29 454 bp -------------------------------------------------------------------GIDNQSIDEFSQ- --DLK--GNL-----YKLWNRMS-------S------GSY----FPP--------------------------------- -----------------AVKEV-AI--PK--------K--Q----G-----------------GVRKL------------ ------GIPT----VA-DRIAQMTVKLM-ME-PLLEPHF-------L--------------------------------- ---DDSYGYRP------------------NK----------------------------SALDA---VGVT--------- ----RKRCW-------------------------EYDW-VV-EF--------DIKGLF-DNLS----------------- -HEL--LMKA-V-KHHI---------SDRW------------------ILLYV---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----D-QH-------------- -------------------------------------------------------------------------------- ---------------G---------G--CLP-RT-A-----------GTP-----QGGVI------------------S- PLLSNLFL-----------------------------------------------------------HY----------- --------AF--DH-WMTKHH----------------------------------------------------------- ---------------------------------------PDNPWCR----------YADDGLA-------------HCR- -----TEKE----AE---------------------------QMLKEIDK---------------------RF-K----- ----S--L----GLEI------H-P------DK----T---KIVYCK------------D-------------------- G-ARKG--KYKNK---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Dh_re_I1/CP001734_1/751437__753288/Desulfohalobium_extraction Dh.re.I1/CP001734.1/751437..753288/Desulfohalobium retbaense/Bacterial D/ORF Sequence %28a.a%29 441 bp -------------------------------------------------------------------GVDRQSLEDFEK- --DLK--NNL-----YKLWNRMS-------S------GSY----MPP--------------------------------- -----------------LVKGV-EI--PK--------K--S----G-----------------GTRLL------------ ------GVPA----VS-DRIAQMAARLE-FE-AQVEPHF-------L--------------------------------- ---PDSYGYRP------------------KK----------------------------SARQA---IDVT--------- ----RKRCW-------------------------DQDW-VL-EF--------DIKGLF-DNID----------------- -HDL--LMKA-V-EKHT---------DNPW------------------VRLYI---------RRWLKA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----L-ES-------------- -------------------------------------------------------------------------------- ---------------G---------E--LVD-RD-K-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------VF--DA-WLTKHY----------------------------------------------------------- ---------------------------------------PRVKWCR----------YADDGLV-------------HCE- -----SEAQ----AR---------------------------FLLEALRQ---------------------RF-K----- ----E--C----GLEL------H-P------EK----T---KIVYCK------------D-------------------- G-RRTG--DYPQT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E_c_I2/X77508/518_extraction E.c.I2/X77508/518 2408/Escherichia coli/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 416 bp -------------------------------------------------------------------GIDKQSLADFDK- --RLV--DNL-----YKIWNRLS-------S------GSY----FPP--------------------------------- -----------------AVKAV-AI--PK--------K--L----G-----------------GERIL------------ ------GIPT----VS-DRIAQTVVKLA-FE-PQVEPHF-------L--------------------------------- ---ADSYGYRP------------------NK----------------------------SALDA---IGVT--------- ----RKRCW-------------------------YYDW-VL-EF--------DIKGLF-DNIP----------------- -HEL--IMKA-V-DKHN---------PARW------------------VKLYI---------QRWLTA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------V-----M-SD-------------- -------------------------------------------------------------------------------- ---------------G---------E--VRA-RT-M-----------GTP-----QGGVI------------------S- PLLANLFM-----------------------------------------------------------HY----------- --------VF--DK-WLAKYY----------------------------------------------------------- ---------------------------------------PKVPWYR----------YADDGIL-------------HCH- -----SEAE----AT---------------------------EMREVLRK---------------------RF-S----- ----E--C----GLEM------H-P------EK----T---RVIYCK------------D-------------------- G-SRKG--DYEHT---MFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87202399|locus|VBIMetAlc68050_2225|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Methylomicrobium alcaliphilum] -------------------------------------------------------------------GVDRQSLADFER- --NLK--DNL-----YKLWNRLS-------S------GSY----FPP--------------------------------- -----------------PVKAV-AI--PK--------K--A----G-----------------GERIL------------ ------GIPT----VS-DRIAQMVVKLE-FE-PQVEPHF-------L--------------------------------- ---PDSYGYRP------------------NK----------------------------SALDA---VGVT--------- ----RERCW-------------------------RYDW-VL-EF--------DIKGLF-DNIP----------------- -HDL--LLKA-V-YKHT---------DTAW------------------VRLYI---------ERWLTV------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----M-PN-------------- -------------------------------------------------------------------------------- ---------------G---------E--LSS-RG-K-----------GTP-----QGGVV------------------S- PVLSNLFL-----------------------------------------------------------HY----------- --------VF--DK-WLQKHY----------------------------------------------------------- ---------------------------------------SDTPWCR----------YADDGLV-------------HCR- -----SEAE----AK---------------------------HMLEALKQ---------------------RF-Q----- ----S--C----GLEL------H-P------VK----T---KIVYCK------------D-------------------- G-SRKG--RYKHT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|108024353|locus|VBILegPne122099_3574|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Legionella pneumophila subsp. pneumophila] -------------------------------------------------------------------GVDEESLEDFAK- --DLK--NNL-----YKLWNRMS-------S------GSY----FPP--------------------------------- -----------------AVKAV-PI--PK--------K--S----G-----------------GERML------------ ------GIPT----VA-DRIAQMVVKLV-FE-PIVEPHF-------H--------------------------------- ---PDSYGYRP------------------NK----------------------------SALDA---VGIT--------- ----RQRCW-------------------------QYDW-VL-EY--------DIRGLF-DNID----------------- -HQL--LMKA-V-RKHT---------DSKW------------------VLLYI---------ERWLVT------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LQE-KV-K-----------GVM-----QGGVI------------------S- PVLSNLFL-----------------------------------------------------------HY----------- --------VF--DS-WMVRNA----------------------------------------------------------- ---------------------------------------TKMSWCR----------YADDGLV-------------HCK- -----TKFE----AQ---------------------------QIRRRLEA---------------------RF-I----- ----E--C----GLEM------H-P------DK----T---KIVYCK------------D-------------------- S-NRRL--NYQNT---SFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23549090|locus|VBIShePut135485_0278|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shewanella putrefaciens CN32] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------MVVKLS-FE-PLVEPHF-------L--------------------------------- ---NDSYGYRP------------------NR----------------------------SAIDA---VGVT--------- ----RKRCW-------------------------YQDW-VL-EF--------DIKGLF-DNIS----------------- -HEL--LMKA-V-RKHT---------DCKW------------------LLLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------V-----K-DN-------------- -------------------------------------------------------------------------------- ---------------N---------E--VIE-RN-M-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------VF--DK-WMHKNH----------------------------------------------------------- ---------------------------------------PGVKWCR----------YADDGLV-------------HCN- -----SEEQ----AQ---------------------------KMRAELEK---------------------RF-K----- ----D--C----GLEM------H-P------TK----T---KIVYCK------------D-------------------- G-TRKG--QYENT---AFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ag_ra_I1/CP000629_1/1749091__1750949/Agrobacterium_extraction Ag.ra.I1/CP000629.1/1749091..1750949/Agrobacterium radiobacter/Bacterial D/ORF Sequence %28a.a%29 419 bp -------------------------------------------------------------------GIDGQTIADFEA- --DLR--NNL-----YKLWNRLA-------S------GSY----FPP--------------------------------- -----------------PVRRV-DI--PK--------S--D----G-----------------KTRPL------------ ------GIPT----VA-DRVAQMVVKRH-LE-PVVEPEF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SALDA---ISVA--------- ----RQRCW-------------------------RYNW-VL-DL--------DIKAFF-DSIE----------------- -PDL--LMRA-V-RKHT---------DCPW------------------VLLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------N--LVA-RE-R-----------GTP-----QGGVI------------------S- PLLASLFL-----------------------------------------------------------HY----------- --------AF--DM-WMCRNF----------------------------------------------------------- ---------------------------------------PDIPFER----------YADDAIC-------------HCR- -----SEDQ----AM---------------------------ALQNALDA---------------------RF-T----- ----D--C----GLTL------H-P------DK----T---KIVYCR------------D-------------------- E-SRRG--THPVY---KFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D177_extraction 469 bp -------------------------------------------------------------------GIDGQTLAGFDE- --NVA--DNL-----YKLWNRLA-------S------GSY----MPQ--------------------------------- -----------------AVRRV-EI--PK--------A--D----G-----------------GVRPL------------ ------GIPA----VS-DRIAQMVVKQV-LE-PVLEPLF-------H--------------------------------- ---ADSYGYRP------------------GK----------------------------SAHQA---IAQA--------- ----RTRCW-------------------------QFDW-VV-EI--------DIKGFF-DNID----------------- -HAL--LLKA-V-RHHT---------QERW------------------LVMYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------R--VQV-RN-R-----------GTP-----QGGVI------------------S- PLISNLFL-----------------------------------------------------------HY----------- --------AF--DM-WMKRQF----------------------------------------------------------- ---------------------------------------PGVPFER----------YADDVVC-------------HCH- -----SQSQ----AE---------------------------ALISKAGQ---------------------RF-A----- ----Q--C----GLEL------H-P------QK----T---RMVYCK------------D-------------------- A-DRRG--NYAET---RFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Chlorobifid|23040926|locus|VBIProAes37017_0657|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Prosthecochloris aestuarii DSM 271] -------------------------------------------------------------------GVDGQSIAEFDE- --AME--NNL-----YKLWNRLA-------S------GSY----MPP--------------------------------- -----------------PVKRV-EI--PK--------A--D----G-----------------GLRPL------------ ------GVPT----VA-DRIAQTVVKQV-LE-PEMERHF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SAHQA---VGEA--------- ----RKRCW-------------------------RNDW-VV-DL--------DIRGFF-DAID----------------- -HEL--LMRA-L-HSHT---------QERW------------------VLLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LQK-RG-A-----------GTP-----QGGVI------------------S- PLLANLML-----------------------------------------------------------HY----------- --------TF--DA-WMQRMF----------------------------------------------------------- ---------------------------------------PHVSFER----------YADDGVC-------------HCR- -----TREQ----AE---------------------------ELMAALKQ---------------------RF-V----- ----D--C----KLEL------H-P------EK----T---RIIYCK------------D-------------------- D-DRCG--NYPVT---SFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D172_extraction 412 bp -------------------------------------------------------------------GVDGQSIEEFEQ- --DLS--GNL-----YKLWNRLA-------S------GSY----MPP--------------------------------- -----------------AVRCV-EI--PK--------A--T----G-----------------GTRPL------------ ------GIPT----VA-DRIGQMVVKDA-LE-PILEPCF-------H--------------------------------- ---HDSYGYRP------------------NK----------------------------SAHDA---LAVA--------- ----RQRCW-------------------------RAAW-VL-DV--------DIKGFF-DNID----------------- -HAL--LMKA-V-RKHI---------DCRW------------------ITLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--SQA-RN-K-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------VF--DM-WMVRNF----------------------------------------------------------- ---------------------------------------PANGFER----------YADDVVI-------------HST- -----SLKQ----VT---------------------------MLRAQLTE---------------------RL-A----- ----D--C----KLEM------S-P------GK----T---KIVYCK------------D-------------------- K-RRKG--GYPEI---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186035780|locus|VBIPhoTem255998_1848|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Photorhabdus temperata subsp. temperata M1021] -------------------------------------------------------------------GIDGQSIEEFEK- --NLA--GNL-----YKLWNRMA-------S------GSY----MPP--------------------------------- -----------------AVRRV-EI--PK--------A--T----G-----------------GTRPL------------ ------GIPT----VA-DRIAQMVVKDM-LE-PILEPQF-------H--------------------------------- ---VDSYGYRP------------------HK----------------------------SAHDA---LRVA--------- ----RQRCW-------------------------RTDW-VL-DV--------DIKGFF-DNID----------------- -HEL--LMRA-V-RRHT---------DCRW------------------VLLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------SV------------------------------------H-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--VQI-RD-K-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DM-WMKREF----------------------------------------------------------- ---------------------------------------PIVRFER----------YADDIVI-------------HCK- -----SHAQ----AM---------------------------MLRGKLRK---------------------RL-A----- ----E--C----KLEM------S-P------GK----T---KVVYCK------------D-------------------- R-ERTE--AYPEI---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|190234795|locus|VBISalEnt166060_0668|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Salmonella enterica subsp. enterica serovar Newport str. USMARCS3124.1] -------------------------------------------------------------------GVDGQTIETFEG- --NLS--GNL-----YKLWNRMA-------S------GSY----MPP--------------------------------- -----------------PVRRV-EI--PK--------A--T----G-----------------GTRPL------------ ------GIPT----VA-DRIAQMVVKDV-LE-PILEPHF-------H--------------------------------- ---NDSYGYRP------------------HK----------------------------SAHDA---LRAA--------- ----RHRCW-------------------------RSNW-VL-DV--------DIKGFF-DNID----------------- -HDL--LMKA-V-CKHT---------RCKW------------------TELYI---------RRWLTA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LHA-RD-R-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DL-WMTRQY----------------------------------------------------------- ---------------------------------------PDRPFER----------YADDIVI-------------HCN- -----SQAQ----AI---------------------------VLRNKLEH---------------------RL-A----- ----E--C----KLEL------S-Q------SK----T---KIVYCK------------D-------------------- G-KRRE--NYPDI---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115665938|locus|VBIChaMin231992_1582|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------GVDGQTIEKFEE- --NLS--DNL-----YKLWNRMT-------S------GSY----FPS--------------------------------- -----------------PVLRV-EI--PK--------G--D----G-----------------RMRPL------------ ------GIPT----VS-DRVAQMVAKDL-LE-PELEKHF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SALDA---VGMA--------- ----RKRCW-------------------------KSNW-VL-EL--------DIKGFF-DNID----------------- -HEL--MMRA-V-RVHT---------EEKW------------------VILYI---------ERWLKS------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--KQL-PD-K-----------GLP-----QGGVA------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DK-WMERKN----------------------------------------------------------- ---------------------------------------PDIQFER----------YADDAVC-------------HCK- -----SEAQ----AQ---------------------------KLKQDLNE---------------------RM-K----- ----E--V----GLEL------H-P------EK----T---NIVYCK------------D-------------------- D-DRRE--EYPLT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Pa_de_I1/CP000491/19065_extraction Pa.de.I1/CP000491/19065 20924/Paracoccus denitrificans/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 413 bp -------------------------------------------------------------------GVDGQTLESFGE- --RLG--PNL-----YKLWNRMS-------S------GSY----MPS--------------------------------- -----------------SVRRV-MI--PK--------A--D----G-----------------GQRPL------------ ------GIPT----VT-DRIAQEVVRLY-LE-PLVEPVF-------H--------------------------------- ---RDSYGYRP------------------ER----------------------------SAIDA---IRKA--------- ----RQRCW-------------------------RYDW-VL-DM--------DIKGFF-DTID----------------- -HEL--LLKA-V-RHHT---------DCRW------------------VLLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------R-----M-ED-------------- -------------------------------------------------------------------------------- ---------------G---------S--LVP-QE-R-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DR-WLDREN----------------------------------------------------------- ---------------------------------------PQVPFER----------YADDIIC-------------HCR- -----TEDE----AR---------------------------RLWQQVEN---------------------RL-A----- ----G--C----GLTL------H-P------QK----T---KIVYCK------------D-------------------- T-NRKG--SFPTV---AFDFLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E_c_I9/CP000946_1/2411599__2413485/Escherichia_extraction E.c.I9/CP000946.1/2411599..2413485/Escherichia coli/Bacterial D/ORF Sequence %28a.a%29 440 bp -------------------------------------------------------------------GVDNQTLKDFER- --DLK--GNL-----YKIWNRLS-------S------GSW----MPP--------------------------------- -----------------PVRAV-EI--PK--------K--D----G-----------------SKRLL------------ ------GIPT----VS-DRIAQMTVLVT-FE-PLVERYF-------L--------------------------------- ---NDSYGYRH------------------GK----------------------------SALDA---IAVT--------- ----RKRCW-------------------------QYDW-YL-EF--------DIKGLF-DNIP----------------- -HDL--LLRA-V-DKHC---------ADKW------------------VRLSI---------RRWLTA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------Q-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LKE-RN-K-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------VF--DK-WLSLLY----------------------------------------------------------- ---------------------------------------PEIPWCR----------YADDGLI-------------HCG- -----SKQQ----AE---------------------------ELLNKLAK---------------------PF-Q----- ----E--C----GLEL------H-P------EK----T---KIVYCK------------D-------------------- S-ERQA--NHETV---QFNFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D171_extraction 419 bp -------------------------------------------------------------------GVDGQTIEQFEA- --DLK--GNL-----YKIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVRAV-PI--PK--------K--T----G-----------------GQRIL------------ ------GVPT----VS-DRIAQMVVKQL-IE-PELDQIF-------L--------------------------------- ---KDSYGYRP------------------NK----------------------------SALDA---VGIT--------- ----RQRCW-------------------------KYDW-VL-EF--------DIKGLF-DNIS----------------- -HEL--LLKA-V-RKHV---------KCKW------------------ALLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------P-------------------------------------M-----E-QD-------------- -------------------------------------------------------------------------------- ---------------E---------Q--RIE-RD-C-----------GTP-----QGGVI------------------S- PILSNLFL-----------------------------------------------------------HY----------- --------AF--DL-WMDRTH----------------------------------------------------------- ---------------------------------------PDLPWCR----------YADDGLV-------------HCR- -----SEQE----AE---------------------------AVKAALQA---------------------RL-A----- ----E--C----QLEM------H-P------TK----T---KIVYCR------------D-------------------- S-KRRG--QHPNV---TFDFLGYCF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >RmInt1_extraction 419 bp -------------------------------------------------------------------GVDGQTLEIFEK- --DLA--ANL-----YKIWNRMS-------S------GTY----FPP--------------------------------- -----------------PVRAV-SI--PK--------K--A----G-----------------GERVL------------ ------GVPT----VS-DRIAQMVVKQM-IE-PDLDSLF-------L--------------------------------- ---PDSYGYRP------------------GK----------------------------SALDA---VGVT--------- ----RQRCW-------------------------KYDW-VL-EF--------DIKGLF-DNLP----------------- -HDL--LLKA-V-RKDV---------KCNW------------------ALLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------P-------------------------------------M-----E-KN-------------- -------------------------------------------------------------------------------- ---------------G---------E--VIE-RS-R-----------GTP-----QGGVV------------------S- PILANLFL-----------------------------------------------------------HY----------- --------AF--DL-WMTRTH----------------------------------------------------------- ---------------------------------------PDLPWCR----------YADDGLV-------------HCQ- -----SEQQ----AE---------------------------ALRVELSS---------------------RL-A----- ----A--C----GLQM------H-P------TK----T---KIVYCK------------D-------------------- Q-RRRE--AYPNV---TFDFLGYQF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D216_extraction 417 bp -------------------------------------------------------------------GVDGVTIEQFEK- --DLK--GNL-----YKIWNRMS-------S------GAY----FPP--------------------------------- -----------------PVRAV-SI--PK--------K--S----G-----------------GQRIL------------ ------GVPT----VA-DRVAQTVVKEI-IE-PALDAIF-------L--------------------------------- ---ADSYGYRP------------------DK----------------------------SALDA---VGVT--------- ----RERCW-------------------------KFDW-VL-EF--------DIKGLF-DNID----------------- -HTL--LMRA-V-RKHV---------ACPW------------------ALLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------M-----Q-ED-------------- -------------------------------------------------------------------------------- ---------------G---------T--LIE-RT-R-----------GTP-----QGGVV------------------S- PVLANLFM-----------------------------------------------------------HY----------- --------TF--DL-WMARTF----------------------------------------------------------- ---------------------------------------PHLRWCR----------YADDGLV-------------HCR- -----SERE----AR---------------------------IVWEALAS---------------------RM-A----- ----E--C----RLEL------H-P------TK----T---KIVYCK------------D-------------------- D-RRKA--NFENV---AFDFLGYCF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|115293251|locus|VBIRhiTro150571_4429|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rhizobium tropici CIAT 899] -------------------------------------------------------------------GVDGQTIEQFEA- --DLK--GNL-----YKIWNGMS-------S------GSY----FPP--------------------------------- -----------------PVRAV-PI--PK--------K--T----G-----------------GQRIL------------ ------GVPT----VS-DRIAQMVVKRL-IE-PELDQIF-------R--------------------------------- ---PDSYGYRP------------------GK----------------------------SALDA---VGIT--------- ----RQRCW-------------------------KYDW-VL-EF--------DIKGLN-DNLA----------------- -HDL--LLKA-V-HKHV---------KGQG------------------ALLYI---------ERWLTA------------ -------------------------------------------------------------------------------- ------------------P-------------------------------------L-----E-QD-------------- -------------------------------------------------------------------------------- ---------------G---------Q--RIG-RI-------------AVP-----RKGVW------------------S- VRFFQI-C-----------------------------------------------------------SC----------- --------TT--HL-ISNRTY----------------------------------------------------------- ---------------------------------------PDLPWCR----------YADDGLV-------------HCR- -----TEQE----AE---------------------------AVKAALQA---------------------RL-A----- ----E--C----QLEM------H-P------TK----T---KIGYCK------------D-------------------- P-KRRG--TYPNV---SFDFLGYCFR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >B_j_I1/BA000040/2212569_extraction B.j.I1/BA000040/2212569 2214373/Bradyrhizobium japonicum/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 414 bp -------------------------------------------------------------------GVDGQSLEDFAG- --DLE--NHR-----YRLWNRLV-------S------GSY----FPP--------------------------------- -----------------PVRRV-EI--PK--------A--G----G-----------------GIRPL------------ ------GIPT----VA-DRIAQMVVKRC-LE-PVLDGEF-------D--------------------------------- ---PDSYGYRP------------------GK----------------------------SAHQA---IEQA--------- ----RKRCW-------------------------QHDW-VV-DL--------DNKSFF-DTID----------------- -HEL--LMRA-V-YRHT---------KADW------------------IRLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------E-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------S--VRA-RT-T-----------GRS-----QGGVV------------------S- PILANLFL-----------------------------------------------------------HY----------- --------VF--DV-WMKGSY----------------------------------------------------------- ---------------------------------------PHIPFER----------YADDIIC-------------HCR- -----TRQE----AE---------------------------ELKSALER---------------------RF-A----- ----D--C----HLLL------H-P------EK----T---KVVYCA------------D-------------------- S-NRRR--SYPQI---HFDFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >R_e_I1/AF261712/2356_extraction R.e.I1/AF261712/2356 4192/Ralstonia eutropha/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 415 bp -------------------------------------------------------------------GIDDEAIAEFEQ- --NLS--KNL-----YKLWNRMP-------S------GSY----LPP--------------------------------- -----------------PVKQV-EI--PK--------A--S----G-----------------GTRKL------------ ------GVPT----VA-DRVAQTVVKLV-IE-PGLDAIF-------H--------------------------------- ---PDSYGYRP------------------GR----------------------------SAKQA---VAIT--------- ----RERCW-------------------------RYDW-VV-EF--------DIKAAF-DQID----------------- -HGL--LMKA-V-RTHI---------REDW------------------ILLCI---------ERWRVA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------E-----T-AD-------------- -------------------------------------------------------------------------------- ---------------G---------V--RVP-RT-R-----------GTP-----QGGVS------------------S- PILMNLFT-----------------------------------------------------------HY----------- --------TF--DR-WMQRTS----------------------------------------------------------- ---------------------------------------PNCPFAR----------YADDAVV-------------HCN- -----SRRQ----AE---------------------------YVMRSIAA---------------------RL-A----- ----A--C----GLTM------H-P------EK----S---KIVYCR------------D-------------------- SRNRSE--RHLHA---SFTFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|86705594|locus|VBIPsePut3905_0289|_extraction reverse transcriptase [Pseudomonas putida ND6] -------------------------------------------------------------------GIDDETIADFER- --NLP--KNL-----YKLWNRMS-------S------GSY----FPP--------------------------------- -----------------PVKAV-EI--PK--------A--S----G-----------------GIRRL------------ ------GVPT----VS-DRIAQTVVKLL-IE-PKLDALF-------H--------------------------------- ---PDSYGYRP------------------GR----------------------------SAKQA---IAIT--------- ----RERCW-------------------------RYDW-VV-EF--------DIKAAF-DHID----------------- -HEL--LMKA-V-RTHI---------KEDW------------------ILLYI---------ERWLVA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------E-----A-AD-------------- -------------------------------------------------------------------------------- ---------------G---------V--RIQ-RE-R-----------GTP-----QGGVI------------------S- PMLMNLFM-----------------------------------------------------------HY----------- --------AF--DA-WMQRNS----------------------------------------------------------- ---------------------------------------PNCPFAR----------YADDAVV-------------HCR- -----SQRQ----AE---------------------------HVMRSIAS---------------------RL-A----- ----V--C----GLTM------H-P------EK----S---KIVYCK------------D-------------------- S-NRRA--GYPHV---SFTFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186786189|locus|VBIPseSyr242867_5567|_extraction Retrontype RNAdirected DNA polymerase [Pseudomonas syringae pv. actinidiae ICMP 18801] -------------------------------------------------------------------GIDEQSIAQFEQ- --KLQ--RNL-----YKVWNRMS-------S------GSY----FPP--------------------------------- -----------------PVRQV-EI--PK--------Q--S----G-----------------CKRKL------------ ------GIPT----VA-DRVAQTAIKLL-IE-PSLDCLF-------H--------------------------------- ---PDSYGYRP------------------GK----------------------------SAKQA---VEIT--------- ----RRRCW-------------------------NINW-VV-EF--------DIKGAF-DHID----------------- -HEL--LLKA-V-KHHI---------KDEW------------------ILLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------E-----T-AD-------------- -------------------------------------------------------------------------------- ---------------G---------V--QVP-RE-S-----------GTP-----QGGVI------------------S- PLLMNLFM-----------------------------------------------------------HY----------- --------AF--DA-WMQRTF----------------------------------------------------------- ---------------------------------------PGCPFAR----------YADDAVV-------------HCR- -----SEKQ----AC---------------------------EVMAAIKA---------------------RL-E----- ----V--C----LLTM------H-P------EK----S---KIVYCK------------D-------------------- S-NRKA--AYPTT---QFTFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Vi_vu_I1/GQ292873_1/3620__5549/Vibrio_extraction Vi.vu.I1/GQ292873.1/3620..5549/Vibrio vulnificus/Bacterial D/ORF Sequence %28a.a%29 453 bp -------------------------------------------------------------------GVDGVTIEDFEK- --DLK--NNL-----YKIWNRMS-------S------GSY----FPT--------------------------------- -----------------PVAAV-SI--PK--------K--S----G-----------------GERVL------------ ------GIPT----VS-DRVAQTVVRDK-LE-IMLEHHF-------L--------------------------------- ---DDSYGYRV------------------GK----------------------------SAHDA---IEVT--------- ----RRRCW-------------------------QYDW-VL-EF--------DIKGLF-DNIR----------------- -HDL--LMKA-V-KKHVQLAEESQSRDYQW------------------ITLYI---------ERWLVA------------ -------------------------------------------------------------------------------- ------------------PL------------------------------------Q-----K-AD-------------- -------------------------------------------------------------------------------- ---------------G---------T--QTE-RE-L-----------GTP-----QGGVV------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------VF--DK-WLEKNY----------------------------------------------------------- ---------------------------------------PDNPWCR----------YADDGLV-------------HAR- -----TKPK----AE---------------------------KLRDELAK---------------------RF-K----- ----E--C----GLEM------H-P------IK----T---KIVYCK------------D-------------------- D-IRRG--SGKHIEHKQFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D131_extraction 420 bp -------------------------------------------------------------------GMDEQSIEMYEM- --DLK--NNL-----YKLWNRMS-------S------GSY----FPK--------------------------------- -----------------PVKAV-AI--PK--------K--N----G-----------------GTRTL------------ ------GIPT----VE-DRVAQMVAKLY-FE-PNVERLF-------Y--------------------------------- ---EDSYGYRP------------------NK----------------------------SAIQA---IEAT--------- ----RKRCW-------------------------RKDW-VL-EF--------DIKGLF-DNIR----------------- -HDY--LIEM-V-KRHT---------NQEW------------------VTLYV---------QRWLIT------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------Q-----M-ED-------------- -------------------------------------------------------------------------------- ---------------G---------T--LIE-RT-A-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------TF--DD-FMVKEF----------------------------------------------------------- ---------------------------------------SSIPWAR----------YADDGIA-------------HCT- -----SLKQ----AK---------------------------YLQRRLEE---------------------RF-K----- ----L--F----GLEL------N-L------EK----T---KIAYCK------------D-------------------- D-DRQL--SYPNT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D153_extraction 418 bp -------------------------------------------------------------------GADEQTIKEFEE- --HLN--NNL-----YKLWNRMA-------S------GSY----FPK--------------------------------- -----------------PVRAV-AI--PK--------K--N----G-----------------GIRIL------------ ------GIPT----VE-DRIAQMVAKMY-FE-PLVEPMF-------Y--------------------------------- ---NDSYGYRP------------------NK----------------------------SAIQA---VGQA--------- ----RERCF-------------------------KRDW-AL-EL--------DIKGLF-DNIK----------------- -HGY--LMYM-V-EKHT---------QIKW------------------LILYI---------KRWLTV------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------I-----M-SD-------------- -------------------------------------------------------------------------------- ---------------G---------S--VAE-RR-S-----------GTP-----QGGVI------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------VF--DD-FMTKAY----------------------------------------------------------- ---------------------------------------PNIWWER----------YADDGVL-------------HCQ- -----SYKQ----AA---------------------------FIKQKLEE---------------------RF-Q----- ----Q--F----GLEL------N-K------EK----T---RIVYCK------------D-------------------- N-RRPQ--NYSCT---QFTFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D155_extraction 420 bp -------------------------------------------------------------------GVDEESIEDFAL- --NLK--DNL-----YKLWNRMS-------S------GTY----FPP--------------------------------- -----------------PVKAV-EI--AK--------S--D----G-----------------SKRLL------------ ------GIPT----VA-DRIAQAVVKDQ-LE-QLVEPKF-------H--------------------------------- ---EDSYGYRP------------------KK----------------------------SALDA---VGVA--------- ----RQRCW-------------------------QQDW-CI-DL--------DIKNFF-DSLD----------------- -HQL--MMKA-I-RFHS---------EEKW------------------IHLYV---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------PL------------------------------------Q-----L-ES-------------- -------------------------------------------------------------------------------- ---------------G---------E--LIE-RQ-S-----------GTP-----QGGVA------------------S- PLLANIFM-----------------------------------------------------------HH----------- --------AF--DN-WMRRHY----------------------------------------------------------- ---------------------------------------PEVRFER----------FADDILA-------------HCS- -----SQKQ----AK---------------------------KVLEEIKI---------------------RL-K----- ----E--C----GLEL------H-P------EK----T---KIVYCK------------D-------------------- D-DRGG--SYEYE---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61052576|locus|VBICloCla155345_0943|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium clariflavum DSM 19732] -------------------------------------------------------------------GVDGVNFEEFEK- --DLK--NNL-----YKLWNRMS-------S------GSY----FPK--------------------------------- -----------------AVRGV-EI--PK--------K--N----G-----------------KKRLL------------ ------GIPT----IE-DRVAQMTVRMS-FE-QLVEPIF-------S--------------------------------- ---SNSYGYRP------------------NR----------------------------SAIEA---VAVT--------- ----RERCW-------------------------KTPW-VL-EF--------DIKGLF-DNID----------------- -HEL--LNRA-V-RKHT---------DSKW------------------IILYI---------ERFLKA------------ -------------------------------------------------------------------------------- ------------------AI------------------------------------K-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--IQQ-RK-C-----------GTP-----QGGVI------------------S- PVLANLFM-----------------------------------------------------------HY----------- --------AF--DM-WMKREF----------------------------------------------------------- ---------------------------------------PGNQWVR----------YADDGII-------------HCK- -----TKEE----AE---------------------------YILGKLKE---------------------RM-L----- ----K--C----KLEI------H-P------EK----T---RIVYCR------------S-------------------- D-KNTE--RHEHE---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|161812148|locus|VBICloPas18034_4761|_extraction Mobile element protein [Clostridium pasteurianum BC1] -------------------------------------------------------------------GTDGVNFTKFEE- --NLK--NNL-----YKIWNRMS-------S------GCY----FPA--------------------------------- -----------------SVRGV-EI--TK--------K--D----G-----------------KTRLL------------ ------GIPT----IS-DRVAQMVVRMN-FE-PQVEPIF-------C--------------------------------- ---DDSYGYRP------------------NR----------------------------SALDA---VGTA--------- ----RERCW-------------------------EMPW-VI-DF--------DIKGLF-DNID----------------- -HEL--MMKA-V-CKHT---------DNKW------------------VIMYI---------ERFLKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------A-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--VQE-RN-A-----------GTP-----QGGVI------------------S- PVLADLFM-----------------------------------------------------------HY----------- --------AF--DW-WMKQKH----------------------------------------------------------- ---------------------------------------PQNPWER----------YADDAVI-------------HCR- -----TKEE----AK---------------------------VLLVQLKE---------------------RM-T----- ----E--C----KLEV------H-P------NK----T---KIVYCR------------S-------------------- D-VYPE--HHEHE---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Al_me_I4/CP000724/658338_extraction Al.me.I4/CP000724/658338 660212/Alkaliphilus metalliredigens/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 412 bp -------------------------------------------------------------------GIDEVTLQEYEN- --NLE--DNL-----YKLWNSMS-------S------GSY----FPQ--------------------------------- -----------------AVRGV-EI--PK--------K--N----G-----------------GVRVL------------ ------GVPS----ID-DRIAQNVMVSE-LN-PKVEPIF-------Y--------------------------------- ---EDSYGYRE------------------NK----------------------------SAIDA---IEVT--------- ----RKRCW-------------------------EYDW-LI-EF--------DIVGLF-DNIN----------------- -HDL--LMKA-V-KQHT---------NEKW------------------VILYI---------ERTLKV------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------V-----M-SD-------------- -------------------------------------------------------------------------------- ---------------G---------I--HVE-RT-K-----------GTP-----QGGVI------------------S- AVLANLFM-----------------------------------------------------------HY----------- --------AF--DH-WMTRKH----------------------------------------------------------- ---------------------------------------SNNPWVR----------YADDGLI-------------HSH- -----SLKE----AE---------------------------VLLLKLGE---------------------RF-K----- ----D--C----HLEI------H-P------NK----T---KIIYCK------------D-------------------- D-NRKQ--NHIHT---NFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D143_extraction 415 bp -------------------------------------------------------------------GIDKISIEKYEK- --NLK--NNL-----YKLWNRMA-------S------GTY----FPK--------------------------------- -----------------AVKAV-EI--PK--------K--N----G-----------------GIRVL------------ ------GVPT----VE-DRIAQMIVKLS-ME-KIIDPIF-------L--------------------------------- ---NDSYGYRP------------------NR----------------------------SAHDA---IKVT--------- ----RSRCW-------------------------KYDW-VL-EF--------DIKGLF-DNIN----------------- -HKL--LLKA-V-YKYA---------KYKW------------------EILYI---------KRWLAN------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------S-----N-NN-------------- -------------------------------------------------------------------------------- ---------------K---------I--TKN-TE-N-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HF----------- --------AF--DK-WMEKRF----------------------------------------------------------- ---------------------------------------PNNKWCR----------YADDGII-------------HCN- -----SRAE----AI---------------------------FILNCLKE---------------------RM-K----- ----E--C----KLEI------H-P------GK----T---KIIYCK------------D-------------------- S-NRKE--NNKLH---EFTFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ma_sp_I3/CP000471/785727_extraction Ma.sp.I3/CP000471/785727 787568/Magnetococcus sp./Bacterial D/Bacterial D/ORF Sequence %28a.a%29 417 bp -------------------------------------------------------------------GLDGLTMEAFEE- --DLK--NQL-----YRLWNRMS-------S------GSY----FPP--------------------------------- -----------------PVMRV-EI--PK--------S--D----G-----------------GVRGL------------ ------GIPT----IG-DRIAQAVVKRY-LE-PLVEPKF-------H--------------------------------- ---EDSYGYRP------------------NR----------------------------SALDA---VRQA--------- ----RQRCW-------------------------RDDW-VL-DL--------DISKFF-DKLD----------------- -HAL--VMRA-V-KRFT---------DCKW------------------VLLYI---------ERWLKA------------ -------------------------------------------------------------------------------- ------------------DV------------------------------------Q-----L-QD-------------- -------------------------------------------------------------------------------- ---------------E---------T--ILH-RE-M-----------GTP-----QGGVI------------------S- PLLANIFL-----------------------------------------------------------HL----------- --------GF--DQ-WMKENY----------------------------------------------------------- ---------------------------------------PHIHFER----------YADDIVV-------------HCR- -----SLKQ----LQ---------------------------WIKKRIEQ---------------------RL-K----- ----L--C----KLSL------N-D------KK----T---RVVYCK------------D-------------------- S-RRSG--EWTCQ---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >W_e_I4/AM999887_1/177114__178961/Wolbachia_extraction W.e.I4/AM999887.1/177114..178961/Wolbachia endosymbiont/Bacterial D/ORF Sequence %28a.a%29 414 bp -------------------------------------------------------------------GVDEVSITKFEE- --NLK--DNL-----YKLWNRMS-------S------GSY----FPE--------------------------------- -----------------PVKAV-AI--PK--------D--T----G-----------------GQRIL------------ ------CVPS----VF-DRIAQTAATMY-LE-PLVEPKF-------H--------------------------------- ---EDSYGYRP------------------NK----------------------------SALDA---VYTA--------- ----RKRCW-------------------------KNDW-TV-DL--------DISGFF-DNLD----------------- -HDL--ALQA-I-KKHT---------DCKW------------------VILYV---------ERWMKA------------ -------------------------------------------------------------------------------- ------------------PI------------------------------------Q-----Q-AD-------------- -------------------------------------------------------------------------------- ---------------G---------S--RVT-RD-K-----------GVP-----QGGSI------------------S- PIISSIFM-----------------------------------------------------------HH----------- --------AF--DM-WMKQNY----------------------------------------------------------- ---------------------------------------PTVPFER----------YVDDAIV-------------HCR- -----TKRQ----AG---------------------------FMKVMIEE---------------------RL-A----- ----K--C----KLKL------H-P------EK----T---QIVYSK------------D-------------------- D-DRKE--QFPKQ---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42837854|locus|VBICloCf158569_3569|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium cf. saccharolyticum K10] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------MVARAY-VE-RAVEPMF-------C--------------------------------- ---EDSYGYRP------------------HK----------------------------SALDA---VEKT--------- ----RKRCW-------------------------KYDY-VI-EL--------DVKGLF-DNID----------------- -HEL--LMRV-V-RRHV---------KEPW------------------ICLYI---------ERWLKS------------ -------------------------------------------------------------------------------- ------------------PF------------------------------------V-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------S--RIE-RE-S-----------GTP-----QGGVI------------------S- PVLANMFL-----------------------------------------------------------HY----------- --------VF--DM-WMKRNF----------------------------------------------------------- ---------------------------------------PQAPFER----------YADDGVV-------------HCR- -----TKEE----AL---------------------------YIKKKLVK---------------------RF-E----- ----E--C----KLEL------H-P------VK----T---RIVYCK------------D-------------------- K-DRTK--EEELA---EFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B_t_I4/AE015928/3254752_extraction B.t.I4/AE015928/3254752 3256655/Bacteroides thetaiotaomicron/Bacterial D/Bacterial D/ORF Sequence %28a.a%29 430 bp -------------------------------------------------------------------GIDKVTLEDYEK- --NLR--GNL-----YKLWNRMS-------S------GSY----FPP--------------------------------- -----------------SVKLV-EI--PK--------S--T----G-----------------GKRPL------------ ------GIPT----VS-DRVAQMAVVML-IT-PSIEPCF-------H--------------------------------- ---EDSYAYRP------------------HR----------------------------SAHDA---VGKA--------- ----RERCW-------------------------KYAW-VL-DM--------DISKFF-DTID----------------- -HEL--LLKA-L-KRHT---------QEKW------------------VLMYI---------ERWLKV------------ -------------------------------------------------------------------------------- ------------------PY------------------------------------E-----K-SD-------------- -------------------------------------------------------------------------------- ---------------G---------S--QVD-RA-L-----------GVP-----QGSVI------------------G- PVLANLFL-----------------------------------------------------------HY----------- --------TF--DK-WMEKNF----------------------------------------------------------- ---------------------------------------PRVPFER----------YADDTIC-------------HCH- -----SLKQ----AE---------------------------YMQAMIQQ---------------------RF-E----- ----C--C----RLRL------N-E------EK----T---KIVYCK------------S-------------------- S-RQKE--CYPNV---TFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D218_extraction 417 bp -------------------------------------------------------------------GVDGVGLAGFES- --DLK--GNL-----YRIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVKAV-EI--SK--------EH-G----A-----------------GTRML------------ ------GVPT----IG-DRIAQTVVAAR-LE-GVVEPKF-------H--------------------------------- ---PDSYGYRP------------------RK----------------------------GSLDA---VRKC--------- ----RERCW-------------------------KYDW-VI-DL--------DVRKFF-DTVP----------------- -WDR--IIAA-V-EANT---------ALPW------------------VLLYV---------KRWLAA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------R-----M-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LAE-RD-R-----------GTP-----QGSAV------------------S- PVLANLFM-----------------------------------------------------------HY----------- --------AF--DL-WMVREF----------------------------------------------------------- ---------------------------------------PACPFER----------YADDAVV-------------HCK- -----SLAQ----AR---------------------------FVLDRLRK---------------------RM-E----- ----Q--V----GVSL------H-P------EK----T---RIVYCK------------D-------------------- G-KRRG--SHEHT---EFTFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|58753801|locus|VBIMycCan203588_3899|_extraction reverse transcriptase [Mycobacterium canettii CIPT 140010059] -------------------------------------------------------------------GVDGVSIEAFEA- --DLG--NNL-----YKVWNRMS-------S------GSY----FPP--------------------------------- -----------------PVRAV-EI--PK--------PH-G----G-----------------GTRML------------ ------GIPT----IA-DRVAQTVVAEE-LT-SRVEVIF-------H--------------------------------- ---DDSHGYRP------------------GR----------------------------SALDA---VKAC--------- ----RQRCW-------------------------KTDW-VI-DL--------DIQKFF-DDVS----------------- -WDL--MLKA-V-AANT---------DLPW------------------VMLYV---------RRWLQA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------A-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LQR-RD-R-----------GTP-----QGSPV------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------AF--DT-WMAREF----------------------------------------------------------- ---------------------------------------PSVRFER----------YVDDAVV-------------HCV- -----TERQ----AR---------------------------QVLAALQG---------------------RM-V----- ----E--V----GLRL------H-P------DK----T---RIVYCK------------D-------------------- G-KRRG--GYEHT---SFTFLGFTFR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|162142109|locus|VBIStrFul287543_7023|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptomyces fulvissimus DSM 40593] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------MVAAR-LE-RNVEPVF-------H--------------------------------- ---SDSFGYRP------------------GR----------------------------SALDA---VEKC--------- ----RERTW-------------------------KRDW-VV-DL--------DIQKFF-DSVP----------------- -WSL--IVKA-V-EAHA---------DAVW------------------VKLYV---------ERWLRA------------ -------------------------------------------------------------------------------- ------------------PL------------------------------------Q-----L-PD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LQR-RD-R-----------GTP-----QGSAV------------------S- PVLANLFL-----------------------------------------------------------HY----------- --------AF--DM-WIAREF----------------------------------------------------------- ---------------------------------------PDIPFER----------YVDDAVV-------------HCV- -----SERQ----AR---------------------------RLVEAIGN---------------------RM-E----- ----E--V----GLRL------H-P------AK----T---RIVYCK------------D-------------------- A-NRRG--AYAQT---SFTFLGFTFR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >D154_extraction 421 bp -------------------------------------------------------------------GVDGQSIDAFEK- --DLK--NNL-----YRIWNRMS-------S------GSY----FPP--------------------------------- -----------------PVRAV-EI--PK--------AH-G----G-----------------GVRVL------------ ------GVPT----VA-DRVAQTVVAMT-LE-PRMEQVF-------H--------------------------------- ---DGSYGYRV------------------GR----------------------------SALDA---VGAC--------- ----RQRCW-------------------------QRDW-VV-DL--------DIQDFF-GSCP----------------- -HDL--IVRA-V-EVNT---------DQPW------------------VVLYV---------RRWLTA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------C-----Y-PD-------------- -------------------------------------------------------------------------------- ---------------G---------S--LVT-PD-R-----------GTP-----QGSAV------------------S- PVLANVFL-----------------------------------------------------------HY----------- --------AL--DL-WLAREF----------------------------------------------------------- ---------------------------------------PGLPFER----------YVDDAVV-------------HCA- -----TRRQ----AE---------------------------QVRTAIGR---------------------RL-E----- ----E--V----GLRC------H-P------AK----T---KVVYCK------------D-------------------- S-GRRG--SHEHT---SFTFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Fr_sp_I5/CP000820_1/4042207__4044207/Frankia_extraction Fr.sp.I5/CP000820.1/4042207..4044207/Frankia sp./Bacterial D/ORF Sequence %28a.a%29 417 bp -------------------------------------------------------------------GPDGVTVEQFEA- --NVK--DRL-----YVLWNRMS-------S------GSY----FPG--------------------------------- -----------------PVGAV-EI--PK--------KGVK----G-----------------GARTL------------ ------GIPN----VV-DRVAQTVLKLA-LE-PKVEPVF-------H--------------------------------- ---RDSYGYRP------------------GR----------------------------SQRQA---LEVC--------- ----RKRCW-------------------------SHDW-VV-DL--------DVRKFF-DTVP----------------- -WEK--LLKA-V-AYHT---------DQKW------------------VLMYV---------ERCLKA------------ -------------------------------------------------------------------------------- ------------------PT------------------------------------K-----H-AD-------------- -------------------------------------------------------------------------------- ---------------G---------T--LQE-RT-M-----------GTV-----QGGPF------------------S- PLAANIYL-----------------------------------------------------------HW----------- --------GL--DA-WMAREF----------------------------------------------------------- ---------------------------------------PTVPFER----------WADDVVF-------------HCV- -----SLEQ----AR---------------------------EVRDAVVA---------------------RL-V----- ----E--V----GLEA------H-P------DK----T---RIVYCK------------D-------------------- S-NRGG--DYENT---SFTFLSYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Zu_pr_I2/CP001650_1/3589332__3591217/Zunongwangia_extraction Zu.pr.I2/CP001650.1/3589332..3591217/Zunongwangia profunda/Bacterial D/ORF Sequence %28a.a%29 402 bp -------------------------------------------------------------------GIDTVSIEQFDE- --SLS--KNL-----YKLWNRMA-------S------GSY----FPP--------------------------------- -----------------AVKEV-EI--PK--------K--D----G-----------------KVRKL------------ ------GIPT----IS-DRIGQMVVKMY-LE-PRLENVF-------N--------------------------------- ---PNSYGYRP------------------NK----------------------------SAHQA---LEQV--------- ----RKNCW-------------------------KMDW-VI-DL--------DIKGFF-DNID----------------- -HHK--MMLA-I-EKHV---------PERW------------------VRLYI---------ARWLAS------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------M-----T-KS-------------- -------------------------------------------------------------------------------- ---------------G---------N--LVS-NQGR-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------GL--DK-WLEQND----------------------------------------------------------- ---------------------------------------NTVKFTR----------YADDVIV-------------NCK- -----SQKH----AE---------------------------QTLEAIKS---------------------RM-H----- ----Q--I----GLEL------H-P------EK----T---KIVYCR------------D-------------------- Y-RRQE--KYSNV---KFDFLGYSY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|46905862|locus|VBICelAlg158510_0236|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cellulophaga algicola DSM 14237] -------------------------------------------------------------------GIDHQTLSEFDS- --VRS--KEL-----YKVWNRLA-------S------GSY----FAP--------------------------------- -----------------AVKRV-NI--PK--------A--G----G-----------------KTRPL------------ ------GIPT----VS-DRIAQQVVKQY-LE-PRLESIF-------S--------------------------------- ---ENSYGYRP------------------NR----------------------------SAHSA---IEVV--------- ----RRNVL-------------------------RYSW-VI-DL--------DIQEFF-ENVD----------------- -HGL--LLKA-L-ERHV---------SEKW------------------VLLYI---------KRWLEA------------ -------------------------------------------------------------------------------- ------------------PV------------------------------------I-----L-ED-------------- -------------------------------------------------------------------------------- ---------------G---------T--VKI-STGK-----------GTP-----QGGVI------------------S- PLLSNLYM-----------------------------------------------------------HY----------- --------CV--DK-WLEQYH----------------------------------------------------------- ---------------------------------------PQVKMVR----------YADDLIV-------------HCR- -----SYEA----AV---------------------------HTLEVLKE---------------------RL-T----- ----E--C----GLTA------H-P------EK----T---KIVYCK------------K-------------------- D-GRDL--KGYPV---QFDFLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Zu_pr_I1/CP001650_1/4279634__4281497/Zunongwangia_extraction Zu.pr.I1/CP001650.1/4279634..4281497/Zunongwangia profunda/Bacterial D/ORF Sequence %28a.a%29 412 bp -------------------------------------------------------------------GVDGQSLQNFRE- --NLS--GNL-----YKIWNRMT-------S------GSY----FPP--------------------------------- -----------------VVKEV-RI--TK--------K--T----G-----------------GFRSL------------ ------GIPT----VS-DRIAQQVIKSY-LE-PKVESSF-------H--------------------------------- ---QNSYGYRP------------------RK----------------------------SAHQA---LEKT--------- ----VSRCG-------------------------YYSW-VV-DL--------DIRGFF-DNID----------------- -HTL--LMKA-V-ERYT---------KEKW------------------VLMYI---------GRWLKT------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------S-----R--E-------------- -------------------------------------------------------------------------------- ---------------G---------E--ITD-RI-K-----------GTP-----QGGVI------------------S- PLLANIFL-----------------------------------------------------------HF----------- --------AF--DK-WMQIHH----------------------------------------------------------- ---------------------------------------SNMPFER----------YCDDAII-------------HCT- -----SEKQ----AY---------------------------FIREAVSK---------------------RM-K----- ----A--C----KLEL------N-S------EK----T---HIVYCK------------N-------------------- HVHSES--HK-NT---SFDFLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D182_extraction 410 bp -------------------------------------------------------------------GIDTQSLEQFEE- --RLA--DNL-----YKIWNRMT-------S------GSY----HPK--------------------------------- -----------------AVREV-QI--PK--------K--S----G-----------------GYRGL------------ ------GIPT----VS-DRVAQQVVKSY-LE-PKVEPSF-------H--------------------------------- ---QDSYGYRP------------------NK----------------------------SAHDA---LAKT--------- ----VRNCG-------------------------YYSW-VV-DL--------DIRGFF-DNID----------------- -HEL--LMKA-V-RVYT---------DEKW------------------IIMYI---------ERWLEV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----R--E-------------- -------------------------------------------------------------------------------- ---------------G---------K--VHK-RE-K-----------GTP-----QGGVI------------------S- PLLANIFL-----------------------------------------------------------HF----------- --------VF--DK-WMEKHH----------------------------------------------------------- ---------------------------------------GNMPFER----------YCDDAII-------------HCT- -----TWNQ----AV---------------------------FIKNAVTK---------------------RM-K----- ----E--C----KLEL------N-S------EK----T---KIVYCK------------N-------------------- SIHRES--NPVPV---SFTFLGHTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >D149_extraction 416 bp -------------------------------------------------------------------GVDGMTIEAFEH- --NLA--RNL-----YKIWNRLS-------S------GCY----MPP--------------------------------- -----------------PVKRV-EI--PK--------S--D----G-----------------KTRPL------------ ------GIPT----VS-DRVAQMAVKMI-LE-PQWDPLF-------S--------------------------------- ---DSSFGYRP------------------GK----------------------------SAHDA---VAQA--------- ----KANCW-------------------------KYEW-VI-DL--------DIRGFF-DNLD----------------- -HAL--LLKA-V-DHLH---------PAPW------------------VRLCI---------VRWLKA------------ -------------------------------------------------------------------------------- ------------------EI------------------------------------I-----F-PD-------------- -------------------------------------------------------------------------------- ---------------G---------H--RHS-PE-K-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------TQ--DK-WLEKHY----------------------------------------------------------- ---------------------------------------PNNSWER----------YADDSII-------------HCR- -----SRRE----AG---------------------------LLLSQLRE---------------------RM-K----- ----A--C----GLEL------H-P------EK----T---RIVNCH------------P-------------------- L-TRRK--NDGHY---SFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ce_ja_I1/CP000934_1/3788874__3790736/Cellvibrio_extraction Ce.ja.I1/CP000934.1/3788874..3790736/Cellvibrio japonicus/Bacterial D/ORF Sequence %28a.a%29 402 bp -------------------------------------------------------------------GADNVCIDMFEH- --NLE--NEL-----YKLWNRMS-------S------GSY----MAP--------------------------------- -----------------PVKRV-EM--AK--------A--D----G-----------------KLRPL------------ ------GIPT----VA-DRVAQMVVKMT-LE-PEWDSKF-------H--------------------------------- ---ASSFGYRP------------------RR----------------------------SAHHA---VQAA--------- ----KINCW-------------------------KYSW-VI-DL--------DIKGFF-DNLN----------------- -HDQ--LQKF-V-AQAT---------DDPW------------------CKLYI---------KRWITA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------Q-----M-PG-------------- -------------------------------------------------------------------------------- ---------------G---------E--LHK-TA-K-----------GTP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------HK----------- --------VF--DS-WMQKYF----------------------------------------------------------- ---------------------------------------PQNPFER----------YADDIVC-------------HCR- -----TEHE----AE---------------------------QLLSAISR---------------------RM-Q----- ----R--F----DLTL------H-P------EK----T---KIVYC---------------------------------- --GRRKIERTKAQ---SFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >UMB_I1/AY075117/120__2136/uncultured_marine_bacterium_extraction UMB.I1/AY075117/120..2136/uncultured_marine_bacterium /Bacterial D/ORF Sequence %28a.a%29 417 bp -------------------------------------------------------------------GVDHVSMEAIAS- --NPR--KYL-----YPLWNRLS-------S------GSY----FPP--------------------------------- -----------------PVKLV-PI--PK--------G--D----G-----------------KERML------------ ------GIPT----II-DRVAQEVIKAE-LE-VIVEPRF-------H--------------------------------- ---PSSFGYRP------------------HK----------------------------SAHEA---LEQC--------- ----AKNSW-------------------------ERWY-VV-DL--------DIKGFF-DNID----------------- -HEK--MMGI-L-RKHT---------NKKH------------------ILLYC---------DRWLKT------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----D-RV-------------- -------------------------------------------------------------------------------- ---------------G---------G--VQA-RM-K-----------GTP-----QGGVI------------------S- PLLANLYL-----------------------------------------------------------HE----------- --------AF--DQ-WISTTQ----------------------------------------------------------- ---------------------------------------PRIVFER----------YADDIVI-------------HTR- -----SMEQ----SH---------------------------FILDKLKA---------------------RL-K----- ----S--Y----SLEL------H-P------DK----T---KIVYCY------------R-------------------- T-ARFH--KEGKEIPVSFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sh_dy_I1/CP000035/29397__31222/Shigella Sh.dy.I1/CP000035/29397..31222/Shigella dysenteriae/Bacterial D/ORF Sequence %28a.a%29 375 bp ---------------------------------------------------------------------------MFDQ- --QRD--GNL-----YKIWNRLC-------S------GTW----FPP--------------------------------- -----------------PVLEK-RI--PK--------S--N----G-----------------KERIL------------ ------GIPT----VS-DRIAQGAIKLF-ME-EKLDPIF-------H--------------------------------- ---ADSYGYRP------------------GK----------------------------SAHDA---LKQC--------- ----AIRCW-------------------------RYSW-IL-EV--------DISAFF-DHVR----------------- -HDL--VLKA-L-EHHG---------MPKW------------------VILYC---------RRWMEA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----SCEN-------------- -------------------------------------------------------------------------------- ---------------G---------E--VIT-RT-R-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DL-WMEREY----------------------------------------------------------- ---------------------------------------RGVPFER----------YADDIVV-------------HCS- -----RMSD----AT---------------------------RLKNRLSE---------------------RF-S----- ----E--V----GLVL------N-A------GK----T---NTAYID------------T-------------------- F-KRRNVAT-------SFTFLGYDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|47225904|locus|VBIMarMed159599_0649|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Marinomonas mediterranea MMB1] -------------------------------------------------------------------GCDGQTMKQFDN- --NRD--RNL-----YKIWNRLC-------S------GSY----LPP--------------------------------- -----------------PVREK-RI--PK--------A--D----G-----------------SDRIL------------ ------GIPT----VS-DRIAQGAVKIY-LE-TRLDKLF-------H--------------------------------- ---NSSFGYRP------------------NR----------------------------SAHMA---LTQC--------- ----ERNCR-------------------------FNSW-VL-EV--------DIKAFF-DHVD----------------- -HDL--VVKA-L-EHHD---------MPRW------------------VVLYC---------RRWMQA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------S-----DSSK-------------- -------------------------------------------------------------------------------- ---------------T---------D--ILTQRT-R-----------GTP-----QGGVI------------------S- PILANLFL-----------------------------------------------------------HY----------- --------AF--DR-WMAKQR----------------------------------------------------------- ---------------------------------------RYVPFER----------YADDIVC-------------HCS- -----RMSE----AV---------------------------KLKEAIQR---------------------RM-E----- ----E--V----GLSI------N-E------AK----S---NVVYID------------T-------------------- F-PRHNVKK-------VFTFLGYDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|18678173|locus|VBIShiBoy33460_0060|_extraction Mobile element protein [Shigella boydii Sb227] ---------------------------------------------------------------------MVTLCQVFGV- --HRS--S-------YRYW------------------------------------------------------------- --------------------KN-RP--EK--------P--D----G---------RRAVL---RSQVL------------ ------ELHG----IS-HGSAGARSIAT-MA-TRR--------------------------------------------- -------GYQM------------------GRWL--------------------AGR--LMKELG---LVSC--------- ----Q-------------------------------------------------------QPT----------------- -HRY--KRGG-H-EHVA---------MPKW------------------VILYC---------RRWMEA------------ -------------------------------------------------------------------------------- ------------------PM------------------------------------Q-----SCEN-------------- -------------------------------------------------------------------------------- ---------------G---------E--LIT-RT-R-----------GTP-----QGGVI------------------S- PLLANLFL-----------------------------------------------------------HY----------- --------AF--DL-WMEREY----------------------------------------------------------- ---------------------------------------RGVPFER----------YADDIVV-------------HCS- -----RMSD----AT---------------------------RLKNRLSE---------------------RF-S----- ----E--V----GLVL------N-A------GK----T---NIAYID------------T-------------------- F-KRRNVAT-------SFTFLGYDF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|42670984|locus|VBIRhoVan113057_1971|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rhodomicrobium vannielii ATCC 17100] -------------------------------------------------------------------GGDGVTIEIFAQ- --NAE--VEL-----EKLRAETL-------A------GIY----RPR--------------------------------- -----------------KVRHA-IV--PK------------PK--G-----------------GERKL------------ ------TIPS----VV-DRILQTATMLS-LG-QTVDHHF-------S--------------------------------- ---SASWAYRE------------------GR----------------------------GVDDA---LADL--------- ----RRLRNS------------------------GLFW-TF-DA--------DIMQYF-DRIL----------------- -HKR--LIDD-L-FIWV---------DDLR------------------IVRLI---------QLWLRS------------ -------------------------------------------------------------------------------- ------------------FS------------------------------------Y-----W----------------- -------------------------------------------------------------------------------- ---------------G-------------------R-----------GIA-----QGAPI------------------S- PLLANLFL-----------------------------------------------------------HP----------- --------MD--RL-LEL-------------------------------------------------------------- ---------------------------------------EGLASVR----------YADDFVV-------------LCR- -----SKAL----AQ---------------------------KAQLIVAS---------------------HL-A----- ----A--R----GLKL------N-M------SK----T---RILAPS------------E-------------------- ----------------AFIFLGQTVE------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|148657122|ref|YP_001277327.1_extraction -------------------------------------------------------------------GLDAVTLRDFEV- --DWT--RQM-----AQLADELQ-------Q------GTY----RPL--------------------------------- -----------------PAKRV-AI--PK------------AS--G-----------------GERAI------------ ------AILA----VR-DRVAQRAVQQV-LD-PLFDPCF-------L--------------------------------- ---DCSYGCRP------------------YV----------------------------GVPDA---IARV--------- ----QRYADQ------------------------GLGW-VV-DA--------DIATCF-DSLD----------------- -QRV--LLSL-V-RQRI---------DELP------------------VLKLI---------AQWLEA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----QGEAALPGDT-PPTPLQR GEAAVRRALSWGAERLHPP--PPVGP-YAAAMWETPGGSIGEDGWAPRQPGLESH--LWTAVMLARPVIDGARQALPYLQ RIGGRRLAVAGAVAVGALALSEAAAR--LRHA-SRR-----------GVP-----QGGAL------------------S- PLLANIYL-----------------------------------------------------------HP----------- --------FD--VA-MMG-------------------------------------------------------------- ---------------------------------------QGLRLVR----------FMDDFVV-------------MCA- -----TQEE----AE---------------------------CALQFAQR---------------------QL-H----- ----I--L----RLTL------N-A------EK----T---HITAYA------------D-------------------- ----------------GIEFLGAAL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118065097|ref|ZP_01533407.1_extraction -------------------------------------------------------------------GPDAVTLRDFEA- --DWT--RQM-----AQLADELQ-------Q------GTY----RPL--------------------------------- -----------------PAKRI-AI--PK------------AS--G-----------------GERAI------------ ------AILS----VR-DRVAQRAVQQV-LD-PLFDPCF-------L--------------------------------- ---DCSYGCRP------------------HV----------------------------GVPEA---VARV--------- ----QRYADQ------------------------GLGW-VV-DA--------DIAGYF-DAID----------------- -QRV--LLGL-V-RQRI---------DELP------------------VLKLI---------AQWLEA------------ -------------------------------------------------------------------------------- ------------------GM------------------------------------L-----PGDAALPDEA-PATPLQH GEAVLRQVMSWGAERL-PP--PPTGP-YAAAAWEMPGGSV-DDGWTVRRSGLESH--LWTAMMLARPAIDGARRALPYLQ RIGARRLAVVGAVAVGALALSEAVAR--MHTA-QSR-----------GTP-----QGGAL------------------S- PLLANIYL-----------------------------------------------------------HP----------- --------FD--VA-MTS-------------------------------------------------------------- ---------------------------------------QGFRLAR----------FVDDFVI-------------MCA- -----TQDE----AE---------------------------RALNFAQQ---------------------QL-R----- ----V--L----RLEL------N-A------EK----T---RIASYA------------N-------------------- ----------------GIEFLGASL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|76258629|ref|ZP_00766283.1_extraction -------------------------------------------------------------------GIDQITLHDFAA- --DWP--NQM-----VRLAEELR-------D------GSY----RPL--------------------------------- -----------------PPRRV-AI--AK------------AS--G-----------------GERAI------------ ------AILT----IR-DRIAQRAVQQV-LT-PLFEPLF-------L--------------------------------- ---DCSYGSRL------------------AV----------------------------GVPEA---IERV--------- ----VRYTEQ------------------------GLIW-VI-DG--------DIRAYF-DSID----------------- -HGI--LLGL-L-RQRI---------DEPA------------------ILHLI---------AQWLAV------------ -------------------------------------------------------------------------------- ------------------GS------------------------------------V-----HTET--PDETLPDSPLV- --ALLRRSGELIHEALNAP--SDPLP-TA---YDYPDLSR----PASPHSGIPTG--LFAALSLAQPAFEIARQLTPLLK RIGAQRLAVGGALAVGTVLLSELVHR--AQASHDRR-----------GTL-----QGGPL------------------S- PLLANIYL-----------------------------------------------------------HP----------- --------FD--LA-MTA-------------------------------------------------------------- ---------------------------------------HGARMVR----------FVDDFVV-------------MCP- -----DRTT----AE---------------------------HTLVLVER---------------------QL-A----- ----T--L----RLTL------N-P------QK----T---RIVAYA------------G-------------------- ----------------GIEFLGQAL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|113939199|ref|ZP_01425057.1_extraction -------------------------------------------------------------------GPDAVTILDFEA- --AWV--DHM-----QQLAMELQ-------S------QIY----RPL--------------------------------- -----------------PPRRL-FL--DK------------RD--G-----------------GKRSI------------ ------AILA----VR-DRIAQRAVLQI-LE-PEIEPTF-------L--------------------------------- ---DCSYGFRP------------------YV----------------------------GVPHA---LTRI--------- ----ERYRQQ------------------------GLQW-VA-HA--------DISDCF-GTID----------------- -HQI--LLSQ-L-HQRI---------SDRA------------------VVELI---------GQWLSV------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----EDAA----TTEASNWWDD GEDLLERLAKHGEDLLWPNQYPQAGPSYAPQMLDF-EANRTDSLRKRALQGLASNAALW-GITHSKRVISGLRSLAPLFK QVPGGSL-TWGAAGIATLALIPLSQR--LLRQ-HER-----------GTL-----QGGAI------------------S- PMLANIYL-----------------------------------------------------------DS----------- --------FD--RA-MTE-------------------------------------------------------------- ---------------------------------------RGHILVR----------FADDFVL-------------LGA- -----HQAA----VE---------------------------QALADATN---------------------VL-K----- ----R--L----RLAT------K-E------SK----T---GVQHFN------------D-------------------- ----------------GLTFLGHRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115349142|locus|VBIThiMob160332_0325|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Thioflavicoccus mobilis 8321] -------------------------------------------------------------------WSPTVSRDDLQH- --HLM--RHL-----LACREEVL-------D------GAY----RPL--------------------------------- -----------------PLRQF-PV--RK------------PD--G-----------------RQRVL------------ ------TAQF----LR-DKLVQRALLTV-LE-PRAEALF-------H--------------------------------- ---DDSFAYRP------------------ER----------------------------NVAKA---LAKV--------- ----RERVRI------------------------GLDW-LV-DA--------DIEKFF-DSIP----------------- -HRP--LLRV-L-DGFV---------ADAK------------------AMKLI---------ERWLGQ------------ -------------------------------------------------------------------------------- ------------------GA------------------------------------H----------------------- -------------------------------------------------------------------------------- -------------------------V--RSLLATPR-----------GIA-----QGAIL------------------S- PLFCNLYL-----------------------------------------------------------HG----------- --------FD--RS-LDS-------------------------------------------------------------- ---------------------------------------AHIPFVR----------FADDFLL-------------FAP- -----TRSD----AG---------------------------RAMEHAAR---------------------RL-E----- ----R--L----DLRL------H-P------DK----T---RVVRSG------------R-------------------- ----------------EVIFLGETL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.me.I2/AF142677/34045..36400/Bacillus_extraction megaterium/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGYTASKPNE- ---------R-----IKLYQQLV-------K------CNVFR-HRPK--------------------------------- -----------------PAKRT-FI--PK--------KNG-----------------------KLRPL------------ ------GIPT----MR-DRVYQNVVKNA-LE-PQWEVKF-------E--------------------------------- ---PTSYGFRP------------------KR----------------------------STHDA---ISNL--------- ----FNKLNTN-----S-----------------KKKW-VF-EG--------DFLGCF-DHLN----------------- -HNW--IMEQ-TS-------------MFPG-------------------NTLI---------KRWLNM------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-Q--------------- -------------------------------------------------------------------------------- ---------------D---------M--LHT-TT-E-----------GTP-----QGGIV------------------S- PLLANIAL-----------------------------------------------------------CG----------- --------ME--EE-IG--------------------------------------------------------------- ------------------IVYKKTYKSNGGYKIDPK-KIG---RVL----------YADDFVI-------------VTE- -----TKEQ----AE---------------------------SMYQNLTP---------------------YL-R----- ----K--R----GITL------S-K------EK----T---RVTHIE------------D-------------------- ----------------GFDFLGFSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18824950|locus|VBIBacCer120511_0064|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus AH187] -------------------------------------------------------------------GVDEHTALSRRE- ---------R-----NLLYEQLK-------K------LNTLQ-HRPK--------------------------------- -----------------PAKRI-YI--VK--------KNG-----------------------KLRPL------------ ------GIPT----IK-DRVYQNIVRNA-LE-PQWEARF-------E--------------------------------- ---AISYGFRP------------------KR----------------------------STHDA---IRSI--------- ----FNRINGG-----T-----------------KKKW-IF-EG--------DFQGCF-DHLN----------------- -HEW--ILKQ-TS-------------YFPG-------------------RKLL---------KRWLKM------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-Q--------------- -------------------------------------------------------------------------------- ---------------S---------F--FAE-TQ-E-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--ET-LG--------------------------------------------------------------- ------------------ITYKKNYKANDSYIMNPACKFT---LIR----------YADDFVV-------------LTE- -----TKEQ----AL---------------------------SVYMRLRP---------------------YL-K----- ----D--R----GLEL------S-P------EK----T---KVTHIE------------E-------------------- ----------------GFEFLGFLI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.a.I1/AE011190/6579..9109/Bacillus_extraction anthracis/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGITTNTPED- ---------R-----VKLFHLLK-------G------YSVRN-IKAF--------------------------------- -----------------PVKRA-YI--PK--------KNG-----------------------KKRPL------------ ------GIPV----IK-DRIFQNMVKNA-LE-PQWECRF-------E--------------------------------- ---SMSYGFRP------------------KR----------------------------SAHDA---MANL--------- ----FLKLSRG-----T-----------------NRAW-IF-EG--------DFQGCF-DNLN----------------- -HEH--ILSC-IE-------------GFPY-------------------SNAI---------NQWLNA------------ -------------------------------------------------------------------------------- ------------------GC------------------------------------I-----D-N--------------- -------------------------------------------------------------------------------- ---------------K---------T--FYK-TE-T-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--KE-LG--------------------------------------------------------------- ------------------VRYH--FPKRDGAMLYPD-SIG---IVR----------YADDFVI-------------VCN- -----SKEE----AE---------------------------SMYAKLQP---------------------YL-D----- ----K--R----GLKL------A-E------EK----T---RVVHIT------------D-------------------- ----------------GFDFLGFNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19384870|locus|VBICloBot822_0094|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium botulinum A3 str. Loch Maree] -------------------------------------------------------------------GIDGFKVITEWD- ---------R-----IKLFNSLK-------D------YSIKN-IKSQ--------------------------------- -----------------PAKRT-YI--PK--------KNG-----------------------KLRPL------------ ------GIPI----IK-DRIYQNIVKNA-LE-PQWESKF-------E--------------------------------- ---SIAYGFRP------------------KR----------------------------STHDA---IEQL--------- ----YLKLRKG-----S-----------------KRQW-IF-EG--------DFKGCF-DNLN----------------- -HEY--IMEC-IN-------------DFPA-------------------KEAV---------YRWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----D-N--------------- -------------------------------------------------------------------------------- ---------------N---------V--FRN-TN-E-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--EE-LG--------------------------------------------------------------- ------------------VKYQ--FTKRQGYCLRDN-SIG---IVK----------YADDFVI-------------LCK- -----TKEE----AE---------------------------TMYERLSP---------------------YL-K----- ----K--R----GLEL------A-E------DK----T---GITHIS------------K-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18918903|locus|VBIBacCer120424_5584|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacillus cereus Q1] -------------------------------------------------------------------GIDGYISNTPQE- ---------R-----VELFNKLS-------R------YSVRN-IKVK--------------------------------- -----------------PARRT-YI--PK--------KNG-----------------------KLRPL------------ ------GIPV----IV-DRVYQNAFKNA-LE-PQWEAKF-------E--------------------------------- ---MTSYGFRP------------------KR----------------------------STHDA---MSDL--------- ----FTKLSKG-----S-----------------AKGW-IF-EG--------DFEGCF-DNLN----------------- -HDY--IMGC-IN-------------NFPN-------------------KSII---------RDWLES------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----D-N--------------- -------------------------------------------------------------------------------- ---------------D---------V--FNE-TT-K-----------GTP-----QGGII------------------S- PLLANVAL-----------------------------------------------------------HG----------- --------ME--KE-IG--------------------------------------------------------------- ------------------VRYI--HTTRQGDTLYSN-SVG---VVR----------YADDFVI-------------VCP- -----TEEE----AY---------------------------GMYDKLEP---------------------YL-N----- ----K--R----GLNL------A-K------DK----T---RVVHIS------------K-------------------- ----------------GFDFLGFNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|161807916|locus|VBICloPas18034_2678|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium pasteurianum BC1] -------------------------------------------------------------------GVDGEIALTNTE- ---------R-----LKLFYDLS-------E------LHIEK-HNPK--------------------------------- -----------------PSRRT-YI--KK--------KNG-----------------------KLRPL------------ ------SIPT----IR-DRIYQNIIKGT-LE-PQWEARF-------E--------------------------------- ---PISYGFRP------------------KR----------------------------GCHDA---IARI--------- ----FRSCHSG-----S-----------------RKRW-IF-EG--------DFKGCF-DNLK----------------- -HDY--IMEQ-IK-------------EFPY-------------------DNLV---------DKWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FNK-TQ-F-----------GSG-----QGNIV------------------S- PLLANIAL-----------------------------------------------------------KG----------- --------ME--DT-LG--------------------------------------------------------------- ------------------IEYK--PVKNNGKIVSYT-NVGKYTLVF----------YADDFVI-------------MCN- -----TQKD----AE---------------------------DVYELLKP---------------------YL-G----- ----K--R----GLEL------S-K------EK----T---RIVTID------------E-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115659708|locus|VBIChaMin231992_1870|_extraction reverse transcriptase [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------MRF-------V--------------------------------- ---PTPF------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------------------------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------LE--KH-LG--------------------------------------------------------------- ------------------VKYN-----NRGE------SQGKRILVR----------YGDDFVI-------------LCE- -----SEED----AL---------------------------KAKETTEE---------------------WL-G----- ----L--R----GLEL------S-K------EK----T---KIIHIS------------E-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|115613496|locus|VBIDehSp228777_1264|_extraction Retrontype RNAdirected DNA polymerase [Dehalobacter sp. CF] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------MG--------------------------------------------------------------- ------------------INYT---LVHHGKKTSYE-NHTPYTMCI----------YADDFVI-------------LCE- -----TKQE----AE---------------------------NPYTTLGS---------------------YL-R----- ----D--R----GLTL------S-E------EK----T---KITHIS------------D-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|31973883|locus|VBICanHam112931_1217|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] -------------------------------------------------------------------GVDNQVINDHKG- ---------R-----EHLYKLLS-------Q------TTSE---KVY--------------------------------- -----------------PVKRV-YI--AK--------KNG-----------------------KKRPL------------ ------GIPT----IL-DRCRQAIVKSA-LE-PYWEAKF-------E--------------------------------- ---PVSYGFRP------------------GR----------------------------SAHDA---IQKI--------- ----FCIARAR-----G-----------------TRHW-VL-DA--------DIKGAF-DNID----------------- -HNF--LIKK-IG-------------GFPE-------------------RNMI---------KQWLQA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----E-H--------------- -------------------------------------------------------------------------------- ---------------G---------N--YIP-NV-A-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--TL-LG--------------------------------------------------------------- ------------------IQYW-KNGTPKQG--------QPYAVVR----------YADDFVV-------------FGK- -----SREE----CE---------------------------TAKIKLQI---------------------WL-A----- ----Q--R----GLAL------S-E------EK----T---SIKHLK------------E-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115603860|locus|VBIRivSp77222_4388|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rivularia sp. PCC 7116] -------------------------------------------------------------------GVDKLLVLTPGA- ---------R-----GTLVDILT-------R------CPPW---KPL--------------------------------- -----------------PVKRV-YI--RK--------SNG-----------------------KQRPL------------ ------GIPC----VI-DRCLQAIVKNA-LE-PYWEAQF-------E--------------------------------- ---RTSYGFRP------------------GR----------------------------GVHDA---IERI--------- ----HSMSKAN-----S-----------------TKSW-VV-DA--------DIEGCF-DNIA----------------- -HSP--LLKT-IG-------------NFPA-------------------KKLI---------QQWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------V--FND-TE-T-----------GVP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--SA-LG--------------------------------------------------------------- ------------------IRYD-KHGHTI----------GNRGIVR----------YADDLVV-------------FCK- -----TQED----AA---------------------------CVVETLSH---------------------WM-K----- ----S--K----GLAL------S-K------AK----T---NIVHLS------------E-------------------- ----------------GFNFLSFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115422190|locus|VBICylSta108647_6985|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cylindrospermum stagnale PCC 7417] -------------------------------------------------------------------GVDKLLVKTPEA- ---------R-----GFLVDSLR-------K------FIPW---KPL--------------------------------- -----------------PAKRV-YI--PK--------SNG-----------------------KKRPL------------ ------GIQT----II-DRCLQAIVKNA-LE-PFWEFHF-------E--------------------------------- ---LSSYGFRP------------------GR----------------------------STHDA---ISKI--------- ----YMIVRPN-----K-----------------KKKW-VL-DA--------DIKGCF-DNIS----------------- -RNF--LMKT-IG-------------NFPA-------------------RKLI---------DQWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-E--------------- -------------------------------------------------------------------------------- ---------------G---------K--FSE-TL-T-----------GIP-----QGAII------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--DA-LG--------------------------------------------------------------- ------------------VKYN-RRGEIV----------SRRAVVR----------YADDFAI-------------FCE- -----TKED----AE---------------------------QAQIDISE---------------------WL-K----- ----S--R----GLEL------S-K------EK----T---RIVHLN------------E-------------------- ----------------GFCFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115419880|locus|VBICylSta108647_6126|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cylindrospermum stagnale PCC 7417] -------------------------------------------------------------------GIDKVVVKTTAA- ---------R-----GQLVNKLT-------D------YSPW---KSS--------------------------------- -----------------PARRI-YI--PK--------ANG-----------------------KKRPL------------ ------GIPV----IQ-DRAIQAMVKNA-LE-PEWEATF-------E--------------------------------- ---RSSYGFRP------------------GR----------------------------SPHDA---IESI--------- ----YNLARPN-----K-----------------RKKW-VV-DA--------DIQGCF-DNIS----------------- -HNF--LLEL-LT-------------GFPA-------------------RELI---------KQWLLA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-A--------------- -------------------------------------------------------------------------------- ---------------G---------S--WHP-TD-A-----------GTP-----QGSVV------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--SA-LG--------------------------------------------------------------- ------------------VKYN-KDGELR----------AARALVR----------YADDFVV-------------FCE- -----TQED----TK---------------------------NVIQILNY---------------------WM-Q----- ----V--R----GLTL------S-L------EK----T---KISHLT------------E-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115390656|locus|VBIDacSal132842_3607|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GVDGRIALTPEE- ---------R-----WELVCELQ-------V------LNDP---IAS--------------------------------- -----------------PTRRI-QI--PK--------SNG-----------------------KKRPL------------ ------GIPI----VT-DRIRQAVVKEA-LE-PHWEAMF-------E--------------------------------- ---PSSYGFRP------------------GR----------------------------SPHDA---IARV--------- ----QALTKQSPQGKPP-----------------KKQW-VV-DA--------DIKGCF-DNID----------------- -HQH--LLGV-IG-------------NFPA-------------------RKLI---------KTWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-K--------------- -------------------------------------------------------------------------------- ---------------G---------N--FNP-TE-G-----------GTP-----QGGVI------------------S- PLLANISL-----------------------------------------------------------HG----------- --------LE--NA-LG--------------------------------------------------------------- ------------------VKWKVRKGRTKSGIYATL-TQSKRAVIR----------FADDFII-------------LCE- -----SEED----AK---------------------------LAKEEANA---------------------FI-N----- ----E--R----GLHL------S-E------EK----T---SICHLN------------D-------------------- ----------------GFKYLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115675501|locus|VBIOscNig7962_2874|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Oscillatoria nigroviridis PCC 7112] -------------------------------------------------------------------GIDGQTATTPSE- ---------R-----VKLVKEMK-------D------YTLW---KAQ--------------------------------- -----------------PARRV-YI--PK--------ANG-----------------------KQRPL------------ ------GIPT----VK-NRIAQAVIKNA-LE-PSWEARM-------E--------------------------------- ---GSSYGFRP------------------GR----------------------------SCHDA---IEHS--------- ----WIRLNKQGN----------------------DRW-VL-DA--------DIKGAF-DNIS----------------- -HNF--ILKT-IG-------------EIPG-------------------RELI---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-S--------------- -------------------------------------------------------------------------------- ---------------E---------I--FHE-TK-S-----------GTP-----QGGII------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------IE--QF-LSQFK------------------------------------------------------------ ---------------------------KRQGKNKSP-RAPKYGFVR----------YADDFII-------------TAE- -----TKED----IE---------------------------EIIPSVKE---------------------LL-K----- ----T--R----GLEL------N-E------DK----T---NIVHIE------------Q-------------------- ----------------GFNFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23995189|locus|VBITriEry99848_5996|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Trichodesmium erythraeum IMS101] -------------------------------------------------------------------GRDAQTAKTSVE- ---------K-----VKLVKEML-------T------YRLW---QAK--------------------------------- -----------------PAKRV-YI--PK--------ANR-----------------------QQGPL------------ ------GIPT----VK-NRVAQAVVKNG-LE-PIWDAEF-------E--------------------------------- ---TNSYGFHP------------------GR----------------------------SCHDP---LEQF--------- ----WIRLQK-GK----------------------DTW-IL-DV--------DIKQDF-DNIT----------------- -HEY--ILKA-IG-------------EIPG-------------------RELI---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-A--------------- -------------------------------------------------------------------------------- ---------------E---------V--FHK-TE-G-----------GTS-----SRGII------------------S- PLLANIAF-----------------------------------------------------------DG----------- --------ME--RL-LARYKT----------------------------------------------------------- -----------------VKTYQCTRPTTDEEYTKKK-KLDKYGFIR----------YADDFII-------------TAR- -----SEED----IK---------------------------AIIPTIEK---------------------WL-S----- ----E--R----GLEL------N-K------DK----T---NLVHIE------------Q-------------------- ----------------GFNFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115391721|locus|VBIDacSal132842_4122|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GVDKEVLNTPDE- ---------R-----VKLVNS----------------WEMP---KAN--------------------------------- -----------------PTRRV-YL--PK--------PNG-----------------------KKRPL------------ ------GIPT----VR-DRVAQAIIKNI-LE-PEWEAVF-------E--------------------------------- ---PNSYGFRC------------------GR----------------------------SCHDA---IEQC--------- ----FIKFRA-GNKGG-------------------HLW-VL-DA--------DIKGFF-DNIA----------------- -HES--ILTA-IE-------------SIPR-------------------GDLI---------EGWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------V--LNP-TV-M-----------GTP-----QGGVI------------------S- PLLANIGL-----------------------------------------------------------HG----------- --------LE--DF-IKSVN------------------------------------------------------------ ---------------------------------------PKLGVIR----------YADDFVV-------------TSK- -----DKES----LE---------------------------HILDQIKQ---------------------WM-L----- ----E--R----GLEI------S-A------EK----T---RIVSME------------E-------------------- ----------------GFDFLGFNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115690797|locus|VBICyaApo239906_2949|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanobacterium aponinum PCC 10605] -------------------------------------------------------------------GIDKEVINTPAQ- ---------R-----VKLVNE----------------WKMP---KAV--------------------------------- -----------------PTKRV-YI--PK--------PNG-----------------------KKRPL------------ ------GIPT----VR-DRVAQAIVKNS-LE-PEWEAAF-------E--------------------------------- ---PNSYGFRC------------------GR----------------------------SCHDA---IGQC--------- ----YLRLRGDSEKGGT-----------------HDKW-VL-DA--------DIKGFF-DNIA----------------- -HES--ILNM-ID-------------SHPK-------------------KELI---------KGWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----D-S--------------- -------------------------------------------------------------------------------- ---------------G---------V--HNL-TE-T-----------GTP-----QGGVI------------------S- PLLANIGL-----------------------------------------------------------HG----------- --------LE--KH-IKQCN------------------------------------------------------------ ---------------------------------------PKLGIIR----------YADDFVV-------------TAK- -----DKES----LE---------------------------EVLIQIKQ---------------------WL-S----- ----E--R----GLEI------S-A------EK----T---RIVHID------------N-------------------- ----------------GFNFLGFNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|23713660|locus|VBIStrAve112782_0248|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptomyces avermitilis MA4680] -------------------------------------------------------------------GIDGQKALSPEK- ---------R-----GKTARQIL-------A------DPMS---HPQ--------------------------------- -----------------PVRRV-YI--PK--------ANG-----------------------KRRPL------------ ------GIPV----IR-DRVDQARFKNA-LE-PEWEARF-------E--------------------------------- ---ARSYGFRP------------------GR----------------------------GAWDA---IEMI--------- ----FNVAGRRT----A-----------------KRLW-VL-DA--------DLSAAF-DHIS----------------- -HQH--LMDS-VG-------------LFPG-------------------RRQI---------QQWLRA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------G---------R--FVS-TP-E-----------GTP-----QGGVI------------------S- PLLMNIAL-----------------------------------------------------------HG----------- --------MG--EV-IG--------------------------------------------------------------- ------------------------------ANRPWNAKTTSPTLVR----------YADDFVV-------------FCT- -----TENE----AI---------------------------KAKQDLAA---------------------WL-E----- ----P--R----GLSF------N-E------EK----T---RVVHLS------------S-------------------- ----------------GVDFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >G.v.I1/BA000045/168850..171364/Gloeobacter_extraction violaceus/CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGVKSLTPKA- ---------R-----LALTKNLR-------I------SE-----KAK--------------------------------- -----------------PMRRV-WI--AK--------PGTQ----------------------EKRPL------------ ------GIPT----MT-DRARQALLTLA-LE-PEWEARF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---LQAI--------- ----YNAIR-------Q-----------------QSKF-VL-DA--------DIAKCF-DRID----------------- -QQA--LLKK-MN-------------TSSA------------------IRRQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-G--------------- -------------------------------------------------------------------------------- ---------------S---------E--LFP-TP-T-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--ER-VKQVS------------------------------------------------------------ -------------------------------------K--MAQLIR----------YADDFVC-------------IHT- -----DQQI----VQ---------------------------SCQTVLEE---------------------WL-A----- ----G--M----GLEL------K-P------SK----T---RIAHTLLLEE---G-Q--P-------------------- ----------------GFDFLGFTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115660493|locus|VBIChaMin231992_2004|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------GVDGVKSLTPKQ- ---------R-----LILVDKIK-------L------GT-----KAK--------------------------------- -----------------PTRRV-WI--PK--------PGTS----------------------EERPL------------ ------GIPT----ME-DRALQAVVKMV-LE-PEWESKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---IEAI--------- ----FSSIS-------K-----------------KSKY-VL-DA--------DISKCF-DRIN----------------- -HNK--LLSK-LN-------------TFPT------------------LRKQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------K--LFP-TN-E-----------GTP-----QGGVL------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--EL-IMGLAP----------------------------------------------------------- -----------------KFDMKRPNGN-QLPVRDKL-K--SICCVR----------YADDFVI-------------LHE- -----DLKV----IN---------------------------QCKKEVEE---------------------WL-S----- ----D--I----GLEL------K-P------SK----T---RIAHCLSDLD---GEK--A-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21539290|locus|VBICyaSp130209_0491|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanothece sp. ATCC 51142] -------------------------------------------------------------------GVDGVKSLTPKQ- ---------R-----LTLVNQLK-------L------SP-----KVK--------------------------------- -----------------PTRRV-WI--PK--------SGTD----------------------EERPL------------ ------GIPT----MY-DRALQGLVKMA-LE-PEWEARF-------E--------------------------------- ---PNSYGFRI------------------GR----------------------------SCHDA---INAI--------- ----FKAIK-------C-----------------KSKF-VL-DA--------DISKCF-DRIN----------------- -HKK--LLEK-LN-------------TYPT------------------LRKQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------E--LFP-TL-E-----------GTP-----QGGVL------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--EC-IKELTE----------------------------------------------------------- -----------------SHSMKRENGKYEKPLKHKR-Q--SVSLIR----------YADDFVI-------------LHE- -----DITF----IL---------------------------KCKDRIAK---------------------WL-N----- ----G--M----GLEL------K-P------SK----T---RLTHTLNDYE---GEK--A-------------------- ----------------GFDFLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21575043|locus|VBICyaSp125535_1555|_extraction Mobile element protein [Cyanothece sp. PCC 8801] -------------------------------------------------------------------GVDGVKSLTPKQ- ---------R-----LNLIDKLK-------L------GT-----KVK--------------------------------- -----------------PTRRV-WI--PK--------PGTE----------------------EKRPL------------ ------GIPT----MY-DRALQGLVKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCQDA---IGAI--------- ----FLAIN-------K-----------------KAKY-VL-DA--------DIAKCF-DRID----------------- -HEQ--LLNK-LN-------------TYPT------------------LRKQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------E--LFP-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--NE-INKLAE----------------------------------------------------------- -----------------TFDMRGPDGKL-LGKRDKR-K--SVSLIR----------YADDFVI-------------LHE- -----DITI----VQ---------------------------RCKEFISE---------------------WL-K----- ----D--M----GLEL------K-P------SK----T---RLAHTLEEYN---KEK--P-------------------- ----------------GFDFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21539742|locus|VBICyaSp130209_0717|_extraction Mobile element protein [Cyanothece sp. ATCC 51142] -------------------------------------------------------------------GVDGVKSLTPKQ- ---------R-----LLLVNKLK-------L------GT-----KVK--------------------------------- -----------------PTRRV-WI--PK--------PGRD----------------------EKRPL------------ ------GIPT----MK-DRALQGLVKMA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---IGAI--------- ----FSAIR-------L-----------------KPKY-VL-DA--------DIAKCF-DKID----------------- -HER--LLEK-IN-------------TYPT------------------LRKQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------DV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------K--LFP-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--SR-IKEMAK----------------------------------------------------------- -----------------DIDWRNEKGHL-ISISARR-K--SISLIR----------YADDFVI-------------IHE- -----NLTI----VQ---------------------------RCREIISE---------------------WL-I----- ----G--M----GLEI------K-P------SK----T---RLIHTLQEYE---GEK--P-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115428478|locus|VBICriEpi239080_4618|_extraction Mobile element protein [Crinalium epipsammum PCC 9333] -------------------------------------------------------------------GVDGVKSLTPVQ- ---------R-----LALVRKLA-------L------KG-----KSK--------------------------------- -----------------PTRRV-WI--DK--------PGTT----------------------EKRPL------------ ------GIPT----MY-DRALQALVKLA-LE-PEWEARF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---IGAI--------- ----FVTIN-------Q-----------------KAKY-VL-DA--------DIAKCF-DRIN----------------- -HRE--LLKK-LN-------------TFPT------------------LKRQI---------GAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------Q--MFP-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--ER-IKQFAE----------------------------------------------------------- -----------------TLPSRSGFGK-----RDKR-K--SLSLIR----------YADDFVI-------------LHE- -----DITV----VK---------------------------RCKEIISE---------------------WL-M----- ----G--M----GLEL------K-P------SK----T---RLAHTLIEYE---GQD--A-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115579300|locus|VBIMicSp236384_3645|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Microcoleus sp. PCC 7113] -------------------------------------------------------------------GVDGQRSLTPKQ- ---------R-----QNLIGQLK-------L------GT-----KVS--------------------------------- -----------------PTRRV-WI--PK--------PGKE----------------------EKRPL------------ ------GIPT----MK-DRALQALVKLA-LE-PEWEAQF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCQDA---ISAI--------- ----QTVIK-------Q-----------------KAKY-VL-DA--------DIAQCF-DRID----------------- -HEA--LLNK-LN-------------TSPT------------------IRRQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-N--------------- -------------------------------------------------------------------------------- ---------------M---------Q--YFD-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--WR-IKEYVE----------------------------------------------------------- -----------------TCDLKRSDGKYQLPKRDKR-D--SVSIIR----------YADDFVI-------------LHN- -----DITV----VQ---------------------------GCREVISE---------------------WL-K----- ----G--M----GLEL------K-P------SK----T---RIAHTLNEHG---QEK--P-------------------- ----------------GFNFLGYYV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115580159|locus|VBIMicSp236384_0858|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Microcoleus sp. PCC 7113] -------------------------------------------------------------------GVDGKKSLTPKQ- ---------R-----LTLVKNLR-------L------TG-----KSK--------------------------------- -----------------PTRRI-WI--PK--------PGKD----------------------EKRPL------------ ------GIPT----IH-DRALQALVKLA-LE-PEWESKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---VGQI--------- ----YLSIN-------K-----------------QPKY-VL-DA--------DISQCF-DKIN----------------- -HNA--LLEK-LN-------------TFPT------------------LRRQV---------RSWLKA------------ -------------------------------------------------------------------------------- ------------------GA------------------------------------I-----D-E--------------- -------------------------------------------------------------------------------- ---------------A---------Q--LIP-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--QL-TKAVSK----------------------------------------------------------- -----------------T-----------------------ACLVR----------YADDFVI-------------LDK- -----DITV----VQ---------------------------RCKKAIEE---------------------FL-K----- ----G--M----GLEL------K-P------SK----T---RISHTLHKYE---G-N--V-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115390726|locus|VBIDacSal132842_0370|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GVDGVKSLSPVA- ---------R-----MKLVNNLK-------L------GS-----KVK--------------------------------- -----------------PTRRV-KI--PK--------P-NG----------------------EERPL------------ ------GIPT----MY-DRALQALVKLA-LE-PEWEAVF-------E--------------------------------- ---PNSYGFRA------------------GR----------------------------SAHDA---VTAI--------- ----FDAIR-------Y-----------------KPKY-VL-DA--------DLAKCF-DRIN----------------- -HER--LLNK-IK-------------TFPT------------------FRKQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----E-G--------------- -------------------------------------------------------------------------------- ---------------K---------E--FSP-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--NE-IKAIAH----------------------------------------------------------- -----------------TFDMKTNNG-YQVSASNKR-R--SVCVIR----------YADDFVI-------------LHE- -----SLAV----VQ---------------------------RCKEVVSN---------------------WL-A----- ----D--M----GLEL------K-P------SK----T---RIAHTLENYE---NEK--A-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21557688|locus|VBICyaSp136448_4440|_extraction Mobile element protein [Cyanothece sp. PCC 7424] -------------------------------------------------------------------GVDGVKSLTPKQ- ---------R-----MNLVGQLK-------L------TC-----KTK--------------------------------- -----------------PTRRV-WI--PK--------PGKD----------------------EKRPF------------ ------LIPC----MS-DRALQALVKIA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------GCHDA---IGAI--------- ----FNQLG-------A-----------------KAKY-VL-DA--------DISKCF-DKIN----------------- -HEK--LLQK-LN-------------TFPT------------------LRRQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------N---------K--LFP-TE-E-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--EI-IKSFAQ----------------------------------------------------------- -----------------NPGELRQE--FSNRGKGRE-Q--SISLIR----------YADDFVL-------------IHE- -----SLAV----VE---------------------------KGKEIIET---------------------WL-R----- ----E--L----GLTL------K-P------EK----T---QITHTLDKHQ---G-K--V-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115683636|locus|VBIOscNig7962_7871|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Oscillatoria nigroviridis PCC 7112] -------------------------------------------------------------------GIDGVKSLSPIQ- ---------R-----VKLVKRLR-------V------TG-----KSK--------------------------------- -----------------PTRRV-MI--PK--------PGSD----------------------EKRPL------------ ------GIPT----IE-DRALQALVKSA-LE-PEWEAQF-------E--------------------------------- ---PNSYGFRA------------------GR----------------------------SCHDA---IEAI--------- ----FNSIR-------L-----------------KAKY-VL-DA--------DIAKCF-DRID----------------- -HKA--LLAK-VN-------------TYPT------------------LRHQL---------KVWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------C-----A-E--------------- -------------------------------------------------------------------------------- ---------------G---------S--LFP-TD-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--NR-VKQYAE----------------------------------------------------------- -----------------TLKGKK---------RDNR-Q--ALSLIR----------YADDFVI-------------MHE- -----DLSV----VK---------------------------KCQEIIAE---------------------WL-R----- ----D--M----GLEL------K-A------SK----T---KLTHTLIKID---G-N--V-------------------- ----------------GFEFLGFHV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.sp.I4/AP003604/45422..47908/Nostoc_extraction sp./CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGRKNLSPKA- ---------R-----LILVQSMK-------L------GD-----KAS--------------------------------- -----------------PTRRV-WI--PK--------PGSS---------------------GEKRPL------------ ------SIPT----LY-DRALQSLVKLA-LE-PEWEARF-------E--------------------------------- ---PNSFGFRP------------------GR----------------------------NAHDA---MKAI--------- ----FNTIK-------F-----------------KPKY-VL-DA--------DIAKCF-DKID----------------- -HNV--LLSK-LN-------------TFPT------------------ISRQI---------RAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----D-F--------------- -------------------------------------------------------------------------------- ---------------S---------EYALHT-TS-M-----------GVP-----QGGTI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--NR-IKQVAL----------------------------------------------------------- -----------------TLPGCK---------SENR-Q--AISLIR----------FADDFVI-------------LHK- -----DLAV----IQ---------------------------RCQQIISE---------------------WL-S----- ----E--L----GLEL------K-P------SK----T---RISHTLNMYE---G-K--V-------------------- ----------------GFDFLGFTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21588186|locus|VBICyaSp112625_3397|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanothece sp. PCC 8802] -------------------------------------------------------------------GVDGVKSLTQKQ- ---------R-----MELVENLT-------L------KG-----KAK--------------------------------- -----------------PTRRV-WI--PK--------PNG-----------------------EKRPL------------ ------GIPT----IT-DRAKQYLVKLA-LE-PQWEAKF-------E--------------------------------- ---HNSYGFRP------------------GR----------------------------SCHDA---IEAI--------- ----YIAIS-------R-----------------KAKF-VL-DA--------DIAKCF-DKIN----------------- -HEK--LLTK-LE-------------TYPE------------------IRKSI---------KGWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------R-----D-D--------------- -------------------------------------------------------------------------------- ---------------K---------E--WFP-TD-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--TI-IKDFAR----------------------------------------------------------- -----------------TWKGEK---------AKNE-Q--SISVIR----------YADDFVI-------------LHE- -----NLDI----IQ---------------------------KCKSIIEN---------------------WL-S----- ----E--I----GLEL------K-P------SK----T---RISHTLQEVE---G-K--I-------------------- ----------------GFNFLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115388988|locus|VBIDacSal132842_1986|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GVDGVKSLNPKQ- ---------R-----LNLAENLT-------L------TG-----KGK--------------------------------- -----------------SLRRV-YI--PK--------PGKA----------------------EKRSL------------ ------GIPV----ME-DRARQALLKLA-ME-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---EGAI--------- ----YVSIN-------Q-----------------KPKW-VL-DA--------DISKCF-DRIN----------------- -HDV--LLRK-LN-------------TTPT------------------IARQI---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-R--------------- -------------------------------------------------------------------------------- ---------------G---------D--WMP-TE-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--EY-IKQWAE----------------------------------------------------------- -----------------TWKGYKNENGRQMSKINRR-Q--SITLIR----------YADDFVV-------------LHR- -----DKSI----VQ---------------------------QAKTLIEH---------------------WL-H----- ----G--L----GLEL------S-E------SK----T---RICHTLYDSE---EEE--A-------------------- ----------------GFDFLGWNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115392614|locus|VBIDacSal132842_1186|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GIDGVKSLSPKQ- ---------R-----LSLAESLT-------L------TG-----KGE--------------------------------- -----------------SLRRV-WI--PK--------PGRK----------------------EKRGL------------ ------GIPV----ME-DRARQALLKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---EQAI--------- ----FNAIR-------Y-----------------KPKW-VL-DA--------DISKCF-DRIN----------------- -HDV--LLQK-LN-------------TIPT------------------IARQI---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-R--------------- -------------------------------------------------------------------------------- ---------------G---------D--WMP-TN-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--DY-IKQWAE----------------------------------------------------------- -----------------TWKG-----GKQANRI-------SISLIR----------YADDFVV-------------LHK- -----DKSI----IQ---------------------------QAKTLIEQ---------------------WL-H----- ----G--S----GLEI------S-E------SK----T---RICHTLYDSE---EKK--A-------------------- ----------------GFDFLGWNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115388851|locus|VBIDacSal132842_2432|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dactylococcopsis salina PCC 8305] -------------------------------------------------------------------GIDGIKSLSPKQ- ---------R-----LNLAENLT-------L------TG-----KGK--------------------------------- -----------------SLRRV-WI--PK--------PGRK----------------------EKRGL------------ ------GIPV----ME-DRARQALLKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---GQAI--------- ----YVAIN-------Q-----------------QSKW-VL-DA--------DISKCF-DRID----------------- -HNV--LLRK-LN-------------TTST------------------IARQI---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-R--------------- -------------------------------------------------------------------------------- ---------------G---------D--WFP-TN-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--KY-IKQWAE----------------------------------------------------------- -----------------TWKGYKDMNGKSRGKKQKR-H--SISVIR----------YADDFVV-------------LHK- -----DKSI----IQ---------------------------EAKMLIEQ---------------------WL-H----- ----G--L----GLEL------S-E------SK----T---RTCHTLHDTN---ETR--A-------------------- ----------------GFDFLGWNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115547536|locus|VBIAnaSp49473_3044|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Anabaena sp. 90] -------------------------------------------------------------------GVEVVKSLGYAQ- ---------R-----LELANSLG-------T------TR-----KVK--------------------------------- -----------------PTRRI-WI--PK--------PGTD----------------------EKRPL------------ ------GIPT----MS-DRANQAFAKLA-LE-SEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------AVHDA---IEAI--------- ----YLAIH-------C-----------------KAKY-VL-DA--------DISKCF-DNID----------------- -HQK--LLSK-LN-------------TYPS------------------MKRLI---------RSWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----D-R--------------- -------------------------------------------------------------------------------- ---------------R---------D--LFP-TK-M-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--EV-IKEYAN----------------------------------------------------------- -----------------TLPTRRNYGR-----KQNR-N--ALSLIR----------YADDFVI-------------IHE- -----DINV----VL---------------------------GAKAVIEG---------------------FL-K----- ----D--I----GLEL------K-P------SK----T---RICHTFEEYE---GEK--P-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115439943|locus|VBIStaCya5387_0048|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Stanieria cyanosphaera PCC 7437] -------------------------------------------------------------------GVDGIKSLTPAK- ---------R-----LEMVNHLN-------I------EG-----KSK--------------------------------- -----------------PTRGV-WI--PK--------RTSDLRSPYGKRAFEVSSNDVSSANGEKRPL------------ ------GIPC----MK-NRVIQCLVKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---IGAI--------- ----FNVIR-------Y-----------------EPKY-VL-DA--------AIAKCF-DKID----------------- -HQK--LLNK-LE-------------TYPG------------------IKKQI---------KAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----D-D--------------- -------------------------------------------------------------------------------- ---------------N------------WFP-TD-E-----------STP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--MA-IKNLAR----------------------------------------------------------- -----------------NLDLTTSTGQVITDRRVKE-K--KLHLIR----------YADDFVI-------------LHN- -----DINV----IY---------------------------KCKETIES---------------------FL-L----- ----E--M----GLEL------K-P------SK----T---KISHTLNKYQ---G-N--I-------------------- ----------------GFDFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115428732|locus|VBICriEpi239080_3135|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Crinalium epipsammum PCC 9333] -------------------------------------------------------------------GVDGVKSLSPEA- ---------R-----LKLVRELK-------L------TG-----KSK--------------------------------- -----------------PTRRV-WI--PK--------PGTD----------------------EKRPL------------ ------GIPT----MY-DRALQAVVKAT-LE-PEWEAFF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---VNQV--------- ----KKAIM-------Q-----------------KAKY-VL-DA--------DIAKCF-DRIN----------------- -HEK--LLQK-LN-------------TKGK------------------VRQQI---------KAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----D-Q--------------- -------------------------------------------------------------------------------- ---------------G---------S--FTA-TS-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--ER-IKQ-------------------------------------------------------------- -----------------EFPRMSHSGRETWYHKKGE-EFPTPDVIR----------YADDFVI-------------FHQ- -----NKTV----VQ---------------------------RCRDIISN---------------------WL-S----- ----D--I----GLQL------K-P------EK----T---RLSHSLNPEL---SDDGIA-------------------- ----------------GFDFLGHHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21544066|locus|VBICyaSp130209_2854|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanothece sp. ATCC 51142] -------------------------------------------------------------------GIDGIKSLRPSA- ---------R-----WKLIQELK-------F------TG-----KSK--------------------------------- -----------------PVRRV-WI--PK--------PGKS----------------------EKRPL------------ ------GIPT----IK-DRALQTLAKMA-LE-PEWEAKF-------E--------------------------------- ---PHSYGFRP------------------GR----------------------------SVHDA---VEAI--------- ----FTGIS-------K-----------------KKKF-IL-ET--------DIEKCF-DKIN----------------- -HSE--LIKK-LN-------------TYPK------------------LRRQI---------KAWLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------I-----D-E--------------- -------------------------------------------------------------------------------- ---------------N---------Q--LFP-TE-E-----------GTP-----QGGTL------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------ME--NL-LKK-------------------------------------------------------------- -----------------AFPRLGVGNRQTWFHSKGQ-EFYSPILIR----------YADDLVV-------------IHE- -----DQKV----IE---------------------------VCKELINE---------------------WL-R----- ----T--I----GLRL------K-D------SK----T---KIVHTYDEHK---RNK--P-------------------- ----------------GFDFLGFNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >C.sp.I1/X71404/446..2898/Calothrix_extraction sp./CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVKSLKPSA- ---------R-----LTLVMNMK-------L------NH-----KVK--------------------------------- -----------------ATRRV-WI--PK--------PGNV----------------------EKRPL------------ ------GIPT----MQ-DRATQSLVKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------NAHDA---REAI--------- ----FNSIR-------Y-----------------SNKW-VL-DA--------DISKCF-DKIN----------------- -HEK--LLTK-IN-------------TFPT------------------MRRQI---------KAWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------H--FSE-TT-E-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--KL-VKEFAA----------------------------------------------------------- -----------------SQRGGK---------VKNQ-N--SISLIR----------YADDFVI-------------LAP- -----NKTQ----II---------------------------VLKEIVKT---------------------WL-A----- ----E--M----GLEL------N-P------NK----T---RIVSTFKSSEIFASQE--V-------------------- ----------------GFNFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Tr.e.I2/CP000393/5587083..5589603/Trichodesmium_extraction erythraeum/CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGIKNLPSMQ- ---------R-----FNLVDLLK-------R------HRF----KAS--------------------------------- -----------------PTRRV-WI--PK--------PGKD----------------------EKRPL------------ ------GIST----MY-DRALQALVKLG-RS-PEWEAHF-------E--------------------------------- ---PNSYGLRP------------------GR----------------------------STHDA---IAAI--------- ----YVSIN-------K-----------------KPKY-VL-DA--------DISKCF-DRIN----------------- -HDA--LLRK-IG-------------RTP-------------------YRRLI---------KQWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------F-----D-N--------------- -------------------------------------------------------------------------------- ---------------K---------Q--FSD-TL-E-----------GTP-----QGGVI------------------S- TLLVNIAL-----------------------------------------------------------HG----------- --------ME--KC-LEKYAE----------------------------------------------------------- -----------------TLPGKK---------RDNK-Q--ALSLIR----------YADDFVI-------------LHE- -----DIKV----VM---------------------------QAKTVIQE---------------------WL-N----- ----Q--V----GLEL------K-P------EK----T---KIAHTLEEYE---GNK--P-------------------- ----------------EFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23984237|locus|VBITriEry99848_0561|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Trichodesmium erythraeum IMS101] -------------------------------------------------------------------GIDGKKSLTLKE- ---------R-----VNLANNLV-------M------SH-----KAK--------------------------------- -----------------STCRI-WI--PK--------PGKT----------------------EKRPL------------ ------GILT----IS-ERAKQGLVKMA-IE-PEWEAKF-------E--------------------------------- ---PNIYGFRP------------------GR----------------------------SCMDA---VEGI--------- ----KAAIK-------Q-----------------KSKY-VL-NA--------DIAKCF-DNID----------------- -HEK--LLDK-IG-------------TFPK------------------VRRQI---------KAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------Y-----D-N--------------- -------------------------------------------------------------------------------- ---------------E---------S--WFL-MD-E-----------GIS-----QGEVI------------------S- GLWANIAL-----------------------------------------------------------HG----------- --------ME--KI-VKEFAY----------------------------------------------------------- -----------------SLSGKK---------VKNE-K--EITLVR----------YADHFVI-------------LHP- -----NLNV----VT---------------------------KAKALVEE---------------------FL-R----- ----G--M----GLEL------K-P------EK----T---RLTHTLIQVG---EEE--P-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115404203|locus|VBIOscAcu116170_2852|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Oscillatoria acuminata PCC 6304] -------------------------------------------------------------------GVDGIKNLDPSE- ---------R-----IKLAETLR-------L------DG-----KAT--------------------------------- -----------------PLLRV-EI--PN--------PGKK----------------------ETRPL------------ ------GIPT----IE-DRAKQALAKLA-LE-PEWEAKF-------E--------------------------------- ---PNSYGFRP------------------GR----------------------------SCHDA---IIAI--------- ----ELQVR-------R-----------------QSKY-IL-DA--------DLKGCF-DNID----------------- -HEA--LIGK-LN-------------TFPI------------------MENQV---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------M-----K-G--------------- -------------------------------------------------------------------------------- ---------------D---------V--FYK-TE-S-----------GTP-----TGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------LE--TH-IADKFP----------------------------------------------------------- -----------------TFRTRKGQEK-----GKMK-EWSEARTIR----------YADDFVI-------------LHE- -----KLEV----IQ---------------------------EAKSETEK---------------------WL-A----- ----T--I----GLKL------N-E------NK----T---RIAHTIKEVE---GQK--P-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >C.w.I1/NZ_AADV01000039/6112..8597/Crocosphaera_extraction watsonii/CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGKKSLRPNQ- ---------R-----LKLVNELR-------L------KGY----KAK--------------------------------- -----------------ALRRV-WI--PK--------PGRD----------------------EKRGL------------ ------GIPT----MK-DRAMQALVKSA-LE-PYWEAQF-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SAQDA---ISRI--------- ----FLAIK-------T-----------------NAKY-VL-DA--------DIAKCF-DKIN----------------- -HDY--LLSK-VD-------------CPHN------------------IKRII---------KQWLEC------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------I--FEE-TD-S-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------MI--ID-IENHFP----------------------------------------------------------- -----------------RTKRREDGSLKQGY---------KPKIIR----------YADDFVI-------------LHT- -----DYDV----IL---------------------------QCKNLVAQ---------------------WL-E----- ----K--V----GLEL------K-P------EK----T---SIRHTLKSIV-HNGKTIEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|21547658|locus|VBICyaSp130209_4633|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cyanothece sp. ATCC 51142] -------------------------------------------------------------------GVDGKKIIHPQQ- ---------R-----YQLALSLN-------L------KGY----KSK--------------------------------- -----------------PLRRV-WI--SK--------PGKD----------------------EKRPL------------ ------GIPT----IT-DRAMQCLIKLC-ME-PYWEAKF-------E--------------------------------- ---GNSYGFRP------------------GR----------------------------STHDA---IEAI--------- ----FNHIR-------Y-----------------KTKY-VL-DA--------DISKCF-DKIN----------------- -HKY--LLDK-TD-------------CPY-------------------FKSII---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------I--FES-TD-S-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------MI--RD-IQKSFP----------------------------------------------------------- -----------------NSITRE-GKRIRGY---------QPKIIR----------YADDFVI-------------LHH- -----ELEI----IN---------------------------HTQKLVNK---------------------WL-E----- ----K--V----GLEL------K-P------SK----T---RICHTLNDIM-IDGKMEKA-------------------- ----------------GFDFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|22636780|locus|VBIMicAer59304_5685|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Microcystis aeruginosa NIES843] -------------------------------------------------------------------GVDGMIAISPEQ- ---------R-----LNLTEEIK--------------GTL----KAK--------------------------------- -----------------PLRRV-WI--PK--------PGRD----------------------EKRPL------------ ------GIPT----IK-DRARQALIKSA-LE-PEWESKM-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SDHDA---ISRI--------- ----YITIN-------Q-----------------SSYF-VL-DA--------DIAKCF-DRIN----------------- -HDF--LLSK-IH-------------CPSS------------------LKRDI---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FEE-TE-T-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------MA--RL-IETLFP----------------------------------------------------------- -----------------KKGNGK-------N---------QAVLIR----------YADDFVV-------------ISP- -----SLEI----IE---------------------------QCKTAISE---------------------WL-K----- ----P--I----GLEL------K-P------EK----T---RVCHTLKPIE-YNGKMEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42723187|locus|VBIArtPla153080_2809|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------GVDGMVVISPKQ- ---------R-----LEMAEKIK--------------GNL----KTK--------------------------------- -----------------PLRRV-WI--PK--------PGRD----------------------EKRPL------------ ------GIPT----IQ-DRARQALVKSA-LE-PEWEGRF-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SAHDA---IGRI--------- ----YTAIN-------Q-----------------GQYY-VL-DA--------DIAKCF-DRIN----------------- -HDY--LLSK-IH-------------CPSV------------------IKRDL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FED-TE-A-----------GTP-----QGGVI------------------S- PILANIAL-----------------------------------------------------------DG----------- --------MA--RL-IETMYP----------------------------------------------------------- -----------------KASGGK-------V---------KATLIR----------YADDFVV-------------ISP- -----SLDI----IE---------------------------QCKTAISR---------------------WL-K----- ----P--I----GLEI------K-P------EK----T---RVCHTLNPIQ-YEGKTEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42724297|locus|VBIArtPla153080_3363|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------------------------GI------------ ------GIPT----IQ-DRARQALVKSA-LE-PEWESRF-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SAQDA---ISRI--------- ----YLSIN-------K-----------------GEYY-VL-DA--------DIAKCF-DRIN----------------- -HDY--LLSK-IH-------------CPSN------------------LKRDL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--FED-TE-A-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------ME--RL-VKGMYP----------------------------------------------------------- -----------------NKRTA--------T---------QVNLIR----------YADDFVV-------------ISK- -----DLGI----IE---------------------------QCKTAISE---------------------WL-K----- ----P--V----GLEI------K-P------EK----T---RICHTLNPIE-YNGKIEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115401811|locus|VBIOscAcu116170_3674|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Oscillatoria acuminata PCC 6304] -------------------------------------------------------------------GIDGVVIISPNQ- ---------R-----LGLAEEIK--------------GRL----KAK--------------------------------- -----------------PLRRV-WI--PK--------PGRD----------------------EKRPI------------ ------GIPT----IR-DRARQALVKAA-LE-PEWESKF-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SAHDA---IVRI--------- ----YAAIK-------M-----------------NSYY-VL-DA--------DIAKCF-DLIN----------------- -PEH--LLSK-IH-------------CPSR------------------LKRDL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FEE-TY-A-----------GTA-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------MA--RL-IETKFP----------------------------------------------------------- -----------------KKNSV--------V---------QATLIR----------YADDFVV-------------ISP- -----RLEV----IE---------------------------QCQTAISE---------------------WL-K----- ----P--I----GLEI------K-P------FK----T---RVCHTLKPIQ-YDGKTEEP-------------------- ----------------GFNFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42725237|locus|VBIArtPla153080_3831|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------MN----------------- -HDY--LLSK-IH-------------CPSR------------------LKRDL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FED-TE-T-----------GTP-----QGGVI------------------S- PILANIAL-----------------------------------------------------------LG----------- --------MG--RL-IGKMYP----------------------------------------------------------- -----------------QNTNNK-------P---------FATVIR----------YADDFVV-------------ISK- -----DLGI----IE---------------------------QCKTAISE---------------------WL-K----- ----P--V----GLEI------K-P------EK----T---RICHTLKSIE-YNGKAEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42723055|locus|VBIArtPla153080_2743|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------MN----------------- -HDY--LLSK-IR-------------CPSS------------------LKKDL---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FED-TE-A-----------GTP-----QGGVI------------------S- PILANIAL-----------------------------------------------------------DG----------- --------MA--RL-VEIMYP----------------------------------------------------------- -----------------QGTNNK-------P---------FARLVI----------YADNFVV-------------ISK- -----DLRI----IE---------------------------QCKTAISE---------------------WL-K----- ----P--V----GLEI------K-P------EK----T---RVCHTLKPIQ-YEGKTEEQ-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42724541|locus|VBIArtPla153080_3485|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------ME--RL-IKEMYP----------------------------------------------------------- -----------------NKGTA--------I---------QVNLMR----------SADDFVV-------------ISK- -----DLGI----IE---------------------------QCPIAISE---------------------WL-K----- ----P--V----GLEI------K-P------EK----T---RIGHTLNRIE-YDGKTQEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42720039|locus|VBIArtPla153080_1249|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------MV--RL-IETMYH----------------------------------------------------------- -----------------KKKNV--------V---------QATVVR----------YADDFVV-------------ISH- -----SLDI----IK---------------------------QCKTAISE---------------------WL-K----- ----P--V----GLEI------K-P------EK----T---RICHTLKPIE-YDGKTEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42716490|locus|VBIArtPla153080_5826|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------------------------------------------M----------------- -PLY--VLTI-LS-------------C----------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------------------------------AI------------------S- PILANIAL-----------------------------------------------------------DG----------- --------MV--RL-IETMYP----------------------------------------------------------- -----------------KKANR--------V---------QASLVR----------YADDFVV-------------ISP- -----SLDI----IE---------------------------PCKNAIYE---------------------WL-K----- ----P--V----GLQI------K-P------EK----T---RVCHTLNPIQ-YEGRTEEA-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42717648|locus|VBIArtPla153080_6402|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Arthrospira platensis NIES39] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------MN----------------- -HDY--LLSK-IH-------------CPSR------------------LKRDV---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------L-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------V--FED-TE-R-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------MA--RL-IENCIQ----------------------------------------------------------- -----------------KKKGR--------K---------QATLIR----------YADDFVV-------------ISP- -----SIEI----IE---------------------------QCKTAISE---------------------WL-R----- ----N--V----GLEI------K-P------EK----T---RVCHTLNPLQ-HGEQTEEP-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115405098|locus|VBIOscAcu116170_7132|_extraction LongchainfattyacidCoA ligase (EC 6.2.1.3) [Oscillatoria acuminata PCC 6304] -------------------------------------------------------------------GIDGVKSLKPKQ- ---------R-----LELAKDLG--------------KHS----KAK--------------------------------- -----------------ALRRV-WI--PK--------PGRD----------------------EKRPL------------ ------GIPI----IR-DRAEQALVKQA-LE-PEWEARF-------E--------------------------------- ---GTSYGFRP------------------GR----------------------------SAHDA---IGRI--------- ----YASIN-------Q-----------------GSYY-VL-DA--------DITKCF-DKIN----------------- -HEY--LLSK-LD-------------CCLQ------------------HRRQI---------KQWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----D-N--------------- -------------------------------------------------------------------------------- ---------------G---------I--FED-TE-S-----------GTP-----QGGVI------------------S- PLLANIAL-----------------------------------------------------------DG----------- --------MA--RL-IEELYP----------------------------------------------------------- -----------------KRKGQK-------V---------KATLIR----------YADDFVV-------------ISP- -----EIEI----IN---------------------------QCKIALEN---------------------WL-K----- ----F--V----GLEL------K-P------EK----T---KICHTLREIE-VNGEKVTP-------------------- ----------------GFDFLGFTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42716762|locus|VBIArtPla153080_5960|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Arthrospira platensis NIES39] -------------------------------------------------------------------GVDGVKSLSWKG- ---------E-----INLVGDLK-------L------GS-----KAK--------------------------------- -----------------PTRRV-DL--TE--------TVGD----------------------APSRN------------ ------GDQT----IF-DRAGQGLVRLA-LE-PEWEAKF-------E--------------------------------- ---SNCYGFRP------------------GR----------------------------SCHDA---IEAI--------- ----SNHLE-------S-----------------EPKW-VL-DA--------QITKCLGDRIN----------------- -SDR--LLDK-LG-------------TFPR------------------LRRQI---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------Q--LFP-TK-E-----------GTS-----QGGII------------------Q- PLLANIAL-----------------------------------------------------------HG----------- --------ME--ED-LLKMAD----------------------------------------------------------- -----------------QLSGRKPVNRN------------ELSVIR----------YADKLVI-------------LHE- -----DGAV----IH---------------------------HCQQVIRE---------------------WL-K----- ----P--W----GLEL------K-P------EP----T---RIAHTLDGED--------A-------------------- ----------------GFDFLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|42721577|locus|VBIArtPla153080_2008|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Arthrospira platensis NIES39] -------------------------------------------------------------------GVDGVKRLRNQE- ---------K-----LDLANCLK-------L------GR-----KTQ--------------------------------- -----------------GLRRV-SI--PE--------PGRD----------------------EKRAV------------ ------GILM----MM-EKAKQGLVKLA-LE-PEWEARF-------D--------------------------------- ---RNSYGFRP------------------RR----------------------------SAQDA---IAAI--------- ----FNGMK-------E-----------------DHKY-VL-DA--------HIEKCF-EGIY----------------- -HQK--LLAK-LN-------------TYPT------------------LRREI---------KAWLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------M-----D-G--------------- -------------------------------------------------------------------------------- ---------------K---------E--LFP-TE-T-----------DTP-----QGGLM-------------------- PLLANIAL-----------------------------------------------------------DG----------- --------LE--SL-LEDKFQ----------------------------------------------------------- -----------------GGVANCGNG--------------KATVVR----------YADDLVV-------------LDG- -----ELEV----IL---------------------------AAKETIEA---------------------WL-M----- ----E--M----GLKL------K-D------GN----T---RISHTFIEHE---GN---I-------------------- ----------------GWDFLGYNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >E.c.I5/AF074613/58241..60646/Escherichia_extraction coli/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGQTWSSPEV- ---------K-----FLAINLLK-------R------RGY----KPQ--------------------------------- -----------------PLKRV-YI--PK--------SNGK----------------------S-RPL------------ ------GIPT----MK-DRAMQALYLLA-LE-PVAEVTA-------D--------------------------------- ---QRSFGFRT------------------GR----------------------------STADA---IAQC--------- ----FCVLAQK-----T-----------------SAEW-VL-EG--------DIRGCF-DNIS----------------- -HQW--LIDN-T----------------ST------------------DRQIL---------TKWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------R-----E-K--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LFP-VN-S-----------GTP-----QGGII------------------S- PVLANIAL-----------------------------------------------------------DG----------- --------LE--AL-LASEF------------------------------------------------------------ -------------------K-KRTVKGRL--------VNPKVNYVR----------YADDFII-------------TGE- -----SKEL----LES--------------------------QVLPVVRR---------------------FM-A----- ----E--R----GLML------S-P------EK----T---KITHIE------------E-------------------- ----------------GFDFLGQNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|86738873|locus|VBIPseAer240047_2455|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Pseudomonas aeruginosa DK2] -------------------------------------------------------------------GVDGITWSTPEA- ---------K-----SQAMLSIK-------R------RGY----RPQ--------------------------------- -----------------PLKRV-YI--PK--------TNGK----------------------M-RPL------------ ------GIPT----MK-DRAMQALYLLA-LE-PVAETTA-------D--------------------------------- ---GRSFGFRP------------------ER----------------------------STADA---IEQC--------- ----FTTLSKK-----V-----------------APQW-IL-EG--------DIKGCF-DNIS----------------- -HDW--LMGH-V----------------PT------------------DREIL---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-D--------------- -------------------------------------------------------------------------------- ---------------R---------Q--LFP-TE-A-----------GTP-----QGGII------------------S- PTLANLVL-----------------------------------------------------------DG----------- --------LE--AK-LDAAFG----------------------------------------------------------- -------------------R-KRYANGVQ--------TRLMVNYVR----------YADDFIV-------------TGR- -----SKEL----LEQ--------------------------EVMPIIKD---------------------FM-Q----- ----E--R----GLTL------S-P------EK----T---KITHID------------D-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >A.v.I1/AY057439/1648..4444/Azotobacter_extraction vinelandii/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGETWSTPES- ---------K-----WKAIFRLQ-------R------TGY----RPR--------------------------------- -----------------PLRRV-YI--PK--------ANGQ----------------------R-RPL------------ ------GIPT----ML-DRAMQALYLLA-LE-PVSETTA-------D--------------------------------- ---RNSYGFRP------------------HR----------------------------STADA---IEQL--------- ----FVNLGRK-----H-----------------SAQW-VM-EG--------DIKGCF-DNIS----------------- -HDW--LIAN-V----------------PL------------------DKAVL---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------L-----E-S--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LNP-TG-A-----------GTP-----QGGII------------------S- PVLANLAL-----------------------------------------------------------DG----------- --------LE--KA-LESRFG----------------------------------------------------------- -------------------Q-RNTKASYK--------T--KVNYVR----------YADDFVI-------------TGI- -----SKEL----LVN--------------------------EVKPVVAA---------------------FM-A----- ----E--R----GLSL------A-A------EK----S---LFTHVS------------E-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32030132|locus|VBICitRod33214_3055|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Citrobacter rodentium ICC168] -------------------------------------------------------------------GVDGKLWSTPKT- ---------K-----WEAIFDMK-------R------TGY----HPK--------------------------------- -----------------PLRRV-YI--PK--------SNGK----------------------L-RPL------------ ------GIPT----MR-DRAMQALYLLA-LE-PVSETTA-------D--------------------------------- ---RNSYGFRP------------------MR----------------------------STADA---IEQC--------- ----FVALSRG-----N-----------------SAQW-VL-EG--------DIKGCF-DNIS----------------- -HDW--LLAH-I----------------PM------------------DKQVL---------GKWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----K-S--------------- -------------------------------------------------------------------------------- ---------------G---------H--YHA-TG-A-----------GTP-----QGGII------------------S- PVLANMAL-----------------------------------------------------------DG----------- --------LE--AV-LESRFG----------------------------------------------------------- -------------------V-KNTKASYK--------T--KVNYVR----------YADDFII-------------TGI- -----SQEL----LEN--------------------------EVKPLVEA---------------------FM-A----- ----E--R----GLQL------S-P------EK----T---VITHIE------------Q-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20838927|locus|VBIAlcBor124741_0664|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alcanivorax borkumensis SK2] -------------------------------------------------------------------GVDGECWDNPAS- ---------K-----WEAIHRLK-------R------HGY----KPR--------------------------------- -----------------PLRRV-WI--PK--------ANGK----------------------R-RPL------------ ------GIPT----MH-DRAMQALYLLA-LE-PVSETTA-------D--------------------------------- ---RNSYGFRP------------------MR----------------------------ATADA---IEQC--------- ----FVVLGRK-----S-----------------SAQW-VL-EA--------DIQGCF-DNIS----------------- -HDW--LLSH-V----------------PM------------------DKAVL---------GKWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----E-S--------------- -------------------------------------------------------------------------------- ---------------G---------R--THP-TH-A-----------GTP-----QGGII------------------S- PVLANMAL-----------------------------------------------------------DG----------- --------LE--EV-LEAAFG----------------------------------------------------------- -------------------Q-RNTKASYR--------T--KVNYVR----------YADDFVV-------------SGI- -----SREL----LER--------------------------EVRPIVEA---------------------FM-A----- ----E--R----GLAL------S-A------EK----T---VVTHVE------------E-------------------- ----------------GFDFLGQNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21163595|locus|VBIBorPet31633_1067|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bordetella petrii DSM 12804] -------------------------------------------------------------------GVDRVTWSTPET- ---------K-----SEAVLSLR-------R------HGY----RPR--------------------------------- -----------------PLRRI-YI--PK--------ANGK----------------------K-RPL------------ ------GIPT----MR-DRAMQALYLLA-LE-PIAETTG-------D--------------------------------- ---KDSYGFRP------------------GR----------------------------SVADA---IRQC--------- ----HTVLAWK-----R-----------------SAEW-VL-EA--------DIEGCF-DNIS----------------- -HDW--LAEN-I----------------PM------------------DKAIL---------KSWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----E-S--------------- -------------------------------------------------------------------------------- ---------------G---------S--LFP-TE-A-----------GTP-----QGGII------------------S- PVLANMAL-----------------------------------------------------------DG----------- --------LQ--EV-LGKSFF----------------------------------------------------------- -------------------R-TRRQNKHY--------D-PKVNFVR----------YADDFIV-------------TGY- -----SREL----LEI--------------------------EVLPLVEK---------------------FL-A----- ----A--R----GLNI------S-K------AK----T---RVTHIS------------E-------------------- ----------------GFDFLGKNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|48579126|locus|VBIEscCol159162_5518|_extraction Mobile element protein [Escherichia coli UMNK88] -------------------------------------------------------------------GVDGKTWSKPGS- ---------K-----MKAIYTLK-------R------RGY----KPL--------------------------------- -----------------PLRRI-YI--PK--------SNGK----------------------K-RPL------------ ------GIPT----MK-DRAMQALYLMA-LE-PVAETTA-------D--------------------------------- ---PNSFGFRP------------------CR----------------------------STADA---IEQC--------- ----FTTLHRA-----D-----------------RAQW-IL-EA--------DIRSCF-DEIS----------------- -HEW--LIAN-I----------------PT------------------DTAIL---------KRWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----D-L--------------- -------------------------------------------------------------------------------- ---------------G---------K--LYP-TS-A-----------GTP-----QGGII------------------S- PTLANMVL-----------------------------------------------------------DG----------- --------LQ--PL-LKKTFY----------------------------------------------------------- -------------------R------GGL--------NPEKINIIR----------YADDFVI-------------TGI- -----SHDT----LSE--------------------------KVLPLLEN---------------------FL-A----- ----E--R----GLTL------S-P------EK----T---RITHIS------------D-------------------- ----------------GFDFLGMNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|61234146|locus|VBILegPne178567_1092|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Legionella pneumophila subsp. pneumophila ATCC 43290] -------------------------------------------------------------------GVDGKIWSTPEA- ---------K-----SKAITQLK-------R------RGY----KPY--------------------------------- -----------------PLKRV-YI--PK--------SNNT----------------------K-RPL------------ ------GIPV----MR-DRAMQALYLLA-LE-PVSETTA-------D--------------------------------- ---WNSYGFRP------------------RR----------------------------STHDA---ISHL--------- ----FVMLARK-----G-----------------AAQW-VL-EG--------DIKGCF-DTIS----------------- -HEW--ILNN-V----------------ML------------------DKRML---------QHWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------H--LFP-TQ-E-----------GTP-----QGGII------------------S- PTLANLVL-----------------------------------------------------------DG----------- --------LE--TL-LATKFG----------------------------------------------------------- -----------------SLK-HDGHASRT--------SKYQVHFVR----------YADDFVI-------------TGK- -----SKTL----LED--------------------------EVKPLIKD---------------------FL-A----- ----Q--R----GLKL------S-E------QK----T---KVTHIT------------H-------------------- ----------------GFDFLGQNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Chlorobifid|21384938|locus|VBIChlPha121022_1466|_extraction putative reverse transcriptase [Chlorobium phaeobacteroides BS1] -------------------------------------------------------------------GVDNQIWITPKA- ---------K-----TNAVASLK-------R------RGY----KPL--------------------------------- -----------------PLRRI-NI--PK--------KNGK----------------------T-RPL------------ ------GIPT----MK-DRAMQALYLLA-LE-PVAETTA-------D--------------------------------- ---DNSYGFRP------------------WR----------------------------STADA---SARC--------- ----FTCLAQR-----N-----------------SAQW-VL-EA--------DIASCF-DAIS----------------- -HEW--LIDN-I----------------PV------------------DTAIL---------RLWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----L-K--------------- -------------------------------------------------------------------------------- ---------------N---------E--LFP-TE-A-----------GTP-----QGGII------------------S- PVLANMCL-----------------------------------------------------------DG----------- --------LE--KA-LAKAFP----------------------------------------------------------- -----------------QAK-KRGL---------------KMHMVR----------YADDFVI-------------TGN- -----SKEL----LEN--------------------------EVLPVVVE---------------------FL-A----- ----E--R----GLFL------S-P------EK----T---KITHIT------------E-------------------- ----------------GFDFLGWNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >B.t.I1/AE015928/2871095..2873499/Bacteroides_extraction thetaiotaomicron/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDKVKWSTPNA- ---------R-----FKAIGELK-------R------RGY----KPQ--------------------------------- -----------------PLKRV-NI--KK--------SNGK----------------------L-RPL------------ ------GIPT----MK-DRAMQALYLLA-LE-PVSETTA-------D--------------------------------- ---SNSYGFRK------------------ER----------------------------STGDA---REQC--------- ----FCVLAKK-----A-----------------SPEW-IM-EG--------DIQGCF-DHIS----------------- -HEW--LLNN-I----------------PM------------------DKVML---------RKWLKC------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----F-N--------------- -------------------------------------------------------------------------------- ---------------K---------E--LFP-TE-E-----------GTP-----QGGII------------------S- PTLANMTL-----------------------------------------------------------DG----------- --------LQ--TM-LAEKYH----------------------------------------------------------- -----------------KKF-VTRKTTTY--------Y-PKVHLVR----------YADDFII-------------TGR- -----NKEA----LE---------------------------EIKPLVVD---------------------FL-K----- ----E--R----GLTL------S-E------EK----T---KITHID------------D-------------------- ----------------GFDFLGYNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87116550|locus|VBIAliFin145170_0642|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes finegoldii DSM 17242] -------------------------------------------------------------------GVDMVTWKTPDA- ---------K-----VCAITELK-------R------RGY----TPQ--------------------------------- -----------------PLRRV-HI--RK--------SNGK----------------------L-RPL------------ ------GIPT----MK-DRAMQALYLMA-LA-PVAETTA-------D--------------------------------- ---ANSYGFRK------------------ER----------------------------STADA---VQQC--------- ----FNDLART-----T-----------------SPQW-IL-EG--------DIKGCF-DHIS----------------- -HEW--LLDN-I----------------PM------------------DKVLL---------RKWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----F-N--------------- -------------------------------------------------------------------------------- ---------------K---------Q--LFP-TE-E-----------GTP-----QGGII------------------S- PTLANMTL-----------------------------------------------------------DG----------- --------LE--KL-LADSFP----------------------------------------------------------- ---------------------INRSKKNY--------YTPMINLVR----------YADDFII-------------TGE- -----SKEL----LEN--------------------------HVKPLVIE---------------------FL-Q----- ----A--R----GLTL------S-E------EK----T---KITHIE------------E-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|21069184|locus|VBIBacVul85104_2201|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacteroides vulgatus ATCC 8482] -------------------------------------------------------------------GVDKITWSSPLA- ---------K-----AKAIFTLK-------R------HGY----KPQ--------------------------------- -----------------PLKRV-NI--KK--------KNGK----------------------L-RPL------------ ------GIPT----MK-DRAMQALYLMA-LD-PIAETTG-------D--------------------------------- ---SHSYGFRR------------------HR----------------------------CTHDA---IEQC--------- ----YIVLSRS-----V-----------------APEW-IL-EG--------DIKGCF-DHIS----------------- -HAW--LINN-I----------------PM------------------DKEIL---------RKWLEC------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----F-N--------------- -------------------------------------------------------------------------------- ---------------G---------E--LFP-TE-E-----------GTP-----QGGII------------------S- PTLANMAL-----------------------------------------------------------DG----------- --------LQ--DL-LEKSVK----------------------------------------------------------- -----------------KYQ-VNYKKIV-----------PKIHLVR----------YADDFIV-------------TAK- -----DKET----IEQ--------------------------VILPLVRK---------------------FL-A----- ----E--R----GLTL------S-E------EK----T---KITHIN------------E-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Th.e.I1/BA000039/27344..30566/Thermosynechococcus_extraction elongatus/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGITWSTQEQ- ---------K-----TQAIKSLR-------R------RGY----KPQ--------------------------------- -----------------PLRRV-YI--PK--------ANGK----------------------Q-RPL------------ ------GIPT----MK-DRAMQALYALA-LE-PVAETTA-------D--------------------------------- ---RNSYGFRR------------------GR----------------------------CTADA---AGQC--------- ----FLALARA-----K-----------------SAEH-VL-DA--------DISGCF-DNIS----------------- -HEW--LLAN-T----------------PL------------------DKGIL---------RKWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----W-K--------------- -------------------------------------------------------------------------------- ---------------Q---------Q--LFP-TH-A-----------GTP-----QGGVI------------------S- PVLANITL-----------------------------------------------------------DG----------- --------ME--EL-LAKHL------------------------------------------------------------ -------------------------RGQ------------KVNLIR----------YADDFVV-------------TGK- -----DEET----LE---------------------------KARNLIQE---------------------FL-K----- ----E--R----GLTL------S-P------EK----T---KIVHIE------------E-------------------- ----------------GFDFLGWNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23927302|locus|VBITheElo119873_1051|_extraction Mobile element protein [Thermosynechococcus elongatus BP1] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -MLANMTL-----------------------------------------------------------DG----------- --------ME--GL-LHKYL------------------------------------------------------------ -------------------------KKY------------KVNLIR----------YADDFVV-------------TGE- -----SRET----LC---------------------------IAAAIIQK---------------------FL-K----- ----E--R----GLTL------S-P------EK----T---KIVHIE------------E-------------------- ----------------GFDFLGWNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|42338528|locus|VBIGalCap53152_0971|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Gallionella capsiferriformans ES2] -------------------------------------------------------------------GVDKVVWDTPEL- ---------K-----AEAVMSLK-------R------KGY----QPQ--------------------------------- -----------------PLRRV-FI--PK--------ANGK----------------------M-RPL------------ ------GIPT----MK-DRAMQALYLQA-LE-PVSETKA-------D--------------------------------- ---PNSYGFRP------------------MR----------------------------ASRDA---AAQC--------- ----FNSLAQK-----Y-----------------AAKW-VL-DA--------DISGCF-DNIN----------------- -HDW--LLNN-I----------------PM------------------DKVTL---------QKWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------K-----W-N--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LFN-TE-A-----------GTP-----QGGII------------------S- PTLANMTL-----------------------------------------------------------DG----------- --------MA--EM-LQKRFG----------------------------------------------------------- -----------------ATG-SREAAKY------------KVNLIR----------YADDLVI-------------TGT- -----TKEV----LE---------------------------EVRELMAE---------------------FL-K----- ----V--R----GLTL------S-E------EK----T---KIVHIE------------E-------------------- ----------------GFDFLGWNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|42346603|locus|VBIGamPro61291_1949|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [gamma proteobacterium HdN1] -------------------------------------------------------------------GVDKVVWDTPEK- ---------K-----LCAMGDLK-------R------RGY----RPK--------------------------------- -----------------PLKRV-HI--PK--------ANGK----------------------L-RPL------------ ------GIPT----MK-DRAMQALYLLG-LL-PVSETTA-------D--------------------------------- ---GCSYGFRP------------------ER----------------------------SVADA---IERC--------- ----FNALGRR-----D-----------------AAAW-VL-EA--------DIKGCF-DHIS----------------- -HDW--LLGN-V----------------PM------------------DKRVL---------ATWLKC------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----E-K--------------- -------------------------------------------------------------------------------- ---------------A---------V--WFA-TE-A-----------GTP-----QGGII------------------S- PTLANFAL-----------------------------------------------------------DG----------- --------LE--QL-LSKTF------------------------------------------------------------ -------------------Y-RTMRHGKM--------VHPKVHLIR----------YADDFVI-------------TGS- -----SEEL----LVN--------------------------EVKPLVER---------------------FL-A----- ----E--R----GLML------S-A------EK----T---KVTHID------------E-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >P.p.I2/Y18999/752..2957/Pseudomonas_extraction putida/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGKIWATPAA- ---------K-----SSGMESMR-------H------RSY----RAL--------------------------------- -----------------PLRRI-YI--PK--------SNGQ----------------------K-RPL------------ ------GIPR----ML-CRSMQALWKLA-LE-PVSESLA-------D--------------------------------- ---PNSYGFRP------------------NR----------------------------STADA---IEYC--------- ----FITLAKR-----T-----------------SPVW-VL-EG--------DIRGCF-DNFN----------------- -HEW--MLKN-I----------------PM------------------DKTIL---------RRWLQA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----D-E--------------- -------------------------------------------------------------------------------- ---------------G---------T--LFA-TQ-A-----------GTP-----QGGII------------------S- PVIANMAL-----------------------------------------------------------DG----------- --------LE--AA-VHASVG----------------------------------------------------------- -----------------PTK-RARERS-------------KINVVR----------YADDFVV-------------TGI- -----SKEI----LEH--------------------------SVLPAVRQ---------------------FM-A----- ----I--R----GLEL------S-E------EK----T---KITHIA------------E-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|19961745|locus|VBIPseStu31643_0668|_extraction Mobile element protein [Pseudomonas stutzeri A1501] -------------------------------------------------------------------GVDGKIWSTPVA- ---------K-----STGAQALQ-------H------RGY----RPQ--------------------------------- -----------------PLRRI-YI--PK--------SNGK----------------------K-RPL------------ ------GIPT----MR-DRAMQALWKLA-LE-PVAETRA-------D--------------------------------- ---PNSYGFRP------------------QR----------------------------STADA---IAHC--------- ----FNALAKR-----G-----------------SAHW-VL-EA--------DIRGCF-DNIS----------------- -HDW--LLTN-V----------------PM------------------DKVVL---------RKWLRA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----D-Q--------------- -------------------------------------------------------------------------------- ---------------G---------A--LFA-TE-A-----------GTP-----QGGII------------------S- PVLANWTL-----------------------------------------------------------DG----------- --------LE--DV-VHASVA----------------------------------------------------------- -----------------STA-RKRKPF-------------KIHVVR----------YADDFII-------------TGA- -----TKAV----LQH--------------------------QVRPAIEA---------------------FL-K----- ----E--R----GLEL------S-D------EK----T---QITHIS------------Q-------------------- ----------------GFDFLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|45180960|locus|VBIBurRhi170666_0329|_extraction Mobile element protein [Burkholderia rhizoxinica HKI 454] -------------------------------------------------------------------GVDGRIWATPMS- ---------K-----LKAAQSLT-------H------RGY----QAL--------------------------------- -----------------PLRRV-YI--PK--------SNGK----------------------E-RAL------------ ------GIPT----MR-DRAMQALWLTA-LL-PIAETTA-------D--------------------------------- ---PNSYGFRP------------------KR----------------------------STADA---VEQC--------- ----FKALAKR-----N-----------------SAQW-VL-EG--------DIRGCF-DNFS----------------- -HDW--LLAN-I----------------PM------------------NKAVL---------RKWLQA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------V--LFP-TD-A-----------GTP-----QGAIA------------------S- PVLANMAL-----------------------------------------------------------DG----------- --------LE--EA-VRSVLG----------------------------------------------------------- -----------------PSK-TARQPA-------------KAHVVR----------YADDFIA-------------TGA- -----SREL----LEK--------------------------QVKPAIEA---------------------FL-S----- ----A--R----GLQL------A-S------EK----T---LVTHIA------------R-------------------- ----------------GFDLLGQNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22088426|locus|VBIHahChe29232_3586|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Hahella chejuensis KCTC 2396] -------------------------------------------------------------------GVDGVLWSTPEA- ---------K-----WAAIGQLK-------R------RGY----RAL--------------------------------- -----------------PLRRV-RI--PK--------ANGK----------------------E-RLL------------ ------GIPT----MQ-DRAMQALYLLA-LQ-PVSETRA-------D--------------------------------- ---RDSYGFRP------------------DR----------------------------STADA---IMQC--------- ----YMLLRKK-----G-----------------SAQW-VL-EA--------DIKGCF-DHID----------------- -HQW--LIDN-V----------------PM------------------DKLML---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----D-M--------------- -------------------------------------------------------------------------------- ---------------G---------R--VWK-TE-E-----------GTP-----QGGII------------------S- PTLANMAL-----------------------------------------------------------DG----------- --------IE--AL-LAQHFG----------------------------------------------------------- -----------------AK--GSKKLRQY-----------KVGLVR----------YADDFVI-------------TGS- -----SKEL----LEN--------------------------EVKPLIEK---------------------FL-A----- ----V--R----GLKL------S-V------EK----T---QVTHIN------------H-------------------- ----------------GFDFLGWTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|19847262|locus|VBIPseAer79785_0614|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Pseudomonas aeruginosa UCBPPPA14] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------MQALYLLA-LS-PIAETTG-------D--------------------------------- ---PNSYGFRI------------------ER----------------------------STADA---MSQL--------- ----FVCLSGK-----A-----------------SAQW-IL-EA--------DIQGCF-DHIN----------------- -HDW--LLNH-V----------------PT------------------DKVIL---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----H-K--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LQA-TD-A-----------GTP-----QGGII------------------S- PTLANMVL-----------------------------------------------------------DG----------- --------LE--SQ-LKRH------------------------------------------------------------- -------------------L-GVTRAKKL-----------KLNVVR----------YADDFVI-------------TGV- -----SPEV----LEK--------------------------EVKPWVEQ---------------------FL-A----- ----V--R----GLQL------S-L------EK----T---RIAHID------------Q-------------------- ----------------GFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|42537310|locus|VBIRalSol167236_2271|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Ralstonia solanacearum PSI07] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------ML-DRAMQALYLLA-LE-PVSEGTS-------D--------------------------------- ---PNSYGFRI------------------NR----------------------------STADA---MSQL--------- ----FVSLSQK-----A-----------------SAQW-VL-EA--------DIKGCF-DHIS----------------- -HDW--LECN-V----------------HM------------------DKAIL---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------V-----F-Q--------------- -------------------------------------------------------------------------------- ---------------G---------Q--FQA-TE-A-----------GTP-----QGGII------------------S- PTLANVAL-----------------------------------------------------------NG----------- --------LE--NQ-LFAHLR----------------------------------------------------------- -----------------AKL-GAVKTKKL-----------KVNVVR----------YADDFVI-------------TGS- -----TPEL----LED--------------------------EIKPWVER---------------------FL-A----- ----V--R----GLSL------S-T------EK----T---RIVNIT------------E-------------------- ----------------GFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21165521|locus|VBIBorPet31633_2025|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bordetella petrii DSM 12804] -------------------------------------------------------------------GVDRVLWDSPES- ---------K-----WEAIGRLR-------Q------PGY----RPL--------------------------------- -----------------PLRRV-YI--PK--------SNGK----------------------E-RPL------------ ------GIPT----MR-DRAMQALYLLA-LE-PVSESTS-------D--------------------------------- ---PNSYGFRK------------------GR----------------------------STADA---MAQI--------- ----FVTLSGR-----A-----------------SAQW-IL-EA--------DIKGCF-DWIN----------------- -HEW--LLAN-V----------------PM------------------DRRVL---------RKWLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------I-----H-K--------------- -------------------------------------------------------------------------------- ---------------G---------Q--LQP-TT-A-----------GTP-----QGGII------------------S- PTLANVTL-----------------------------------------------------------NK----------- --------LE--TD-LAEYLG----------------------------------------------------------- -----------------TKL-GWTKAKRL-----------KVHVVR----------YADDFIV-------------TGA- -----SKDV----LDT--------------------------EVRPWIER---------------------FL-A----- ----V--R----GLQL------S-T------EK----T---RIIHID------------E-------------------- ----------------GFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|54795537|locus|VBIStrPas183593_1131|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus pasteurianus ATCC 43144] -------------------------------------------------------------------GVDKELWLTPNA- ---------K-----YQAIKKLK-------V------RGY----CPK--------------------------------- -----------------PLRRI-YI--PK--------KNGK----------------------K-RPL------------ ------SIPT----MT-DRAMQTLFKFA-LE-PIAETTA-------D--------------------------------- ---PNSYGFRP------------------KR----------------------------STQDA---IEQC--------- ----FLALSKQ-----K-----------------SAKW-VL-EG--------DIKGCF-DNIS----------------- -HEW--IMKN-I----------------PM------------------NKTIL---------GKWLKS------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-N--------------- -------------------------------------------------------------------------------- ---------------Q---------K--LFP-TE-L-----------GSP-----QGSPI------------------S- PIISNMVL-----------------------------------------------------------DG----------- --------LE--RK-LSATF------------------------------------------------------------ -------------------R-KKKVNGKV--------YTPKINFVR----------YADDFIV-------------TGV- -----SKEL----LEN--------------------------EVKPVIIE---------------------FL-K----- ----E--R----GLEL------S-E------EK----T---LITHIT------------D-------------------- ----------------GFDFLGINI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|42436540|locus|VBIStrPne98725_0947|_extraction Mobile element protein [Streptococcus pneumoniae AP200] -------------------------------------------------------------------GVDGELWLTPQA- ---------K-----YKAIEKLN-------L------RGY----KPK--------------------------------- -----------------PLKRV-YI--PK--------KNGK----------------------K-RPL------------ ------SIPT----MT-DRAMQTLYKFA-LE-PIAETTA-------D--------------------------------- ---PNSYGFRA------------------KR----------------------------CTQDA---IEQC--------- ----FTSLNKK-----K-----------------SAKW-VL-EG--------DIKGCF-DNIS----------------- -HEW--ILNN-I----------------PM------------------NKKLL---------KLWLEC------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-K--------------- -------------------------------------------------------------------------------- ---------------Q---------K--LFP-TE-T-----------GSP-----QGSPI------------------S- PIISNMVL-----------------------------------------------------------DG----------- --------LE--KA-IKEKY------------------------------------------------------------ -------------------H-RRTVNKKT--------YFPKVNFAR----------YADDFIV-------------TGE- -----SAEL----LEN--------------------------GVKPIIVK---------------------FL-A----- ----E--R----GLEL------S-E------EK----T---LITHIN------------D-------------------- ----------------GFDFLGVNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|48578553|locus|VBIEscCol159162_5238|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Escherichia coli UMNK88] -------------------------------------------------------------------GVDGEIWQHPES- ---------K-----WSAITRLK-------R------SGY----HPL--------------------------------- -----------------PLRRI-YI--PK--------ANGK----------------------F-RAL------------ ------GIPT----ML-DRAMQALYLMA-LE-PLSEITA-------D--------------------------------- ---HHSYGFRP------------------MR----------------------------STADA---IEQV--------- ----FNACGKK-----A-----------------SAEW-IL-EG--------DIRGCF-DNLS----------------- -HEW--LVSH-I----------------PM------------------DRMVL---------RNWLKS------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------C-----E-G--------------- -------------------------------------------------------------------------------- ---------------M---------S--FYP-TK-G-----------GTP-----QGGII------------------S- PTLMNMAL-----------------------------------------------------------DG----------- --------LQ--SL-LERRFP----------------------------------------------------------- -----------------STT-VQGRKA-------------KIHLVR----------YADDFVI-------------TGA- -----TAEL----LRN--------------------------DVMPIV-------------------------------- -------------------------------------------------------------------------------- -----------------IDFLGDAA------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|115293380|locus|VBIRhiTro150571_4588|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rhizobium tropici CIAT 899] -----------------------------------------------------------GNQGKRTPGVDRIIWDTPDK- ---------C-----VRGLLSLK-------R------RGY----HPL--------------------------------- -----------------PLRRV-YI--PK--------ANSK----------------------KLRPL------------ ------GIPT----MK-DRAMQALHLLA-LL-PVAETTA-------D--------------------------------- ---PNSYGFRP------------------YR----------------------------ATRDA---ARQC--------- ----FIALRGR-----G-----------------TAEW-VL-DA--------DIAGCF-DEIS----------------- -KDW--LIAN-I----------------PM------------------DKVVL---------RKWLDS------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----K-D--------------- -------------------------------------------------------------------------------- ---------------G---------D--WHA-TK-A-----------GTP-----QGGII------------------S- PTLANMAL-----------------------------------------------------------DG----------- --------ME--KM-LRDFYG----------------------------------------------------------- -----------------PRR-RNSLT--------------KVHLIR----------YADDFVV-------------TGA- -----SKEV----LE---------------------------EAKSMVEE---------------------FL-S----- ----E--R----GLSL------S-E------EK----T---RIVRVE------------E-------------------- ----------------GFDFLGWNVR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Gfid|54574176|locus|VBIGlaSp182133_0425|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Glaciecola sp. 4H37+YE5] -------------------------------------------------------------------GIDGIVWINTKQ- ---------K-----WEAAKALS-------C------RNY----RSQ--------------------------------- -----------------PLRRV-YI--PK--------KNGK----------------------K-RPL------------ ------GIPT----MF-DRAMQALFLLA-YE-PVAEVTA-------D--------------------------------- ---HHSYGFRP------------------KR----------------------------SAADA---IEKC--------- ----FNVLAQK-----T-----------------SAQW-IL-EG--------DIKGCF-DNIS----------------- -HTW--LHQH-L----------------KL------------------EQKVL---------NQWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------R--LFP-TT-A-----------GTP-----QGGII------------------S- PCLSNGAL-----------------------------------------------------------DG----------- --------ME--AM-LKSITK----------------------------------------------------------- -----------------PIQ--------------------KVHLIR----------YADDFVI-------------TAN- -----SKEL----LEN--------------------------TIKPAEMA---------------------FL-F----- ----E--R----GLTL------S-K------EK----T---LITSIT------------K-------------------- ----------------GFDFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|184905921|locus|VBILegPne304526_2043|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Legionella pneumophila str. 121004] -------------------------------------------------------------------GIDGVVWTTSEE- ---------K-----CEAVRNLK-------A------RGY----KAT--------------------------------- -----------------PLRRI-YI--PK--------KNGK----------------------E-RPL------------ ------SIPT----LK-DRAMQALYLLA-LE-PVGETTA-------D--------------------------------- ---LNSYGFRP------------------KR----------------------------STHDA---IYQC--------- ----YATLARK-----N-----------------CAQW-IL-EG--------DIKACF-DEID----------------- -HGW--LKSN-I----------------II------------------DQRVL---------TQWLQA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------M-----E-K--------------- -------------------------------------------------------------------------------- ---------------N---------Q--LFE-TA-R-----------GTP-----QGGPA------------------S- PLLANMVL-----------------------------------------------------------DG----------- --------LE--RE-IHSGCG----------------------------------------------------------- -----------------QGN--------------------KINYIR----------FADDFIV-------------TAN- -----SPDI----LKE--------------------------KVMPIISN---------------------FL-A----- ----Q--R----GLSL------S-Q------EK----T---KIVHIE------------E-------------------- ----------------GFDFLGFNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Sh.sp.I1/CP000446/2526748..2528903/Shewanella_extraction sp./CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVIWNTDAR- ---------R-----IAAVKQLK-------R------KAY----QAK--------------------------------- -----------------PLKRI-YI--PK--------KNGK----------------------L-RPL------------ ------GIPC----MI-DRAQQALHLLA-LE-PISETVA-------D--------------------------------- ---PNSYGFRP------------------HR----------------------------STADA---IAQC--------- ----FLCLSQR-----Y-----------------SSEW-VL-EG--------DIKACF-DKIG----------------- -HQW--LIDN-I----------------AL------------------DKKML---------RQWLEC------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------L--FYR-TD-E-----------GTP-----QGGII------------------S- PTLMLLTL-----------------------------------------------------------SG----------- --------LE--QL-LKATAR----------------------------------------------------------- -----------------RKG-C------------------NVNFIG----------YADDFVV-------------TGS- -----SKEV----LVN--------------------------EIKPLIAR---------------------FL-A----- ----E--R----GLTL------S-E------EK----T---HVTHIN------------D-------------------- ----------------GFDFLGFNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|58933841|locus|VBISheBal147952_0958|_extraction Mobile element protein [Shewanella baltica OS678] -------------------------------------------------------------------GIDGIIWNTDAR- ---------R-----MKAVNQLS-------R------KAY----IAK--------------------------------- -----------------PLKRI-YI--PK--------KNGK----------------------L-RPL------------ ------GIPC----MI-DRAQQALHLLA-LE-PVSETLA-------D--------------------------------- ---PNSYGFRP------------------NR----------------------------STADA---VDQC--------- ----FKCLAQK-----K-----------------SAQW-VL-EG--------DIKACF-DKIG----------------- -HQW--LLDN-I----------------TV------------------DKRML---------EQWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------L--FYR-TD-E-----------GTP-----QGGVI------------------S- PSLMLMTL-----------------------------------------------------------AG----------- --------LE--QH-IKSTAL----------------------------------------------------------- -----------------KKG-T------------------RANFIG----------YADDFVV-------------TCA- -----SKEV----LEN--------------------------DIKPLITD---------------------FL-A----- ----E--R----GLTL------S-E------EK----T---HITHIN------------D-------------------- ----------------GFDFLGFNH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|54183625|locus|VBISheBal163160_2541|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shewanella baltica OS117] -------------------------------------------------------------------GIDGIIWNSDAR- ---------C-----MTAVNQLS-------R------KGY----HAK--------------------------------- -----------------PLRRI-YI--PK--------KNGK----------------------L-RPL------------ ------GIPC----MI-DRAQQALHLLA-LE-PISETVA-------D--------------------------------- ---LNSYGFRP------------------NR----------------------------SAADA---IAQC--------- ----FKCLCMK-----R-----------------SSQW-VL-EG--------DIKACF-DKIG----------------- -HQW--LIDN-I----------------QL------------------DKRML---------KQWLGC------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------V-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------L--FYK-TA-E-----------GTP-----QGGII------------------P- PTLMLLTL-----------------------------------------------------------AG----------- --------LE--QL-VKSIAC----------------------------------------------------------- -----------------KTG-N------------------SVNFIG----------YADDFII-------------TGS- -----SKEV----LVN--------------------------EIKPQLIG---------------------FL-Q----- ----E--R----GLTL------S-D------DK----T---HITHID------------D-------------------- ----------------GFDFLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|35297935|locus|VBIXenBov95754_1334|_extraction Mobile element protein [Xenorhabdus bovienii SS2004] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------MI-DRTQQALHLLA-LD-PISETIA-------D--------------------------------- ---PNSYGFRP------------------NR----------------------------STADA---IAQC--------- ----FKCLCQK-----R-----------------SARW-VL-EE--------DLKACF-DKIG----------------- -YQW--LIEN-I----------------QI------------------DKRML---------KQWLGS------------ -------------------------------------------------------------------------------- ------------------DF------------------------------------I-----D-K--------------- -------------------------------------------------------------------------------- ---------------G---------L--FYR-TA-E-----------GTP-----QGGII------------------S- PTLMLLTL-----------------------------------------------------------AG----------- --------LE--KR-VKEVAR----------------------------------------------------------- -----------------KTD-D------------------RINSIE----------YADNFVM-------------TGA- -----SEDV----LLN--------------------------EVKPQLID---------------------FL-R----- ----E--R----GLTL------S-E------EK----T---HITHIN------------D-------------------- ----------------GFDFLGFNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|115626437|locus|VBIFibAes90597_0767|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Fibrella aestuarina] -------------------------------------------------------------------GVDGQLWTNPPR- ---------K-----RQAIDELR-------S------RGY----RPQ--------------------------------- -----------------PLKRI-YI--PK--------RNGK----------------------Q-RPL------------ ------SIPT----MK-DRAMQALHLMA-LQ-PVSETTA-------D--------------------------------- ---PCSFGFRP------------------AR----------------------------QVADA---VERC--------- ----FGLLSRQ-----D-----------------SPQW-VL-EA--------DIEACF-DRID----------------- -HDW--LLQH-I----------------PM------------------EKTIL---------GQWLKA------------ -------------------------------------------------------------------------------- ------------------GY------------------------------------I-----E-K--------------- -------------------------------------------------------------------------------- ---------------G---------N--WWP-TT-E-----------GTP-----QGGII------------------S- PVLANMAL-----------------------------------------------------------DG----------- --------LA--KE-LAAHFA----------------------------------------------------------- -------------------K-SYKRPDRG--------FNPKVRLVR----------YADDFII-------------TGI- -----SRQQ----LEE--------------------------QVKPVVCN---------------------FL-S----- ----K--R----GLRL------S-E------SK----T---RQTAIT------------E-------------------- ----------------GFDFLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|24029979|locus|VBIWolEnd21207_0693|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Wolbachia endosymbiont of Drosophila melanogaster] -----------------------------------------------------------ENQGKNTAGVDRQIWSTCNT- ---------K-----FQGIKLLK-------Q------RGY----KPS--------------------------------- -----------------PLKRI-YI--SK--------SNGK----------------------R-RPL------------ ------GIPT----IK-DRAMQALYLFA-LE-PIAETIS-------D--------------------------------- ---RHSYGFRP------------------KR----------------------------SCADA---TVAC--------- ----HLLLASR-----N-----------------QLQW-IL-KG--------DIKWCF-DNIN----------------- -HEW--LMKH-I----------------PM------------------EKKIL---------HSWLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------L-----E-S--------------- -------------------------------------------------------------------------------- ---------------K---------T--LYS-TT-A-----------GTP-----QGSII------------------S- PILANLAL-----------------------------------------------------------NG----------- --------LE--KS-LESQFG----------------------------------------------------------- -----------------KLG-SKRRSKIR--------S--GVNVIR----------YADDFII-------------SGI- -----TREV----LEN--------------------------EVKPLVSS---------------------FL-Q----- ----E--R----GLIL------S-E------EK----T---KITSIT------------T-------------------- ----------------GFDFLGCNVR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Gfid|189809342|locus|VBIAltMac287461_0611|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alteromonas macleodii str. 'Ionian Sea U4'] -------------------------------------------------------------------GIDGETWQSATK- ---------K-----WRAISSLK-------R------SGY----KAS--------------------------------- -----------------PLRRV-FI--PK--------SNGQ----------------------R-RPL------------ ------GIPT----ML-DRGMQALYLLA-VE-PEVETNS-------D--------------------------------- ---GNSFGFRK------------------QR----------------------------SCADA---IEQC--------- ----FKVLCRK-----G-----------------AGEC-VL-DA--------DIKGCF-DNIS----------------- -HEW--MLKH-L----------------SI------------------DKPIL---------SQWLNA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----E-S--------------- -------------------------------------------------------------------------------- ---------------G---------K--VYP-TL-A-----------GTP-----QGGII------------------S- PTLMNMVL-----------------------------------------------------------NQ----------- --------LQ--GT-IEEASG----------------------------------------------------------- -----------------VKR-GKHREIRS--------NVKRVSVIR----------YADDFVV-------------TAH- -----SQAF----LVD--------------------------TILPCINE---------------------FM-S----- ----Q--R----GLAL------S-P------EK----T---HVRHIS------------E-------------------- ----------------GFDFLGQNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115641574|locus|VBIThiNit264030_3543|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Thioalkalivibrio nitratireducens DSM 14787] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------MQRC--------- ----FLSLAKR-----S-----------------SAEW-IL-EG--------DIRACF-DAFD----------------- -HDW--LIEH-T----------------PT------------------DQGRL---------RAWLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------M-----E-Q--------------- -------------------------------------------------------------------------------- ---------------R---------R--IFP-TE-R-----------GTA-----QGGII------------------S- PTVANMVL-----------------------------------------------------------DG----------- --------LE--GR-IRARFK----------------------------------------------------------- -----------------RRG--------------------KVNLIR----------FADDFVI-------------TGE- -----SRAI----LEN--------------------------DVTPLVTE---------------------FL-H----- ----E--R----GLVL------A-P------EK----T---RIVHID------------D-------------------- ----------------GFDFLGFRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Ms.b.I1/NZ_AAAR02000002/377828..379992/Methanosarcina_extraction barkeri/CL1/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGEKWLSSAS- ---------K-----MKAVLSLT-------G------KRY----KAK--------------------------------- -----------------PLKRV-FI--NK--------PGKT----------------------KKRPL------------ ------GIPT----MY-DRAIQSLYSLA-LE-PVAEIKS-------D--------------------------------- ---LRSFGFRK------------------HR----------------------------STKDA---CQQI--------- ----FLCLSKK-----T-----------------SAQW-IL-EG--------DIRGCF-DNIN----------------- -HQW--LLTN-I----------------PI------------------DKAIL---------TQFLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----Y-K--------------- -------------------------------------------------------------------------------- ---------------R---------H--LNP-TK-A-----------GTP-----QGGII------------------S- PILANMTL-----------------------------------------------------------DG----------- --------IE--KM-LLVKYP----------------------------------------------------------- -----------------KKG--------K--------NSKKVNFIR----------YADDFIV-------------TAN- -----SKET----AG---------------------------EIKDEVVA---------------------FL-K----- ----E--R----GLEL------S-D------DK----T---FITNIN------------E-------------------- ----------------GFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|161805880|locus|VBICloPas18034_1667|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium pasteurianum BC1] -------------------------------------------------------------------GVDKELWSTTAS- ---------K-----MQAVLSLT-------D------KNY----KAK--------------------------------- -----------------PLRRV-YI--EK--------KGKK----------------------AKRPL------------ ------GIPC----MY-DRAMQALYALA-LD-PVSEVTA-------D--------------------------------- ---TKSFGFRK------------------NR----------------------------CCQDA---CEYI--------- ----FTALSRE-----N-----------------CAKW-IL-EG--------DIKACF-DYIS----------------- -HEW--LIEN-I----------------PM------------------DKSVL---------KQFLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----F-E--------------- -------------------------------------------------------------------------------- ---------------N---------E--LFP-TD-D-----------GTP-----QGGVI------------------S- PILANMAL-----------------------------------------------------------DG----------- --------MQ--KA-LSDRFH----------------------------------------------------------- -----------------TNKLGRVDNRFQ--------IANKVYLVR----------YADDFIV-------------TAA- -----TKEI----AE---------------------------EAKELIRE---------------------FL-Q----- ----T--R----GLEL------S-E------EK----T---KITHIN------------D-------------------- ----------------GFDMLGWTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|19653441|locus|VBIStrEqu35012_1915|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Streptococcus equi subsp. zooepidemicus] -------------------------------------------------------------------GIDGELWTTPAQ- ---------K-----MEALLSLT-------D------KGY----KAS--------------------------------- -----------------PLRRV-YI--DK--------KGKK----------------------KKRPL------------ ------GIPT----MY-DRAMQALYALA-LE-PIAETTA-------D--------------------------------- ---TKSFGFRK------------------GR----------------------------SCQDA---CEYI--------- ----FTALSRK-----A-----------------SPQW-IL-KG--------DIKGCF-DNIS----------------- -HDW--LLEN-I----------------PM------------------DKSIL---------KQFLKA------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------V-----F-K--------------- -------------------------------------------------------------------------------- ---------------G---------E--LFP-TE-D-----------GTP-----QGGII------------------S- SILANMAL-----------------------------------------------------------DG----------- --------LQ--QV-LSDRFH----------------------------------------------------------- -----------------TNRLGRIDFRFK--------NSHKVNLVR----------YADDFIV-------------TAA- -----TQEI----AL---------------------------EAKELIRE---------------------FL-I----- ----G--R----GLEL------S-E------EK----T---LVTHIN------------D-------------------- ----------------GFDLLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|161806116|locus|VBICloPas18034_1785|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium pasteurianum BC1] -------------------------------------------------------------------GVDKKLWSTSAS- ---------K-----IKAVLTLT-------D------KQY----RTK--------------------------------- -----------------PLKRV-YI--KK--------KGKN----------------------KKRPL------------ ------GIPT----MY-DRAMQTLYALA-LE-PVAEVTG-------D--------------------------------- ---HISFDFRK------------------GR----------------------------SAKDA---CEQT--------- ----FCVLSRK-----C-----------------SPTW-IL-EG--------DIKGCF-DNIN----------------- -HDW--LQKN-I----------------PM------------------DKRIM---------KQFLKS------------ -------------------------------------------------------------------------------- ------------------GF------------------------------------I-----Y-E--------------- -------------------------------------------------------------------------------- ---------------G---------N--LFP-TD-T-----------GSP-----QGGAI------------------S- SLYANMTL-----------------------------------------------------------DG----------- --------LE--KL-IQDKYH----------------------------------------------------------- -----------------RNSKGKIENHYR--------AKTKVNMVR----------YADDFII-------------TAN- -----TKEI----AE---------------------------ELKDIVSK---------------------FL-K----- ----N--R----GLNL------S-Q------EK----T---TITHID------------Y-------------------- ----------------GFDFLGWTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.sp.I2/BA000020/259212..261419/Nostoc_extraction sp./CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGVKSLDFNG- ---------R-----FELEITLK-------Q------SSGNW--HHQ--------------------------------- -----------------ELREI-PI--PK--------KDG-----------------------TTRML------------ ------KIPT----IA-DRCWQCLAKYA-LE-PAHEATF-------H--------------------------------- ---ARSYGFRT------------------GR----------------------------AAHDA---QQFL--------- ----FSNLS-SKAK--R-----------------ISKR-VI-EL--------DIEKCF-DRIN----------------- -HST--IMEN-LI-------------APKG------------------IKLGI---------YRCLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--SI-HRYHKD----------------------------------------------------------- ------------------NQRITNKTPES--------DIRY-PSVR----------YADDMVI-------------VLR- -----PQDD----AN---------------------------EILAKIED---------------------FL-N----- ----A--R----GMKV------S-A------KK----T---KITATT------------D-------------------- ----------------GFDFLGWHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115683516|locus|VBIOscNig7962_8018|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Oscillatoria nigroviridis PCC 7112] -------------------------------------------------------------------GIDGRASLTFEE- ---------R-----LALSEELR-------A------KSNNW--KHQ--------------------------------- -----------------KLRSI-PI--PK--------KDG-----------------------STRLL------------ ------KIPT----IA-DRAWQCLAKYA-LE-PAHEATF-------H--------------------------------- ---ARSYGFRT------------------GR----------------------------SAHDA---QKFL--------- ----FLNLS-SKAH--G-----------------ISKR-VI-EL--------DIEKCF-DRIS----------------- -HTS--IMER-LI-------------APKG------------------IKTGI---------FRCLKS------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------N-----P-G--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--EI-HR--------------------------------------------------------------- -------------------------------------------SVR----------YADDMVI-------------ILK- -----PKDD----AK---------------------------AILDKVSE---------------------FL-A----- ----A--R----GMKV------S-E------KK----T---KLTATT------------D-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115430450|locus|VBICriEpi239080_1694|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Crinalium epipsammum PCC 9333] -------------------------------------------------------------------GIDGKASLNHEE- ---------R-----FALSEELR-------T------RSSKW--KHQ--------------------------------- -----------------KLREI-PI--PK--------KDG-----------------------TTRLL------------ ------KVPT----IG-DRAWQCLVKLA-LE-PAHEATF-------H--------------------------------- ---AKSYGFRT------------------GR----------------------------AAHDA---QKYL--------- ----FDHLR-STSH--G-----------------IEKR-VI-EL--------DIEKCF-DRIA----------------- -HKS--IMER-LI-------------APSG------------------IKLGI---------YRCLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--DI-HQ--------------------------------------------------------------- -------------------------------------------SVR----------YADDMVF-------------ILK- -----PKDD----AV---------------------------AILEQISQ---------------------FL-A----- ----E--R----GMKI------S-E------KK----T---KLTATT------------D-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115337801|locus|VBIAnaCyl106394_6267|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Anabaena cylindrica PCC 7122] -------------------------------------------------------------------GIDGKTALTFEQ- ---------R-----FQLSEKLR-------T------EANNW--KHQ--------------------------------- -----------------GLREI-PI--PK--------KDG-----------------------KTRIL------------ ------KVPT----IA-DRAYQCLVKYA-LE-PAHEATF-------H--------------------------------- ---ARSYGFRT------------------GR----------------------------SAQDA---QKYL--------- ----YTNLN-SSVN--G-----------------IEKR-VI-EL--------DIEKCF-DRIN----------------- -HTA--IMDR-LI-------------APYS------------------IRLGI---------FRCLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--SI-HRYHIQ----------------------------------------------------------- ------------------GLRITNKTKGY--------KIVE-PSVR----------YADDMII-------------ILR- -----PEDD----AK---------------------------EILDKISR---------------------FL-A----- ----E--R----GMKV------S-E------KK----T---KLTATT------------D-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115514952|locus|VBICalSp227687_3172|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Calothrix sp. PCC 6303] -------------------------------------------------------------------GIDGKKSLTFRE- ---------R-----FELSELLK-------A------SCNNW--KHQ--------------------------------- -----------------GLREI-PI--PK--------KDG-----------------------TTRML------------ ------KIPT----MA-DRAWQCLAKYA-LE-PAHEATF-------H--------------------------------- ---ARSYGFRS------------------GR----------------------------SAHDA---QTVL--------- ----LTHLR-SNNN--G-----------------INKR-VI-EL--------DIEKCF-DRIS----------------- -HTS--IMEN-LI-------------APKG------------------VKLGI---------FRCLKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--SI-HRYHRN----------------------------------------------------------- ------------------GSKITNKTAGK--------DITE-PSIR----------YADDMVI-------------IIR- -----PQDD----AQ---------------------------KILADIDS---------------------FL-A----- ----A--R----GMKV------S-E------KK----T---KITAAT------------D-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >C.w.I6/NZ_AADV02000041/1584..4153/Crocosphaera_extraction watsonii/CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GIDGIKSLNFKQ- ---------R-----FALAERL--------L------KAHDW--KHS--------------------------------- -----------------KLREI-PI--PK--------KDG-----------------------TTRML------------ ------KVPT----MA-DRAWQCLVKYA-LE-PAHEALF-------H--------------------------------- ---ARSYGFRP------------------GR----------------------------STHDA---QKIL--------- ----FLNLK-SDSN--G-----------------LNKR-IL-EL--------DIEKCF-DRIN----------------- -HTS--IMER-VI-------------APQT------------------IKTGI---------WRCLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANVAL-----------------------------------------------------------DG----------- --------IE--DI-HY--------------------------------------------------------------- -------------------------------------------SIR----------YADDMVV-------------ILK- -----PKDD----AD---------------------------KILKDIQE---------------------FL-A----- ----A--R----GLKV------S-E------KK----T---KLVRAT------------E-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115603115|locus|VBIRivSp77222_2588|_extraction reverse transcriptase homolog [Rivularia sp. PCC 7116] -------------------------------------------------------------------GIDGKKSLTFEE- ---------R-----FALEELLK-------A------KSSKW--KHQ--------------------------------- -----------------KLRAI-PI--PK--------KDGT----------------------TTRLL------------ ------KIPT----LA-DRCWQCLAKYA-LE-PAHEATF-------H--------------------------------- ---KHSYGFRT------------------GR----------------------------SAHDA---QKQV--------- ----FQNLK-SSSN--G-----------------INKR-IL-EL--------DIEKCF-DRIN----------------- -HSS--IISN-LI-------------APNR------------------LKLGI---------FRCLKV------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------N-----P-D--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTC-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--EL-HKYHTN----------------------------------------------------------- -----------------KGRKIKATTPEK--------DINT-ACVR----------YADDMVF-------------FLR- -----PEDD----EK---------------------------EILDNISQ---------------------FL-A----- ----K--R----GLKV------S-E------KK----T---KLTAST------------F-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115549836|locus|VBIAnaSp49473_5321|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Anabaena sp. 90] -------------------------------------------------------------------GVDGKASLTYKE- ---------R-----VELDKLLM-------E------QVNTW--THS--------------------------------- -----------------KLREI-PI--PK--------KDG-----------------------TKRIL------------ ------KVPT----IK-DRAWQCIIKYT-IE-PAHEAIF-------H--------------------------------- ---ERSYGFRP------------------GR----------------------------STHDA---QKYL--------- ----FDNLR-SQSH--G-----------------KDKI-IL-EM--------DIEKCF-DRIS----------------- -HNH--LMSQ-II-------------APQS------------------VKLGV---------WKCLKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVC------------------S- PLLANIAL-----------------------------------------------------------HG----------- --------IE--AI-HK--------------------------------------------------------------- -------------------------------------------SVR----------YADDMVF-------------IFK- -----KGDD----QA---------------------------KVFDEITE---------------------FL-R----- ----I--R----GLNI------K-T------AK----T---RFVPAT------------T-------------------- ----------------GFNFLGWKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|22782216|locus|VBINosSp37423_6520|_extraction reverse transcriptase homolog [Nostoc sp. PCC 7120] -------------------------------------------------------------------GCGESRTPRF--- ---------------------------------------------NR--------------------------------- -----------------EVRRI--I--PP--------IDS---------------------------------------- --------------------NQCLAKYA-LE-PAHEATF-------H--------------------------------- ---EHSYGFRP------------------GR----------------------------STHDA---QSQI--------- ----ANYLA-SSKG--G-----------------INKR-IL-EL--------DIEKCF-DRIN----------------- -HST--IMSN-LI-------------APQG------------------LKQGI---------FRALKA------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------N-----P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PLLANIAL-----------------------------------------------------------NG----------- --------IE--DL-HQYHDC----------------------------------------------------------- -----------------NYKKITPSTPER--------NIKK-ACVR----------YADDMVF-------------FLR- -----PEDD----AE---------------------------EILEKISQ---------------------FL-A----- ----Q--R----GLKI------S-E------KK----T---KLTAST------------D-------------------- ----------------GFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|22782374|locus|VBINosSp37423_6599|_extraction reverse transcriptase homolog [Nostoc sp. PCC 7120] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------M------------ ------EIGS----IK-D-------------------------------------------------------------- ------YG------------------------------------------------------------------------ -------------------------------------------------------------------------------- ------------K-------------SPSP------------------KRTGL--------------------------- -------------------------------------------------------------------------------- --------------------------------------------------------------P-E--------------- -------------------------------------------------------------------------------- ----------------------------FP---E-Q-----------GTP-----QGGVV------------------S- PILANIAL-----------------------------------------------------------NG----------- --------IE--SI-HR--------------------------------------------------------------- ------------------------SKAKG--------QIIE-PSVR----------YADDMVI-------------ILK- -----PKDN----AI---------------------------EILERISE---------------------FL-R----- ----K--R----GMQV------S-Q------KK----T---KITAAT------------D-------------------- ----------------GFDFLGWHF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >N.sp.I1/BA000019/6209592..6207287/Nostoc_extraction sp./CL2/ORF Sequence %28a.a%29 -------------------------------------------------------------------GVDGKKALEPSQ- ---------R-----LALYEVLV-------K------NWKQW--KHQ--------------------------------- -----------------PLKRV-YI--PK--------ADG-----------------------TRRGL------------ ------GIPT----IS-DRAYQCLIKYA-LE-PAAEAMF-------N--------------------------------- ---ARSYGFRP------------------GR----------------------------SCHDV---QKLL--------- ----FSNLNGGQAN--G-----------------LSKR-IL-EL--------DIERCF-DKID----------------- -HKF--LMQS-VQ-------------LPKA------------------AKQGI---------FWAIKA------------ -------------------------------------------------------------------------------- ------------------GV------------------------------------R-----G-E--------------- -------------------------------------------------------------------------------- ----------------------------FPS-SE-S-----------GTP-----QGGVI------------------S- PLLANIVL-----------------------------------------------------------HG----------- --------LE--NV-GH--------------------------------------------------------------- ------------------ELRYKVRSGGR--------QIDTIKGFR----------YADDVVF-------------LLK- -----PEDN----PE---------------------------ALRQNIDT---------------------FL-E----- ----A--R----GLKV------K-E------AK----T---KIVHST------------D-------------------- ----------------SFDFLGWNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|354961371|dbj|BAL14050.1|_extraction hypothetical protein BJ6T_88080 %5BBradyrhizobium japonicum USDA 6%5D -------------------------------------------------------------------GADGITFAQIETE GRE----RWL-----ENVRQELT-------A------GDY----RPQ--------------------------------- -----------------PLLRV-WI--PK--------SN------G-----------------GRRPL------------ ------SIPT----VK-DRTVMTAAMLV-IG-AIFEADL-------L--------------------------------- ---ENQYGFRP------------------KV----------------------------DAKMA---VRRV--------- ----FWHIRD------------------------HRRSEIV-DA--------DLRDYF-TSIP----------------- -HAP--LMKC-L-TRRI---------ADGR------------------LLSMI---------KGWLTV------------ -------------------------------------------------------------------------------- ------------------AV------------------------------------I-----EKD--------------- -------------------------------------------------------------------------------- ---------------GRRITRTAEAR------TKKR-----------GTP-----QGSPL------------------S- PLLANLYF-----------------------------------------------------------RR----------- --------FL--LA-WRHGHQ----------------------------------------------------------- -------------------------------------DQLDAHIVN----------YADDFVI-------------CCR- -----PGSS---------------------------------ETAMARMQ---------------------TLMN----- ----R--L----GLEV------N-D------TK----T---RLARVP------------E-------------------- ----------------SVTFLGYTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149195205|ref|ZP_01872295.1_extraction --------------------------------------------------------IFSPQNIKIYLKTDKIDID----- ---------V-----KKLSDELL-------R------GKY----IPS--------------------------------- -----------------PLQSF-EL--KK--------SNN-----------------------KTREI------------ ------KILT----DK-DKLVQKVLYES-IN-EFFDKQF-------S--------------------------------- ---NRSYGYRI------------------GK----------------------------STIKA---IKRC--------- ----KDFIKR------------------------KYFY-VF-KS--------DIKNFF-ENIN----------------- -HNK--LISL-L-DRNI---------EDKR------------------IIRLI---------VQFIKS------------ -------------------------------------------------------------------------------- ------------------GI------------------------------------L----------------------- -------------------------------------------------------------------------------- -------------------------KKEYFS-HE-I-----------GVH-----QGDIL------------------S- PLLSNIYL-----------------------------------------------------------NE----------- --------FD--KF-LES-------------------------------------------------------------- ---------------------------------------KNIEFVR----------YADDFVI-------------FMK- -----KNNK---------------------------------EIPEILNI---------------------FL-K----- ----N--I----DLEI------S-E------EK----S---YFSDIY------------K-------------------- ----------------GFSFLGCFF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115351524|locus|VBIThiMob160332_3694|_extraction hypothetical protein [Thioflavicoccus mobilis 8321] -------------------------------------------------------------------GADRQDRKIFEK- ----QQEQHF-----RAIHRKVL-------Q------DRW----IFS--------------------------------- -----------------PFLEK-SI--PK--------IA------G-----------------GERII------------ ------SLAT----IR-DTLVQRRLYEY-LY-PIVDPLL-------S--------------------------------- ---DACCAYRR------------------GS----------------------------GAHNA---IKQI--------- ----RNALDA------------------------GYVH-VL-DA--------DIKSFF-DRVD----------------- -HET--LLAI-ARDLPI----------DER------------------ALRLL---------QIYLTT------------ -------------------------------------------------------------------------------- ------------------PR------------------------------------V-----TSE--------------- -------------------------------------------------------------------------------- ---------------DRRKADSAKPRTKYSREPRTL-----------GIP-----QGGVI------------------S- GMLANLLL-----------------------------------------------------------AS----------- --------FD--KE-MQQ-------------------------------------------------------------- ---------------------------------------GEDILVR----------YADDFLV-------------CCQ- -----TEQA----AN---------------------------DADVRAAH---------------------ALDA----- ----L--G----GLEL------H-P------DK----T---AVRDAT------------D-------------------- ----------------GVDFVGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|17232506|ref|NP_489054.1_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------I------------ ------SAAP----YR-DRVVHHALYNI-IV-PIFERTF-------I--------------------------------- ---SDSYANRV------------------GF----------------------------GSHRA---LRRF----TQFSR SSH----------------------------------Y-VF-QA--------DIRKYF-PSID----------------- -HKI--LKHL-IYRKI----------KCPD------------------TLWLI---------NTIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------FQEPVINYFPGDNLLAPLE-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------C-----------GLP-----IGNLT------------------S- QFFANVYL-----------------------------------------------------------NN----------- --------FD--HF-VKEQLQ----------------------------------------------------------- -------------------------------------AFK---YIR----------YVDDFAL-------------FSD- -----DKAF----LE---------------------------TARVIIEE---------------------YLT------ ----S--L----RLKI------H-P------IK----S---QLFE----------------------------------- -----TK--------HGANFVGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119509703|ref|ZP_01628848.1_extraction_extraction -----------------------------------------------------------------KGKRFRDNVLEFNY- --NLE--TEL-----IRLQKELT-------D------KTY----QPG--------------------------------- -----------------AYRTF-HL--ID---------P-K-----------------------SRLI------------ ------SAAP----YR-DRVVHHALCNI-IV-PIFEKTF-------I--------------------------------- ---GDSYANRL------------------GF----------------------------GTHRA---LRKF----THFAR NSR----------------------------------Y-VL-QC--------DIRKYF-PSID----------------- -HIV--LKEL-IRRKI----------KCPD------------------TLWLI---------DTIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------EQETVIDYFPGDDLLSPVI-Q---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----IGNLT------------------S- QFFSNIYL-----------------------------------------------------------NG----------- --------FD--HF-VKEQLK----------------------------------------------------------- -------------------------------------ISK---YVR----------YVDDFAL-------------FSD- -----ERQL----LA---------------------------DARLAIEE---------------------YLT------ ----T--L----RLKI------H-P------IK----S---QLFE----------------------------------- -----TQ--------IGATFLGFRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|126660098|ref|ZP_01731218.1_extraction_extraction -----------------------------------------------------------------KGKRYRDNVLDFNY- --NLA--TEL-----FKIQQELT-------N------KTY----QPG--------------------------------- -----------------QYRTF-HL--RD---------P-K-----------------------SRLI------------ ------SAAP----YR-DRVVHHALCNI-IV-PIFEKTF-------I--------------------------------- ---CDSYANRE------------------GY----------------------------GTHRA---LRRF----TDFLR THT----------------------------------C-IL-QC--------DIKKYF-PSID----------------- -HQI--LKQL-IRRKI----------KCQD------------------TLWLI---------DKIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------PQEPTIQYFPNDTLLTPIE-R---K----------------------- -------------------------------------------------------------------------------- -----------------------------------Q-----------GLP-----IGNLT------------------S- QFFANIYL-----------------------------------------------------------NP----------- --------LD--HF-IKEQLK----------------------------------------------------------- -------------------------------------CKA---YVR----------YVDDFAL-------------FSS- -----DPDY----LK---------------------------DCRQQIEN---------------------FLI------ ----S--L----RLKI------H-P------VK----S---QLFS----------------------------------- -----TN--------IGATFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119511289|ref|ZP_01630404.1_extraction_extraction -----------------------------------------------------------------KGKRFRENILKFNY- --NLE--AEL-----AKIKTQLE-------S------KTY----QPG--------------------------------- -----------------RYKTF-EI--CE---------P-K-----------------------HRLI------------ ------SAAP----YR-DRVVHHALCNI-IV-PIFEPTF-------I--------------------------------- ---TDSYANRL------------------GF----------------------------GTHRA---LRRF----TTFAR SHR----------------------------------Y-VL-QC--------DIKKYF-PSIN----------------- -HEI--LKSL-LHRKL----------KCQD------------------TLWLA---------ETIINS------------ -------------------------------------------------------------------------------- ------------------SN------------PQESVIDYFPGDDLLSPLQ-G---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----IGNLT------------------S- QFFANVYL-----------------------------------------------------------NN----------- --------LD--HF-VKEQIK----------------------------------------------------------- -------------------------------------AQN---YVR----------YVDDFAL-------------FSD- -----DYGF----LA---------------------------AAKLAIEE---------------------HLI------ ----N--L----RLKL------H-P------VK----S---QLFE----------------------------------- -----TR--------HGASFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115549820|locus|VBIAnaSp49473_5313|_extraction FIG00874608: hypothetical protein [Anabaena sp. 90] -------------------------------------------------------------------------------- ---------L-----SKLQQELI-------N------KTY----QPG--------------------------------- -----------------EYRTF-YI--KE---------P-K-----------------------TRMI------------ ------SAAP----YR-DRVVHHALCNI-IV-PLIEPTF-------I--------------------------------- ---GDSYANRV------------------GF----------------------------GSHRA---LRRF----TKFAR SNR----------------------------------Y-IL-QC--------DIQKYF-PSID----------------- -HEI--LKTL-LHNKL----------KCPD------------------TLWLI---------DSLIDN------------ -------------------------------------------------------------------------------- ------------------SN------------EQFPVVEYFPGDELLTPLE-R---Q----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLT------------------S- QFFANVYL-----------------------------------------------------------NY----------- --------LD--HF-IKDKLK----------------------------------------------------------- -------------------------------------VRK---YLR----------YVDDFAL-------------FSN- -----DREF----LT---------------------------DARYAIEE---------------------YLT------ ----E--L----RLKI------H-P------IK----S---QLFE----------------------------------- -----TK--------YGTNFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115507212|locus|VBICalSp15318_6404|_extraction hypothetical protein [Calothrix sp. PCC 7507] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------DL-LQ-PL---------------------------------------------- -----KYAKR---------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------D------------------ALLLI---------DTIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------AQENIVDFFPGDDLLTPIQ-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLT------------------S- QFFANIYL-----------------------------------------------------------NS----------- --------FD--HF-VKEKLK----------------------------------------------------------- -------------------------------------AQK---YIR----------YVDDFAL-------------FAD- -----DHNF----LA---------------------------NARLAIEA---------------------YLA------ ----T--L----RLKI------H-P------VK----S---QLFE----------------------------------- -----TK--------HGANFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|126659397|ref|ZP_01730532.1_extraction_extraction -----------------------------------------------------------------LGKRFRDSVLEFND- --NLE--GNL-----LKLQKELK-------S------HTY----QPG--------------------------------- -----------------EYSTF-RI--YD---------P-K-----------------------PRLI------------ ------SAAP----YR-DRVVHHALCNV-IV-PLIEKSF-------I--------------------------------- ---PDSYANRE------------------GY----------------------------GSHRA---LKRF----IGFDR SSK----------------------------------Y-IL-QC--------DIKKYF-PSID----------------- -HEI--LKQQ-IRHYL----------KCSK------------------TLWLI---------DIIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------EQEPVNDVFPGDDLLTLTE-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLA-----IGNLT------------------S- QFFCNLYL-----------------------------------------------------------NK----------- --------FD--HF-VKEELK----------------------------------------------------------- -------------------------------------AKK---YVR----------YVDDFAL-------------FSD- -----NQDF----LI---------------------------TSKQLIED---------------------YLT------ ----T--L----RLKL------H-P------VK----T---QLFE----------------------------------- -----TK--------YGANFLGFRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115462114|locus|VBILepSp112363_4293|_extraction FIG00874608: hypothetical protein [Leptolyngbya sp. PCC 7376] -------------------------------------------------------------------------------- ---------L-----LKLRHELI-------T------KSY----RPG--------------------------------- -----------------GYRTF-RI--FD---------P-K-----------------------PRLI------------ ------SAAP----YR-DRVIHHALCNV-IV-PLLDKSL-------I--------------------------------- ---DTTYANRT------------------GY----------------------------GTHRA---LKQF----ITLAR SSR----------------------------------Y-IL-QC--------DISKYF-PSID----------------- -HQI--LKAQ-LHHKI----------KCKD------------------TLWLI---------DLIIDY------------ -------------------------------------------------------------------------------- ------------------SN------------PQEPVHHYFPGDTLLTPLE-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------H-----------GLP-----IGNLT------------------S- QFFANFYL-----------------------------------------------------------NG----------- --------FD--HF-VKEQLH----------------------------------------------------------- -------------------------------------ARK---YLR----------YVDDYAL-------------FSN- -----DYGF----LK---------------------------DAKVAIRD---------------------YLE------ ----G--L----RLRM------H-P------IK----S---QLFE----------------------------------- -----TK--------YGANFVRFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|113474819|ref|YP_720880.1_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------I--IE---------R-K-----------------------PRII------------ ------SAAP----YR-DRIIHHALCNI-II-PIFEKTF-------I--------------------------------- ---YDTYANRI------------------NF----------------------------GTHKA---LSRF----TKFSC SSC----------------------------------Y-VL-QC--------DIVKYF-PSID----------------- -HQI--LKEI-IRRQI----------KCQD------------------TLWLI---------EKIIDG------------ -------------------------------------------------------------------------------- ------------------SN------------QQIPVLTKFPGDDLLSSIN-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----LGNLT------------------S- QFFANLYL-----------------------------------------------------------NN----------- --------FD--HF-IKEELK----------------------------------------------------------- -------------------------------------VKK---YLR----------YVDDFAL-------------FSD- -----DKKF----LV---------------------------IARQKIET---------------------YLA------ ----N--L----CLKI------H-P------IK----S---QLFQ----------------------------------- -----TK--------KGANFVGFLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|68551127|ref|ZP_00590553.1_extraction_extraction -----------------------------------------------------------------KGKRENQSVLHFFT- --FLE--ENL-----WQILSELR-------T------KTW----QPG--------------------------------- -----------------SYKTF-SI--YK---------P-K-----------------------PRMI------------ ------SAAP----FK-DRVVHHALITI-VG-PLLERSF-------I--------------------------------- ---FDTYANRT------------------AK----------------------------GTHKA---IERY----QHYLK KYA----------------------------------Y-VL-KC--------DIRKYF-PSID----------------- -HEI--LKSL-LRRKI----------ACAD------------------TLWLI---------DTIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------IQAEHFHYFPGDTLFTPHE-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----IGNLT------------------S- QFFANYYL-----------------------------------------------------------SF----------- --------LD--HY-VKEVLR----------------------------------------------------------- -------------------------------------CKG---YVR----------YVDDYVL-------------FSD- -----SKDE----LW---------------------------EWKKAIEE---------------------FLQ------ ----Q--F----RLTL------N-S------GR----T---ELYP----------------------------------- -----AT--------EGKCFLGQKV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119356297|ref|YP_910941.1_extraction_extraction -----------------------------------------------------------------KGKRETRSVLLFYA- --QLE--DNL-----WQIIAELK-------S------KTW----QPG--------------------------------- -----------------SFKSF-SI--YK---------P-K-----------------------PRLI------------ ------SAAP----FR-DRVVHHALINV-VG-PLLEQTF-------I--------------------------------- ---HDTYANRM------------------GK----------------------------GTHKA---IRRY----QHFLC RFD----------------------------------Y-AL-KC--------DIKKYF-PSVD----------------- -HEI--LKTS-LRRRV----------ACND------------------TLWLI---------DTIIDN------------ -------------------------------------------------------------------------------- ------------------SN------------SQDDHLHYFQGDDLFTPVE-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----IGNLT------------------S- QFFANYYL-----------------------------------------------------------NF----------- --------LD--HF-VKERLH----------------------------------------------------------- -------------------------------------CKG---YVR----------YVDDFVL-------------FSD- -----SRAE----LW---------------------------RWKEEIER---------------------YLE------ ----E--F----RLVL------N-A------QR----T---ELFP----------------------------------- -----TT--------EGRCFLGQKV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|42673013|locus|VBIRhoVan113057_2968|_extraction hypothetical protein [Rhodomicrobium vannielii ATCC 17100] ----------------------------------------------------------------------KPGAAAFMA- --NLE--REI-----LRLERELR-------D------GSY----RPG--------------------------------- -----------------RYVEI-LV--KD---------P-K-----------------------ERLI------------ ------SAAP----FR-DRVVHHALCAV-VC-PLFEAGF-------T--------------------------------- ---DHTFANRT------------------GK----------------------------GTHKA---IRLY----ERYRD NHS----------------------------------Y-VL-RA--------DIFRYF-PAID----------------- -HEI--LKAE-FRRKI----------ACER------------------TLWLM---------DLIVDC------------ -------------------------------------------------------------------------------- ------------------SN------------SQEPVELHFPGDDLFTPYT-R---R----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLT------------------S- QFFANLYL-----------------------------------------------------------NR----------- --------FD--HW-VIEKLG----------------------------------------------------------- -------------------------------------A-P---YVR----------YVDDFAL-------------FHD- -----DPGI----LA---------------------------TWREKIER---------------------CLE------ ----G--R----RLKL------H-P------RK----T---LILP----------------------------------- -----VA--------EPSPFLGFELH------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Bfid|21260775|locus|VBICanAcc132554_2632|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Candidatus Accumulibacter phosphatis clade IIA str. UW1] -------------------------------------------------------------------RLAQSGHACFEF- --ALA--DRL-----LELKRELE-------T------GQY----RPG--------------------------------- -----------------GYLNF-FI--HE---------P-K-----------------------RRKI------------ ------SAAP----FR-DRVVHHPLCNV-IE-PRFERLF-------I--------------------------------- ---ADSYANRR------------------GK----------------------------GTHRA---IDRL----QHFAQ RHR----------------------------------Y-VL-RA--------DIVKHF-PSID----------------- -HQV--LHAI-LARVV----------PEAD------------------LMALI---------DRIIAS------------ -------------------------------------------------------------------------------- ------------------GA---------GVLDEEYATVYFPGDDLLAAC--R---P----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLT------------------S- QFWSNCYL-----------------------------------------------------------HP----------- --------FD--QF-VTRELR----------------------------------------------------------- -------------------------------------WAA---YLR----------YVDDFAL-------------FSD- -----SKRE----LW---------------------------AWKRAIVE---------------------RLA------ ----R--L----RLTI------H-E------GP----A---QVVP----------------------------------- -----VE--------NGIPWLGFVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|146277368|ref|YP_001167527.1_extraction_extraction -----------------------------------------------------------------KGRHRQRDVIAFEA- --DLE--PNL-----FAIQESLI-------Q------KTY----RTG--------------------------------- -----------------PYHRF-FV--YE---------P-K-----------------------KREI------------ ------ASLP----LK-DRVVQHALVSV-IE-PIFEARF-------I--------------------------------- ---DQSFACRV------------------GK----------------------------GAHKG---ADTV----QRYMR EVLR----------------E-----------QG-QVF-AL-KA--------DISKYF-PSVC----------------- -HDA--LRRI-IRRRI----------ACPD------------------TLWLI---------DSILES------------ -------------------------------------------------------------------------------- ------------------SA-----------EP---------------GAL-T---P----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GIP-----IGNLT------------------S- QMFANIYL-----------------------------------------------------------HE----------- --------LD--HF-VKHTLR----------------------------------------------------------- -------------------------------------ERR---YVR----------YMDDFAV-------------IHH- -----DKAH----LH---------------------------EVRRACED---------------------FLWA----- ----E--L----GLRT------N-A-------K----T---QVFP----------------------------------- --IGEPG--------RALDFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|134299090|ref|YP_001112586.1_extraction_extraction -----------------------------------------------------------------KGKRYIGEVLEFTA- --NLE--ENL-----ISIQNDLI-------N------QTY----QTG--------------------------------- -----------------RYREF-YV--YD---------P-K-----------------------LRLV------------ ------AALP----FR-DRVMHHAVCNI-IE-PIFEKVF-------I--------------------------------- ---YDSYACRV------------------NK----------------------------GTHAG---ANRV----TSYLR KAQR----------------H-----------WP-RVY-CL-KG--------DVKQYF-PSIN----------------- -HGI--LKRI-LHRKI----------SCPK------------------TRWLL---------NEIIDS------------ -------------------------------------------------------------------------------- ------------------SA-----------SS---------------DDL-N---P----------------------- -------------------------------------------------------------------------------- -----------------------------------S-----------GIP-----IGNLT------------------S- QLFANIYL-----------------------------------------------------------NE----------- --------LD--HF-IKEDLC----------------------------------------------------------- -------------------------------------ARY---YVR----------YMDDFII-------------LGD- -----NKRQ----LW---------------------------AVLDEIKG---------------------FLDF----- ----K--L----NLQL------N-G-------K----T---GVFP----------------------------------- --VNQ-----------GIDFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|27311204|ref|NP_758929.1_extraction_extraction -----------------------------------------------------------------KGKTKANATLVFFN- --NLE--ENI-----IDTKRADV--------------GRV----QNV--------------------------------- -----------------PLSPF-YY--SS---------R-K-----------------------RRLI------------ ------SAPH----FK-DRVVHRAIYNV-IE-PLFDKTY-------I--------------------------------- ---YDSYACRR------------------GE-------------------------------RA---PTKALTGLQYFIK KVES----------------K-----------HG-KAY-AL-KA--------DISRYF-SSID----------------- -HQV--LKSI-LEAKI----------QCQR------------------TLDLL---------FYIIDN------------ -------------------------------------------------------------------------------- ------------------S-------------P---------------CES-M---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----LGNLT------------------S- QIFANVYL-----------------------------------------------------------HE----------- --------LD--RY-AKHALG----------------------------------------------------------- -------------------------------------AKH---YIR----------YMDDFAI-------------IHH- -----DKAV----LH---------------------------QWRKDIEE---------------------FLHL----- ----Y--L----RLKT------N-S-------K----T---QVFPYQ--------------------------------- RVMAGA---------------WIRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|120599269|ref|YP_963843.1_extraction_extraction -----------------------------------------------------------------KGKTKSNSTLVFFN- --NLE--ENI-----IQIQNELI-------W------GMY----LSS--------------------------------- -----------------PYHHF-YV--FE---------P-K-----------------------RRLI------------ ------SAPS----FR-DRVVHRAIYNV-IE-PILDRQY-------I--------------------------------- ---YDSYACRR------------------GK----------------------------GTHRG---ADRA----QLFIR RVEK----------------T-----------HS-KAY-AL-KA--------DISRYF-SSID----------------- -HHI--LKSL-VSAKI----------QCER------------------TKCLL---------FYIIDS------------ -------------------------------------------------------------------------------- ------------------S-------------P---------------SDA-H---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----LGNLT------------------S- QVFANLYL-----------------------------------------------------------NE----------- --------LD--RF-AKHTLK----------------------------------------------------------- -------------------------------------AKN---YVR----------YMDDFVI-------------IHH- -----DKQQ----LH---------------------------QWRVMIER---------------------FINC----- ----Q--L----RLKT------N-S-------K----T---QVFP----------------------------------- -VAASAG--------RSLDFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|150390313|ref|YP_001320362.1_extraction_extraction -----------------------------------------------------------------KDKRFRDEILKFSA- --NLE--ENL-----IQIQNELI-------W------KAY----KVG--------------------------------- -----------------RYREF-YV--HE---------P-K-----------------------KRLI------------ ------MALP----FK-DRVVQWAIYRV-LN-PLFEKTY-------T--------------------------------- ---EHSYACRI------------------GR----------------------------GTHQA---AKKL----QYWLR QIDR----------------K-----------PQ-KYY-YL-KM--------DISKYF-YRVD----------------- -HSI--ALKI-LRKKI----------KDKD------------------VLWLM---------EEIIQS------------ -------------------------------------------------------------------------------- ------------------ED-----------MAFGLPLGMEPGDCPKYMRL-H---D----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GMP-----IGNLT------------------S- QLLANIYL-----------------------------------------------------------NE----------- --------LD--QF-CKHKLQ----------------------------------------------------------- -------------------------------------IKY---FIR----------YMDDFIV-------------LHH- -----DKKY----LH---------------------------RLKVEIEN---------------------FLNS----- ----E--L----ELHL------N-R-------K----T---CIRP----------------------------------- -----TP--------VGIEFVGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149736339|gb|EDM52225.1_extraction_extraction -----------------------------------------------------------------KGKRYRDDVLIFNR- --NYE--EQL-----INIQNHLI-------Y------ETY----EVG--------------------------------- -----------------KYHTF-YV--YE---------P-K-----------------------KRLI------------ ------MSLP----FK-DRIVQWAIYRQ-LF-PLYEKTF-------I--------------------------------- ---FDSYACRK------------------GK----------------------------GTHKA---ADRL----QYWLR QTER----------------K-----------PE-RYY-YL-KM--------DISKYF-YRVD----------------- -HDI--LLKI-LARRI----------KDQR------------------LLNLL---------EKIINC------------ -------------------------------------------------------------------------------- ------------------ES-----------MNFGLPPGKEPDEVAVSDRL-S---N----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GMP-----IGNLT------------------S- QMFANIYL-----------------------------------------------------------NE----------- --------VD--QY-AKHELG----------------------------------------------------------- -------------------------------------LHY---YIR----------YMDDIII-------------LHH- -----DKKY----LA---------------------------EVKELLRA---------------------FLSD----- ----E--L----RLDL------N-N-------K----T---TIRP----------------------------------- -----CS--------MGVDFVGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|21725483|locus|VBIDesAce42372_4322|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfotomaculum acetoxidans DSM 771] ------------------------------------------------------------------RKRYRNEVLKYTA- --NLG--ENL-----IQAEEELI-------S------KSY----RVS--------------------------------- -----------------PYRKS-FV--YE---------P-K-----------------------KRLV------------ ------MALP----FG-DRIVQWSVYRT-LN-PLLNKRY-------I--------------------------------- ---SHSYACRT------------------GY----------------------------GSHRA---VKQL----QYWLR YLER----------------R-----------HG-RIY-VL-KA--------DMTKYF-YRVD----------------- -HDI--IMNI-LERII----------GDYD------------------LIWLL---------EEIVRC------------ -------------------------------------------------------------------------------- ------------------EH-----------TWFGLPLDAEGFECELTGEV----------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GIP-----IGNLT------------------S- QMIANLYL-----------------------------------------------------------NE----------- --------LD--QY-AKHNLQ----------------------------------------------------------- -------------------------------------IKY---YMR----------YMDDVLI-------------LHN- -----DKKY----LW---------------------------HIKEEIEE---------------------FLDR----- ----N--L----RLKL------N-N-------K----T---CVRT----------------------------------- -----NT--------QGIDWIGYRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149833092|gb|EDM88174.1_extraction_extraction -----------------------------------------------------------------KGRRYNKDVLRVQH- --DIW--NVI-----EQIQQDVR-------S------GKY----TID--------------------------------- -----------------KYYIF-YV--YE---------P-K-----------------------KRMI------------ ------MSIT----FY-HRIVQWAIYRV-IN-PLLVKGY-------I--------------------------------- ---KDTYGCIP------------------GR----------------------------GSLAA---MQRL----RYWIK SVEH----------------K-----------PG-TWY-YL-KL--------DISKYF-YRIS----------------- -HEV--LKEI-LARKI----------KDQQ------------------LLQVL---------YNIIDC------------ -------------------------------------------------------------------------------- ------------------QY-----------TPFGLPPGKGPGEVPLEERL-Y---D----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GMP-----VGNLL------------------S- QVFANIYL-----------------------------------------------------------DA----------- --------LD--QF-CKRTLC----------------------------------------------------------- -------------------------------------IHF---YVR----------YMDDIII-------------LSD- -----SKEQ----LH---------------------------MWKDEIQK---------------------FVET----- ----T--L----RLSL------N-Q-------K----T---CIRP----------------------------------- -----IS--------QGIEFVGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|178964775|locus|VBIDesAlk310819_0341|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfovibrio alkalitolerans DSM 16529] ---------------------------------------------------------------------YRREVAEFSV- --NLE--ENL-----INIHNHLV-------W------GSW----EPG--------------------------------- -----------------RPRSF-TV--FE---------P-K-----------------------RRDI------------ ------QAPP----FA-DRIVHHALVRV-VE-PLFERRF-------I--------------------------------- ---YHSYACRT------------------GK----------------------------GAQRA---VWAL----QRMLR TAHR----------------N-----------WQ-TPY-VV-KA--------DIKSYF-ASIR----------------- -HDV--LFTA-IERVV----------SCKD------------------TLDLW---------KRITAG------------ -------------------------------------------------------------------------------- ------------------Y-----------------------------GHD-----G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GLP-----VGALT------------------S- QLAANVML-----------------------------------------------------------DQ----------- --------LD--HA-MTDGAG----------------------------------------------------------- -------------------------------------VGR---YVR----------YMDDFII-------------VAP- -----DKAA----AW---------------------------RALNAAAD---------------------TVA------ ----G--L----GLAL------N-P-------K----T---KIIP----------------------------------- -----AK--------CGVDFCGYRT------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|23317494|locus|VBIRhoCen1465_1100|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rhodospirillum centenum SW] ----------------------------------------------------------------------SDEVLEFGF- --GLE--EKL-----FDLQGQMV-------N------GVW----RPG--------------------------------- -----------------RPREF-MV--RD---------P-K-----------------------PRLI------------ ------SAPP----FA-DRVVHHAVVRV-IE-PVLERRF-------I--------------------------------- ---FDSYACRK------------------GR----------------------------GVHTA---VDRL----QRHLR EASC----------------E-----------GG-KVW-VL-KA--------DISKYF-ASIN----------------- -HGR--LMAI-LGRSI----------SDKK------------------VLWLC---------RTNLKG------------ -------------------------------------------------------------------------------- ------------------Y-----------------------------GFD-E---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----VGALT------------------S- QLFANIYL-----------------------------------------------------------DQ----------- --------LD--HW-IKDELG----------------------------------------------------------- -------------------------------------IKR---YVR----------YMDDFVI-------------VGH- -----SKAD----LW---------------------------ALYDAIAD---------------------FLAT----- ----K--L----ALRL------N-R-------K----T---TVLP----------------------------------- -----AS--------GGIDFCGYRTW------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|151580925|gb|EDN44663.1_extraction_extraction -----------------------------------------------------------------RGRSGKVPVARGFA- --ELE--KTV-----VTLRDELL-------A------GTW----QPG--------------------------------- -----------------RYYYF-TI--TD---------P-K-----------------------EREV------------ ------AAAP----FR-DRVVHHALVRV-LE-PIFEPRF-------I--------------------------------- ---ADSFACRP------------------GK----------------------------GTHAA---LARA----REFTR RHR----------------------------------Y-CL-KC--------DIKKYF-PNID----------------- -HAL--LLRE-VGRAV----------DDAR------------------VLELI---------GRILAS------------ -------------------------------------------------------------------------------- ------------------HA-------------DGAAQEWRAGAGLFDVEQ-R---P----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLT------------------S- QFLANVHL-----------------------------------------------------------HP----------- --------LD--LF-VKQTLR----------------------------------------------------------- -------------------------------------VKG---YVR----------YVDDFLL-------------FGD- -----DRAA----LK---------------------------AHGQRVRE---------------------FVR------ ----T--L----RLRV------H-P------DK----F---RLSR----------------------------------- -----TE--------QGVDFVGFVA------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|68553139|ref|ZP_00592520.1_extraction_extraction -----------------------------------------------------------------RGKKSQLRVAHFLF- --HQE--KEC-----LRLQTELK-------Q------GIW----QPS--------------------------------- -----------------GFRVF-EI--RE---------P-K-----------------------PRRI------------ ------SAAD----FQ-DRVVQHALCNI-LG-PLCERRL-------I--------------------------------- ---FDTWACRR------------------GK----------------------------GSHLA---MKRA----QAFSR RFP----------------------------------Y-FL-KC--------DIRRYF-DSVD----------------- -HTI--LKRL-LWRLI----------KDKP------------------VLNLL---------DRIIDH------------ -------------------------------------------------------------------------------- ------------------P---------------------------LPGAL-P---G----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GLP-----IGNLT------------------S- QHFANLYL-----------------------------------------------------------GE----------- --------LD--HQ-LKDRMG----------------------------------------------------------- -------------------------------------VKA---YLR----------YMDDMLI-------------FAD- -----DKSR----LH---------------------------ELVTGIED---------------------FVKQ----- ----H--L----QLSL------R-P------SA----T---LVAP----------------------------------- -----VS--------EGVPFLGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|78189651|ref|YP_379989.1_extraction_extraction -----------------------------------------------------------------KGKQAKRYVCAYRK- --QLQ--ENL-----QLLRHQIL-------S------GAI----QTG--------------------------------- -----------------KYHAF-TI--YD---------P-K-----------------------ERVI------------ ------CATP----FS-QRVLHHAIMNV-CH-PFFEKHQ-------I--------------------------------- ---AGSFASRK------------------GK----------------------------GTYAA---LDKA----REYNC CYR----------------------------------W-FL-KL--------DVRKYF-DSIN----------------- -HTV--LQKQ-LTRLF----------KDKT------------------LLLIF---------EQIIDS------------ -------------------------------------------------------------------------------- ------------------YS-----------------------------TA-D---H----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GVP-----IGNLT------------------S- QYFANHYL-----------------------------------------------------------SV----------- --------AD--HY-AKEGLR----------------------------------------------------------- -------------------------------------VPA---YVR----------YMDDMVL-------------WHN- -----EKEE----LL---------------------------AMGYMFQT---------------------FIAK----- ----E--L----LLEL------K-P------------------------------------------------------- FCLNATH--------KGLPFLGYLL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|68551181|ref|ZP_00590605.1_extraction_extraction -----------------------------------------------------------------KGKVSKHYVGAYKK- --QLP--QNL-----QRLQQQLF-------S------GEV----ETG--------------------------------- -----------------GYHTF-TI--YD---------P-K-----------------------KRLI------------ ------CATP----FS-QRVLHHALMNV-CH-ASFEKQQ-------I--------------------------------- ---VTSFASRP------------------GK----------------------------GTYAA---LDKA----REYHR HFR----------------------------------W-FL-KL--------DVRKYF-ESID----------------- -HSI--LKQQ-LYRMF----------KDKN------------------VLLMF---------DNIIDS------------ -------------------------------------------------------------------------------- ------------------YA-----------------------------TE-A---G----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------SVP-----IGNLT------------------S- QYFANHYL-----------------------------------------------------------LV----------- --------AD--YY-VKQRLC----------------------------------------------------------- -------------------------------------IPA---YVR----------YMDDMVL-------------WHH- -----DKEA----LL---------------------------EAGYRLQD---------------------YLAR----- ----E--L----RLQL------K-P------------------------------------------------------- FCLNESR--------KGLPFLGYLL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|42527768|ref|NP_972866.1_extraction_extraction -----------------------------------------------------------------RGKRLKPDVLLYEK- --NLY--TNL-----KTLQNYLI-------N------QTV----LLG--------------------------------- -----------------SYRFF-KI--YD---------P-K-----------------------ERII------------ ------CAAP----FN-ERVLHHAIINI-TE-SVFEKFQ-------I--------------------------------- ---YDSYACRK------------------NK----------------------------GTQAA---LLRA----LYFSR RFK----------------------------------Y-FL-KL--------DMKKYF-DSIP----------------- -HSK--LSLL-LTCKF----------KDKA------------------LLHLF---------NKLIAS------------ -------------------------------------------------------------------------------- ------------------YS-----------------------------VT-E---G----------------------- -------------------------------------------------------------------------------- -----------------------------------W-----------GVP-----IGNLT------------------S- QYFANFYL-----------------------------------------------------------SF----------- --------FD--HY-AKEKMN----------------------------------------------------------- -------------------------------------VRG---YIR----------YMDDVLL-------------FSD- -----NLKD----IK---------------------------LIQKKAKN---------------------FLSC----- ----E--L----DLTL------K-E------------------------------------------------------- EIIGMVK--------NGIPFLGFLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42983704|locus|VBIEubRec131464_1270|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Eubacterium rectale DSM 17629] ------------------------------------------------------------------CKRYTNEVLAFSM- --VKE--EEL-----LRATEEIQ-------N------LTY----RQG--------------------------------- -----------------EYKIF-KV--FE---------P-K-----------------------ERLI------------ ------MALP----FY-DRVVQHMICNA-IQ-PVFENGF-------Y--------------------------------- ---YHSYACRS------------------GK----------------------------GMHAA---SDTL----YQWMY ETEV----------------K-----------QGLRMY-AF-KG--------DISKYF-ASIP----------------- -HDK--LKDE-NRRYI----------GDKK------------------ALMLM---------DDIIDH------------ -------------------------------------------------------------------------------- ------------------NG----------------------------ILP-D---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----VGNLT------------------S- QLFANVYG-----------------------------------------------------------NK----------- --------LD--KF-CKHVLH----------------------------------------------------------- -------------------------------------IPY---FVR----------YMDDFII-------------LSD- -----DLEQ----LK---------------------------EWVKRIEE---------------------FLEN----- ----E--M----FLHI------N-P-------K----S---TILY----------------------------------- -----AG--------NGIDFCGYIH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42986635|locus|VBIEubRec131464_2830|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Eubacterium rectale DSM 17629] ------------------------------------------------------------------GKGWYAEVKCIEK- --DLD--HYL-----KRLQENLI-------E------HRY----HTS--------------------------------- -----------------EYEIF-TK--KE---------SNK-----------------------EREI------------ ------YKLP----FYPDRICQWAILQV-IE-PYLLNSM-------T--------------------------------- ---KDTYSAIP------------------NR----------------------------GIQPI---INQL----RGYKK KIKKDGKVVAEKWIPSILVSD-----------PEATKY-CL-KL--------DVRKYY-PSIV----------------- -HDV--LKAK-YRELF----------KDEE------------------LIWLM---------DEIIDS------------ -------------------------------------------------------------------------------- ------------------ISTCPATEENIEILQRLGVAVNIIIDDNGREFV-D---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----IGNYV------------------S- QYDGNFNL-----------------------------------------------------------SV----------- --------VD--HW-LKEVKG----------------------------------------------------------- -------------------------------------VKY---YFR----------YMDDMVI-------------FGS- -----SKEE----LH---------------------------KLKRELDE---------------------FMAV----- ----N--L----KQVL------K-H-------N----W---QVFP----------------------------------- -----TK-------VRGVDFVGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42706064|locus|VBIAliSha154597_2655|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes shahii WAL 8301] ---------------------------------------------------------------------RTYGVIEHDK- --KRE--VNL-----LKLRETLL-------N------GTF----HTS--------------------------------- -----------------KYDVF-TI--YE---------P-K-----------------------EREI------------ ------YRLP----YFPDRILHHAIMNV-LE-PIWVSTF-------T--------------------------------- ---ADTYSCIK------------------NR----------------------------GIH--------------AAAK KVKQ------------ALRED-----------PEGTTF-CL-KL--------DIRKFY-PSIN----------------- -HDV--LKSI-LRRKL----------KDKR------------------LLRLL---------DEIIDS------------ -------------------------------------------------------------------------------- ------------------AD------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----IGNYL------------------S- QYFANLYL-----------------------------------------------------------TY----------- --------FD--HW-IKEQKR----------------------------------------------------------- -------------------------------------VKH---YFR----------YADDIVI-------------LAS- -----DKSY----LH---------------------------SLMGEIRA---------------------YLG------ ----D--L----KLEV------K-G-------N----W---QVFP----------------------------------- -----VA-------ARGIDFVGYVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87116280|locus|VBIAliFin145170_0509|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes finegoldii DSM 17242] --------------------------------------------------------------------------MLHDK- --NRE--ANI-----LALHETLK-------N------HTF----KNS--------------------------------- -----------------EYSTF-TI--YE---------P-K-----------------------ERII------------ ------FRLP----YYPDRILHHAIMNI-LE-PIWVSVF-------T--------------------------------- ---KDTYSCIK------------------GR----------------------------GIH--------------GAMR NVKR------------AIK-D-----------RENARY-CL-KI--------DIRKFY-PSID----------------- -HDV--LKTI-IRRKI----------KCKD------------------TLALL---------DTIIDS------------ -------------------------------------------------------------------------------- ------------------TD------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----IGNYL------------------S- QYFANLML-----------------------------------------------------------AY----------- --------FD--HW-IKEEKR----------------------------------------------------------- -------------------------------------VRN---YFR----------YADDMVF-------------LAS- -----TKEE----LH---------------------------ILLADIKK---------------------YLA------ ----A--L----KLTL------K-G-------N----E---QIFP----------------------------------- -----IAENRADKHGRGLDFVGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|41179367|ref|NP_958675.1_extraction_extraction -----------------------------------------------------------------HGKRRTWGYLEFKE- --YDL--ANL-----LALQAELK-------A------GNY----ERG--------------------------------- -----------------PYREF-LV--YE---------P-K-----------------------PRLI------------ ------SALE----FK-DRLVQHALCNI-VA-PIFEAGL-------L--------------------------------- ---PYTYACRP------------------DK----------------------------GTHAG---VCHV----QAELR RTR--------------------------------ATH-FL-KS--------DFSKFF-PSID----------------- -RAA--LYAM-IDKKI----------HCAA------------------TRRLL---------RVVLPD------------ -------------------------------------------------------------------------------- ------------------EG------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----IGSLT------------------S- QLFANVYG-----------------------------------------------------------GA----------- --------VD--RL-LHDELK----------------------------------------------------------- -------------------------------------QRH---WAR----------YMDDIVV-------------LGD- -----DPEE----LR---------------------------AVFYRLRD---------------------FASE----- ----R--L----GLKI------S-H------------W---QVAP----------------------------------- -----VS--------RGINFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|83310593|ref|YP_420857.1_extraction_extraction -----------------------------------------------------------------RHKRNTASALDFEM- --VLE--NNL-----MDLLAELQ-------A------GTW----MPG--------------------------------- -----------------PATVF-AI--TR---------P-R-----------------------PREV------------ ------WAAQ----FR-DRIVHHLVYRA-IN-PLFEPAF-------I--------------------------------- ---ADSCACIK------------------GR----------------------------GTLYG---AERL----HRHLR SATE----------------N-----------WSKPAF-YL-KA--------DIANFF-GSIR----------------- -HAD--LFAM-LARRI----------KDPT------------------MLELC---------RKLVFQ------------ -------------------------------------------------------------------------------- ------------------DV-----RRDAIVKDGAGTLALVPPHKSLFQAL-P---G----------------------- -------------------------------------------------------------------------------- -----------------------------------I-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------DG----------- --------LD--QM-IKRRLG----------------------------------------------------------- -------------------------------------MRH---YVR----------YVDDMVL-------------IHP- -----ESKA----LL---------------------------SAAEAIRD---------------------HLSG----- -------I----GLKL------A-E------HK----T---FVAP----------------------------------- -----VT--------KGVDFVGHVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|83309559|ref|YP_419823.1_extraction_extraction -----------------------------------------------------------------RHKRTTRSALAFEV- --ALE--ANL-----MQLLTELR-------A------GTW----WPA--------------------------------- -----------------PATVF-AI--TR---------P-K-----------------------PREV------------ ------WAAQ----FR-DRIVHHLVYRA-IN-PLFEPAF-------I--------------------------------- ---ADSCACIK------------------GR----------------------------GTLYA---ADRL----ERHLR SVTE----------------G-----------WSKPAY-YL-KA--------DIANFF-GSIR----------------- -HDV--LFAM-LARRI----------ADPT------------------MLELC---------RRLVFQ------------ -------------------------------------------------------------------------------- ------------------DV-----RQGAIVQDAAGTLARVPSHKSLFQTP-A---G----------------------- -------------------------------------------------------------------------------- -----------------------------------I-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------DP----------- --------VD--QM-VKRRLK----------------------------------------------------------- -------------------------------------LR----YVR----------YVDDMVI-------------VHQ- -----DPKV----LL---------------------------AAADAIRA---------------------HLSG----- -------L----GLHL------A-E------SK----T---FVAP----------------------------------- -----VE--------KGVDFVGHVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|121528340|ref|ZP_01660954.1_extraction_extraction -----------------------------------------------------------------RTKRNSRNALEFEQ- --DLE--RNL-----GRLYGELR-------D------GSY----RPG--------------------------------- -----------------RSICF-VV--TR---------P-K-----------------------PREV------------ ------WAAD----FR-DRVVHHFLYNH-IG-ARFEDAF-------I--------------------------------- ---AGSCACIK------------------GR----------------------------GTLYA---AEFL----ESGIR SITR----------------N-----------WSRQAY-YL-KC--------DLSNFF-VAID----------------- -KTI--LLDL-LLAKV----------REPF------------------WAWLT---------ELVLMH------------ -------------------------------------------------------------------------------- ------------------DP-----RTDFEFRGDPKLLEKVPSHKRLMEQP-S---H----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------DV----------- --------LD--QR-AKHQLK----------------------------------------------------------- -------------------------------------AKH---YVR----------YVDDFLF-------------LHE- -----SPAR----LN---------------------------EILADVTA---------------------FLPA----- ----R--L----GVQI------N-P------RK----T---ILQQ----------------------------------- -----ID--------RGIDFVGHVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|19110484|locus|VBIBurGlu130723_4311|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Burkholderia glumae BGR1] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------MHHLLYNR-TG-LRFERSF-------I--------------------------------- ---ADSCACIK------------------GR----------------------------GTLYA---ARRL----ESKVR SITQ----------------N-----------WSRPAF-YL-KC--------DLANFF-VSID----------------- -KPI--LLKL-LLAKI----------PEPF------------------WRTLT---------ERVLMH------------ -------------------------------------------------------------------------------- ------------------DP-----RTDFEFHGDPQMLALVPPHKRLLEQA-G---H----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------DV----------- --------LD--QH-AKQALG----------------------------------------------------------- -------------------------------------ARY---YIR----------YVDDFLF-------------LHE- -----SPAR----LN---------------------------EILVDVTA---------------------FLPA----- ----C--L----GVRI------N-P------RK----T---ILQP----------------------------------- -----ID--------RGVDFVGQMI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|162042828|locus|VBIRalSol168141_2001|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Ralstonia solanacearum CMR15] ---------------------------------------------------------------------NSASAMGFEI- --NLE--GNL-----RRLCDDLV-------S------DSY----KPG--------------------------------- -----------------RSKCF-VI--TR---------P-K-----------------------YREV------------ ------WAAE----FR-DRIVHHLLYNR-IG-PRFERSF-------I--------------------------------- ---ADSCACIK------------------GR----------------------------GTLYA---AQRL----EAKVR SITQ----------------N-----------WARPAH-YL-KC--------DLANFF-VSID----------------- -KRV--LLDL-LLAKI----------PEPF------------------WRELT---------ELVLMH------------ -------------------------------------------------------------------------------- ------------------DP-----RGDFAYLGDPWMMDKVPPHKRLMEQP-S---H----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------NE----------- --------LD--QF-VKHELR----------------------------------------------------------- -------------------------------------CRH---YIR----------YVDDFVL-------------LHE- -----SPQW----LN---------------------------DAHDAIES---------------------FLPG----- ----R--L----GARL------N-P------RK----T---ILQS----------------------------------- -----ID--------RGVDFVGQVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186514316|locus|VBIPseSp174409_5710|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Pseudomonas sp. CF161] ----------------------------------------------------------------------SDSALAFEI- --DLE--QNL-----IQLHDDLV-------T------GTY----RPG--------------------------------- -----------------RSICF-VV--TR---------P-K-----------------------AREV------------ ------WAAA----FR-DRIVHHLLYNH-IG-PRIERSF-------I--------------------------------- ---ADSCACIK------------------GR----------------------------GTLYA---AKRL----ESKIR SASE----------------N-----------WSRPVF-YL-KL--------DLANFF-VAID----------------- -KEV--LRKQ-LAARI----------TEPW------------------WLALA---------EQILMH------------ -------------------------------------------------------------------------------- ------------------DP-----REDYEVRSPAHLFNRVPQHKRLTAQP-S---H----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------NA----------- --------LD--QF-AKHQLK----------------------------------------------------------- -------------------------------------ARH---YIR----------YVDDFVF-------------LHE- -----SPQQ----LN---------------------------EWLAQVEA---------------------FLPS----- -------L----GARL------N-P------TK----T---ILQP----------------------------------- -----VE--------RGVDFVGHVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|58533098|locus|VBIColFun187779_3500|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Collimonas fungivorans Ter331] ---------------------------------------------------------------------NTPNALAFEQ- --DLE--RNL-----TRLYAELV-------D------GSY----KPG--------------------------------- -----------------QSICF-VV--TR---------P-K-----------------------PREV------------ ------WAAD----FR-DRVVHHLLYNR-IS-PRFYAAF-------I--------------------------------- ---KDTCACIP------------------GR----------------------------GTMYA---AQRL----EAKIR SATE----------------N-----------WSKPVW-YL-KC--------DLANFF-VSID----------------- -KNV--LHKQ-IAVRV----------TEPW------------------WMRLA---------ETILFH------------ -------------------------------------------------------------------------------- ------------------DP-----RQNYQLRGASALIELVPPHKRLTNQP-A---H----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GLP-----IGNLS------------------S- QFFANIYL-----------------------------------------------------------DA----------- --------LD--QH-VKHQVR----------------------------------------------------------- -------------------------------------ARH---YIR----------YVDDFIL-------------LHE- -----SPQW----LN---------------------------AALADINA---------------------FLPD----- ----V--L----HTNL------N-P------TK----T---ILQP----------------------------------- -----VD--------RGVDFVGHVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|58850249|locus|VBIPseSp173302_0942|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Pseudogulbenkiania sp. NH8B] ---------------------------------------------------------------------NSHNALAFEQ- --NLE--RNL-----CQLYDELA-------S------GTY----SPG--------------------------------- -----------------RSICF-VV--TR---------P-K-----------------------PREV------------ ------WAAD----FR-DRIVHHLLYNH-IA-PRFHASF-------I--------------------------------- ---TDSCACIP------------------GR----------------------------GTLYA---AERL----EAKVR RITQ----------------N-----------WSRPAF-YL-KC--------DLANFF-VSID----------------- -KRV--LRDQ-LAAKI----------DEAW------------------WLALA---------EQILFH------------ -------------------------------------------------------------------------------- ------------------DP-----RTDYELHSSPALVDLVPRHKRLAEQP-A---H----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GLP-----IGNLS------------------S- QFFANVYL-----------------------------------------------------------NA----------- --------LD--QY-AKHQLR----------------------------------------------------------- -------------------------------------ARH---YIR----------YVDDFIL-------------LHE- -----SAQW----LS---------------------------ATHDQIEA---------------------WLPA----- ----R--L----HVRL------N-P------AK----T---ILQP----------------------------------- -----VS--------RGIDFVGQVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|35398478|locus|VBISidLit69165_0197|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sideroxydans lithotrophicus ES1] ---------------------------------------------------------------------NSASALAFET- --NLE--RNL-----CRLNDELR-------N------GTY----QPG--------------------------------- -----------------KSICF-VI--TR---------P-K-----------------------PREV------------ ------WAAE----FR-DRIVHHLLYNR-IS-PRFYAGF-------I--------------------------------- ---ADSCACIP------------------GR----------------------------GTLYG---AQRL----EAKIR SITQ----------------N-----------WSKPAH-YL-KL--------DLANFF-VSID----------------- -KHV--VRGL-LAKRI-----------DGW------------------WMELA---------ELVLFH------------ -------------------------------------------------------------------------------- ------------------DP-----RQDFELRGDPYLLRRVPPHKRLTSQP-G---H----------------------- -------------------------------------------------------------------------------- -----------------------------------I-----------GLP-----IGNLS------------------S- QFFANVLL-----------------------------------------------------------DA----------- --------LD--QH-IKHDLR----------------------------------------------------------- -------------------------------------CRH---YVR----------YVDDMVL-------------LHE- -----SPMW----LC---------------------------AARSDIET---------------------WLPL----- ----H--L----GLRL------N-P------VK----T---ILQP----------------------------------- -----VD--------RGVDFVGQVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|45146078|locus|VBIAliDen149934_3700|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alicycliphilus denitrificans BC] ---------------------------------------------------------------------NSASALEFEM- --RLE--RNL-----CDLYDELV-------S------GAY----QPG--------------------------------- -----------------RSICF-PI--TR---------P-K-----------------------PREV------------ ------WAAS----FR-DRIVHWLLYSH-IA-PRFHAAF-------V--------------------------------- ---ADSCACIP------------------GR----------------------------GTLYG---AQRL----ERHVR SCTR----------------N-----------WARPAH-YL-KC--------DLANFF-VSID----------------- -KHV--LRER-IAARV----------HEPW------------------WMALA---------DTILFH------------ -------------------------------------------------------------------------------- ------------------DP-----RQDVEVRGCASDLRRVPPHKSLFNAP-D---D----------------------- -------------------------------------------------------------------------------- -----------------------------------T-----------GLP-----IGNLS------------------S- QFFANVLL-----------------------------------------------------------DA----------- --------LD--QR-VKHRLR----------------------------------------------------------- -------------------------------------APY---YIR----------YVDDFVL-------------LHP- -----SRTW----LT---------------------------AAHHDIET---------------------WLPE----- ----Q--L----RLQL------N-P------RK----T---IRQP----------------------------------- -----VD--------RGLDFVGQVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|148359926|ref|YP_001251133.1_extraction_extraction -----------------------------------------------------------------KNKRNTMNALRFET- --DYE--SNL-----IALRDELN-------S------STW----HPG--------------------------------- -----------------RSIAF-VI--DK---------P-V-----------------------KREI------------ ------FAAD----FR-DRVVHHWLINQ-LN-PLFEKTF-------I--------------------------------- ---YDSYASRK------------------GR----------------------------GAHLG---IARA----AQFIR KCSL----------------N-----------YQRDCY-VL-KL--------DIMSFF-ICIN----------------- -RRI--LWEG-LRCFI----------ERHYNQS-----------DKKLILEVA---------RKIVEN------------ -------------------------------------------------------------------------------- ------------------EP-----TSNCFIKGKRRDWQDFPKDKSLFYAK-P---H----------------------- -------------------------------------------------------------------------------- -----------------------------------C-----------GLP-----IGNLT------------------S- QVFANFYL-----------------------------------------------------------NP----------- --------FD--HY-IKHNLG----------------------------------------------------------- -------------------------------------VRF---YGR----------YVDDFIL-------------VHE- -----DKMF----LK---------------------------SLIPQMEQ---------------------FLQE----- ----E--L----ELEI------H-P------RK----R---YLQH----------------------------------- -----YR--------KGIPFLGVIL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87117093|locus|VBIAliFin145170_0906|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Alistipes finegoldii DSM 17242] ---------------------------------------------------------------------NTVNQLRYEL- --NME--HEL-----LELYDDIV-------S------RRY----NPS--------------------------------- -----------------RSICF-LL--TE--------LE-I-----------------------KREI------------ ------FGAN----FR-DRIVHRLIHDQ-IA-PLFERTF-------I--------------------------------- ---ADSYSCRK------------------GR----------------------------GTLYA---IRRL----DHHIR SCSR----------------N-----------YSRPCW-VL-KL--------DVQGYF-FSID----------------- -RKI--LYAM-LRSYL----------ERHWTAYCAAQPAGRYMLDSELLFYLL---------ERVIFH------------ -------------------------------------------------------------------------------- ------------------DS-----TQNCIVRGSRKVWADFPPSKSLFRAA-P---D----------------------- -------------------------------------------------------------------------------- -----------------------------------C-----------GLP-----IGNLT------------------S- QLFSNIYM-----------------------------------------------------------DR----------- --------FD--QW-MKRELK----------------------------------------------------------- -------------------------------------VRH---YGR----------YVDDFFI-------------VHE- -----DRAY----LK---------------------------SLIPVIRD---------------------FLRE----- ----E--L----HLTL------H-P------NK----I---HLQR----------------------------------- -----AD--------RGVLFVGGYV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87072650|locus|VBIOrnRhi164984_0536|_extraction blr1915; probable reverse transcriptase/maturase family protein [Ornithobacterium rhinotracheale DSM 15997] ---------------------------------------------------------------------NKRSQLCFEW- --QYE--QEI-----DKLYNEIV-------H------YAY----EPS--------------------------------- -----------------APNVF-VT--TH---------P-V-----------------------VREI------------ ------FAPQ----FR-DRVVHHLIYNY-IY-QYLDKKF-------I--------------------------------- ---YDSYSCRL------------------QK----------------------------GTSFG---VKRA----AQFMR RVSE----------------N-----------YTKDAY-VL-KL--------DIRGYF-MQMN----------------- -RNI--LHEK-INQMLDYEALKITNTERVY------------------LQYLI---------RKVVMH------------ -------------------------------------------------------------------------------- ------------------DA-----ARRPRLCASPQEWKKVPCAKSLFHAP-P---N----------------------- -------------------------------------------------------------------------------- -----------------------------------C-----------GLP-----IGSLT------------------S- QLFSNVYL-----------------------------------------------------------NE----------- --------LD--HF-VKSELK----------------------------------------------------------- -------------------------------------IKY---YGR----------YVDDMLF-------------MHR- -----SKDY----LQ---------------------------QVIDSVAC---------------------ELHK----- -------I----GLSL------H-P------QK----I---KLKH----------------------------------- -----YK--------YGVEFLGRYL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118747050|ref|ZP_01594931.1_extraction_extraction -----------------------------------------------------------------RGKKPSANQLLFEA- --RWL--DNL-----HQLYSSLR-------A------GCW----QPA--------------------------------- -----------------PTVCF-TV--TH---------P-K-----------------------TREI------------ ------HAPA----FA-DRIVHHLLVDR-LQ-RLYEPVF-------V--------------------------------- ---YDSYANRT------------------AK----------------------------GSHAA---VDRL----QQMIR R-------------------------------RNGQGW-YL-QL--------DIHNYF-NSIH----------------- -RPT--LYAL-LCRRLDLALQKGKLADSQRLA----------------LRSLC---------HKLL-------------- -------------------------------------------------------------------------------- -------------------A-----RKSREIERPGAAPSSVPPHKRLRNAR-P---Q----------------------- -------------------------------------------------------------------------------- -----------------------------------C-----------GLP-----VGNLT------------------S- QFFANVYL-----------------------------------------------------------NE----------- --------LD--QF-IKHQLK----------------------------------------------------------- -------------------------------------VRN---YLR----------YVDDFVL-------------LAD- -----SKEQ----LR---------------------------TWQAEIAA---------------------FLET----- ----R--L----QLRL------K-D------A-----V---VLAP----------------------------------- -----LH--------HGVDFLGYRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|90580666|ref|ZP_01236470.1_extraction_extraction -----------------------------------------------------------------KSEQKKTCLLEAGR- --ICH--T---------LASEIL-------T------CQY----QPE--------------------------------- -----------------PYHHF-AI--TE---------P-K-----------------------LREI------------ ------YAPA----FK-DRIVQMWIVSQ-LE-KAISHLM-------I--------------------------------- ---DDTYATQL------------------NK----------------------------GTFAA---INKA----QKLMR KPHY--------------------------------RV-AM-QL--------DIYSYF-NHIN----------------- -KAR--LKDR-LVALI----------ETPPQSIGQFHPVKKHI-----LLYLI---------EQILQQ------------ -------------------------------------------------------------------------------- ------------------DA----ARENNLQTNNHRLLAQIPPHKRLSFSQ-Q---G----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GLP-----IGSVT------------------S- QMFGNFYL-----------------------------------------------------------ND----------- --------LD--HF-CKHTLK----------------------------------------------------------- -------------------------------------IKG---YIR----------FMDDLIV-------------LSD- -----NIAQ----LK---------------------------IWKNHIDA---------------------FLKQ----- ----Q--L----LLTL------H-P------HK----T---KIKV----------------------------------- -----IE--------HGVDYLGYTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|126090247|ref|YP_001041702.1_extraction_extraction -----------------------------------------------------------------SSQQKSANALKFPK- --LCH--E---------LSTEIL-------T------NTY----QPY--------------------------------- -----------------SYHHF-AI--TE---------P-K-----------------------LREI------------ ------YAPA----FR-DRIAQMWIALQ-MA-PIMENQF-------I--------------------------------- ---DDTYANRK------------------GK----------------------------GTLAA---IAKV----QKLMR QPRH--------------------------------TW-GL-QL--------DIYSYF-NSIN----------------- -KKE--LLTQ-LYNLI----------YN-----SNLSALRQYC-----LANLI---------EKIIEQ------------ -------------------------------------------------------------------------------- ------------------DA----TKLQNERTGDQYLLNQIPLHKKLQFNNTQ---Q----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GLP-----IGSVT------------------S- QLFGNFYL-----------------------------------------------------------NN----------- --------LD--HE-IKHTLK----------------------------------------------------------- -------------------------------------VKG---YVR----------YMDDLFI-------------LSD- -----SPEK----LQ---------------------------QWKEHIEW---------------------YLTS----- ----H--L----QLKL------H-P------TK----V---HLAP----------------------------------- -----IE--------EGFDYLGFRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|35402002|locus|VBISidLit69165_1918|_extraction hypothetical protein [Sideroxydans lithotrophicus ES1] ---------------------------------------------------------------------GRLRIQRFGE- --DPL--RHL-----ITIQKQLR-------E------RRY----QFG--------------------------------- -----------------PYKTF-TV--RE---------K-K-----------------------FRDV------------ ------VDAP----MK-DRIVHWMLYQY-LL-PIWQPRF-------I--------------------------------- ---HDTFGNLP------------------GR----------------------------GTHAA---LRRL----AQFAR SER--------------------------------AEW-VL-QL--------DISKYF-YSVN----------------- -HAL--LKER-VLRHI----------GDHE------------------LRALI---------INLIDS------------ -------------------------------------------------------------------------------- ------------------FR-------------TDGSYDHLFAESTLYRQT-P---A----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GMP-----IGNLS------------------S- QLFANIFL-----------------------------------------------------------ND----------- --------FD--HW-VKETLR----------------------------------------------------------- -------------------------------------VKR---YVR----------YVDDMAI-------------LGE- -----SREE----LQ---------------------------EVCEQITS---------------------NLAS----- ----E-------GLTI------H-P------HK----I---RIAP----------------------------------- -----TR--------AGVPFLGSIV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|71065017|ref|YP_263744.1_extraction_extraction -----------------------------------------------------------------KSKGGRRGCFQFEK- --SLG--REL-----NELQEELA-------N------NTY----KPR--------------------------------- -----------------PYFKF-IV--YE---------P-K-----------------------KREI------------ ------YAPA----FR-DCVVQYAIYLR-VM-PIFDKTF-------I--------------------------------- ---DQSFACRT------------------GL----------------------------GTHKA---AEYA----QDALR RAGP-------------------------------NTY-TL-QL--------DIKKFF-YSID----------------- -RPT--LRKL-LERKI----------KDKR------------------LVDLM---------MLFADY------------ -------------------------------------------------------------------------------- ------------------P---------------------------------E---P----------------------- -------------------------------------------------------------------------------- -----------------------------------K-----------GIP-----IGNLL------------------S- QMFALIYM-----------------------------------------------------------NP----------- --------VD--HY-ATRVLK----------------------------------------------------------- ------------------------------------PAAG---YCR----------YVDDFLL-------------FGL- -----TRAQ----AL---------------------------TYRKLLTD---------------------FVEQ----- ----K--L----KLTL------S---------R----S---TIAN----------------------------------- -----TK--------RGANFCGYRT------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|22394887|locus|VBILepCho83238_2278|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Leptothrix cholodnii SP6] ---------------------------------------------------------------------ARFACHAFER- --RLG--AQL-----SDLAQSLE-------A------GTY----APR--------------------------------- -----------------PYNTF-MV--HE---------P-K-----------------------PREI------------ ------SAPA----FR-DRVVQHAVYNV-IQ-PIFDRTF-------I--------------------------------- ---DQSFACRP------------------GA----------------------------GTHAA---ADYV----QHGMQ ISRP-------------------------------DSY-TL-HL--------DVRRFY-YSID----------------- -RGI--LRAL-VERKL----------KDRR------------------LVDLM---------MAFAEM------------ -------------------------------------------------------------------------------- ------------------P---------------------------------G---P----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GLP-----IGNLL------------------S- QLHALIYL-----------------------------------------------------------NP----------- --------LD--HY-IKRELG----------------------------------------------------------- -------------------------------------VRL---YCR----------YVDDLLL-------------LDL- -----SRDE----AI---------------------------AHRDAIEH---------------------YLAD----- ----H--L----RLQL------S---------K----A---TMAP----------------------------------- -----TR--------RGVNFVGYRT------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23962770|locus|VBIThiSp19295_1623|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Thioalkalivibrio sulfidophilus HLEbGr7] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------------------M-------I--------------------------------- ---DDSYACRP------------------GK----------------------------GTHVG---ATRV----EQWLR GMT-----------------A-----------AGGAVW-VV-KM--------DVSKYF-ASIR----------------- -HDL--AKAV-VRDKI----------SCPA------------------TLQLI---------DAIIDS------------ -------------------------------------------------------------------------------- ------------------TA---------------------------DPADPD---P----------------------- -------------------------------------------------------------------------------- -----------------------------------V-----------GIP-----VGNLL------------------S- QWIANLVG-----------------------------------------------------------NR----------- --------ID--QW-AKRELR----------------------------------------------------------- -------------------------------------LKR---YAR----------YMDDMVV-------------LVR- -----TKQE----AL---------------------------TIRDQFDD---------------------KLA------ ----S--M----GMRF------S---------K----A---SVLP----------------------------------- -----AS--------RGVNFLGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|91778076|ref|YP_553284.1_extraction -------------------------------------------------------------------GIDRINRSTFER- --NLN--AEI-----ALIRRKAG-------N------RSY----HFS--------------------------------- -----------------QFKEK-LI--SK--------GANR----------------------FPRVI------------ ------SIAT----FR-DRITLRAMCDI-LK-ARFE-------------------------------------------- ---------------------------------------------------------------------------GLLEL KIPQIVIH----DLKREVAS------------GKYNYF-I--KL--------DVQNFY-PSIS----------------- -HAI--LRET-VKKRI----------KSAN------------------VISLL---------DSALET------------ -------------------------------------------------------------------------------- ------------------PT---------------VAVA--DKGRRAEPV------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLSI------------------S- NILAEIFL-----------------------------------------------------------HR----------- --------FD--IW-MEKKLG----------------------------------------------------------- -------------------------------------VK----YFR----------YVDDVFA-------------LC-- -----TSDP----AP---------------------------FFSEMSRT---------------------L-EG----- ----D--F----SLKV------H-DIS-LPGSK----S---VFGP----------------------------------- -----IG--------EEFSFLGYLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118697871|ref|ZP_01555949.1_extraction -------------------------------------------------------------------GIDRINRPTFEK- --VLR--SEI-----ALIRRKSA-------N------STY----RFS--------------------------------- -----------------QYREK-LV--SK--------GSGR----------------------PPRVI------------ ------SIAT----FR-DRITLRALCDF-LR-ARFQ-------------------------------------------- ---------------------------------------------------------------------------DQLNL QIPQVVIH----RLKEEIAT------------GKYSSF-I--KL--------DVRDFY-PSIP----------------- -HAL--LRAQ-VTTRL----------KSHK------------------AVALL---------ERAIET------------ -------------------------------------------------------------------------------- ------------------PT---------------VSVA--DKSKAAVAV------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAI------------------S- NLLAEIFL-----------------------------------------------------------HP----------- --------FD--TA-VASKVN----------------------------------------------------------- -------------------------------------VA----YFR----------YVDDVFV-------------LT-- -----SEPP----AP---------------------------LFEELRSA---------------------L-ES----- ----D--F----GLSV------H-PVG-TDGSK----S---TFGL----------------------------------- -----VT--------NEFTFLGYLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|153215166|ref|ZP_01949864.1_extraction -------------------------------------------------------------------GIDNMNQYAFRK- --QLD--PQV-----EIISRKMI-------E------GSY----AFS--------------------------------- -----------------KYKLK-LL--TK--------GRNK----------------------APREI------------ ------SIPT----VR-DRIALRAMCNF-LQ-DRFA-------------------------------------------- ---------------------------------------------------------------------------ESVKF TLPQDVIK----DVKAHALS------------GDFDSY-I--KL--------DVSNFY-PSIR----------------- -HKN--LRSQ-LRKRI----------RQDH------------------ILDMI---------FSAVTA------------ -------------------------------------------------------------------------------- ------------------PS---------------VLVS--RKDDRPSEV------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAV------------------S- NILAAIYL-----------------------------------------------------------QN----------- --------ID--KF-LSELPN----------------------------------------------------------- -------------------------------------VK----CYR----------YVDDVLV-------------LC-- -----SADQ----AH---------------------------DLAQIIIK---------------------K-FR----- ----N--I----GLKI------H-PVK-VP-EK----S---KIDS----------------------------------- -----LT--------NGFDYLGYQF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|18338818|locus|VBIEscCol51957_5061|_extraction FIG00639787: hypothetical protein [Escherichia coli IAI39] -------------------------------------------------------------------GIDRIRPSKLDL- --TIK--NEI-----TFIFEKVN-------C------GNY----KFT--------------------------------- -----------------AYKEK-LI--SK--------GANS----------------------IPRQI------------ ------SIPT----AR-DRITLRALCEC-LT-EIYP-------------------------------------------- ---------------------------------------------------------------------------NS-RL KLPHTVID----LLKNALNS------------DLYAEY-A--KI--------DLKSFY-PSIE----------------- -HKL--IFNA-IKNKI----------RKKE------------------IRQLI---------TSSLIV------------ -------------------------------------------------------------------------------- ------------------PT---------------VSGSTGSKGVPNNTR------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAI------------------S- NILAEISL-----------------------------------------------------------SD----------- --------FD--NE-INKMHD----------------------------------------------------------- -------------------------------------IW----YMR----------YVDDILI-------------LT-- -----QKDQ----AT---------------------------KIASHIID---------------------K-LQ----- ----S--L----NLNP------H-PLN-EENSK----S---KVGS----------------------------------- -----LD--------ESFNFLGYHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186391424|locus|VBIPseAer311403_5632|_extraction FIG00639787: hypothetical protein [Pseudomonas aeruginosa WC55] -------------------------------------------------------------------GLDRTRPAKLDA- --KLN--GEL-----ELIIQKVH-------Q------GKY----RFT--------------------------------- -----------------AYKEK-LI--LK--------GATS----------------------LPRQI------------ ------SIPT----AR-DRIVLRALCEC-LA-EVYP-------------------------------------------- ---------------------------------------------------------------------------TA-KL SLPQGVIE----DLKSALAS------------GVYSEY-A--KI--------DLKHFY-PSIP----------------- -HSL--IDAA-VRKKI----------RKPE------------------LKQLI---------ASAITT------------ -------------------------------------------------------------------------------- ------------------PT---------------VSESKGRKDAPNTTI------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAV------------------S- NLLAEIAL-----------------------------------------------------------QD----------- --------ID--LA-FKSRPD----------------------------------------------------------- -------------------------------------IW----YKR----------YVDDILI-------------LT-- -----PKDQ----SE---------------------------AVANELIN---------------------G-LK----- ----K--M----GLQP------H-EFG-PE-SK----S---KFAP----------------------------------- -----LT--------EPFSFLGYQV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|187071036|locus|VBIPseSyr250467_3903|_extraction FIG00639787: hypothetical protein [Pseudomonas syringae pv. theae ICMP 3923] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -MLAEIAL-----------------------------------------------------------QD----------- --------ID--NL-FDARTG----------------------------------------------------------- -------------------------------------IW----YKR----------YVDDILI-------------LG-- -----PAGV----AR---------------------------STAHELIA---------------------E-LK----- ----A--L----KLNP------H-DFE-TG-SK----S---KIES----------------------------------- -----LT--------DPFSFLGYQI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|23100091|locus|VBIRhiEtl120572_3276|_extraction hypothetical protein [Rhizobium etli CIAT 652] -------------------------------------------------------------------GKDGIDPGAFQK- --NID--TEL-----ALIRKRVY-------A------QSY----RFT--------------------------------- -----------------HFKQR-LI--SK--------GAGK----------------------PPREI------------ ------AIAG----VR-DRVTLRAVTNV-LM-DVFH-------------------------------------------- ---------------------------------------------------------------------------DA-KL SPAHFIIK----DLLDFVRPL-----------GDEYVF-L--QL--------DVQDFY-PSLD----------------- -HQL--LLKR-IRTRT----------RYKY------------------FTSLV---------DAAVKT------------ -------------------------------------------------------------------------------- ------------------PT---------------------GDGKTANDV------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLSV------------------S- NILSSIYM-----------------------------------------------------------MQ----------- --------ID--E---TARAR----------------------------------------------------------- -------------------------------------FH----YYR----------YVDDILV-------------IC-- -----KASD----AT---------------------------RHFRWLKG---------------------R-LS----- ----K--A----HLTC------H-PL--VEGSK----S---KIVP----------------------------------- -----LS--------IGIDYLGYHIT------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|71737296|ref|YP_277064.1_extraction -------------------------------------------------------------------GIDRINGFQFSC- --RSA--SEL-----EVVSAKCL-------D------STF----RFA--------------------------------- -----------------PFLEQ-LK--LK--------GRGK----------------------EPRVI------------ ------SIPI----VR-DRIVLHQLQKY-LA-LIFP-------------------------------------------- ---------------------------------------------------------------------------EQVPR NVASSYVRQVALDVS----TM-----------DAASAW-VC-ST--------DIQKFY-DSID----------------- -QDR--TVML-VGRKI----------KIPQ------------------VLNLI---------AHALRT------------ -------------------------------------------------------------------------------- ------------------PT---------------VPKNTPRNKHSVYKQEV---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAI------------------S- NILAAIYM-----------------------------------------------------------AD----------- --------VD--RA-MRSVPG----------------------------------------------------------- -------------------------------------LH----YYR----------YVDDVLM-------------YG-- -----EESL----VT---------------------------TSFNSLKR---------------------R-LM----- ----R--R----GLHL------H-GLS-S--DK----T---KFGP----------------------------------- -----LD--------QDFSYLGYVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|126356707|ref|ZP_01713711.1_extraction -------------------------------------------------------------------GIDRINGFQYAA- --RSE--IEL-----EVVSKKCR-------D------GSF----RFS--------------------------------- -----------------PFLEK-LK--LK--------GRGK----------------------APRVI------------ ------GIPT----VR-DRVVLNQLHRY-LA-IIFP-------------------------------------------- ---------------------------------------------------------------------------HTVPR NVASTYVRAVADDMR----TC-----------DPKLTW-VC-CT--------DIERFY-DSIN----------------- -QQR--LIRI-LKRKI----------KCNE------------------AIDLI---------AHALQT------------ -------------------------------------------------------------------------------- ------------------PT---------------VPKNTGRLQYRIFKQIH---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAI------------------S- NLLAAIYM-----------------------------------------------------------AE----------- --------VD--QD-MISVPG----------------------------------------------------------- -------------------------------------IK----YYR----------YVDDVLI-------------YG-- -----EMGP----VT---------------------------AAYESLRR---------------------R-LK----- ----L--R----GLSL------H-GRT-S--DK----T---HIGP----------------------------------- -----ID--------QRFSYLGYEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115520115|locus|VBICalSp227687_6109|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Calothrix sp. PCC 6303] -------------------------------------------------------------------GIDRLSGIQFRK- --QSK--SQF-----KVIKKKCL-------N------GTY----RFS--------------------------------- -----------------PYLEL-LR--SK--------GRDK----------------------PPRLL------------ ------AIPT----VR-DRIVLYALKEI-LF-QIFP-------------------------------------------- ---------------------------------------------------------------------------DCVPR KLANTYIY----DIKKFVSSR-----------SPSEVS-IL-RA--------DVENFY-GSIN----------------- -REK--LFIK-LKKRI----------KSYK------------------LLSLI---------KKAIET------------ -------------------------------------------------------------------------------- ------------------PI---------------VPNNYSRKDRKEYVKESK---G----------------------- -------------------------------------------------------------------------------- -----------------------------------I-----------GIP-----QGLSI------------------S- NILAHIYL-----------------------------------------------------------YD----------- --------FD--HL-IK---N----------------------------------------------------------- -------------------------------------HNCVNVYYR----------YVDDILI-------------FS-- -----EKDK----ID---------------------------KIEILFKS---------------------E-LK----- ----N--I----DLNC------N-----F--NK----T---YKKS----------------------------------- -----GQ--------EEFEYLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|22613911|locus|VBIMetMob89187_1849|_extraction hypothetical protein [Methylotenera mobilis JLW8] -------------------------------------------------------------------GIDRLNGSQFEK- --QSV--NHL-----KTINKKCL-------S------GTY----KFS--------------------------------- -----------------PYAEV-LQ--LK--------GRGK----------------------SPRLI------------ ------GIPT----LR-DRLVLNQLKDV-LA-HIFP-------------------------------------------- ---------------------------------------------------------------------------DCVPK TRANTLIHKISREIKQLNYQA-----------NDEEVM-VF-GC--------DIKGFY-DEID----------------- -RNI--LLEI-LKKRI----------KSVK------------------VIKLI---------LSAISN------------ -------------------------------------------------------------------------------- ------------------PI---------------VPRNYRNKNLDDFTTEK---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLAI------------------S- NILAAIYL-----------------------------------------------------------SE----------- --------FD--TE-IRKVSN----------------------------------------------------------- -------------------------------------H-----YFR----------YVDDILV-------------IH-- -----KSND----AE---------------------------NVKSIIES---------------------K-LN----- ----L--L----GLQI------H-PLG-S--GK----T---HFSK----------------------------------- -----LS--------EEFGYLGYLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|21773733|locus|VBIDesVul86729_1889|_extraction hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] -------------------------------------------------------------------GLDRITPFAFDK- --NSN--QFI-----TLISNKAL-------K------NTY----KFT--------------------------------- -----------------PYLER-LK--LK--------GRDS----------------------FPRVI------------ ------SVPT----IR-DKLTLTALNKY-LQ-TIFP-------------------------------------------- ---------------------------------------------------------------------------EHVNR RVPNQRIR----DLHNLLSTI-----------DTSSVY-IA-RA--------DISDFF-GKID----------------- -RSL--LLSK-IQHHV----------DV-R------------------ALSLI---------EKTLQN------------ -------------------------------------------------------------------------------- ------------------PT---------------VPYNYQRMEISKYVNTQ---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGLPL------------------S- SILAEFYI-----------------------------------------------------------ST----------- --------ID--SE-LRPLTL----------------------------------------------------------- -------------------------------------L-----YDR----------YVDDIIA-------------IS-- -----NKN------D---------------------------TFEKSITH---------------------A-LD----- ----T--L----GLTL------N-----I--RK----T---KFKI----------------------------------- -----ID--------ATFNYLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|21758211|locus|VBIDesSal121003_1367|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfovibrio salexigens DSM 2638] -------------------------------------------------------------------GIDRVSVVDFDK- --RKD--EYF-----NNIHVKCL-------N------GTY----KFS--------------------------------- -----------------PYLQK-LI--LK--------GPES----------------------FPRKI------------ ------SIPT----VR-DKIVLSILNSI-IQ-EIFP-------------------------------------------- ---------------------------------------------------------------------------HCVNR ELPNVKVR----NLKNVIATC-----------N-CDYE-FH-RV--------DIKSYY-DNID----------------- -IEQ--LFGI-LNGNI----------DDEL------------------LLLLL---------HRAVIN------------ -------------------------------------------------------------------------------- ------------------PT---------------VPQNYSRSEKSKYANKDK--------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLPI------------------S- NVLAEIYL-----------------------------------------------------------LD----------- --------FD--EY-MNDKCK----------------------------------------------------------- -------------------------------------F-----YDR----------YVDDIVA-------------LA-- -----L--P----IV---------------------------DFEKSCED---------------------K-FK----- ----I--K----NLPL------N-----T--DK----T---KFCC----------------------------------- -----EV--------SEVNYLGYAI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|21876762|locus|VBIExiSp39724_0818|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Exiguobacterium sp. AT1b] -------------------------------------------------------------------GIDKLQKKKFDL- --IKN--DQI-----QVIVNKVR-------S------GTY----KFT--------------------------------- -----------------FYKEN-LI--IK--------NRDS----------------------LPRMI------------ ------SIPT----LR-DRIVMKVLHEI-LR-DTF--------------------------------------------- -------------------------------------------------------------------------------- ---KIELKLVQSVIKKLTEES-----------TKYDSY-I--KI--------DISNFF-GTLD----------------- -QDL--LMKK-IKKRV----------RKKE------------------ILCLI---------EDSIKT------------ -------------------------------------------------------------------------------- ------------------PT---------------V-NSYHRKENEIIGNFK---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGIPI------------------S- NVLAEIYL-----------------------------------------------------------KD----------- --------LD--YL-YLQRQD----------------------------------------------------------- -------------------------------------LA----YFR----------YVDDILI-------------LC-- -----NKSD----VF---------------------------DIESEIKK---------------------YILE----- ----E--Y----NLNI------N-----F--QK----S---TSGS----------------------------------- -----LY--------DGINYLGYTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|32431149|locus|VBISpiLin97822_0631|_extraction hypothetical protein [Spirosoma linguale DSM 74] -------------------------------------------------------------------GIDRVSQEVFNT- --RKN--EEI-----SLILSKVL-------N------GSY----KFS--------------------------------- -----------------PFLEE-LR--IR--------SRDR----------------------LPRLI------------ ------SIPT----IR-DRLVLATLKIA-LH-EYFT-------------------------------------------- ---------------------------------------------------------------------------ESVRK QVASIYINEVNKTL-----SL-----------NTFTYY-I--KI--------DIKSFF-DSLS----------------- -HTR--LFTI-LNS-I----------LPET------------------LYLLV---------KKAILN------------ -------------------------------------------------------------------------------- ------------------PT---------------VPKDCLSVEVEKYIPKA---------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGISI------------------S- NILAEIYL-----------------------------------------------------------RD----------- --------FD--IL-YGMRED----------------------------------------------------------- -------------------------------------IK----YFR----------YVDDILI-------------LS-- -----NN-------L---------------------------DLYSEVLH---------------------K-LE----- ----L--L----FLEP------N-----G--NK----C---SQGN----------------------------------- -----IC--------DGFHYLGFHI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|23360423|locus|VBIRosDen86677_1106|_extraction reverse transcriptase [Roseobacter denitrificans OCh 114] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------MP-----QGLSI------------------S- NILSMLYL-----------------------------------------------------------EG----------- --------FD--HY-FAQKYR----------------------------------------------------------- -------------------------------------------YFR----------YVDDILL-------------VV-- -----GEEQ----AV---------------------------GIHDEIAR---------------------HMRT----- ----E--L----RLKT------H-DLDPNQPDK----T---AITP----------------------------------- -----VK--------IGTEYLGYHIS------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Gfid|189292173|locus|VBITerTur232417_0522|_extraction reverse transcriptase family protein [Teredinibacter turnerae T8415] -------------------------------------------------------------------KRHRKSVLRFEK- --DLD--NNL-----NRLSQDIV-------D------KKL----PYG--------------------------------- -----------------RFRSF-TI--YD---------P-K-----------------------KRLI------------ ------HAAC----FD-DRVFHHAFMNY-AA-PVLEKAM-------S--------------------------------- ---PTSYTCRH------------------GF----------------------------GVHQA---IGKA----QKCLR AFP----------------------------------Y-VI-KI--------DIEGYF-PAIP----------------- -HVK--LQCA-LTRKF----------KGSD------------------ALGQI---------HRIVNS------------ -------------------------------------------------------------------------------- ------------------HS------------------------------SHP---G----------------------- -------------------------------------------------------------------------------- -----------------------------------H-----------GLP-----IGSLT------------------S- QYFAIYYL-----------------------------------------------------------DS----------- --------LD--RF-MENHAK----------------------------------------------------------- -------------------------------------VMA---QVR----------YMDDILW-------------WCV- -----SKQH----AL---------------------------ETLVEIEN---------------------EL-A----- ----A--L----GLRI------K-S-------N----V---QIQK----------------------------------- -----SS--------LGVTFCGYRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|21262415|locus|VBICanAcc132554_3443|_extraction hypothetical protein [Candidatus Accumulibacter phosphatis clade IIA str. UW1] ---------------------------------------------------------------------QRPAVARFLA- --DLD--GQL-----DHLAACIL-------N------GQA----PQG--------------------------------- -----------------QYTSF-TI--HD---------P-K-----------------------RRLI------------ ------HAAC----FA-DRVLQHAILNL-AE-PRFEAML-------V--------------------------------- ---DSTYACRP------------------GK----------------------------GVHAA---ARQV----QRNLQ RFA----------------------------------W-RV-QV--------DVDSYF-PSID----------------- -HAC--LKAL-LATRF----------KGAG------------------FLALL---------GRIIDT------------ -------------------------------------------------------------------------------- -------------------A------------------------------GDA---G----------------------- -------------------------------------------------------------------------------- -----------------------------------R-----------GLP-----IGSLT------------------S- QHFANAYL-----------------------------------------------------------DT----------- --------AD--RR-LLEDRR----------------------------------------------------------- -------------------------------------VRA---HVR----------YMDDILW-------------WCD- -----SRAD----AL---------------------------ATLAELND---------------------FLRR----- ----E--R----GLQL------K-P-------K----V---SIAP----------------------------------- -----SR--------TVVAWCGFRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43173932|locus|VBIRumSp156992_1069|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Ruminococcus sp. SR1/5] ------------------------------------------------------------------GKRENTERLRFYD- --NRE--GNL-----EEISTLLR-------A------GKV----PKV--------------------------------- -----------------EYHSF-YV--YV---------P-K-----------------------VRKV------------ ------IFID----YW-SKVVQRAIYDV-LN-PKICRTF-------I--------------------------------- ---EHTYACVK------------------GR----------------------------GQLAA---MEQL----YTWMR ETR-----------------T-----------SGTEWY-YY-KF--------DVAKFF-YRID----------------- -HEI--LMDI-CRKKI----------DDPR------------------TVDLL---------GYYINN------------ -------------------------------------------------------------------------------- ------------------DA---------------VPFGMPLDANQLTITEEQMLYD----------------------- -------------------------------------------------------------------------------- -----------------------------------L-----------GIP-----IGGGL------------------S- HMLGNMYL-----------------------------------------------------------DP----------- --------LD--QF-CKRVLG----------------------------------------------------------- -------------------------------------IKR---YIR----------YMDDIII-------------LDN- -----DKER----LK---------------------------GYGRRMTQ---------------------FLEE----- ----R--L----HLNF------N-N-------K----T---ALRP----------------------------------- -----VR--------VGCEFVGFVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118066942|ref|ZP_01535200.1_extraction ------------------------------------------------------------------VGRKGRGEERLAQ- -------------------------------------------------------------------------------- -----------------------WS--AQ--------------------------------------------------- -----------------DALVLKWVALR-IEGLLPTHAR-------C--------------------------------- ---AHLKGHGG------------------GR----------------------ESV--RQAARA---LASG--------- ----------------------------------EFNF-VY-RT--------DIRGYY-RHIR----------------- -KEQ--LLSQ-I-QSYV---------ADPV------------------LHDLL---------RQYLYY------------ -------------------------------------------------------------------------------- ------------------SV------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------EDGGEFHTPKQ-----------GIC-----RSCPL------------------S- PLFGASLL-----------------------------------------------------------YH----------- --------VD--AH-FSAQEG----------------------------------------------------------- -----------------------------------------IFYSR----------YMDDFLL-------------LTR- -----TRWP----LR---------------------------RAVKKLHQ---------------------FF-N----- ----L--G----GFET------H-P------DK----T---QLGRIE--------------------------------- ---------------QGFDWLGVEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|151571856|gb|EDN37510.1_extraction_extraction --------------------------------------------------------------------------KQICK- --------------------------------------------SDN--------------------------------- -----------------TKTIV-WD--TE--------------------------------------------------- -----------------DSIVLKYLAEN-IKANNIFSKY-------V--------------------------------- ---VSNKASGG------------------IP----------------------KAI--RVINEY---IKT---------- -----------------------------------HKY-IY-RS--------DIKSFY-NSIN----------------- -HKI--LLSK-L-HKYT---------AIN-------------------EYKLI---------ARHLDR------------ -------------------------------------------------------------------------------- ------------------IQ------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------WQDGEYVEIKQ-----------GIS-----KSSPL------------------S- PVLGCIYL-----------------------------------------------------------DE----------- --------LD--KA-MQKLD------------------------------------------------------------ -----------------------------------------IKYLR----------YADDWII-------------LAK- -----TKHK----LR---------------------------KAVKICKQ---------------------IL-S----- ----K--L----KLEE------H-P------DK----TDYRNFNNPN--------------------------------- ---------------KTFNFLGIEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119476243|ref|ZP_01616594.1_extraction_extraction ---------------------------------------------------------------------------VVTK- --------------------------------------------ADG--------------------------------- -----------------ETLHL-WS--SQ--------------------------------------------------- -----------------DALVLKMLAMA-LPEALSLSSL-------C--------------------------------- ---THIKGHGG------------------LK----------------------ATV--SALHAA---LPD---------- -----------------------------------YRY-VM-KT--------DVKRYY-ESID----------------- -HTI--LLKQ-L-DKDI---------TDPF------------------IWRLL---------VQFVKR------------ -------------------------------------------------------------------------------- ------------------TV------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------ERGGTFKSIHC-----------GIS-----RGCPL------------------S- PIIAAYYL-----------------------------------------------------------KA----------- --------LD--KQ-MEGNTR----------------------------------------------------------- -----------------------------------------YFYRR----------YMDDVIV-------------LAK- -----TRWH----LR---------------------------KAVRTVNQ---------------------HF-N----- ----Q--L----KVEQ------A-P------DK----T---FIGRIE--------------------------------- ---------------KEFDFLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|123441511|ref|YP_001005497.1_extraction_extraction -----------------------------------------------------LLNKDDPRKFRIGRNSDEYIIN-LKQC ADRINEDAYL-----FSGCH-QS-------KINGKNI--FIL-K-EF--------------------------------- -----------------VDLI--SI--RK--------ANDN----------------------IKRIF------------ ------SIKQ----ADRNDIVTRVKIML-SE------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------PIPFYIY-KL--------DIKSFY-ESID----------------- -RGF--VLKK-IYSNSL---------VSYR------------------TKRVI---------SKFFET------------ -------------------------------------------------------------------------------- ------------------D------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------EIKNIN-----------GLP-----RGVGL------------------S- ATLSELFL-----------------------------------------------------------EG----------- --------FD--KI-IKINSN----------------------------------------------------------- ----------------------------------------IYYYSR----------YVDDVII-------------FSH- ---KKIDEF----IA---------------------------FFEKLLP------------------------GS----- -------------LSL------N-K------DK----CREIII---HNEKSKNSS---Q--------------------- ----------------SFSYLGYNF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|189841748|locus|VBIAltMac287510_4038|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Alteromonas macleodii str. 'Ionian Sea UM4b'] ------------------------------------------------------------KKFQPGTEYNNFVADTYEKI EGKLTFDDYK-----FSQFV-KK-------SFKGKDG--WAF-A-SA--------------------------------- -----------------ADEL--VA--KK--------LNDN----------------------IRRLF------------ ------KVRP----SDRHAIVKQTIALA-KD------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------SQPITIA-RL--------DIKDFY-ESLD----------------- -RTS--IVKF-ITEEWL---------LSHQ------------------NRMVL---------KQWDKQ------------ -------------------------------------------------------------------------------- ------------------L------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------ASQGIQ-----------GLP-----RGMSL------------------S- STLSEVRI-----------------------------------------------------------RH----------- --------FD--KQ-MKLDSK----------------------------------------------------------- ----------------------------------------VYFYAR----------YVDDIIV-------------FYS- ---GEQAQL----ES---------------------------VMKERLK------------------------AS----- ----A--K----ELTL------N-T------DK----SFYTVL----NDPTNGAS---A--------------------- ----------------DIDYLGYKL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|53718225|ref|YP_107211.1_extraction_extraction -----------------------------------------------------QLLALKAELVVLKEDKSSAI-ILMGDI SQKVLQPSFK-----IDLSQ-KT-------GPKGKPV--FCI-D-AD--------------------------------- -----------------PETFF-VI--KQ--------LQHN----------------------IHRIY------------ ------SIKQ----ANRHDLVCQLRDML-GS------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------KFPFELV-RT--------DISSFY-ESIN----------------- -RKH--LVEK-LDRDQL---------LSPA------------------SKKYI---------KQALDS------------ -------------------------------------------------------------------------------- ------------------Y------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------GTI--------SG--A-----------GIP-----RGVGI------------------S- AYLAELYL-----------------------------------------------------------RP----------- --------VD--KA-IRAIPG----------------------------------------------------------- ----------------------------------------LVLYCR----------FVDDIVA-------------VFA- --RPPIGKS----LG---------------------------SFKDRIIA---------------------I-FG----- ----D--S----GLAH------N-S------AK----TSEFKL--ADTSTKK---------------------------- -----------------FEYLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87348281|locus|VBIMetSp147316_1516|_extraction hypothetical protein [Methylophaga sp. JAM7] ------------AGLYFPDLEPHTLAVRNKVQEIRAYRSRESS--IKAEDFKQNLDALKAELVHLKATKSAAIDEKMDDI SLKVLQPSFK-----IELSQ-KT-------GPKGKLV--YCI-D-SK--------------------------------- -----------------PETFF-VI--KQ--------LQRN----------------------IYRIY------------ ------GVKQ----ANRHDLVCQVRDTV-GT------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------KFPFEMV-RT--------DISTFY-ESID----------------- -RKR--LIKK-LDKDQL---------LSPS------------------SKKFI---------NQVLDS------------ -------------------------------------------------------------------------------- ------------------Y------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------GVL--------SGTST-----------GIP-----RGVGI------------------S- AYLAELYL-----------------------------------------------------------RP----------- --------VD--KA-IRAIPG----------------------------------------------------------- ----------------------------------------LILYCR----------FVDDVVA-------------IFA- --RPPTGTD----LG---------------------------SYKDHVIK---------------------V-FA----- ----E--N----GLTH------N-L------DK----TYEFDL--KRQEPKK---------------------------- -----------------FEYLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87263720|locus|VBIFleLit174749_0250|_extraction hypothetical protein [Flexibacter litoralis DSM 6794] -----------------------------------------------------LLDRLNSEKDELKKNREKALEQILIEI AERTDVEDYE-----LKIEK-GQ-------IKWGSQL--YEI-E-HN--------------------------------- -----------------PENYF-VA--KQ--------LQRN----------------------IFKTF------------ ------KVKQ----ANRKIIIDQLRLLL-DD------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------GFPKIII-RT--------DIKKFY-ESIP----------------- -HKE--LLAK-IEENSL---------LSYP------------------SKKMI---------RRVLNQ------------ -------------------------------------------------------------------------------- ------------------YW------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------GILIADGVKTNSDERV-----------GIP-----RGIGF------------------S- AYLAELYL-----------------------------------------------------------RS----------- --------FD--KK-IKSLSN----------------------------------------------------------- ----------------------------------------VTYYCR----------YVDDIVI-------------IITP KHRNETKTV----L----------------------------TYQNEVKN---------------------ILLS----- ----S--T----KLKI------N-T------SK----TKVIDLTPSNQERKKSIT---Y--------------------- ----------------NLTYLGYKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|45429733|locus|VBIVibFur169925_2671|_extraction hypothetical protein [Vibrio furnissii NCTC 11218] ------------EREYFEKAYAERLKIRVLKKFKYRILSKFKKGVITKEFYEKRVDLIRKLLEVRKGRYNSFVNDEIESI CNKVNRKRYN-----LPLSKLPD-------QISGKDV--FTI-G-KS--------------------------------- -----------------VESVF-VT--RH--------LHNI----------------------IRSTY------------ ------NIKQ----SHRDLIVSRVKSIC-LD------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------KSPKYIV-KA--------DVKDFY-ESIE----------------- -HEL--IIKK-LHASTK---------LSVL------------------PKRIL---------TQLLRS------------ -------------------------------------------------------------------------------- ------------------Y------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------AKL--------VGCDK-----------GLP-----RGVGL------------------S- AYLSEVYM-----------------------------------------------------------KD----------- --------LD--EE-LVNNPD----------------------------------------------------------- ----------------------------------------LTFYSR----------YVDDLIL-------------IYS- ---PDVIKD----KS---------------------------HYLKVVDE---------------------A-VG----- ----R--K----KLKL------N--------NK----TKEIDL--TVDKAQE---------------------------- -----------------FEYLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|42313451|locus|VBIDicDad25310_0565|_extraction hypothetical protein [Dickeya dadantii 3937] ------------EVEYFPKAYNIRKKISRLKKFINWLSSKNSQ--ISSLNLEARKDKANKIIEIRKKQYDTEVNEQLSSI SKTVSVKGYN-----LPLTLLPQ-------QVKNKDV--YSI-G-NG--------------------------------- -----------------VEELF-VS--MQ--------IKNI----------------------LNSLF------------ ------NIKV----NNRDLIISRLSALT-KE------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------ISPKYII-RA--------DVEKFY-ESIN----------------- -HKD--LLEI-LHSSPK---------LSVP------------------PRRVI---------TQLIRK------------ -------------------------------------------------------------------------------- ------------------Y------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------QSL--------TGSDK-----------GLP-----RGVGI------------------S- AYLSELYM-----------------------------------------------------------TI----------- --------ID--NK-IKSLLD----------------------------------------------------------- ----------------------------------------ITYYER----------FVDDFIV-------------VFC- ---PSKEKN----TG---------------------------SYLQQIAS---------------------I-IN----- ----E--R----GLTL------N--------NK----TTEIDL--FTQTNKN---------------------------- -----------------FEYLGYKF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|151560523|gb|ABS14021.1_extraction_extraction --------------------------------------------------------------VFRKQDYRSYDIKNIEEV KVRLLQAENE-----SSLKFNIA-------HTKGRAL--Y-F-P-QD--------------------------------- -----------------YETEI-II--RK--------ANTN----------------------IKKIL------------ ------GINP----VSRNDIIRHLKEIL-RE------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------GVPYVIG-RY--------DIKRFY-DNIK----------------- -------ISA-LDESLS---------TTYD------------------TRRLV---------SGFLSS------------ -------------------------------------------------------------------------------- ------------------H------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------EALYSS-----------GLP-----TGISL------------------S- ATLSELYL-----------------------------------------------------------RN----------- --------FD--RG-IKALPW----------------------------------------------------------- ----------------------------------------VRYFAR----------YVDDIII-------------IAE- --------------R---------------------------TTAQLMES---------------------ALIS----- ----GLPD----GLAL------NGK------DK----RYFRKL--ERDFGGAGPE---A--------------------- ----------------DFDYLGYRF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|50119996|ref|YP_049163.1_extraction_extraction --------------------------------------------------------------RIKQSDFYKCKNLVDEKV FDQVIEESYQLAHGLTAPVISKT-------ISKGKEV--YYV-D--R--------------------------------- -----------------LSYKL-IL--RK--------LQGN----------------------IRNKI------------ ------ETDK----LQRNEIVRNLVSYL-QE------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------GVKSKVI-CI--------DLKSFY-ESID----------------- -IDS--LLSE-TAKIGS---------LSYH------------------SKKLI---------EVVLDE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------HRSI-GGK-----------GVP-----RGLEL------------------S- SLLADLYL-----------------------------------------------------------QE----------- --------FD--EW-IKRIDG----------------------------------------------------------- ----------------------------------------VFLYKR----------FVDDILI-------------MTD- -----HKVD----EQ---------------------------SILTSIKN---------------------KLPA----- ----N--------LCI------N-N------LK----TQIIE-IKKRTSPNDVEGKLAA--------------------- ----------------TIDYLGYKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32374434|locus|VBISalEnt87589_2944|_extraction hypothetical protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633] -------------------------------------------------------------YYFSKGNSEKLESLINDAV L--IANENFR-----SGVSVKKL-------NIKGRCV--YSA-S--N--------------------------------- -----------------LKEKL-IL--RH--------CNSN----------------------LKCLE------------ ------SLLP----KQRNKIIDELKLYL-RE------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------GTQFRVY-RL--------DIKSFF-ESIQ----------------- -LPQ--LFKY-MHDESR---------LSRH------------------TKNLL---------EWYLKA------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------CERIHATQ-----------GLP-----RGLEI------------------S- PMLSELYL-----------------------------------------------------------SE----------- --------FD--RN-INRHPE----------------------------------------------------------- ----------------------------------------VFYYSR----------FVDDMVI-------------ISS- -----GNED----QK---------------------------TFMKQVVD---------------------FLPN----- ----G--------LKL------N-K------NK----LNISPLIPKRSKGDNNNDKLLH--------------------- ----------------KFDFLGYSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87389803|locus|VBIPseFlu200955_4838|_extraction hypothetical protein [Pseudomonas fluorescens Q8r196] ------------------------------------------------------------DSPNAKTDYEPIASEILKKI NSGTVFHN----------AIKSF-------PLNNKTVLTYTT-EDAK--------------------------------- -----------------IASKL-LI--RN--------LKLN----------------------ARIKQ------------ ------P--------NRSEIISALISSL-QD------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------GTPYRAH-RY--------DIKSFY-ESIN----------------- -RTA--VLAM-LENESL---------CSSK------------------TLNLI---------KVLFRS------------ -------------------------------------------------------------------------------- ------------------L------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------DAENI---K-----------GLP-----RGLDI------------------S- AYLSEIYL-----------------------------------------------------------RH----------- --------FD--TN-LRRLEH----------------------------------------------------------- ----------------------------------------VNFYAR----------FVDDIIL-------------LTS- -----NDRA----DE---------------------------T-ANHVKA---------------------FLSD----- ----D--------LTLHDDGKRS-D------IK----VSKSE-IPTEEK--NLKEKF-Q--------------------- ----------------VLNYLGYEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|23621411|locus|VBISinMed134228_4356|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Sinorhizobium medicae WSM419] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----MTR-----------------------------------------------------------TH----------- --------PD--LP-W---------------------------------------------------------------- --------------------------------------------CR----------YADDGLV-------------HCR- -----TEQE----AQ---------------------------ALKAALQA---------------------RL-A----- ----E--C----GLQM------H-P------IK----T---QIVYCK--------------------------------- --DNRRRKRYP---TVKFDFLGYQFR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >Gfid|58366981|locus|VBIAciCal182605_3025|_extraction Retrontype reverse transcriptase [Acidithiobacillus caldus SM1] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -MLANIYL-----------------------------------------------------------HH----------- --------VL--DQ-WFTQVV----------------------------------------------------------- ----------------------------------QGRLKGRSLLVR----------YADDAVL-------------AFD- -----DFRD----GQ---------------------------RVLAVLGK---------------------RM-G----- ----R--Y----ALKL------H-P------QK----T---GFIDFR--------------------------------- --FKRPRGRHPLATGTSFSFLGFTQ------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|89895550|ref|YP_519037.1_extraction_extraction -------------------------------------------------------------------------------- FPVPFEFEAIRYQWEQSMRSWLR-------S------QDILQ-WTPR--------------------------------- -----------------PYRRC-LT--PK--------HRYG--------------------------------------- -----------------FRVATQ------LD-PLETIVF-------T--------------------------------- ---SLVY----------------------------------------------------EIGKD---IESARIPKEEKIA FSHRFAAKPDGRMYDSEYSWDLFQDHCGELVESNDYRY-VV-IA--------DIADFY-PRIY----------------- -FHP--LENA-LSECTRKKN------HIKA------------------ITSMI---------KNW--------------- -------------------------------------------------------------------------------- ------------------NF------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------SVSY-----------GIP-----VGSAA------------------S- RLLAELVI-----------------------------------------------------------DD----------- --------VD--RG-LLS-------------------------------------------------------------- ---------------------------------------EGVKHCR----------YVDDYRI-------------FCK- -----NERE----AH---------------------------EHLALLAN---------------------TLFE----- ----N--H----GL------------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|21240921|ref|NP_640503.1_extraction_extraction -------------------------------------------------------------------------------- FPKAFEFEALWHQWPQ-VKQELR-------S------KNISKMLVPN--------------------------------- -----------------PYNST-IQ---K--------ARGG--------------------------------------- -----------------YRVVHQ------LE-PMEAIAY-------T--------------------------------- ---AMAY----------------------------------------------------EVGGS---IEAMRAPATHRIA CSYRIIA--NGNFFVGGSGWGDFKAKSQEL--ANENKF-AA-IT--------DISDFY-NQIY----------------- -LHR--LQNA-IEASGPAH---------KS------------------LASDI---------EDFLAL------------ -------------------------------------------------------------------------------- ------------------NN------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------KASQ-----------GVP-----VGPAA------------------S- IVMAEAVL-----------------------------------------------------------ID----------- --------ID--SF-ISQ-------------------------------------------------------------- ---------------------------------------CGVLHTR----------YVDDIRI-------------FSN- -----SAAK----LA---------------------------ETLEKLTL---------------------YLYE----- ----N--H----RL------------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|31925946|locus|VBIAllVin64954_3044|_extraction hypothetical protein [Allochromatium vinosum DSM 180] ----------------------------------------------------------------------------THLF FDNHAECKKILLDIHANFDNYMA--------------------QSPP--------------------------------- -----------------STLDT-LT--QV--------GYTG--------------------------------------- -----------------FRWATQ------IE-PFWNAYY-------L--------------------------------- ---ALVV----------------------------------------------------SIAEE---IESSRVPIEKESV FSYRYEWDQNTAKIFKPSTWGDYKKKSLEI--SHDYQY-VV-VT--------DIADFY-PRIY----------------- -HHR--IDNA-LRRLPKAGE------TPKK------------------IMDLL---------FSF--------------- -------------------------------------------------------------------------------- ------------------SK------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------NVSY-----------GLP-----VGGPA------------------S- RILAELAL-----------------------------------------------------------VT----------- --------TD--LQ-LSR-------------------------------------------------------------- ---------------------------------------RNIKFCR----------YADDYSI-------------FCN- -----SKSD----AY---------------------------KVLVLLSE---------------------KLH------ ----N--E----GLSL------Q-K------KK----T---KIITTEEFRETSRMLDPADKTNPLAEEEQKLLNISL--- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43055415|locus|VBIHalPra106773_1068|_extraction hypothetical protein [Halanaerobium praevalens DSM 2228] --------------------------------------------------------------------------LYWNDL Y------NFG-----LDLEKNINTLIHLLKE------DLY----KPN--------------------------------- -----------------KSYKI-YF--PK--------STG-----------------------LVRPI------------ ------SVLN----FI-DLLVYQAIVNL-LA-EK---FY-------K--------------------------------- ---SF-HSYYN--------------------------------------------------HFV---FGNIYNKTNNENK IFFYKHWKQQWKKYENLTKQHYND----------GYEY-IT-EF--------DLASFY-DTVD----------------- -H-Y--LVKN-LLKEKNI---------DKK------------------IIDIL---------SSQLLS------------ -------------------------------------------------------------------------------- ------------------WN------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------KNQTQNQGFKH-----------GIP-----QGPLG------------------S- GFIAEICL-----------------------------------------------------------HN----------- --------ID--KK-MRKLVL----------------------------------------------------------- -------------------------------------ENRNIRYMR----------YVDDIRI-------------FTK- -----DKYL----AQ----------------------------------------------------------------- -------K----NIVYLDLLARE-LGFIPQANK----TKVKKVDNIHLYIKTNKDFSRVASEYKEKGELSPSNNRKYTKR LIKNLKSGNY------DKTLIKFGLFKV---------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|42441956|locus|VBISulAut92361_1181|_extraction Mobile element protein [Sulfurimonas autotrophica DSM 16294] ----------------------------------------------------------------------WSENQLFNPF YTRKKAKSIA-----KSVIKNIN-------N------KTY----SPN--------------------------------- -----------------KPHIK-KI--PK--------ESG-----------------------GFRKV------------ ------TIYQ----IH-DAAVSKLFYNS-LL-YKNKHRF-------S--------------------------------- ---SFSYAYRN------------------DR----------------------------NVHFA---IQDIWVDISENSR TF-------------------------------------IA-EF--------DFSDFF-GSIS----------------- -HDY--LKKQ-YDKNGFIISD-----EEKF------------------IID-------------KFLD------------ -------------------------------------------------------------------------------- ------------------VD------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------KND-------K-----------GIP-----QGTSI------------------S- LFLANMVC-----------------------------------------------------------WN----------- --------LD--KN------L----------------------------------------------------------- -------------------------------------EKEGLKFAR----------YADDTVI-------------WSL- -----DYEK----IC----------------------------------------------------------------- -------N----SFTIITDFSTG-AGV-----K----INAKKSKGISLLTS---------------------------KE LPAELSNKTY------NIDFLGYSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|58526013|locus|VBIColFun187779_0008|_extraction hypothetical protein [Collimonas fungivorans Ter331] -----------------------------------------------------------------------NIDKKFNPF YVRRRAKSIA-----LSIALKIQ-------N------GEY----APN--------------------------------- -----------------DPHIK-EI--DK--------PGG-----------------------GKRQL------------ ------TIYQ----IP-DAAVSQLFYTR-LL-AKNRHRF-------S--------------------------------- ---SFSYAYRN------------------DR----------------------------NVHFA---IQDISVDIEQDAR TF-------------------------------------IA-EF--------DFSDFF-GSIN----------------- -HQS--LFLQ-FSKNGFFISQ-----EEIS------------------VIK-------------AFL------------- -------------------------------------------------------------------------------- -------------------A------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------KRE-------K-----------GIP-----QGTSI------------------S- LFLANLVC-----------------------------------------------------------WK----------- --------LD--NS------L----------------------------------------------------------- -------------------------------------EKAGLKFAR----------YADDTVI-------------WSP- -----DYAA----IC----------------------------------------------------------------- -------N----ALEIINAFSIE-AGV-----S----INAKKSDGISLLAR---------------------------DG SRSEIASKP-------DFNFLGYAI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|52548386|gb|AAU82235.1_extraction_extraction ------------------------------------------------------------------------IDHPYEIV LIETDLYNWL-----DSLREKIA-------A------NEY----NPS--------------------------------- -----------------RTKII-DI--PK--------PNW-----------------------HLRPG------------ ------NILT----IE-DNVIYSALLLD-GI-EKVKDNIGWSSK-TK--------------------------------- ---RFSSILKE-----------------------------------------------------------------NQNG SKWFEFELDMWMKFRDESLKLIEQ----------GYTH-VL-FA--------DISAFF-ENID----------------- -IQR--LMYD-LESFGI----------PTD------------------NRELL---------SKCLNR------------ -------------------------------------------------------------------------------- ------------------WA------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------EPK------GR-----------GIP-----QAFRP------------------S- NILAEVYS-----------------------------------------------------------NM----------- --------ID--KR------L----------------------------------------------------------- -------------------------------------SYEGITYLR----------YVDDVRI-------------FCE- -----TKMD----AV----------------------------------------------------------------- -------K----SLHLLTLTPHH-LGDM---------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|148254613|ref|YP_001239198.1_extraction_extraction ------------------------------------------------------------------------IESPLEIE LVDADQATWL-----SQLSEKIA-------A------G-Y----RPH--------------------------------- -----------------SAVIA-DI--PK--------GNG-----------------------AVRPA------------ ------ALLN----LE-DRVVYAAAVGA-LL-SAINLGLGWSQG-KV--------------------------------- ---DFSYRLSE-----------------------------------------------------------------SVRR VEWFTNRFNGWSAFRKVSVERIDN----------GAAH-VV-LT--------DITGFY-ENID----------------- -LTV--LFSD-LRTLGA----------DSD------------------VIQLL---------QLCLNR------------ -------------------------------------------------------------------------------- ------------------WA------------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------VVP------NR-----------GVP-----QGLSA------------------S- DVLAKVYL-----------------------------------------------------------NP----------- --------ID--QA------M----------------------------------------------------------- -------------------------------------ADMQVDFIR----------YVDDIRI-------------FCS- -----DVPA----CK----------------------------------------------------------------- -------K----ALM----------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|176420657|locus|VBIAciBau289782_2220|_extraction retrontype reverse transcriptase [Acinetobacter baumannii ABHKU310] ----------------------------------------------------------------KKYHTNYISIKEFEK- DIKVNSENIF-----NSIKNN-----------------NY----KFQ--------------------------------- -----------------SLYPI-VIENKK--------ELNK----------------------KPRLI------------ ------CIPT----VR-DRLIQMILIYY-IS-IHLKNELAVL----K--------------------------------- -----SQDFSV------------------SG-V--------------------GIL--KARQKA---K-DL--------- --------------------------------RNTKPY-VL-KT--------DISSFF-DNID----------------- -RAK--LLNE-I--------------KDVM------------------PPDIL---------Y-LFQS------------ -------------------------------------------------------------------------------- ------------------II------------------------------------------------------------ -------------------------------------------------------------------------------- -----YCDPSIPYEYNKDYKKLIYSKLR-------K-----------GVR-----QGMPI------------------S- PLLASFYL-----------------------------------------------------------ND----------- --------FD--EW-LIKKKY----------------------------------------------------------- ------------------------------------------KHVR----------YADDLIF-------------FLD- -----SEKQ----CK---------------------------EVYREVSQ---------------------EL------- -------L----KLNLTLPTLEE-N------TK----T---QIISPK--------------------------------- ---------------ETVNFLGLDL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|189765406|locus|VBIAcePas307039_2786|_extraction hypothetical protein [Acetobacter pasteurianus 386B] -------------------------------------------------------------------GVDGITVPLFD-- ---AQKNNYL-----KDIRANLL--------------HGY----TFS--------------------------------- -----------------KLRGV-AV--PK--------TDVT----------------------KYRLI------------ ------CVPT----LA-DRIVQRALLVE-ME-KKGK-QLGVL----N--------------------------------- ---DVSHGFIA------------------GG-A--------------------RTV--SSAQKS---VCRI--------- --------------------------------RSEKPW-IL-KA--------DISKFF-DRID----------------- -RDS--LFDT-F--------------SKKF------------------RFSTL---------HPLIKG------------ -------------------------------------------------------------------------------- ------------------AI------------------------------------------------------------ -------------------------------------------------------------------------------- -----QSEVEVKGE-------RVELAIRQNGIIKGK-----------GLR-----QGMPI------------------S- PFMSNFAL-----------------------------------------------------------SG----------- --------FD--K--IISSKY----------------------------------------------------------- ------------------------------------------SIVR----------YADDLIV-------------LGK- -----NEDE----CK---------------------------RAKEDIEC---------------------EL------- -------Y----KIGQTL----N-E------DK----T---YIRAPN--------------------------------- ---------------ESVEFLGLELR------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|82702063|ref|YP_411629.1_extraction -------------------------------------------------------------------------------- -------------KSPTIRTEIE-------QFSLSLEKNLRRIADQL--------------------------------- -----------------REKRY-VFSQSY--------GVAVKK---------------------KNNP------------ ------SKK--------RPIVISPIPNR-IV---------------Q--------------------------------- ---RALLDVVQ-------------------EIP--------------------SVR--AKLDSG---FNFGGIAEIG--- ---------VPQAILKAYKTALE-----------KPYF--I-RT--------DICAFF-DNIP----------------- -RSQ--ALEI-I--------------TS----AS----------KDDDFNTLL---------TQATTT------------ -------------------------------------------------------------------------------- ------------------EL------------------------------------S----------------------- -------------------------------------------------------------------------------- ----------------------------NLITLGRD--KELFPLEGKGVA-----QGSCL------------------S- PVLCNLLL-----------------------------------------------------------DD----------- --------FD--KK-MNAR------------------------------------------------------------- ----------------------------------------GIVCIR----------YIDDFIL-------------FAP- -----SESK----AF---------------------------KAFASASA---------------------FL-E----- ----K--L----NLSV------Y-D------PR----H---SPDKAEHGVSNKGFEFLGCSV------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|71735515|ref|YP_277063.1_extraction -------------------------------------------------------------------------------- -------------SSDEIKRDAE-------EFESRLPDSLVEIQRSL--------------------------------- -----------------SKQTF-TFLQQT--------GVAQKK---------------------PG-------------- ------GKA--------RPLVLAPIPNR-VV---------------Q--------------------------------- ---RALLDVLQ------------------RRVR--------------------FVR--KVLDTP---TSYGGIPTKR--- ---------VAMAISDARDAMRN-----------GARF-HI-RS--------DIPAFF-TKIN----------------- -KDR--VQDL-L--------------RS----HI----------NCDATLKLL---------DLAITT------------ -------------------------------------------------------------------------------- ------------------DL------------------------------------A----------------------- -------------------------------------------------------------------------------- ----------------------------NIDDLRRQGLNEIFPIGIEGVA-----QGSPL------------------S- PLLANIYL-----------------------------------------------------------AD----------- --------FD--VA-MNAD------------------------------------------------------------- ----------------------------------------GITCLR----------YIDDFLL-------------LGE- -----SLSN----VD---------------------------RAFNRALK---------------------TL-D----- ----K--I----GLSA------Y-D------PR----V---DKVKASRGSTDKGFDFLGCNV------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|22599913|locus|VBIMetSil55537_2322|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Methylocella silvestris BL2] -------------------------------------------------------------------------------- ---------------IETRREVE-------AFEANSQSNLKRIADQL--------------------------------- -----------------LHRKF-IFPAAK--------GVPIQK--------------------AKGKR------------ ------GDI--------RPLVVAKVEAR-IV---------------Q--------------------------------- ---RAIHDVLI-------------------EVP--------------------SIR--RYVRTP---YSFGGVRKEKDDS VSA------VPAAIDAAMAAIGD-----------GFSY-YI-RS--------DITAFF-TKIP----------------- -KSA--VAAL-V--------------SD----AVG---------HQSEFMDLF---------RRAIHV------------ -------------------------------------------------------------------------------- ------------------EL------------------------------------E----------------------- -------------------------------------------------------------------------------- ----------------------------NMARLART--VNAFPIYDIGVA-----QGNSL------------------S- PLLGNILL-----------------------------------------------------------YD----------- --------FD--QQ-MNGNP------------------------------------------------------------ ----------------------------------------DAVCLR----------YIDDFII-------------FAK- -----TQQL----AE---------------------------NMFQKAIH---------------------IL-A----- ----S--H----GMSV-------------------------AKHKTVKGLVRDKFEFLGIEFA----------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|89098464|ref|ZP_01171347.1_extraction_extraction -------------------------------------------------------------------------------- ---------FSTNSLAEKFSNLD-------FSTFTKKE------RGK--------------------------------- -----------------WYK-T-SNI--S--------IPKFAH--------------------SRRIL------------ ------NVP--------APFPQMRLSQL-L-------------------------------------------------- ---VKNTEELN------------------EYYS-----------------------------QS---KLSLTRPIVKEES D--------RAVERKYHFSKIIE--RRIESIN--DKKY-IL-KT--------DISRYF-PTIY----------------- -THS--IPWA-L--------------HT-KEVAKQ-TR------GD-SL--L----------GNTIDE------------ -------------------------------------------------------------------------------- ------------------YV------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------NIQDGQTM-----------GLP-----VGPDT------------------S- LIISEVIG-----------------------------------------------------------TA----------- --------ID--IK-LQEAHP----------------------------------------------------------- ---------------------------------N-------IIGSR----------YTDDFEF-------------YFK- -----TQSE----AE---------------------------KVLNTIQE---------------------IV-R----- ----H--F----ELDI------N-P------VK----T---EIISSPNLLEPIWLSNLK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87123886|locus|VBIAeqSub156184_1010|_extraction hypothetical protein [Aequorivita sublithincola DSM 14238] -------------------------------------------------------------------------------- -------------GIIKTQWNTI-------SSPFKRPE------RLK--------------------------------- -----------------YSE-S-KWVVFS--------IPKVQL--------------------SRRII------------ ------NIP--------NPLHQSKLSST-I-------------------------------------------------- ---SDRWVEID------------------EIFK-----------------------------KS---FITSSSPVEDPKK N--------RALIPKHDFGAFKR--RRLNESF--DNLY-EV-KT--------DVSRFY-GTIY----------------- -THS--IPWL-V--------------HT-KPIAKE-NR------DDMTM--L----------GNALDR------------ -------------------------------------------------------------------------------- ------------------DL------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------CLNSGQTM-----------GIP-----IGPDT------------------S- LVIAEIIT-----------------------------------------------------------CL----------- --------ID--IQ-IQTKLK----------------------------------------------------------- ---------------------------------N-------VKSFR----------FIDDYYL-------------YCD- -----NYAD----AE---------------------------KAFKFIQS---------------------LL-T----- ----E--Y----QLDI------N-E------EK----T---KISKVPFPFDSKWSIELGSFQF----------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|89094970|ref|ZP_01167900.1_extraction_extraction -------------------------------------------------------------------------------- ---------FITDNFAELLTS-S-------TIPVTGDFAKNITKKAK--------------------------------- -----------------IPK-A-KLCVYT--------HARGGL--------------------LRRKL------------ ------SIC--------NPVLYYLLSRE-I-------------------------------------------------- ---ESNWTSIV------------------SVAG-----------------------------GS---SLAATAPEQK-KT G--------RAIDGKHSQGDRAS--LAIHTRI--GRRF-VL-TT--------DISRFY-HSIY----------------- -THS--IPWA-L--------------HT-KPVAK--------ASRALTL--L----------GNKLDF------------ -------------------------------------------------------------------------------- ------------------LV------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------QGQDGQTV-----------GIP-----IGPDT------------------S- LVLAELIM-----------------------------------------------------------HQ----------- --------CD--HA-LITKLP----------------------------------------------------------- ---------------------------------L-------IKGHR----------FIDDYEL-------------SFS- -----TRTE----AE---------------------------EAFHFLET---------------------CL-S----- ----D--Y----ELAL------N-P------KK----T---KVSELPLPLESDWSRELK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|39995659|ref|NP_951610.1_extraction_extraction -------------------------------------------------------------------------------- ---------FTTSSYADALNGLG-------TVP-----------DSN--------------------------------- -----------------FKQ-T-KCIRFS--------HSKYAS--------------------LRRDL------------ ------SIP--------NPYPFYELAAL-L-------------------------------------------------- ---SAHWGEVE------------------TQWN-----------------------------NS---PYTRSAPVPS-MT ---------RAVEGNTNFNALPN--ARAEVRS--SGRF-LL-NA--------DVSKCY-HTIY----------------- -THS--LPWA-L--------------HT-KPVAKN-NQKWPPAGGKPPL--V----------GNGLDL------------ -------------------------------------------------------------------------------- ------------------WS------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------NLQDAQTI-----------GLP-----VGPDT------------------S- FVLAEALL-----------------------------------------------------------SA----------- --------VD--QL-VASKIA----------------------------------------------------------- ---------------------------------G-------ARGFR----------FVDDYEF-------------VCD- -----TLEQ----AE---------------------------QVLSVLQW---------------------AL-A----- ----E--F----ELSL------N-P------KK----T---TIKELPTALDTTWVNAFC--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|115525547|ref|YP_782458.1_extraction_extraction -------------------------------------------------------------------------------- ---------FQTHSLADCHATLE-------SDWQNVLLKQSKRERNN--------------------------------- -----------------HPRPS-HPILFD--------MARKGH--------------------ARRTL------------ ------AIP--------NPINQTRLVEE-I-------------------------------------------------- ---SKHWTQLT------------------DIIS-----------------------------KS---SLSLTKCEVS-NS G--------RAI-PLPPLSVLAE--KRIVLYA--ARGA-IL-QT--------DILSFY-HSVY----------------- -SHA--IPWA-I--------------HG-KAFAKS-NR------TDPSL--L----------GNRLDA------------ -------------------------------------------------------------------------------- ------------------LV------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------SCQDGQTI-----------GLP-----VGPDT------------------S- RIISEVLL-----------------------------------------------------------CA----------- --------VE--AR-IPSKVG----------------------------------------------------------- ---------------------------------SR-----ITGGYR----------YIDDFFL-------------CFD- -----SLAE----AE---------------------------VGLAALRE---------------------SC-L----- ----H--F----DLRL------N-P------TK----T---HTIHALDFNEETWATEIA--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149114909|ref|ZP_01841658.1_extraction_extraction -------------------------------------------------------------------------------- --------------IDSRQFTPE-------VCEALALL---VDSASR--------------------------------- -----------------RKNGY-DLVEYK--------ATRYNN--------------------VPRTL------------ ------ALV--------HPKAYAFLVKH-I-------------------------------------------------- ---CDNWDEI----------------------------------------------------KF---IQSGDNSIIKPEL RQ-D-----GRIMVMNYEAPIIKKRRYNEASF--SKKF-CV-KA--------DIANCF-NSIY----------------- -THA--IPWA-A--------------VG-VEAAKN-NR------ENELW-------------YNKLDM------------ -------------------------------------------------------------------------------- ------------------FQ------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------KSKRGETQ-----------GIP-----IGPAT------------------S- SFVVELIL-----------------------------------------------------------QK----------- --------VD--QK-LSAC------------------------------------------------------------- ----------------------------------------GFNFER----------YVDDYQF-------------YCD- -----SYEQ----AQ---------------------------QIILVLGQ---------------------EL-S----- ----V--F----KLTL------N-L------NK----T---YIVALPSSSEDDWVLELL--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|127512398|ref|YP_001093595.1_extraction_extraction -------------------------------------------------------------------------------- --------------FTTSSLTPE-------VC--LRLLSGGIDN-KR--------------------------------- -----------------KNTGF-APVEYK--------STRYNN--------------------IPRVL------------ ------SLV--------HPVAHAQLCKH-I-------------------------------------------------- ---YENWQDI----------------------------------------------------EY---ISANTISRVRPEF YEVD-----KRIVVMNYDDPIVKDRKFNDITF--GKKY-IV-NA--------DIASCF-DSIY----------------- -SHS--IPWA-I--------------KG-YEFAKN-NR------GENEW-------------FNQFDK------------ -------------------------------------------------------------------------------- ------------------FL------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------TTKRCETN-----------GMP-----IGPAT------------------S- SICLEIIL-----------------------------------------------------------AR----------- --------VD--SR-LHDL------------------------------------------------------------- ----------------------------------------GFEHER----------YVDDFCC-------------YCS- -----TKEQ----AD---------------------------EFIIKLSE---------------------LL-A----- ----E--Y----RLSL------N-L------RK----T---TIIELPIASTDDWIVELN--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|91202364|emb|CAJ75424.1_extraction_extraction -------------------------------------------------------------------------------- --------------FSTKQFTKK-------IAQKLSHL-------RS--------------------------------- -----------------RKDGY-DQISYK--------ITRYNN--------------------IPRVL------------ ------SIP--------HPKPYADIVFC-L-------------------------------------------------- ---SENWDNL----------------------------------------------------AY---ICNNEVSLIRPRQ HK-D-----GRIIIMNYEGSHEKIERSLKKSF--GHKF-CI-ET--------DITNCF-PSIY----------------- -SHA--IPWA-L--------------IG-LKEAKSRKR------YKNEW-------------FNKIDA------------ -------------------------------------------------------------------------------- ------------------RQ------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------MLKRNETQ-----------GVP-----IGPAS------------------S- NIITEIIL-----------------------------------------------------------AK----------- --------VD--EV-MSK-------------------------------------------------------------- ----------------------------------------DFNYIR----------FIDDYTC-------------YCK- -----KYED----AE---------------------------EFARRLSQ---------------------EL-S----- ----K--Y----NLTL------N-L------KK----T---HIHQLPKPTNDDWIIDLK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|145632406|ref|ZP_01788141.1_extraction_extraction -------------------------------------------------------------------------------- ----FSFTNLLQKISEELSDKSL-------LNFTHNNANQRPDIANN--------------------------------- -----------------EMVNH-LIYANK--------DGKLSW----------------------RPL------------ ------QII--------HPLVYVDLVHK-IT---------------K--------------------------------- ---KDNWEKLK------------------KRFK--------------------EYQ--NN--PQ---IECLSIPVKS-NS QLKD-----KAKQILKWWESVEQ--ESICLSL--EYEY-VF-DT--------DVADCY-SSIY----------------- -THS--IAWA-I--------------EG-KDIAKK-NH------R-LTL--L----------GNSIDK------------ -------------------------------------------------------------------------------- ------------------KI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------NCQYGQTN-----------GIP-----QGSIL------------------M- DFIAEMIL-----------------------------------------------------------GY----------- --------ID--IL-LSEELK----------------------------------------------------------- ---------------------------------NKGIS--EYKILR----------YRDDYKI-------------FVK- -----NHSD----GE---------------------------NILKYLSE---------------------IM-M----- ----S--F----GLKL------N-S------SK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20432649|locus|VBIErwTas9546_3451|_extraction conserved domain protein [Erwinia tasmaniensis Et1/99] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------IHTNK--------DGKLSW----------------------RPL------------ ------QLI--------HPVLYFYLVRE-LT---------------E--------------------------------- ---AKNWEKIK------------------ERFS--------------------KFS--SN--QK---ITCLSIPVKS-SE NKKD-----KAAQVNSWWQKFEL--QSIELAI--DYDY-LF-ET--------DIADCY-GSIY----------------- -THS--IAWA-V--------------ET-KEVAKI-NR------S-IAL--L----------GNFIDK------------ -------------------------------------------------------------------------------- ------------------TI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------KMQFNQTN-----------GIP-----QGSVL------------------M- DFIAEITL-----------------------------------------------------------GY----------- --------ID--EI-VGEKLS----------------------------------------------------------- ---------------------------------EILIS--DYKILR----------YRDDYRI-------------FVN- -----NQSD----GE---------------------------LILKTLAD---------------------VM-S----- ----D--M----GFRL------N-S------SK----T---KSSSDIITSSVKSDKNSWMASKG---------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115478982|locus|VBISynSp231034_3139|_extraction hypothetical protein [Synechococcus sp. PCC 7502] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------NK--------DGKFAW----------------------RPF------------ ------QII--------HPAIYVSLVNE-IT---------------S--------------------------------- ---ENNWREIV------------------AAFK--------------------RFS--FN--NQ---IKCLSIPIES-EN LLSD-----KAENISNWWHSVEQ--NSIELSL--KYEY-IM-HT--------DISDCY-GSIY----------------- -THS--IVWA-L--------------HT-KKIAKE-QR------RDRSL--I----------GNIIDS------------ -------------------------------------------------------------------------------- ------------------HI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------DMSFAQTN-----------GIP-----QGSGL------------------M- DLIAEIVL-----------------------------------------------------------GF----------- --------AD--LE-LSEKL------------------------------------------------------------ ---------------------------------ESYPHLIDYQILR----------YRDDYRV-------------FTN- -----NPQE----SD---------------------------LIVKSLTE---------------------IL-G----- ----N--L----GLKL------N-S------QK----T---LSSN----------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|32316913|locus|VBIRhoMar93821_1425|_extraction hypothetical protein [Rhodothermus marinus DSM 4252] -------------------------------------------------------------------------------- ---------------------------------LDNLCEQGIRPAHL--------------------------------- -----------------EGVNY-LLLSNK--------DGRHAW----------------------RPF------------ ------EVI--------HPVLYVDLVNA-IT---------------H--------------------------------- ---PENWKFIQ------------------ERFL--------------------QFK--ND--SV---VDCVSIPVES-TS RSKD-----KEEQILVWWQEIEQ--RSIELAL--EFEV-LA-HT--------DINDCY-GQIY----------------- -THS--IAWA-L--------------HS-KEKAKK-ER------KDLSL--I----------GNRIDK------------ -------------------------------------------------------------------------------- ------------------CI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------NMRYGQTN-----------GIP-----QGSIL------------------M- DFIAEMVL-----------------------------------------------------------GY----------- --------AD--LC-LSEKLA----------------------------------------------------------- ---------------------------------KTTLQDKNYKILR----------YRDDYRI-------------FVR- -----SKYD----AE---------------------------EILKILTE---------------------VL-S----- ----N--L----GLKL------N-S------EK----T---VISDSIIQSSIKEDKLAWLF------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118587264|ref|ZP_01544691.1_extraction_extraction -------------------------------------------------------------------------------- ----FSFDTVLTTASQSLEDAS-----------LASMTAKGKSLSNV--------------------------------- -----------------ADVNY-KMLISK--------DGQFDW----------------------RPF------------ ------QII--------HPVTYVDLANC-IT---------------E--------------------------------- ---ESNWKKII------------------KRFE--------------------EFE--AN--PR---IRCISIPVES-LT SQKD-----TAATILNWWENLEQ--ASIEYSL--DYAY-CI-KT--------DITNCY-GSIY----------------- -THS--ISWA-L--------------HG-KSWSKQ-HR------KPSNG--V----------GNRIDN------------ -------------------------------------------------------------------------------- ------------------KI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------HLQFGQTN-----------GIP-----QGSTL------------------F- DFVAEMVL-----------------------------------------------------------GY----------- --------SD--LM-LSNRLN----------------------------------------------------------- ---------------------------------EKKIS--NYQIIR----------FRDDYRI-------------FSN- -----SKSD----CE---------------------------QIAKELSD---------------------VL-A----- ----D--L----NMHF------N-S------KK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|150378854|ref|ZP_01918051.1_extraction_extraction -------------------------------------------------------------------------------- ----ISFAPVLEKVDSELNGRPF-------SQF-----KNGKSPSDC--------------------------------- -----------------DDVNY-NLITNK--------DGKLSW----------------------RPY------------ ------ELI--------HPIIYVALLDV-VC---------------K--------------------------------- ---EENWAFIV------------------KRFK--------------------EFE--S---SI---IECCSLPVLS-QD TDKD-----QAAQVKNWWQAVEQ--KSLMYSL--ECTN-VL-HT--------DVTDCY-GSIY----------------- -THS--IVWA-L--------------HG-RQVAKE-KK------KDKSL--L----------GNAIDF------------ -------------------------------------------------------------------------------- ------------------HI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------SSRAGQTN-----------GIP-----QGSAL------------------M- DLIAEMVL-----------------------------------------------------------GY----------- --------VD--EL-ISEELN----------------------------------------------------------- ---------------------------------TFT----DFKILR----------YRDDYRI-------------FSN- -----SNDK----SE---------------------------EILKIISD---------------------KL-R----- ----V--V----GMKL------G-V------AK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|110642862|ref|YP_670592.1_extraction_extraction -------------------------------------------------------------------------------- ------------------------------------------DHFSS--------------------------------- -----------------VVDNY-ILQTNK--------DGHYCW----------------------RPF------------ ------ELI--------HPAIYVHLVHK-IT---------------E--------------------------------- ---DESWQLLL------------------ERFG--------------------EFQ--SN--TK---IVCASLPRESEED GVSD-----KAKAVSGWWRDVEQ--ESINKSL--QFKY-LF-ST--------DIANFY-PSIY----------------- -THS--IPWA-I--------------YT-KEDAKA-AR------GAGRN--L----------GDQIDY------------ -------------------------------------------------------------------------------- ------------------AL------------------------------------R----------------------- -------------------------------------------------------------------------------- ----------------------------QMRWGQTN-----------GIP-----QGSAL------------------M- DFIAEIVL-----------------------------------------------------------GY----------- --------AD--EL-LGQKLE----------------------------------------------------------- ---------------------------------SQNIN--DYHIIR----------YRDDYRI-------------FTN- -----SKED----AE---------------------------AIARHLTV---------------------IL-Q----- ----G--L----GLQL------N-A------SK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20121338|locus|VBIVibFis37164_1384|_extraction hypothetical protein [Vibrio fischeri MJ11] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------LQTNK--------DGHYSW----------------------RPF------------ ------QLI--------HPALYVHLVSK-IT---------------E--------------------------------- ---EDNWLLIQ------------------ERFS--------------------LFQ--AN--DK---MLNAGIPREAGED SASD-----TAEAVKGWWNDVEQ--QSLIKAL--DFKY-LF-AT--------DISNFY-PSIY----------------- -THS--VSWA-I--------------HT-KPVAKE-QR------NRNRMSLI----------GVAIDK------------ -------------------------------------------------------------------------------- ------------------TL------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------HMQNGQTN-----------GIP-----QGSVL------------------M- DFIAELIL-----------------------------------------------------------GY----------- --------AD--SE-LTLKLD----------------------------------------------------------- ---------------------------------TLGVE--DYYIIR----------YRDDYRV-------------FTN- -----SKED----SE---------------------------TIARQITL---------------------TL-Q----- ----D--L----GLQL------N-A------SK----T---SISEDIIISSIKEDKRVALSIFGKSQ------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|67871822|locus|VBIStaAur224518_0062|_extraction hypothetical protein [Staphylococcus aureus subsp. aureus MSHR1132] -------------------------------------------------------------------------------- -------------------------------------YVKMNNLKEQ--------------------------------- -----------------DDINC-KIYANK--------DGNFDW----------------------RPL------------ ------EII--------NPYLYIYLVRY-LT---------------R--------------------------------- ---TDVWYQLV------------------NCFN--------------------NNK--V---ER---ITVASIPVES-QI STKD-----KKEQIYNWWDEVEQ--ESIVYAL--DYKK-II-HL--------DISNCY-GSIY----------------- -THV--ISWA-I--------------HG-KNHAKN-QK------RNKNL--L----------GNKIDY------------ -------------------------------------------------------------------------------- ------------------LI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------LMQNNQTN-----------GIP-----QGSIL------------------M- DFVAEIIL-----------------------------------------------------------TY----------- --------AD--KL-LENKIG----------------------------------------------------------- ---------------------------------DYDI---DYKIIR----------YRDDYRI-------------FAN- -----DSET----IN---------------------------IVTKALNV---------------------VL-M----- ----T--L----NFKL------N-S------KK----T---VYSEDIIISSIKEDKISYQDII----------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|84619238|emb|CAJ43157.1_extraction_extraction -------------------------------------------------------------------------------- ----VDFSSLLEEINSA----------------GKINFQPDSKSLMG--------------------------------- -----------------KNINY-EVLVSK--------DGLYSW----------------------RRI------------ ------TLI--------NPLYYVYFCKL-IT---------------S--------------------------------- ---PSNWKAIR------------------NKFR--------------------EFE--SN--DL---FLCSSTPVS--KK NTSN-----VAASVLNWWEDFEQ--KSLSLAL--EYEF-MF-ST--------DISNFY-PSIY----------------- -THS--FEWV-F--------------IS-KEEAKK-KE------NNNNP-------------GRLIDT------------ -------------------------------------------------------------------------------- ------------------HI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------MMMSNQTN-----------GIP-----LGSTL------------------M- DTFAELIL-----------------------------------------------------------GE----------- --------ID--LQ-LRKKTE----------------------------------------------------------- ---------------------------------EQKIT--DYKVVR----------YRDDYRI-------------FSS- -----SKDD----LD---------------------------KISKCLVE---------------------VL-G----- ----E--F----GLDL------N-S------RK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|84619229|emb|CAJ43154.1_extraction_extraction -------------------------------------------------------------------------------- ----INFTTLLNDINSS----------------KKIKIEPTAKELMG--------------------------------- -----------------KDINY-EVLVSK--------DGLYSW----------------------RRI------------ ------TLI--------NPLYYVYFCRK-IT---------------A--------------------------------- ---PATWEIIT------------------EKFK--------------------SFE--SN--DL---FTCSSIPVR--KD NSSN-----IAASVMNWWEDFEQ--KSLALAL--EYEF-MF-ST--------DISNFY-PSIY----------------- -THS--FEWV-F--------------IS-KEEAKK-KK------SKNNP-------------GGLIDS------------ -------------------------------------------------------------------------------- ------------------HI------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------MMMNNQTN-----------GIP-----LGSTL------------------M- DTFAELIL-----------------------------------------------------------GQ----------- --------ID--IE-LRKKTN----------------------------------------------------------- ---------------------------------ELKII--NYKVVR----------YRDDYRI-------------FSN- -----SKDD----LD---------------------------IISKCLVN---------------------VL-G----- ----D--F----GLDL------N-S------KK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|133728112|gb|EBA29810.1_extraction_extraction -------------------------------------------------------------------------------- ----FNLGYLLESAESKLGKKQL----------KKEMYFENKVYSNF--------------------------------- -----------------PNVNF-LIQTNK---------TISTY----------------------RPI------------ ------TLL--------HPYIYVDYVNF-LT---------------E--------------------------------- ---IKIWEELK------------------ERFQ--------------------NLQ--EEVKEK---IICSSLPFDI-ET SKEDDS---KKEMALNFWKSIEQ--ETIKYSL--GYNY-LL-KL--------DISNFY-GSIY----------------- -THT--LCWA-F--------------HG-ENYSKE-AK------NSKNLQNLT---------GDKCDR------------ -------------------------------------------------------------------------------- ------------------KF------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------WMNYGETV-----------GIP-----QGNVI------------------S- DLMSELLL-----------------------------------------------------------AY----------- --------ID--SE-LVKKID----------------------------------------------------------- ---------------------------------DEI----DYKIIR----------YRDDYRI-------------FTK- -----RLED----ST---------------------------LVKRELVV---------------------LL-Q----- ----R--F----KLNL------G-E------SK----T------------------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|16329609|ref|NP_440337.1_extraction_extraction -------------------------------------------------------------------------------- -------------FTVSKEIKLQ-------DTPYNINI------NDL--------------------------------- -----------------KKRQV-AFVSFP--------KSTLTY----------------------RNF------------ ------SVQ--------HPWNYHDIIFY-L-------------------------------------------------- ---HQNWDNIL------------------SHI----------------------FH--SE--NK---VAAYSFPIPVSKK DFEDLSPLRAGRMIYEWLEMAEE--DLILDGQ--KFNI-LA-KT--------DITNFY-PSIY----------------- -THG--IGWA-I--------------HG-REEALE-DK------EFRLF-------------GNKIDR------------ -------------------------------------------------------------------------------- ------------------LF------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------YSNDGRTN-----------GIP-----IGSAL------------------S- DLIAETIL-----------------------------------------------------------AD----------- --------ID--RK-FSQESK----------------------------------------------------------- ---------------------------------HI-----EYAAVR----------FKDDYRI-------------LCN- -----SKEN----AK---------------------------KLLDILSH---------------------QL-S----- ----Q--Y----NLSL------N-E------SK----T---SFLNLPDGLYREHNRAYF--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|124120168|gb|EAY38911.1_extraction_extraction -------------------------------------------------------------------------------- -------------FKV-HDFKLQ-------RKPYNKSIQNNKKSTDL--------------------------------- -----------------ARRQL-APISYP--------KSLLTN----------------------RVF------------ ------AIQ--------DPRNYHDIVFY-L-------------------------------------------------- ---HDEWETVL------------------DRL----------------------YP--KN--TK---IFSYSMPIPVDKK NQGQMGELRSGRMIYEWIRMAES--DLVIDAT--SYKY-LA-KT--------DITNFY-SSVY----------------- -THS--LAWA-L--------------AGNRETAFE-DK------QCSNF-------------GNKIDK------------ -------------------------------------------------------------------------------- ------------------LL------------------------------------Q----------------------- -------------------------------------------------------------------------------- ----------------------------YANDARTN-----------GIP-----VGSAL------------------S- DLVAEILL-----------------------------------------------------------AW----------- --------ID--EK-VSKELT----------------------------------------------------------- ---------------------------------SL-----DFLAVR----------FKDDYRV-------------LCN- -----SEED----AK---------------------------KVLSTISN---------------------EL-S----- ----K--I----NLTL------N-E------NK----T---QVFIVPDGLYRPHDREYF--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|16331604|ref|NP_442332.1_extraction_extraction -------------------------------------------------------------------------------- ---------FPKVLEIDAIYNMADEFVEQVENKNLDLDSF----KPG--------------------------------- -----------------SCRRF-IV--PK--------DELAYRQ-------------------ATQLD------------ ------PQDS----IILSALIYQYGQEI-EN--------RRLTK-EK--------------------------------- ---VFSYRFKP------------------DFQT--------------------GFY--DNQISW---NQFW--------- ----------------------------------QSAY-AL-SNGFDTVLYCDIADFY-NQIY----------------- -HHT--VENQ-LFASGF---------PNQA------------------KKWII---------SLIEST------------ -------------------------------------------------------------------------------- ------------------TA------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------KVSR-----------GIP--VGPHALHL-------------------- --IAESTM-----------------------------------------------------------IP----------- --------ID--NC-LVS-KG----------------------------------------------------------- ----------------------------------------IN-FIR----------FADDIIV-------------FCK- -----SRNH----AK---------------------------QLAYSIAS---------------------TL-D----- ----K--Q----QRL----------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115471181|locus|VBISynSp54615_0753|_extraction hypothetical protein [Synechococcus sp. PCC 6312] -------------------------------------------------------------------------------- -------------------------FIALVEGK--QLSEF----EPK--------------------------------- -----------------PCRRF-IV--PK--------DEISYRQ-------------------ATQLY------------ ------PQDS----ILLSALVHQFGQGI-ED--------RRLDS-SQ--------------------------------- ---VFSYRFSP------------------SIDE--------------------GLY--KAKTAW---NEFW--------- ----------------------------------SSAY-NK-SQNSSTILYCDIADFY-NQIY----------------- -HHS--VENQ-LIESGF---------PNQA------------------VKWIV---------SLLEST------------ -------------------------------------------------------------------------------- ------------------TA------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------GVSR-----------GVP--IGPHAIHL-------------------- --IAEATL-----------------------------------------------------------IP----------- --------ID--NS-MKT-NG----------------------------------------------------------- ----------------------------------------LN-FLR----------YADDIIV-------------FCD- -----SDKE----SK---------------------------SALSLIAS---------------------VL-D----- ----K--Q----QRLM------L--------QR----H---KTRFYKPEDFKVLCANM---------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|110679730|ref|YP_682737.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------AASRAEQRPVALS------------------------------------------ ---------------------------------------LSSRT-------------------DA--------------- -----------------SGVVHRFAADI-ETDLIMRATYRRLAK-QY--------------------------------- ---NIQLPNRE------------------VIIS--------------------GIL--EAVCE----------------- ----------------------------------GSPY-AV-TR-------CDIRSFY-ENID----------------- -AEP--IVKK-VIADTR---------TDAS------------------LRAVI---------EWIYDA------------ -------------------------------------------------------------------------------- ------------------GG------------------------------------------------------------ -------------------------------------------------------------------------------- --------------------------------G--------------AVPSNVAPRGLAI------------------S- TVLAELAL-----------------------------------------------------------SD----------- --------FD--KA-LKKLPG----------------------------------------------------------- ----------------------------------------VHRYFR----------FADDMVI-------------F--- ------HLP----QY---------------------------DILAEIND---------------------LL-D----- ----T--L----GLEI------N--------EK----T---TVTHFRSDKPSGAGANTKA-------------------- ----------------NFDFLGYEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|21683491|locus|VBIDesAut25181_2015|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfobacterium autotrophicum HRM2] -------------------------------------------------------------------------INSLKHL AHRLGFAPEV---------------------------LQKAASRAEK--------------------------------- -----------------SYKFD-KI--PK--------KSG------------------KG----FREI------------ ------SKPN----AL-LKNIQKAIHKL-LTEIEISD------------------------------------------- ---NAHCGIKK------------------R-----------------------SNV--TNAMNH---CNK---------- ----------------------------------E--W-VY-SM--------DFKNFF-PNIS----------------- -HHQ--VYGL-FR------YELKCSP---D------------------VTSIL---------TRLCTV------------ -------------------------------------------------------------------------------- ------------------KGG----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGGSM------------------S- MDIANLVS-----------------------------------------------------------RK----------- --------LD--TR-LEGLCK----------------------------------------------------------- ----------------------I------------HN----LSYTR----------HCDDLNF----------------- -----SGKR----ILD--------------------------TFRAKV-E---------------------IIIK----- ----E--S----GFPL------N-P------DK----E---TLIPHH-HPQ----------------------------- ------------------SVVGLRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|115376572|ref|ZP_01463804.1_extraction_extraction -----------------------------------------RAKANGLPEGLDSVEALAKALGISVSRLR-----WF--- ------SFHR---------------------------EVDT----GT--------------------------------- -----------------HYQTW-EI--PK--------RDG-------------------G----KRTL------------ ------TAPK----RE-LKAVQRWVLAN-V----VER--------LP-VH------------------------------ ---GAAHGFVA------------------G--R--------------------SIL--TNALAH---QGA---------- ----------------------------------D--V-VV-KV--------DMKDFF-PSVT----------------- -WPR--VKGL-L-------RKGGLPE---N------------------LATLL---------ALLSTE------------ -------------------------------------------------------------------------------- ------------------APREVVRFRG---------ETLYV-------AKGPR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ALP-----QGAPT------------------S- PALTNALC-----------------------------------------------------------LR----------- --------LD--KR-LSALSK----------------------------------------------------------- ----------------------R------------LG----FTYTR----------YADDLTF----------------- -----SWR-----RAKKSRQKEDAPVA---------------LLLARV-K---------------------GVLE----- ----A--E----GFTL------H-P------DK----T---RVQRKG-SRQ----------------------------- ------------------RVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|108758657|ref|YP_628423.1_extraction_extraction -----------------------------------------RARANGLTE-LDSAEALAKALGLSVSKLR-----WF--- ------AFHR---------------------------EVDT----AT--------------------------------- -----------------HYVSW-TI--PK--------RDG-------------------S----KRTI------------ ------TSPK----PE-LKAAQRWVLSN-V----VER--------LP-VH------------------------------ ---GAAHGFVA------------------G--R--------------------SIL--TNALAH---QGA---------- ----------------------------------D--V-VV-KV--------DLKDFF-PSVT----------------- -WRR--VKGL-L-------RKGGLPE---G------------------TSTLL---------SLLSTE------------ -------------------------------------------------------------------------------- ------------------APREAVQFRG---------KLLHV-------AKGPR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ALP-----QGAPT------------------S- PGITNALC-----------------------------------------------------------LK----------- --------LD--KR-LSALAK----------------------------------------------------------- ----------------------R------------LG----FTYTR----------YADDLTF----------------- -----SWT-----KAKQPKPRR--PVA---------------VLLSRV-Q---------------------EVVE----- ----A--E----GFRV------H-P------DK----T---RVARKG-TRQ----------------------------- ------------------RVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|32192806|locus|VBIHalOch22000_1548|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Haliangium ochraceum DSM 14365] ---------------------------------------------------LADTTALAEALELPIPRLR-----WL--- ------VYHR---------------------------EVDR----HT--------------------------------- -----------------HYHRW-TV--PK--------RSG-------------------G----ERLI------------ ------SAPK----PE-LKRAQRWIARN-I----TEH--------LP-VH------------------------------ ---GAAHGFLP------------------G--R--------------------STA--TNAAVH---AGA---------- ----------------------------------R--V-II-KF--------DIRDFY-PSVT----------------- -LPR--VKGV-F-------RKAGYGE---Q------------------VATVM---------ALLCTE------------ -------------------------------------------------------------------------------- ------------------PPREEMVLRD---------KKYYV-------AVGPR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------SLP-----QGAPT------------------S- PSITNALA-----------------------------------------------------------LG----------- --------LD--SR-LAGLAR----------------------------------------------------------- ----------------------S------------LA----CRYTR----------YADDLTF----------------- -----SWH-----GDSEA------PIQ---------------RLKNAV-A---------------------RIVH----- ----G--E----GFRV------H-E------GK----T---RIMRAG-GRQ----------------------------- ------------------KVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|17530191|gb|AAL40743.1_extraction_extraction -----------------------------------------LIKREGVPA-IASAEELARAMGIALKELR-----FL--- ------AYNR---------------------------KVSR----VT--------------------------------- -----------------HYRRF-LL--PK--------KTG-------------------G----LRLI------------ ------SAPM----PR-LKRAQAWALEH-I----FNK--------LS-FE------------------------------ ---PAAHGFVA------------------G--R--------------------SIV--SNARPH---VGA---------- ----------------------------------D--V-VV-NL--------DLKDFF-PTVS----------------- -FPR--VKGA-L-------RHLGYSE---S------------------VATAL---------ALVCTE------------ -------------------------------------------------------------------------------- ------------------PEVDEVGLDG---------TTWYV-------ARGER-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FLP-----QGSPC------------------S- PAITNLLC-----------------------------------------------------------RR----------- --------LD--RR-LHGLAQ----------------------------------------------------------- ----------------------A------------LG----FVYTR----------YADDLTF----------------- -----SGR-----G-EAAESK---RVG---------------KLLRGA-A---------------------DIVA----- ----H--E----GFVV------H-P------DK----T---RVMRRG-RRQ----------------------------- ------------------EVTGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22088430|locus|VBIHahChe29232_3588|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Hahella chejuensis KCTC 2396] ------------------------------------------------------------------NELR-----FL--- ------AYSK---------------------------EVSK----LS--------------------------------- -----------------HYQQF-AI--AK--------KTG-------------------G----VREI------------ ------SAPM----PR-MKRAQYWVLDN-I----LAP--------LT-LH------------------------------ ---EAAHGFVV------------------E--R--------------------SIV--SNAQPH---VGK---------- ----------------------------------D--V-VI-NL--------DLKDFF-PTVS----------------- -YAR--IKGA-F-------RHLGYSE---Q------------------IATIL---------GLLCSQ------------ -------------------------------------------------------------------------------- ------------------PKTQEVEMDG---------QKWFV-------SEGER-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FLP-----QGAPT------------------S- PAISNVIC-----------------------------------------------------------RK----------- --------LD--RR-LQSMAA----------------------------------------------------------- ----------------------K------------LG----FTYTR----------YADDVTF----------------- -----SAD-----G-KSDDD-----VK---------------RLLWRC-R---------------------SIIK----- ----D--E----GFVV------H-P------DK----T---RIMRKH-RRQ----------------------------- ------------------EVTGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|22548316|locus|VBIMetNod76414_3882|_extraction hypothetical protein [Methylobacterium nodulans ORS 2060] ------QRHARALAWHERRAEELVFLGDGVSAGLRADAAAPPAPRPGLPV-LATPKALADAIGIPLAELR-----FL--- ------AYDR---------------------------ALSR----IS--------------------------------- -----------------HYWRF-TI--PK--------KAG-------------------G----VRLI------------ ------SAPM----PR-LKRAQYWILDN-L----LAH--------AP-VH------------------------------ ---DAAHGFVP------------------G--R--------------------SIV--TNAAAH---VGR---------- ----------------------------------A--V-VV-NL--------DLKDFF-PTLS----------------- -FRR--VKGK-F-------RGLGYAE---P------------------VATVL---------ALLCTE------------ -------------------------------------------------------------------------------- ------------------PDVDEVEIDG---------ERLFA-------ARGPR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLP-----QGAPT------------------S- PLFTNLIC-----------------------------------------------------------AR----------- --------LD--AR-LSGLAR----------------------------------------------------------- ----------------------S------------MG----FTYTR----------YADDLTF----------------- -----S-------G-AAAEK-----IG---------------ALIGLV-G---------------------EIVA----- ----A--E----GFVV------H-P------DK----T---RIMRRG-SRQ----------------------------- ------------------EVTGLTVNERVAVPRDVLRRFRAL-------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23988607|locus|VBITriEry99848_2732|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Trichodesmium erythraeum IMS101] --------------------------------------------IYNLPI-LNTAQDIANAMDITVGELR-----FL--- ------AFSR---------------------------PTST----VS--------------------------------- -----------------HYIRF-KI--PK--------KTG-------------------G----ERQI------------ ------SAPM----PR-LKNVQTWILDN-I----LCN--------VP-LH------------------------------ ---EAVHGFRA------------------G--H--------------------SIL--TNAEPH---VAK---------- ----------------------------------D--V-II-NF--------DLKNFF-PSIC----------------- -YKR--VKGL-F-------FSLGYSE---A------------------AATIF---------ALLCTE------------ -------------------------------------------------------------------------------- ------------------PNVVVVELDG---------QVYYV-------AQSDR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGSPA------------------S- PAITNIMC-----------------------------------------------------------RR----------- --------LD--KR-LSEMAA----------------------------------------------------------- ----------------------K------------LD----FVYTR----------YADDLTF----------------- -----SSS-----G-DSLRH-----IC---------------NVFRRT-E---------------------SIVS----- ----H--E----GFTV------N-R------EK----T---RLLRGK-SS------------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115666271|locus|VBIChaMin231992_6191|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Chamaesiphon minutus PCC 6605] --------------------------------------------QHRLPI-LHAAADLANAMGITVGQLR-----FL--- ------AFSR---------------------------RIAT----VS--------------------------------- -----------------HYVRF-QI--PK--------KTG-------------------G----FRAI------------ ------SAPL----PR-LKQAQQWILDN-I----LEG--------VV-LH------------------------------ ---PTAHGFRR------------------G--R--------------------SIV--TNAEPH---VGA---------- ----------------------------------L--V-VI-NM--------DLQDFF-PSIS----------------- -YAR--VKGI-F-------RSLGYSE---A------------------IATIL---------GLICTE------------ -------------------------------------------------------------------------------- ------------------SDVTEIELDG---------RSYYI-------AQELR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PALTNLLC-----------------------------------------------------------RR----------- --------LD--RR-LDRMAK----------------------------------------------------------- ----------------------S------------RG----FIYTR----------YADDLTF----------------- -----STT-----DRERLRD-----IG---------------NILKGT-H---------------------GIVT----- ----H--E----GLTI------H-P------DK----T---RVLRQS-Q------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87270320|locus|VBIFleLit174749_3500|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Flexibacter litoralis DSM 6794] -----------------------------------------KLEKLKLPY-FENIVEFSEKINKPISELR-----FL--- ------AFQR---------------------------SVSK----VN--------------------------------- -----------------QYHNY-YV--PK--------KSG-------------------G----KRLI------------ ------SAPK----PK-LKHTQNWIKTN-V----LDK--------IE-IN------------------------------ ---ENVHGFVK------------------E--R--------------------SIL--TNAEPH---QNK---------- ----------------------------------N--L-VI-SL--------DLKDFF-PSIS----------------- -YKR--VKGL-F-------LKFGYSE---Q------------------LSTLF---------GLLTTH------------ -------------------------------------------------------------------------------- ------------------NETDKLNVDG---------EIYYAQKVDKETGKTNR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FLP-----QGSPA------------------S- PAITTLIA-----------------------------------------------------------YK----------- --------MD--KR-LEGLAK----------------------------------------------------------- ----------------------K------------MG----FTYTR----------YADDLTF----------------- -----SSDL----DLKKDTKATNKIIG---------------SLLYFV-K---------------------KVVT----- ----S--E----GFEI------H-P------DK----T---HIMRKG-NQQ----------------------------- ------------------KVTGIIV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|22752516|locus|VBINosPun48114_0472|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Nostoc punctiforme PCC 73102] --------------------------------------------SHGLAE-YNTAEEIAFAMVISLEKLH-----FL--- ------TTST---------------------------SLTR--------------------------------------- -----------------HYLPF-KI--SK--------KTG-------------------G----KRII------------ ------SAPK----PE-LKAAQRWILEN-I----LEK--------LE-VH------------------------------ ---NAAHGFCK------------------N--R--------------------SIV--TNAKPH---VGA---------- ----------------------------------N--V-IV-NI--------DLQNFF-QSIS----------------- -YKR--IKEL-F-------SGFGYSE---S------------------TATIF---------GLICT------------- -------------------------------------------------------------------------------- --------------------TAEIAING---------QINHT-------ASENR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGSPA------------------S- PAISNLVC-----------------------------------------------------------RN----------- --------LD--IR-LAAIAE----------------------------------------------------------- ----------------------N------------LG----FCYTR----------YADDLTF----------------- -----STSE----DASSK-------IS---------------NLIKNT-K---------------------FIIH----- ----G--E----NFTV------N-D------NK----T---KISSKS-V------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19396752|locus|VBICloBot123574_1885|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium botulinum B str. Eklund 17B] --------------------------------------------------IINDDKELASFLGIEYKKLR-----FL--- ------VYHR---------------------------DVVS----VD--------------------------------- -----------------HYYRY-TI--PK--------KRG-------------------G----VRNI------------ ------AAPK----SV-LKNSQRKILDD-I----LLK--------IP-TS------------------------------ ---DEAHGFLK------------------G--K--------------------SVV--SAAESH---FKQP--------- ----------------------------------E--L-LI-NI--------DLEDFF-PTIT----------------- -FER--VRGL-F-------KSFGYSG---Y------------------IASML---------AMICTY------------ -------------------------------------------------------------------------------- ------------------CERMKIEIRG---------EEKYI-------KTSNR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ILP-----QGSPA------------------S- PMITNIIC-----------------------------------------------------------IK----------- --------LD--KR-LKGLSS----------------------------------------------------------- ----------------------K------------YN----FTYTR----------YADDMSF----------------- -----SFN-----GDDYELN-----IG---------------RFIGLV-S---------------------KIVK----- ----E--E----GFNI------N-K------DK----T---KFLKKN-NCQ----------------------------- ------------------CITGIVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19511877|locus|VBICloTet101274_1986|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium tetani E88] --------------------------------------------------IIKDDKELAKFLGIEYKKLR-----FL--- ------VYHR---------------------------DVVS----VD--------------------------------- -----------------NYHRY-TI--PK--------KKG-------------------G----VRNI------------ ------AAPK----SL-LKNAQRKILEE-I----LSK--------LP-VS------------------------------ ---EYAHGFLK------------------E--K--------------------SVV--SGAKAH---KNKP--------- ----------------------------------E--L-LI-NI--------DLEDFF-PTIT----------------- -FER--VRGM-F-------KGFGYSG---Y------------------VASLL---------SMICTY------------ -------------------------------------------------------------------------------- ------------------CERMEVEVRG---------EIKYV-------KTSHR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ILP-----QGSPA------------------S- PMITNIIC-----------------------------------------------------------RK----------- --------LD--KR-LSGLAS----------------------------------------------------------- ----------------------K------------YS----FTYTR----------YADDMSF----------------- -----SFI-----NENTDLT-----YG---------------RLIGLI-S---------------------KIVK----- ----E--E----GFNI------N-K------NK----T---KFLRQN-NRQ----------------------------- ------------------CITGIVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|33240062|ref|NP_875004.1_extraction_extraction -----------------------------------------KLELNGLPV-ITTFAELSSTLETTPNSLQ-----WL--- ------TYER---------------------------DSQK----ID--------------------------------- -----------------HYTRF-EI--PK--------KSG-------------------G----SRLI------------ ------SSPK----PA-LRNAQKWILES-I----LNK--------LD-IH------------------------------ ---SAATAFRP------------------G--K--------------------SIL--DNAKLH---ANS---------- ----------------------------------K--V-VL-RL--------DLKDFF-PSIT----------------- -FIR--IRGL-F-------ESLGYNP---G------------------ISTVF---------SLLCTD------------ -------------------------------------------------------------------------------- ------------------SPRIILKYHG---------ETHFV-------KVGPR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------NLP-----QGACT------------------S- PALANLIA-----------------------------------------------------------HK----------- --------LD--RR-LQKYTE----------------------------------------------------------- ----------------------K------------IG----WIYSR----------YADDLVF----------------- -----SSN-----AEEMPAY----------------------RLVKAA-S---------------------KIVA----- ----S--E----GFRI------N-K------HK----T---NIMRHP-HRQ----------------------------- ------------------TVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23006148|locus|VBIProMar70153_1246|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Prochlorococcus marinus str. MIT 9312] --------------------------------------------SNKIPE-LNSFVDVANALEVTPSKLQ-----WL--- ------IYER---------------------------GASN----ID--------------------------------- -----------------HYLRY-EI--PK--------KSG-------------------G----KRLI------------ ------SSPK----KD-MKKAQKWILEN-I----LIN--------LD-VD------------------------------ ---KAAMAFQK------------------G--L--------------------SII--DNASLH---VKS---------- ----------------------------------K--I-IV-RI--------DIKDFF-PTIT----------------- -FPR--VRGF-F-------ESLGYNP---G------------------VATVF---------ALICTD------------ -------------------------------------------------------------------------------- ------------------SPKVILKQEAIDGDKNNKDYPHFI-------AISER-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------SLP-----QGACT------------------S- PSLANLIC-----------------------------------------------------------RK----------- --------ID--SR-LNGYSS----------------------------------------------------------- ----------------------K------------SG----WKYSR----------YADDLIF----------------- -----STT-----SEDSYPH----------------------RLIKSI-S---------------------SIIS----- ----E--E----GFKV------N-Q------SK----T---RLMRAP-N------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|20090946|ref|NP_617021.1_extraction_extraction ---------------------------------------------------------FCKLTNSSLKQVN-----LF--- ------LSNK---------------------------K--------K--------------------------------- -----------------GYITF-KL--PK--------KNG-------------------D----FREI------------ ------NAPS----KK-MKYIQRWILDN-I----LYK--------LN-SG------------------------------ ---DYAHGFIP------------------G--K--------------------TIF--TNAKVH---VNQ---------- ----------------------------------D--L-VL-GV--------DIKDFF-PSIN----------------- -FRS--VYYV-F-------KSAGYTK---K------------------IAWTL---------ADLCTY------------ -------------------------------------------------------------------------------- ------------------HWK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PMLANLVA-----------------------------------------------------------LK----------- --------LD--KK-IAKYCA----------------------------------------------------------- ----------------------R------------RN----FRYSR----------YADDVTI----------------- -----SGSY----KLPMHKE--------------------------KI-I---------------------GIIE----- ----D--D----GFVV------N-H------EK----T---RMFSKG-SRQ----------------------------- ------------------KVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23503472|locus|VBISheHal24697_0538|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shewanella halifaxensis HAWEB4] -----------------------------------------------------------SLV----SRMTGV-------- ----------------------------------------S----NL--------------------------------- -----------------FYRQF-PL--KK--------RSG-------------------G----IRTI------------ ------ESPY----PK-LAYVQRWIKNH-I----LEI--------KP-IS------------------------------ ---SNALAYVQ------------------G--S--------------------SHI--ENAKRH---IGA---------- ----------------------------------K--E-LL-KI--------DLVDFF-EYIK----------------- -LST--VKSI-F-------TECGYTE---K------------------VSHQL---------AKLCTL------------ -------------------------------------------------------------------------------- ------------------RDR----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPS------------------S- PVISNLVL-----------------------------------------------------------VE----------- --------LD--KR-LQSISK----------------------------------------------------------- ----------------------K------------HE----LVYTR----------YADDLCF----------------- -----SGH--------------AI-SD---------------EFFSLV-K---------------------DEIE----- ----S--E----GFVV------N-Q------NK----S---GVVRGH-KRK----------------------------- ------------------LITGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|124818457|locus|VBIDesSul232581_2941|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfocapsa sulfexigens DSM 10523] ------------------------------------------------------IIFSQNHL----SRLVGYNIDYI--- ------RNAS---------------------------STKS----DQ--------------------------------- -----------------FYRCY-KI--KK--------KSS------------------DT----MRTI------------ ------CEPL----PS-LKEIQRWILDN-I----LSQ--------VT-IS------------------------------ ---KFAKAYVV------------------G--F--------------------SIK--DNARFH---LRQ---------- ----------------------------------K--Q-VL-RI--------DVKDFF-PSIK----------------- -GQN--VFHI-F-------KNIGYSA---E------------------VSAML---------TGLTTL------------ -------------------------------------------------------------------------------- ------------------KNC----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PTLSNIFM-----------------------------------------------------------NR----------- --------CD--AR-IAGYCL----------------------------------------------------------- ----------------------A------------RK----IRYTR----------YSDDLTF----------------- -----SGE--------------FD-VG---------------KLISFI-N---------------------MVFK----- ----D--S----GLLL------N-K------TK----T---KQMFKH-QRQ----------------------------- ------------------FTTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42744943|locus|VBIBacFra167533_1043|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacteroides fragilis 638R] ------------------------------------------------RFDDGAIRWCANLLATEESRLREV-LDYI--- ------PRQ----------------------------------------------------------------------- ------------------YTCF-HV--RK--------RSG-------------------G----FRYI------------ ------SAPA----GD-FRSMQQTIYHR-I----LLL--------AN-IH------------------------------ ---PAVTGFCP------------------G--K--------------------SVS--DNARVH---LGR---------- ----------------------------------K--N-VL-KV--------DLHDFF-PSIR----------------- -SPR--VRAA-F-------REMGYSR---S------------------IAKVL---------AELCCL------------ -------------------------------------------------------------------------------- ------------------RCC----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PALSNIIA-----------------------------------------------------------YP----------- --------MD--KK-MMALAG----------------------------------------------------------- ----------------------E------------YG----LVYTR----------YADDLTF----------------- -----SGD--------------YLPKD---------------EVLVRI-H---------------------RIIR----- ----E--E----GFTM------N-V------KK----T---RFLSEH-KRK----------------------------- ------------------IITGVSV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22093614|locus|VBIHahChe29232_6135|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Hahella chejuensis KCTC 2396] ------------------------------------------------------------------EDLD-----WL--- TQARGAP-RK---------------------------AASA----LG--------------------------------- -----------------HYTYR-WL--DK--------PRG-------------------G----ARLL------------ ------EAPK----VW-LKGIQRRIYQE-I----LAP--------LP-LH------------------------------ ---EAVHGFRA------------------Q--R--------------------SSL--THARVH---VGQ---------- ----------------------------------A--L-LL-KF--------DLKDFF-PSVG----------------- -YPQ--VYRV-L-------RRLGYGH---E------------------VTRLL---------SRLCTQ------------ -------------------------------------------------------------------------------- ------------------VTPNEILRSPAARHL--------ERGEE---LFRRA-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PALANLVA-----------------------------------------------------------AH----------- --------LD--RR-LSALAD----------------------------------------------------------- ----------------------S------------MG----RRYSR----------YADDFVL----------------- -----SGAL----MSPGA-------IE---------------RLQALV-G---------------------AIAL----- ----E--E----GFVL------N-T------RK----S---AVIGQG-ARQ----------------------------- ------------------QVGGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|22653979|locus|VBIMyxXan43560_5634|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Myxococcus xanthus DK 1622] --------------------------------------------------SLSTQAELAEWLGVTGAQLD-----WL--- ADVPGIERTS---------------------------SPGR----WR--------------------------------- -----------------RYRYT-WI--PK--------RSG-------------------G----ERLL------------ ------ERPG----LD-LVLVQRKLLHD-I----LDH--------IP-PH------------------------------ ---DAAHGFVA------------------G--R--------------------SVR--RFAEPH---ARQ---------- ----------------------------------A--V-VV-RV--------DIEDFF-FAVR----------------- -PAR--IWAV-F-------RTAGYPD---G------------------VVRAL---------AGLCTN------------ -------------------------------------------------------------------------------- ------------------RTPSA-VVSQARRPDSLADASTRWRLAR---RLESR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PALANLAA-----------------------------------------------------------YR----------- --------LD--VR-LSALAE----------------------------------------------------------- ----------------------A------------MG----ARYTR----------YADDMAF----------------- -----SGGD----ALARR-------TH---------------KLLRYV-S---------------------QILR----- ----E--E----GFTP------R-T------GK----T---RVMHRS-TQQ----------------------------- ------------------RLAGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|45389180|locus|VBIVarPar156291_1643|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Variovorax paradoxus EPS] ------------------------------------------GLDHFALPRLPTLADLAQWLELEPDRLA-----WLTAA SQAFRTPDAS---------------------------SPRL----AS--------------------------------- -----------------HYRYQ-LQ--PK--------RLG-------------------G----LRLL------------ ------EIPK----AD-LKRAQRRILDD-L----LQR--------VP-VH------------------------------ ---EAAHGFVQ------------------E--R--------------------SVA--SHAAAH---VGK---------- ----------------------------------E--V-VI-GF--------DLRDFF-PSIR----------------- -ASR--VHAL-W-------RTLGYPE---G------------------VARAL---------TALCTH------------ -------------------------------------------------------------------------------- ------------------RTSAA--VIERLRDDGGLD----WMGAK---RLAEP-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGSPC------------------S- PALANLCA-----------------------------------------------------------FR----------- --------LD--LR-LEGLAW----------------------------------------------------------- ----------------------I------------FG----ATYTR----------YADDLVF----------------- -----SGPA----SLRGR-------FS---------------ALRAWV-D---------------------GISA----- ----D--E----GFAL------H-P------RK----V---RCMPRH-HQQ----------------------------- ------------------RVTGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|19209845|locus|VBIBurPhy117947_3716|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Burkholderia phytofirmans PsJN] ------------------------------------------WLHDVALPQLPTLGDLAAWLDIEPGDLG-----WF--- ADRWRVPTRG---------------------------AATP----LH--------------------------------- -----------------HYAYK-AI--EK--------RDG-------------------R----CRII------------ ------EIPK----PR-LRALQRKVLSG-L----LDR--------IP-AH------------------------------ ---ESVHGFRH------------------G--R--------------------NIV--TFAAPH---VGK---------- ----------------------------------A--V-VM-RF--------DLTDFF-ASVH----------------- -AGR--VYSA-F-------YALGYPQ---A------------------VARAL---------TALCTN------------ -------------------------------------------------------------------------------- ------------------RIPSGRLLAPDVRER--ID----WRERQ---RYRNR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PALANLCA-----------------------------------------------------------FR----------- --------LD--LR-LAGLAR----------------------------------------------------------- ----------------------S------------VG----ATYTR----------YADDLAF----------------- -----SGDE----ELARM-------AD---------------RLCIRV-A---------------------AIAL----- ----E--E----GFGV------N-L------RK----T---RVMRRS-ARQ----------------------------- ------------------HLAGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|19085378|locus|VBIBurCen118154_6802|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Burkholderia cenocepacia J2315] ------------------------------------------PLAGCDVPQWPTPGDLAGWLGVSAPELD-----WL--- SDHWRVDARS---------------------------GATP----LH--------------------------------- -----------------HYTYV-AV--DK--------RSG-------------------G----CRLV------------ ------EIPK----GR-LREAQRRILRG-L----LDR--------IA-PH------------------------------ ---GAVHGFRK------------------G--H--------------------GIV--SFATPH---ADR---------- ----------------------------------D--V-VV-RF--------DLADFF-VSVR----------------- -AAR--VHAL-F-------ATLGYPA---E------------------VARIL---------TGLCTN------------ -------------------------------------------------------------------------------- ------------------RVPSARLLAPDLRDR--FD----WIGRQ---RYRER-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PALANLCA-----------------------------------------------------------FR----------- --------LD--LR-LAALAR----------------------------------------------------------- ----------------------S------------VD----ATYTR----------YADDLAF----------------- -----SGGG----ALARD-------VE---------------RLQVRV-A---------------------AIAL----- ----E--E----GFAL------Q-L------RK----T---RVMRRG-TRQ----------------------------- ------------------QLAGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|23418760|locus|VBISacEry28377_5670|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Saccharopolyspora erythraea NRRL 2338] ---------------------------------------------RWPVARWHTAEQLAHGLDLDVGELM-----WF--- ADRGGWLRHA---------------------------HPP-----LR--------------------------------- -----------------HYRCR-WM--RT--------RSG-------------------G----IRLI------------ ------EQPK----AR-LAELQRRITRR-V----VDA--------MP-VH------------------------------ ---EAAHGFRR------------------G--R--------------------SAA--TCAAEH---AGR---------- ----------------------------------H--F-LV-RV--------DVEGFF-ASLT----------------- -FTR--ISRA-M-------RAAGYPG---A------------------VAGAI---------GGLLTT------------ -------------------------------------------------------------------------------- ------------------ATPRDVLAAAPRVRESEVDAR--RRLLG---RLAAA-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HVP-----QGAPS------------------S- PALANALM-----------------------------------------------------------HR----------- --------LD--RR-IDAYAG----------------------------------------------------------- ----------------------T------------LG----ATYTR----------YADDLAF----------------- -----SGDR----RLP---------VA---------------ALLRGV-T---------------------AIVR----- ----A--E----GLRL------R-Q------SK----T---RVLAPH--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|161717503|locus|VBIAmyOri230246_0836|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Amycolatopsis orientalis HCCB10007] ---------------------------------------------RFQVARWTTFDECAAALDLTPGELS-----WF--- ADAKGWNRRA---------------------------AEP-----VR--------------------------------- -----------------HYTYR-WI--PT--------ASG-------------------G----VRLI------------ ------ERPK----PR-LAELQRRIVRH-V----VDA--------LP-VH------------------------------ ---EAAHGFRR------------------G--R--------------------SPL--SCAAPH---AGR---------- ----------------------------------E--T-VV-RM--------DLEGFF-PTVS----------------- -ARR--ISAL-L-------ALAGYPP---A------------------VAEAL---------AGVLTT------------ -------------------------------------------------------------------------------- ------------------AVPPAVLATVPGGRRD--PAR--TRLLS---NLAAT-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPS------------------S- PSVANAVT-----------------------------------------------------------HH----------- --------LD--RR-LGGLAR----------------------------------------------------------- ----------------------A------------LG----ATYTR----------YADDLAF----------------- -----SGD-----SLP---------LH---------------RLLPGV-R---------------------RIVT----- ----D--E----GFRL------R-D------DK----T---SIAGAH--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|54397388|locus|VBIPseDio141225_2974|_extraction retrontype reverse transcriptase [Pseudonocardia dioxanivorans CB1190] ---------------------------------------------RWPVQPWEDVDALAAGLDLRAGELD-----WF--- ADPGGWLDRS---------------------------ASPR----LA--------------------------------- -----------------HYRRR-W---------------H-------------------G----QRLI------------ ------ETPA----PR-LAELQRRVGRR-V----LAH--------IP-VH------------------------------ ---DAAHGFVA------------------G--R--------------------SPL--TLAREH---SGR---------- ----------------------------------Q--W-VL-RL--------DVEGFF-SRIG----------------- -PAR--IAGL-L-------GAAGYPA---A------------------VAGAL---------AGLLVT------------ -------------------------------------------------------------------------------- ------------------STPAAVLRRAPERPADAVRDR--LR---------AP-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PAVANVLA-----------------------------------------------------------HG----------- --------LD--RR-LSALAA----------------------------------------------------------- ----------------------A------------AG----ARYGR----------YADDLVL----------------- -----SGDG----PLP---------VQ---------------GLLRRA-R---------------------EIAA----- ----D--E----GFDV------R-P------AK----T---RVMPAH--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23978013|locus|VBITolAue42623_0706|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Tolumonas auensis DSM 9187] --------------------------------------------------------------------LE-----LL--- --------KM---------------------------IHSP----DS--------------------------------- -----------------FYREF-DI--PK--------KKG-------------------G----VRHI------------ ------VSPY----PS-LLSCQKWIYNN-I----LKK--------IS-IH------------------------------ ---PAAHGFNL------------------N--H--------------------SIV--TNAQSH---LNK---------- ----------------------------------K--C-LL-QM--------DVKDFF-PSLP----------------- -INW--VINL-F-------SSLGYSH---N------------------VSFNL---------ASLCCL------------ -------------------------------------------------------------------------------- ------------------NDK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAAT------------------S- PYLSNILL-----------------------------------------------------------VG----------- --------LD--KR-LSLLST----------------------------------------------------------- ----------------------S------------YQ----LTYTR----------YADDLCF----------------- -----SGE---------------YIPH---------------ELIQTI-E---------------------SIII----- ----D--Y----NLTP------N-K------NK----T---RLQLNN-NKR----------------------------- ------------------IVTGISV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|134078|sp|P23070_extraction_extraction ------------------------------------------RLRNLGLPVMNNLHDMSKATRISVETLR-----LLIYT ADF----------------------------------------------------------------------------- -----------------RYRIY-TV--EK--------K---------------------GPEKRMRTI------------ ------YQPS----RE-LKALQGWVLRN-I----LDK--------LS-SS------------------------------ ---PFSIGFEK------------------H--Q--------------------SIL--NNATPH---IGA---------- ----------------------------------N--F-IL-NI--------DLEDFF-PSLT----------------- -ANK--VFGV-F-------HSLGYNR---L------------------ISSVL---------TKICCY------------ -------------------------------------------------------------------------------- ------------------KNL----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPS------------------S- PKLANLIC-----------------------------------------------------------SK----------- --------LD--YR-IQGYAG----------------------------------------------------------- ----------------------S------------RG----LIYTR----------YADDLTL----------------- -----SAQS----MKK---------VV---------------KARDFL-F---------------------SIIP----- ----S--E----GLVI------N-S------KK----T---CISGPR-SQR----------------------------- ------------------KVTGLVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32132182|locus|VBIEscFer122920_3781|_extraction RNAdirected DNA polymerase (EC 2.7.7.49), msDNA specific [Escherichia fergusonii ATCC 35469] -------------------------------------------------------------------------------- ---------------------------------------HA----GR--------------------------------- -----------------HYRRI-IL--SK--------RHG-------------------G----QRLV------------ ------LAPD----YL-LKTVQRNILKN-V----LSQ--------FP-LS------------------------------ ---SFATAYRP------------------G--C--------------------PIV--SNAQPH---CQQ---------- ----------------------------------P--Q-IL-KL--------DIENFF-DSIS----------------- -WLQ--VWRV-F-------RQAQLPR---N------------------VVTML---------TWLCCY------------ -------------------------------------------------------------------------------- ------------------NDA----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PAISNLVM-----------------------------------------------------------CR----------- --------FD--ER-IGEWCQ----------------------------------------------------------- ----------------------A------------RG----ITYTR----------YCDDMTF----------------- -----SGHF----N-----------AR---------------LVKNKV-C---------------------GLLA----- ----E--L----GLNL------N-Q------RK----S---CLVAAC-KHQ----------------------------- ------------------QVTGIVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42840291|locus|VBICloCf158569_1292|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium cf. saccharolyticum K10] -------------------------------------------------EMILSMKLLGYASLTEEQELS-----VL--- -------YEL---------------------------SNHA----SR--------------------------------- -----------------HYREA-VI--PK--------RTG-------------------G----VRHL------------ ------MVPD----ST-LKAVQRRILRN-V----LSG--------LE-LS------------------------------ ---EQAYAYRK------------------G--V--------------------GIR--ENAAVH---TGQ---------- ----------------------------------E--K-IL-KL--------DIHDFF-GSIT----------------- -SSS--VYGLAF-------PGTVFPP---Q------------------VRGLL---------TSLCCL------------ -------------------------------------------------------------------------------- ------------------RGR----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGSPA------------------S- PAISNLVM-----------------------------------------------------------RP----------- --------FD--EH-MGLWCR----------------------------------------------------------- ----------------------E------------RN----IRYSR----------YCDDMTF----------------- -----SGSF----D-----------AG---------------EVIRKV-S---------------------GFLS----- ----E--R----GMRL------N-W------KK----T---GVYGKN-CRQ----------------------------- ------------------EVTGLTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|191810597|locus|VBIGorPol218398_4913|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Gordonia polyisoprenivorans VH2] ---------------------------------------------PEGLPDLYDVTQLAHWLDFTVPELE-----WF--- -ADRGGWLRT---------------------------SRPP----LR--------------------------------- -----------------HYRIW-RR--SK--------RDG------------------------FRVI------------ ------EAPK----PR-MRETQRRLLRR-L----VEH--------VP-AH------------------------------ ---PCARGFVA------------------G--S--------------------STT--AFAWPH---SDR---------- ----------------------------------P--V-VL-RA--------DLRHCF-ETIT----------------- -TTR--VRRV-F-------HDVGYPA---H------------------IARIL---------AELCTT------------ -------------------------------------------------------------------------------- ------------------ATPIDELGGIDLAHRA---------------LLRDR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PHLANLVM-----------------------------------------------------------RG----------- --------LD--RR-LDGYAR----------------------------------------------------------- ----------------------A------------NG----LRYTR----------YGDDLAL----------------- -----SGD--------------SMDAD---------------RTLWVV-L---------------------KIIE----- ----T--E----GFTA------H-P------DK----V---RVMYRH--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|96825515|locus|VBIGorSp50678_2729|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Gordonia sp. KTR9] ---------------------------------------------LDDLPDLYDVEQFAHWLDFTVPELE-----WF--- -ADRGQWLRT---------------------------ARLP----LR--------------------------------- -----------------HYRIW-RR--EK--------RDG------------------------VRVI------------ ------EAPK----PR-MRETQRRLLRR-L----VER--------IP-AH------------------------------ ---PAARGFVP------------------G--S--------------------SPA--AFAWPH---TDR---------- ----------------------------------P--A-VV-RV--------DLRHCF-ETIT----------------- -VQR--VRAV-F-------RDAGYQP---H------------------IARLL---------AELCTT------------ -------------------------------------------------------------------------------- ------------------ATPVDELQGLDREHAV---------------LLRDR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAPT------------------S- PHLANLVM-----------------------------------------------------------RP----------- --------LD--RR-LNGYAR----------------------------------------------------------- ----------------------R------------NG----LRYTR----------YGDDLAI----------------- -----SGD--------------AINAD---------------RALWTM-L---------------------RIVE----- ----D--E----RFTV------H-P------GK----V---QIMYSH--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|58974322|locus|VBIStaAur171735_1995|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Staphylococcus aureus subsp. aureus ECTR 2] ----------------------------------------------------------------------GIKSDYF--- -------YKC---------------------------LYVN----DH--------------------------------- -----------------FYNVI-KI--PK--------RKK------------------DE----YREL------------ ------MIPN----MA-LKNIQRWILDN-V----LYR--------RQ-VH------------------------------ ---KCATGFVP------------------R--K--------------------SIV--NNAIPH---VGQ---------- ----------------------------------K--Y-IL-KM--------DIENFF-PSIT----------------- -FKQ--VRKI-F-------SEMGYKF---E------------------LATAL---------ANLCTV------------ -------------------------------------------------------------------------------- ------------------NNQ----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PYIANIIF-----------------------------------------------------------YN----------- --------ID--KR-IFSYCQ----------------------------------------------------------- ----------------------K------------NN----LRYTR----------YADDITI----------------- -----SGSN----K-----------VS---------------FSKEII-R---------------------EIVN----- ----Q--Y----NFRI------N-E------SK----T---IMFKPG-DRK----------------------------- ------------------KLLVL--------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|61385079|locus|VBIRahAqu175761_1636|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Rahnella aquatilis CIP 78.65 = ATCC 33071] ------------------------------------------------------------------SALT-----HC--- ------LYIV---------------------------KP------EN--------------------------------- -----------------QYVQF-EI--PK--------RKG-------------------G----KRII------------ ------SAPS----GM-LKNLQSSLSDL-L----LDC--------LD-EIIFLKFPDSEITRQKAKNSLFLKVKCSGSEI KQPSLSHGFER------------------K--R--------------------SII--TNAMMH---LGK---------- ----------------------------------K--H-VF-NI--------DLENFF-GSFN----------------- -FGR--VRGF-FIKN----GNFLLEP---E------------------IATVI---------AKIACY------------ -------------------------------------------------------------------------------- ------------------NN-----------------------------E------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGSPC------------------S- PVISNLIT-----------------------------------------------------------HA----------- --------LD--IK-LAAVAA----------------------------------------------------------- ----------------------K------------HS----CTYTR----------YADDITF----------------- -----S-------TRKESLPSSVAKSD-----NN-TF-----VAGKVI-R---------------------REIA----- ----R--S----GFSI------N-E------SK----T---RDQYKD-SRQ----------------------------- ------------------EVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23598545|locus|VBISheSp103602_2506|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shewanella sp. W3181] ------------------------------------------------------------------AFLT-----NT--- ------LY-----------------------------KPGVNSHVNS--------------------------------- -----------------HYHQF-DI--TK--------KSG-------------------G----VRTI------------ ------SAPS----DE-LKDLQRRLSDL-L----LDC--------KA-VIQFDK-----------------KIEC----- ---TLSHGFER------------------E--K--------------------SII--TNARIH---RGK---------- ----------------------------------K--N-VL-NL--------DLADFF-GSFN----------------- -FGR--VRGY-FIAN----KDFKLNP---H------------------IATVI---------AQIACY------------ -------------------------------------------------------------------------------- ------------------KD-----------------------------T------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGSPC------------------S- PVIANLIT-----------------------------------------------------------NS----------- --------LD--IK-LSKLAK----------------------------------------------------------- ----------------------K------------NG----CSYTR----------YADDITF----------------- -----S-------TRKKCFPIAIVK-N-----VE-SL-----TLGSKL-L---------------------GEIK----- ----R--A----GFSI------N-T------SK----T---RLQFKD-SRQ----------------------------- ------------------EATGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|61324279|locus|VBIPanAna213218_2623|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Pantoea ananatis PA13] ------------------------------------------------------------------VFLT-----RV--- ------VYMR---------------------------DT------EK--------------------------------- -----------------LYTDF-TI--LK--------KNK-------------------T----QRVI------------ ------SAPD----EE-LKEIQRKISDL-L----LDC--------LT-TIRNEN-----------------KSNN----- ---KLSHGFEL------------------G--K--------------------NII--TNAERH---KSK---------- ----------------------------------K--W-VL-NI--------DLLNFF-DQFN----------------- -YGR--VSGY-FIKN----KFFELHS---N------------------IANLI---------AKIACY------------ -------------------------------------------------------------------------------- ------------------KN-----------------------------K------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGSPC------------------S- PVISNLIL-----------------------------------------------------------QS----------- --------LD--QR-LNNICK----------------------------------------------------------- ----------------------K------------NG----CTYSR----------YADDITI----------------- -----S-------TNKKKFPKAIVSNH-----SDKEI-----NLNNGF-I---------------------KEIL----- ----R--A----GFSI------N-N------DK----T---RLRFNT-LRQ----------------------------- ------------------EVTGLTV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|58718285|locus|VBIMicAer161757_1975|_extraction RNAdirected DNA polymerase [Micavibrio aeruginosavorus ARL13] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----AAHGFTK------------------D--R--------------------SII--TNARPH---RRK---------- ----------------------------------R--W-VL-NL--------DLKDFF-PSIN----------------- -FGR--IRGF-FIKN----KNFNLDP---D------------------VASAI---------AHICCY------------ -------------------------------------------------------------------------------- ------------------EG-----------------------------K------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGSPC------------------S- PIISNLIG-----------------------------------------------------------HL----------- --------LD--VR-LIKIAK----------------------------------------------------------- ----------------------K------------YS----CTYTR----------YADDITF----------------- -----S-------SDLRKFPKEIAHKK---LFSKKSW-----ELSNSI-K---------------------QEIE----- ----K--S----GFFV------N-E------KK----T---RMQYQE-SRQ----------------------------- ------------------DVTSLVVNKKVNVKRELYKTLRAQCHTLFRTGGCYEIAYQRPENEKISYLQRILDKIFYRPK KIKEKKEDRKYKTLDQIQGMLNFVYQVRHDRDKKLGIAERKEIESSIKSLYRKFLFFINFYYIDRPLIICEGKTDPIYLK CALKSLHLKYPGLIEKTSEGFDFRIK >Gfid|108026150|locus|VBILegPne122099_4451|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Legionella pneumophila subsp. pneumophila] -------------------------------------------------------------------NLA-----KL--- ------IYPT---------------------------T-------NN--------------------------------- -----------------LYHSF-SI--PK--------KSG-------------------E----LREI------------ ------NAPN----RI-LKEIQYKLLNE-L----QSY--------YS-KR------------------------------ ---NCTHGFIN------------------E--R--------------------NVV--TNARPH---VKK---------- ----------------------------------N--I-IL-NL--------DLKEFF-QSIH----------------- -FGR--VRNL-FMS-----NPFNFNK---N------------------VATVL---------AQICCH------------ -------------------------------------------------------------------------------- ------------------KG-----------------------------H------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PLITNLIC-----------------------------------------------------------YK----------- --------LD--KE-LSKLAF----------------------------------------------------------- ----------------------N------------NN----CTVTR----------YVDDITF----------------- -----SF-N----CSHDKIPRDILDGI-----SGESI-----SISKKL-V---------------------NIIN----- ----K--N----GFEI------N-Y------KK----V---RLSSSN-QRQ----------------------------- ------------------QVTGIVT------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42796352|locus|VBIButBac135163_1575|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [butyrateproducing bacterium SSC/2] ------------------------------------------------------REDVANILEIEDKSLR-----YF--- ------LYCV---------------------------KP------DN--------------------------------- -----------------MYKNF-EI--NK--------RNG-------------------G----TRII------------ ------SAPN----KK-LKNIQRKLLQI-L----ENV--------YT-PK------------------------------ ---ICAYGFIN------------------G--K--------------------SIY--DNASIH---LKR---------- ----------------------------------R--Q-IL-NL--------DLKDYF-LQIN----------------- -FGR--VRGM-LLK-----KPYELGE---E------------------AATVI---------AQIVCY------------ -------------------------------------------------------------------------------- ------------------KG-----------------------------K------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PIIANMIC-----------------------------------------------------------AP----------- --------MD--NH-FMKLAK----------------------------------------------------------- ----------------------A------------NN----MKYTR----------YADDLTF----------------- -----S--------TRTEFPTSIVR-I-----ENDTV-----VVGKKI-L---------------------QILK----- ----K--D----GFLL------N-E------EK----I---YLRSKD-KRQ----------------------------- ------------------EVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|23820590|locus|VBISynSp37135_3215|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Synechococcus sp. PCC 7002] ---------------------------------------------VKSFRSLQTPQNVAVLLDVEYDKLI-----YH--- ------IYSS---------------------------NV------DE--------------------------------- -----------------RYKKF-EV--YK--------KSG-------------------G----IRTI------------ ------TTPV----TS-LKFIQWKLNQV-L----NAV--------YQ-PK------------------------------ ---PAVHGFTL------------------N--K--------------------NIL--TNAQAH---VGK---------- ----------------------------------R--F-VL-NL--------DLEDFF-PSIN----------------- -FGR--VRGL-FMA-----PPYQLPA---G------------------VATVL---------AQICCY------------ -------------------------------------------------------------------------------- ------------------DN-----------------------------Q------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PIVSNMIC-----------------------------------------------------------AK----------- --------MD--TQ-LQRLAK----------------------------------------------------------- ----------------------E------------CR----ATYTR----------YADDITF----------------- -----S-------TTLREFPEDL--AYPVKTEEGTQF-----VLGDHL-L---------------------QIIA----- ----E--N----GFKI------N-N------QK----T---RLQTKG-S------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115416315|locus|VBICylSta108647_1281|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Cylindrospermum stagnale PCC 7417] ---------------------------------------------RERFYSLTKPIDIAHLLGISYRRLV-----YH--- ------IYLV---------------------------EP------EK--------------------------------- -----------------RYKTF-DI--PK--------KSG-------------------G----IRQI------------ ------STPI----TA-LKLIQRKLNQV-L----QAV--------YQ-TK------------------------------ ---PSVHGFVS------------------G--K--------------------NIV--SNAQAH---AKK---------- ----------------------------------R--Y-VL-NL--------DIKDFF-PSVN----------------- -FGR--VRGM-FMA-----KPYNLHP---D------------------VATVL---------AQICCH------------ -------------------------------------------------------------------------------- ------------------NN-----------------------------Q------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PIITNMIC-----------------------------------------------------------AK----------- --------MD--SQ-LQRLAK----------------------------------------------------------- ----------------------D------------CK----ATYTR----------YADDMTF----------------- -----S-------TTLPKFPEEL--AYIVAEEECKKI-----VIGNRL-T---------------------AVIN----- ----E--N----GFEV------N-Q------QK----N---RLQVKG-N------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|35427561|locus|VBIAnaVar43351_4315|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Anabaena variabilis ATCC 29413] ---------------------------------------------SKKFSELKSARDVAQILQVPYNYLI-----YY--- ------IYRT---------------------------SE------NS--------------------------------- -----------------KYQTF-NL--EK--------KSG-------------------G----YRTI------------ ------HSPN----NS-LEILQRKLSQI-L----YSV--------YS-PK------------------------------ ---PCVHGFAA------------------N--R--------------------SIV--TNAKVH---TKK---------- ----------------------------------K--F-VL-NI--------DIKDFF-DVIN----------------- -FGR--VRGL-FIA-----KPYQLNE---E------------------VATIL---------AQICCF------------ -------------------------------------------------------------------------------- ------------------QN-----------------------------K------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PIISNLVC-----------------------------------------------------------AR----------- --------MD--KE-LQKFAK----------------------------------------------------------- ----------------------E------------NG----IFYTR----------YADDITF----------------- -----S-------INREDLPVGLVSSY---SKGFSKV-----ILGDDL-R---------------------SIIE----- ----N--N----GFKI------N-E------AK----V---RLAYRT-Q------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|190416239|locus|VBIVarPar264937_4482|_extraction Retrontype RNAdirected DNA polymerase [Variovorax paradoxus B4] -----------------------------------------------FLLRADVPYFSRHIFGAEYWKIE-----GL--- ------LYP------------------------------------TP--------------------------------- -----------------RYTNF-VI--AK--------ANG-------------------K----NRHI------------ ------AEPS----KK-IKALQYRALAY-L----KGT--------VP-PAK----------------------------- ---PCVHGFVE------------------G--R--------------------SIL--TNAEKH---LER---------- ----------------------------------RPYH-IL-NL--------DLSDFF-PTIT----------------- -FFR--VRGA-LMA-----PPMKFSF---E------------------MATML---------AHLCTH------------ -------------------------------------------------------------------------------- ------------------EG-----------------------------S------------------------------ -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PFLANLIC-----------------------------------------------------------RT----------- --------LD--SQ-LTLLSK----------------------------------------------------------- ----------------------R------------HR----ATYTR----------YADDLTF----------------- -----SFSH----RSAERLPANVV------AFDGGTV-----SIGSEL-R---------------------SIIE----- ----S--N----SFHI------N-E------GK----T---RISTRL-RRM----------------------------- ------------------EITGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42429773|locus|VBIPreMel47739_0018|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Prevotella melaninogenica ATCC 25845] --------------------------------------------------MLNELKLPWYYSDFRGKQLS-----FL--- ------ADTN---------------------------NV------QR--------------------------------- -----------------RCKTF-RL--RK--------KHG-------------------G----YREI------------ ------TAPK----GS-LRGILNALNIL-L----QTY--------DE-PT------------------------------ ---PWAFGFVC------------------G--R--------------------SVV--DNARPH---VGK---------- ----------------------------------R--Y-IL-NL--------DLKDFF-PTIT----------------- -RQQ--VADC-LTA-----EPFGFSS---L------------------AVKLI---------SGLATV------------ -------------------------------------------------------------------------------- ------------------RT-----------------------------KNNKE-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------VLA-----QGFAT------------------S- PTLSNFIC-----------------------------------------------------------RE----------- --------MD--KE-IAGVAA----------------------------------------------------------- ----------------------A------------QG----ITFTR----------YADDLTF----------------- -----S--------------------------SDTDILRPQGELVQQV-K---------------------AIVE----- ----R--Y----GFRL------N-E------EK----T---HLQRRG-RRQ----------------------------- ------------------EVTGLMV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|54406438|locus|VBIPreDen163057_0487|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Prevotella denticola F0289] --------------------------------------------------LLNEVLDAFCFTKSRPAQLR-----FF--- ------ANTY---------------------------NT------QR--------------------------------- -----------------RCRTF-RL--RK--------KHG-------------------G----YREI------------ ------IAPK----GK-LRDILHALNIV-L----QTF--------DD-PT------------------------------ ---PWAYGFVC------------------G--R--------------------SVV--DNARPH---VGK---------- ----------------------------------R--Y-VL-GL--------DLKDFF-PSIT----------------- -RRQ--VADC-LTT-----EPLGFSS---V------------------AADLV---------AGLASV------------ -------------------------------------------------------------------------------- ------------------RT-----------------------------DEGQE-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------VLA-----QGFAT------------------S- PTLSNFVC-----------------------------------------------------------RE----------- --------MD--RE-IADVAA----------------------------------------------------------- ----------------------A------------QS----ITFTR----------YADDLTF----------------- -----S--------------------------SDADILRPQGAFVQQI-K---------------------TIVE----- ----R--H----GFRL------N-E------AK----T---HLQRRG-RRQ----------------------------- ------------------EVTGLMV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61051028|locus|VBICloCla155345_0188|_extraction retrontype reverse transcriptase [Clostridium clariflavum DSM 19732] ------------------------------------------LPTISDLDSLSDAFAISKRLIFLLTKKT---------- ---------------------------------------------EN--------------------------------- -----------------YYKRF-YI--KK--------RDG-------------------T----SREI------------ ------LSPT----YS-LKLIQRWILKE-I----LEK--------VS-LS------------------------------ ---EQAMAYKK------------------GKNN--------------------GIK--KNAMHH---RYS---------- ----------------------------------L--Y-IL-EM--------DIKDFF-HSIK----------------- -RER--VFYL-F-------KNLGYNN---M------------------VSNIL---------TNLCTY------------ -------------------------------------------------------------------------------- ------------------NG-Y---------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGGVC------------------S- PYISNLIC-----------------------------------------------------------YR----------- --------LD--NR-LSGLCG----------------------------------------------------------- ----------------------K------------RD----ILYTR----------YADDLTF----------------- -----SS-------DNKQTLT---------------------KAINII-I---------------------NIIE----- ----D--E----GFKV------N-K------NK----T---RLLSPG-SHK----------------------------- ------------------KVTGITV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42882548|locus|VBICopCat158046_0287|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Coprococcus catus GD/7] --------------------------------------------TI-----------IASR------------------- ---------------------------------------------NN--------------------------------- -----------------LYAKY-YI--PK--------KNG-------------------R----YRLI------------ ------LQPS----KE-LKVLQRWLLRN-I----FAY--------FP-VS------------------------------ ---EYSSAYSK------------------G--N--------------------SVR--KNAAVH---KEG---------- ----------------------------------R--Y-LL-HT--------DITNFF-PTIS----------------- -RTM--LKQY-FQSNESLTRKLGMAD---E------------------DIELI---------LDICLY------------ -------------------------------------------------------------------------------- ------------------RGEN---------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LV-----VGSVA------------------S- PQIANMLM-----------------------------------------------------------YA----------- --------FD--LE-LKQMLD----------------------------------------------------------- ----------------------DF-----------GS----FRYTR----------YADDIVI----------------- -----SSMS----FIDEQVLK---------------------QTEQLM-I---------------------KY------- ------------GFKM------N-H------EK----T---YYMGKN-GKR----------------------------- ------------------QVTGIVL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42755620|locus|VBIBacXyl109951_0368|_extraction hypothetical protein [Bacteroides xylanisolvens XB1A] ---------------------------------------LMNEVWSYLCKGVHKRIPLKDVTYFSNYKLA---------- ---------------------------------------------KD--------------------------------- -----------------AYYKF-LI--PK--------KSG-------------------K----TREI------------ ------QAPI----KD-LKRLQICLNFI-L----SSL--------YH-PH------------------------------ ---PSAKGFIL------------------G--Q--------------------NIG--DAAKPH---VRM---------- ----------------------------------P--Y-VF-HL--------DLKDFF-TSIS----------------- -LYR--VKAC-LTLPPF--NLNGDKE---R------------------IAYCI---------ANICCT------------ -------------------------------------------------------------------------------- ------------------N------------------------------DGNRA-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FLP-----QGAPT------------------S- PILSNIVS-----------------------------------------------------------LR----------- --------LD--RK-LTGLAK----------------------------------------------------------- ----------------------R------------FS----ARYTR----------YADDITF----------------- -----SSYQ----DIANNT-----------------------EFQQEL-A---------------------RIIS----- ----G--Q----NFQI------Q-P------SK----T---RAEGRG-YRQ----------------------------- ------------------TVCGLTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|179034165|locus|VBIDesSp185686_1262|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Desulfovibrio sp. X2] -----------------------------------------------------SIDELNAAVGTECDRNSLSLIHLALSI GVSSKLLFAI---------------------------SNNS----RR--------------------------------- -----------------YYRSF-EI--RK--------ASG-------------------G----MRTI------------ ------HAPR----IF-LKTIQRWIVDY-F----LFQ--------LP-CH------------------------------ ---PGCHSFQR------------------G--K--------------------SII--SNAAIH---VGK---------- ----------------------------------K--Y-VG-NI--------DIADYF-KSIK----------------- -SEH--LVHK---------LSKFFGI---G------------------LSRAI---------SLICTH------------ -------------------------------------------------------------------------------- ------------------DDF----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPS------------------S- PILSNFYL-----------------------------------------------------------FD----------- --------FD--KE-ISLLCE----------------------------------------------------------- ----------------------Q------------NE----LSYSR----------YADDITI----------------- -----SGKE----KNSIY------------------------NAINYS-K---------------------YLLK----- ----K--H----NLTL------N-Y------QK----T---RVSSRG-GQQ----------------------------- ------------------NVTGVVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186794270|locus|VBIPseSyr250047_3804|_extraction Reverse transcriptase [Pseudomonas syringae pv. actinidiae ICMP 18883] -------------------------------------------------------------------------------- ------TFLH---------------------------ETVSRDNEEE--------------------------------- -----------------AYKVF-RL--RK--------PNV------------------GHSPDRFRWI------------ ------CAPT----ED-LLRVQRWINAH-I----LSK--------IE-PH------------------------------ ---EASYAYDN------------------R--H--------------------GVL--EAAELH---CEA---------- ----------------------------------R--W-LI-KL--------DLTNFF-ESIL----------------- -EPQ--VYEL-F-------KSLGYQP---L------------------VAFEL---------ARLCT------------- -------------------------------------------------------------------------------- -------------------RLRVSGNPDRRYKNKTPPEGLPYTG-----DSRIG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAAS------------------S- PRIANLVM-----------------------------------------------------------QS----------- --------LD--EN-LQAYAT----------------------------------------------------------- ----------------------E------------ND----LAYSR----------YADDLVF----------------- -----SSK-----GAFDRSEAT--------------------QHIKSI-N---------------------NCLV----- ----D--E----GFWL------N-K------AK----T---KIITPG-TRK----------------------------- ------------------IVLGLLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|187047349|locus|VBIPseSyr301829_4700|_extraction Reverse transcriptase [Pseudomonas syringae pv. pisi str. PP1] -------------------------------------------------------------------------------- ------PFLH---------------------------RTISRKYRAE--------------------------------- -----------------AYNVF-KL--KK--------QNA------------------GHSPDRFRWI------------ ------CAPT----ED-LLRAQRWINTH-I----LSK--------IQ-PH------------------------------ ---EASFAYDN------------------R--N--------------------GVL--ETAQLH---CGA---------- ----------------------------------T--W-LI-KL--------DLTNFF-ESIL----------------- -EPQ--VYEL-F-------RSFGYQP---L------------------VAFEL---------SRLCT------------- -------------------------------------------------------------------------------- -------------------RVRASGNPDQRHGNKPLPPGLPYSG-----DTRIG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAAT------------------S- PRIANLVM-----------------------------------------------------------QS----------- --------LD--ET-LQTYAT----------------------------------------------------------- ----------------------E------------NN----LMYSR----------YADDLVF----------------- -----SSK-----GVFDRLQAT--------------------QHIKAI-N---------------------KCLS----- ----E--E----GFWL------N-T------AK----T---KIVTPG-ARK----------------------------- ------------------IVLGLLI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|186596167|locus|VBIPseSyr309376_2646|_extraction Reverse transcriptase [Pseudomonas syringae CC1416] -------------------------------------------------------------------------------- ------PFLH---------------------------RTIDRTYQTE--------------------------------- -----------------AYTVF-RL--QK--------ASV------------------GHSPDRFRYI------------ ------CAPC----PE-LLRAQRLINKY-I----LSN--------LT-PH------------------------------ ---HASYAYDG------------------R--N--------------------GVL--EAASVH---CQA---------- ----------------------------------T--W-LI-KL--------DLTNFF-ESIL----------------- -EPQ--VFKL-F-------RSIGYQP---L------------------VAFEL---------SRLCT------------- -------------------------------------------------------------------------------- -------------------RVRVRGNPDTKFWSGELSSSFPYK------DSRIG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAAT------------------S- PKIANLVM-----------------------------------------------------------RS----------- --------LD--EK-LYDLAS----------------------------------------------------------- ----------------------K------------NE----LIYTR----------YADDLIF----------------- -----SSR-----QTFDRALAS--------------------RHIRAV-N---------------------QVLR----- ----E--E----GYWL------N-R------AK----T---KIVPPG-TRK----------------------------- ------------------VVLGLLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|19898038|locus|VBIPseFlu98510_2127|_extraction Reverse transcriptase [Pseudomonas fluorescens SBW25] -------------------------------------------------------------------------------- ------GFLR---------------------------MVVARNKQVE--------------------------------- -----------------PYRVF-KL--KK--------QVR------------------SFGRERFRFV------------ ------CAPH----PY-LLKAQRWINRE-I----LAK--------IP-GH------------------------------ ---EASYAYSP------------------G--S--------------------SVY--DAASMH---AGC---------- ----------------------------------R--W-LI-KL--------DATNFF-ESIL----------------- -EPK--IYEI-F-------RTIGYQP---L------------------VAFEL---------ARVCT------------- -------------------------------------------------------------------------------- -------------------RTRPSGNPVYLNRADPNKKGLPYN------SVQIG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLP-----QGAAT------------------S- PHLSNLAA-----------------------------------------------------------RD----------- --------LD--ES-LKNFAL----------------------------------------------------------- ----------------------L------------NQ----LIYTR----------YADDLTF----------------- -----SSI-----EEFVREDVV--------------------KKIHKI-Y---------------------GFMR----- ----A--N----GLWP------N-K------SK----T---QIVPPG-ARK----------------------------- ------------------IVLGLLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|182722260|locus|VBIEscCol236749_3924|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Escherichia coli TW07509] -------------------------------------------------------------------------------- ------SFLR---------------------------KVINREH-FH--------------------------------- -----------------PYRFF-LI--RK--------RRS------------------ISKRKIF--------------- -------VPE----PQ-LLKVQKFIHDN-V----LKH--------IL-PH------------------------------ ---DASFAYTP------------------Y--S--------------------SIY--DAAHVH---INS---------- ----------------------------------R--W-LI-KI--------DITNFF-ESIS----------------- -EID--AYHV-F-------KYNGYSN---L------------------ISFEL---------ARICTW------------ -------------------------------------------------------------------------------- ------------------PIPRSRNERLHRLQNKYLKEYKFYDR-----KQKLG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------STP-----QGAPT------------------S- PCLSNLVC-----------------------------------------------------------KN----------- --------LD--AE-ISFLCE----------------------------------------------------------- ----------------------K------------LQ----VRYSR----------YSDDITI----------------- -----SSTN----KSFSRHNAQ--------------------HIIDSI-Y---------------------RILN----- ----K--Y----GFNP------N-K------YK----T---KIIPPG-AKK----------------------------- ------------------IVLGLNV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32538707|locus|VBIVibSp203755_2696|_extraction putative reverse transcriptase [Vibrio sp. Ex25 (Prj:41601)] -------------------------------------------------------------------------------- ----DLSIQD---------------------------VIKGLSTAPL--------------------------------- -----------------AYKVY-TI--PK--------RTK-------------------G----ERVI------------ ------AQPS----PF-VKEVQRTLISV-F----LTK--------YK-PS------------------------------ ---EVSTAYVE------------------G--K--------------------SII--DNAEAH---KDN---------- ----------------------------------D--W-IL-KL--------DFKNFF-PSLK----------------- -PND--LFTF-LER----EGVVIGEF---D-------------------KKIL---------SSYLFR------------ -------------------------------------------------------------------------------- ----------------------R--------------------------NNRKL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ELS-----IGAPS------------------S- PLVSNLIM-----------------------------------------------------------KS----------- --------ID--NK-IEGYCN----------------------------------------------------------- ----------------------D------------NS----IVYTR----------YADDLTF----------------- -----STMD----LDKID------------------------ALKQYI-A---------------------AVLS----- ----E--TKSP-KLSI------N-D------SK----T---KVIGRG-RSR----------------------------- ------------------RVTGIVL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20658100|locus|VBIAciFer6930_0276|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Acidithiobacillus ferrooxidans ATCC 53993] -------------------------------------------------------------------------------- ----PFSEHE---------------------------LVVLIATAPS--------------------------------- -----------------RYKAY-YI--EK--------RGG------------------RG----QREI------------ ------SQPT----KE-IKFLQRLLASK-E----LRE--------LP-IH------------------------------ ---DVAVGYRS------------------G--R--------------------SIL--DHAQPH---ASA---------- ----------------------------------R--Y-LL-KL--------DFTNFF-PSLK----------------- -SKA--LDHR-L-S----RDTAYSTA---E-------------------RWIL---------RNLLCR------------ -------------------------------------------------------------------------------- --------------------RTP--------------------------GTGNY-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------QLS-----IGAPS------------------S- PHISNYLL-----------------------------------------------------------CE----------- --------FD--QL-MSDYCG----------------------------------------------------------- ----------------------I------------RV----VRYTR----------YADDLAF----------------- -----STSI----PSVLN------------------------DIETEV-R---------------------RLIQ----- ----E--LDYL-GLSL------N-E------AK----T---INVSTK-HRR----------------------------- ------------------TLVGLTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20075292|locus|VBIVibCho20143_1988|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Vibrio cholerae MJ1236] -------------------------------------------------------------------------------- ----VIMPQE---------------------------FERLEVRGSH--------------------------------- -----------------AYKVY-SI--PK--------RKA-------------------G----RRTI------------ ------AHPS----SK-LKICQRHLNAI-L----NPL--------LK-VH------------------------------ ---DSSYAYVK------------------G--R--------------------SIK--DNALVH---SHS---------- ----------------------------------A--Y-VL-KM--------DFQNFF-NSIT----------------- -PTI--LRQC-LIQ----NDILLSVN---E-------------------LEKL---------EQLIFW------------ -------------------------------------------------------------------------------- ------------------NPSKK--------------------------RNGKL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ILS-----VGSPI------------------S- PLISNAIM-----------------------------------------------------------YP----------- --------FD--KI-INDICT----------------------------------------------------------- ----------------------K------------HG----INYTR----------YADDITF----------------- -----STNI----KNTLN------------------------KLPEIV-E---------------------QLII----- ----Q--TYAG-RIII------N-K------RK----T---VFSSKK-HNR----------------------------- ------------------HVTGITL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|45328477|locus|VBIPseSp136886_1162|_extraction hypothetical protein [Pseudoalteromonas sp. SM9913] -------------------------------------------------------------------------------- --------KE---------------------------LLNFSSTAPR--------------------------------- -----------------RYKKY-TI--AK--------RNS------------------DE----RRLI------------ ------AHPS----KE-VKFLQRLLVTH-L----EDK--------LT-IH------------------------------ ---ASANAYVK------------------Q--K--------------------GIK--SNALAH---KDN---------- ----------------------------------Q--Y-LL-KM--------DFKNFF-LSIT----------------- -PSI--LIEQ-MLA----FGIGLDDR---N-------------------IEFI---------SGILFW------------ -------------------------------------------------------------------------------- --------------------KLR--------------------------RNSPL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLS-----IGAPS------------------S- PFISNVIM-----------------------------------------------------------YN----------- --------FD--RL-IEKECK----------------------------------------------------------- ----------------------V------------MG----ITYTR----------YADDLAF----------------- -----STNI----KDILF------------------------EIPKLV-K---------------------NTLK----- ----K--LYGS-KIRV------N-T------KK----T---VFSSKK-FNR----------------------------- ------------------HITGITL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|36739607|locus|VBISalEnt101322_4228|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] -------------------------------------------------------------------------------- --------SE---------------------------IISFSLTAPY--------------------------------- -----------------RYKIY-KI--AK--------RNS------------------DK----KRTI------------ ------AHPS----KE-LKFIQREITEY-L----TDK--------LP-VH------------------------------ ---ECAFAYKK------------------G--S--------------------SIK--TNAQVH---LHT---------- ----------------------------------K--Y-LL-KM--------DFENFF-PSIT----------------- -PRL--FFSK-LRL----ANIDLTAD---D-------------------KVLL---------ENILFF------------ -------------------------------------------------------------------------------- --------------------KSK--------------------------RNSNL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLS-----IGAPS------------------S- PLISNFVM-----------------------------------------------------------YF----------- --------WD--IE-VQEICS----------------------------------------------------------- ----------------------K------------IG----VNYTR----------YADDLTF----------------- -----STNN----KDVLF------------------------DIPDML-E---------------------NVLP----- ----K--YSLG-RIRI------N-H------EK----T---VFSSKG-HNR----------------------------- ------------------HVTGITL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|22468281|locus|VBIMarSp124341_2509|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Marinomonas sp. MWYL1] -------------------------------------------------------------------------------- --------LD---------------------------ILVFSLSSPH--------------------------------- -----------------RYKVY-EI--PK--------RNS------------------EK----TRVI------------ ------AHPS----KE-LKFFQRILIEI-L----DEI--------LP-VH------------------------------ ---KASFAYQK------------------G--V--------------------GIK--DNAQQH---TNS---------- ----------------------------------K--Y-LL-KM--------DFKDFF-PSIT----------------- -PAL--FFEV-AEK----HGVALNER---D-------------------KMVL---------KGLLFW------------ -------------------------------------------------------------------------------- --------------------KRK--------------------------GVEGL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------VLS-----IGAPS------------------S- PLISNFIM-----------------------------------------------------------SF----------- --------FD--EA-ISEECS----------------------------------------------------------- ----------------------K------------RK----IKYTR----------YADDVTF----------------- -----STRV----KNNLF------------------------EIPNLV-S---------------------NLL------ ----K--SNVS-GVYI------N-S------AK----T---IFTSRA-HNR----------------------------- ------------------HVTGVTL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|54694388|locus|VBIAltSp152916_1743|_extraction putative reverse transcriptase [Alteromonas sp. SN2] -------------------------------------------------------------------------------- --------VE---------------------------IVSYINKAPH--------------------------------- -----------------RYKVY-KI--PK--------RNG------------------SG----SRVI------------ ------AQPA----KE-LKVLQKSALNM-P----LLD--------LP-VH------------------------------ ---KSAFAYME------------------G--I--------------------GIK--QNAQKH---SKN---------- ----------------------------------Q--Y-LL-KM--------DFSDFF-PSIV----------------- -SSD--LLSH-VEK----HKGKLEIK---E-------------------KVAL---------KKLFFW------------ -------------------------------------------------------------------------------- --------------------CVK--------------------------GSTEH-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLS-----IGAPS------------------S- PFLSNTIL-----------------------------------------------------------FD----------- --------FD--TI-VNDYCT----------------------------------------------------------- ----------------------Q------------EK----ITYTR----------YADDLTF----------------- -----TTNV----SGVLF------------------------ELPTYI-E---------------------GVLK----- ----K--LEYP-TLKI------N-H------SK----T---VFSSKK-NNR----------------------------- ------------------HVTGLVL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|189991747|locus|VBIHaeInf313847_0653|_extraction Retrontype RNAdirected DNA polymerase [Haemophilus influenzae KR494] -------------------------------------------------------------------------------- --------KE---------------------------ILSFINSSPS--------------------------------- -----------------RYKSY-NI--KK--------RHG-------------------G----TREI------------ ------AEPT----RS-LKILQSWALNK-Y----LSK--------YK-IH------------------------------ ---PSAIAYVK------------------N--K--------------------NIK--DFVLPH---SNN---------- ----------------------------------K--Y-LL-KI--------DFKNFF-NSIK----------------- -GID--FLHF-LED----KKSDLSNE---E-------------------RHLL---------TNIFFC------------ -------------------------------------------------------------------------------- ------------------KNKTS--------------------------ESKEL-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------YLS-----IGAPS------------------S- PFISNIIM-----------------------------------------------------------ID----------- --------FD--DQ-ISQLCA----------------------------------------------------------- ----------------------N------------IG----VTYTR----------YADDLAF----------------- -----STNN----PNILD------------------------ELLEEI-E---------------------IICK----- ----N--LNYPKKLEI------N-S------EK----T---VFTSRK-HNR----------------------------- ------------------TLTGLVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|42737677|locus|VBIBacMar168520_0745|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Bacteriovorax marinus SJ] -------------------------------------------------------------------------------- -KFYNIPQEE---------------------------VQDFISSNKT--------------------------------- -----------------KYFRF-EI--PK--------RNG-------------------K----FRTI------------ ------THPR----HE-LKIVQYWLVEK-V----FSR--------MNYVS------------------------------ ---CSSYAYIK------------------E--R--------------------DIL--KNARKH---QRN---------- ----------------------------------S--H-FL-SV--------DFSNFF-ESIT----------------- -FKL--LEPF-LEKFYHSTEVEYSLE---E------------------LKNVV---------KRSCFD------------ -------------------------------------------------------------------------------- ---------------------------------------------------SYG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLP-----VGFVT------------------S- SIISNIVM-----------------------------------------------------------FD----------- --------FD--LR-INVLLM----------------------------------------------------------- ----------------------S------------RKMLGRVVYTR----------YADDVTI----------------- -----STNK----EGAIR------------------------EIYELFCK---------------------GLIS----- ----E--FNS--EITL------N-A------SK----T---KVRHKRCGNV----------------------------- ------------------LVTGLRI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|21794453|locus|VBIDicDad95084_3620|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dickeya dadantii Ech703] -------------------------------------------------------------------------------- --------HF---------------------------LNSIIKNKCK--------------------------------- -----------------YYRSF-TI--SK--------GKG------------------KK----KRLI------------ ------EAPR----IS-LKLIQAWLAYH-LSRNSADY--------IS--------------------------------- ---ENSYAFIP------------------GV-N--------------------GIY--EAAKRH---CGS---------- ----------------------------------Q--W-VL-SI--------DLKDFF-HYVN----------------- -SNK--IMPA-L-------LDLGYRT---E------------------QAAKI---------IDIVTL------------ -------------------------------------------------------------------------------- ------------------QDR----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPS------------------S- PYISNIAF-----------------------------------------------------------KP----------- --------TD--EL-IERIIT----------------------------------------------------------- ----------------------G------------RG----VSYTR----------YADDLTF----------------- -----SGCD----AEFNID-----------------------GFKGEI-I---------------------NALK----- ----S--H----GWIV------A-L------DK----V---KISKKP-NRL----------------------------- ------------------KVHGFLV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|22912385|locus|VBIPelThe8413_2780|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Pelotomaculum thermopropionicum SI] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------MKIL-KI--PK--------KNG-------------------K----YRTI------------ ------YAPD----AE-EKRALRGIVGI-L----NQK--------CQHVCDP---------------------------- ---AAVHGFMP------------------L--K--------------------SPV--TNALAH---VGR---------- ----------------------------------K--Y-TV-SF--------DLEDFF-DTVT----------------- -PEK--ASKC-LTKEQ---KELVFVD---G------------------AAR----------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------QGLPT------------------S- PAVANLAA-----------------------------------------------------------TD----------- --------MD--RA-ILKWIE----------------------------------------------------------- ----------------------K------------SG--KSVVYTR----------YADDLAF----------------- -----SFDD----PELIP------------------------VIQKKV-P---------------------EIIR----- ----R--S----GFRV------N-T------DK----T---TVQAAVAGRR----------------------------- ------------------IICGVAV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23585887|locus|VBISheSp85603_0521|_extraction Retron reverse transcriptase [Shewanella sp. MR7] -------------------------------------------------------------------------------- --------QF---------------------------EQFKNLNDLD--------------------------------- -----------------KYKPS-SV--PK--------S-----------------------DGKSRTV------------ ------YNPA----QL-IRLFQRRINTR-I----FHPKHSQGGL-ISWPS------------------------------ --------YLF------------------GSIPNNPQSPE-NSNK--------NYI--TCAGMH---CGA---------- ----------------------------------K--S-IL-KM--------DISDFF-DNIH----------------- -HRE--VINI-FE------GLLKFPN---D------------------VSQTL---------ADICCY------------ -------------------------------------------------------------------------------- ------------------KGN----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VI-----QGALT------------------S- SYIASAVL-----------------------------------------------------------FD----------- --------VE--PN-VVRRMH----------------------------------------------------------- ----------------------E------------KH----LVYTR----------LVDDITI----------------- -----SSK------IYDYDFS---------------------YAKNIV-I---------------------DMLR----- ----K--K----GLPT------N-E------DK----T---IVINSS-TKE----------------------------- -----------------GLVHGLRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|189627687|locus|VBIVibPar262249_1037|_extraction Retron reverse transcriptase [Vibrio parahaemolyticus 10290] -------------------------------------------------------------------------------- -------KEI---------------------------QSIISLPESQ--------------------------------- -----------------KYFEK-KV--PK--------S-----------------------DGSIRLV------------ ------YCPH----PQ-VRRAQRKINNN-I----FKK------L-VKWPS------------------------------ --------YIF------------------GSIP------NTKISK--EQIERKDYI--SCAGQH---CGA---------- ----------------------------------K--S-LL-KM--------DIKSFF-DNIH----------------- -FDH--VLDM-FV------NFFHYDK---D------------------VSFTL---------AKLCCK------------ -------------------------------------------------------------------------------- ------------------SDY----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------IV-----QGGLT------------------S- SYVASLIL-----------------------------------------------------------WK----------- --------HE--PE-LVKRLQ----------------------------------------------------------- ----------------------R------------KN----LTYTR----------LVDDISI----------------- -----SSH------VSNFDFS---------------------MVQQLV-T---------------------NMLY----- ----E--I----DLPV------N-N------DK----T---KVYHLS-TEP----------------------------- -----------------LTIHGIRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20112680|locus|VBIVibFis127983_1043|_extraction Retron reverse transcriptase [Vibrio fischeri ES114] -------------------------------------------------------------------------------- -------DEL---------------------------NKALSLSSEE--------------------------------- -----------------RYVQG-KV--SK--------S-----------------------DGSYRDV------------ ------YNPH----NL-IRKIQRRINRR-I----FIK------L-VGWPS------------------------------ --------YIF------------------GSIP------N-NIDKNGKVLEAKDYV--NCAAQH---CLA---------- ----------------------------------K--S-LL-KM--------DLKDFF-NNIH----------------- -IDH--VKEM-FE------KFFNYPK---D------------------VSEAL---------SNICCR------------ -------------------------------------------------------------------------------- ------------------GDS----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LV-----QGALT------------------S- SYIASLCL-----------------------------------------------------------WD----------- --------LE--PH-LVKRLK----------------------------------------------------------- ----------------------R------------KQ----LIYTR----------LVDDITV----------------- -----SSK------ISNFQFD---------------------MTIEHI-T---------------------SMLH----- ----E--M----DLPI------N-N------SK----T---KVAYTS-IEP----------------------------- -----------------LTVHGMRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|180089752|locus|VBIEscCol233378_0214|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Escherichia coli C295_10] -------------------------------------------------------------------------------- -------EEL---------------------------KAIAELSLDE--------------------------------- -----------------KYTLK-EI--PK--------I-----------------------DGSKRIV------------ ------YSLH----PK-MRLLQSRINKR-I----FKE------L-VVFPS------------------------------ --------FLF------------------GSVPSKNDVLNSNVKR--------DYV--SCAKAH---CGA---------- ----------------------------------K--T-VL-KV--------DISNFF-DNIH----------------- -RDL--VRSV-FE------EILHIKD---E------------------ALEYL---------VDICTK------------ -------------------------------------------------------------------------------- ------------------DDF----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VV-----QGALT------------------S- SYIATLCL-----------------------------------------------------------FA----------- --------VE--GD-VVRRAQ----------------------------------------------------------- ----------------------R------------KG----LVYTR----------LVDDITV----------------- -----SSK------ISNYDFS---------------------QMQSHI-E---------------------RMLS----- ----E--H----DLPI------N-K------HK----T---KIFHCS-SEP----------------------------- -----------------IKVHGLRV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|181671864|locus|VBIEscCol269450_2693|_extraction retron reverse transcriptase [Escherichia coli HVH 221 (43136817)] -------------------------------------------------------------------------------- -------DKF---------------------------EKLISDGVEK--------------------------------- -----------------AYYIKPPI--EK--------K-----------------------SGGHRIV------------ ------YAPN----RM-LKSILRKINNK-I----FSQ--------INFPD------------------------------ --------YLY------------------GSIP------DKENPR--------DYI--LCAQQH---CKS---------- ----------------------------------K--I-LV-KM--------DIENFF-PTMK----------------- -SKF--VYQI-FK------ELFRFSD---E------------------VSNIL---------TTLTTY------------ -------------------------------------------------------------------------------- ------------------EGF----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGAPT------------------S- TYLANLYF-----------------------------------------------------------YD----------- --------CE--PS-KVSYLR----------------------------------------------------------- ----------------------N------------LG----FRYTR----------LIDDITI----------------- -----SRLN------KDGDWK---------------------FVESVI-S---------------------NFIK----- ----Q--K----ELTV------N-N------EK--------TKLLSA-KSP-------Q--------------------- ----------------SFKVHGLCI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20514590|locus|VBIPhoLum48522_5509|_extraction retron reverse transcriptase [Photorhabdus luminescens subsp. laumondii TTO1] -------------------------------------------------------------------------------- -------EKF---------------------------ESLLRDGVEN--------------------------------- -----------------AYYIKPPI--KK--------K-----------------------NGGERIV------------ ------YAPN----RM-LKSILRKINNR-I----FNQ--------INFPD------------------------------ --------YLY------------------GSIP------DKENPR--------DYI--LCAHQH---CKA---------- ----------------------------------K--I-LI-KL--------DIENFF-PTMK----------------- -TKF--VFNI-FK------DLFKFSD---E------------------VSNIL---------TKLTTY------------ -------------------------------------------------------------------------------- ------------------DGF----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGAPT------------------S- TYLANLYF-----------------------------------------------------------YD----------- --------CE--PN-KVNYLR----------------------------------------------------------- ----------------------S------------LG----FRYTR----------LIDDITV----------------- -----SRLK------KEGDWK---------------------FVETII-S---------------------EFIT----- ----Q--K----ELSV------N-K------DK--------TQLLSA-NSP-------Q--------------------- ----------------SFKVHGLCI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|191105765|locus|VBIErwAmy195016_0061|_extraction retron reverse transcriptase [Erwinia amylovora ACW56400] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------M-LKSLLRKINQK-I----FTK--------INYPD------------------------------ --------YLF------------------GSIP------DRENPR--------DYI--LCAEQH---CKA---------- ----------------------------------K--V-LI-KI--------DIENFF-PTMR----------------- -EEY--VYSI-FN------KLLKCSE---E------------------VSAIL---------TKLTTF------------ -------------------------------------------------------------------------------- ------------------DGY----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGAPT------------------S- TYLANLYF-----------------------------------------------------------FD----------- --------SE--PK-KVQHLR----------------------------------------------------------- ----------------------N------------NG----FRYTR----------LIDDITI----------------- -----SRAT------KEGSWK---------------------YVESYI-G---------------------EFIT----- ----Q--K----HLSI------N-K------DK--------TISISS-SSP-------Q--------------------- ----------------QFKVHSLSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|58531404|locus|VBIColFun187779_2668|_extraction hypothetical protein [Collimonas fungivorans Ter331] ---------------------------------------------------------------------------SVAAL AKMLDVTENF---------------------------LTLVASKPDD--------------------------------- -----------------FYSIS-KI--PK--------K-----------------------TDGFRTI------------ ------SDPV----KE-LKIVQRRIVRR-I----FSK--------CSFPS------------------------------ --------YLF------------------GSIR------DEINPR--------DFV--RNAQYH---SQA---------- ----------------------------------R--E-VM-AF--------DVESFF-PSVR----------------- -PQF--VKKV-LK------FLFNLPN---E------------------VAEML---------VSLMTL------------ -------------------------------------------------------------------------------- ------------------QDG----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- SYVANLIF-----------------------------------------------------------YD----------- --------CE--HK-IVKTLN----------------------------------------------------------- ----------------------A------------MG----FTYSR----------LVDDITV----------------- -----SSKT----IIKNEERR---------------------FIYEQI-S---------------------KMLG----- ----E--K----KLKI------S-Q------RKYGVTN---TTVIGK-KT------------------------------ ------------------VVTGLVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|19999333|locus|VBIPseSyr93040_4030|_extraction retron reverse transcriptase [Pseudomonas syringae pv. tomato str. DC3000] -------------------------------------------------------------------------------- --------KL---------------------------LNDLAGRATD--------------------------------- -----------------SYTHF-VI--RT--------K-----------------------GDKERNV------------ ------YEPK----YE-LKKLQKRINSR-L----FEK--------VHYPF------------------------------ --------YLQ------------------GGVR------DEDHPR--------DYI--ENSRIH---AGS---------- ----------------------------------K--S-LI-SL--------DIRNFY-DNIP----------------- -YES--VVSI-FK------YFFNFPD---E------------------VSDLL---------SKIVTR------------ -------------------------------------------------------------------------------- ------------------DGK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGACT------------------S- SYVANLIF-----------------------------------------------------------HN----------- --------SE--YN-FVSRLR----------------------------------------------------------- ----------------------N------------QG----VTYSR----------LLDDVTL----------------- -----SSSR----LLSQDEVS---------------------GYIKYV-A---------------------GLFS----- ----E--H----KLRI------K-K------SK----T---KIERSD-DLS-------A--------------------- ----------------EYTVTGVWV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|86956996|locus|VBIPseStu243261_4122|_extraction retron reverse transcriptase [Pseudomonas stutzeri CCUG 29243] -------------------------------------------------------------------------------- --------KV---------------------------MLSLAKNSSN--------------------------------- -----------------SYTQF-TV--PS--------K-----------------------G-KDRVV------------ ------YEPK----LN-LKKIQKRINSR-I----FEH--------VQFPP------------------------------ --------YLQ------------------GGIK------DNLSPR--------DYV--ENSKFH---AGS---------- ----------------------------------S--V-LV-SL--------DIRNFY-DNIK----------------- -YDS--VYDV-FL------YFFKFEP---S------------------VCTVL---------TDLVTL------------ -------------------------------------------------------------------------------- ------------------NGK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGACT------------------S- SYIANLVF-----------------------------------------------------------HN----------- --------HE--YS-LVSYFR----------------------------------------------------------- ----------------------S------------KG----ITYSR----------LLDDVTL----------------- -----SSSK----EIPPQEIE---------------------EAISKV-A---------------------EMFK----- ----R--H----KLRI------H-P------KK----K---KIEVSS-DTR-------S--------------------- ----------------EYKVTGVWV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|177021178|locus|VBIAerSal291263_3734|_extraction retron reverse transcriptase [Aeromonas salmonicida subsp. pectinolytica 34mel] -------------------------------------------------------------------------------- --------KV---------------------------LLDIAKNSSN--------------------------------- -----------------SYTEF-IV--PS--------K-----------------------N-KDRLV------------ ------YEPK----YE-LKKIQKRINAR-L----FEK--------VQYPL------------------------------ --------YLQ------------------GGIK------DNLVKR--------DYV--ENAKLH---VKS---------- ----------------------------------K--H-LI-NL--------DIKAFY-DNIK----------------- -PHH--VFSV-YK------YLFKFPD---D------------------VCDVL---------TQLTTY------------ -------------------------------------------------------------------------------- ------------------KNR----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGACT------------------S- SYIANLIF-----------------------------------------------------------FN----------- --------DE--YT-LVSSFR----------------------------------------------------------- ----------------------Q------------QG----ISYSR----------LLDDVTL----------------- -----SSNK----ELSSEDVT---------------------NAIKKV-A---------------------ALFK----- ----K--Y----DLRL------K-N------SK----T---KIEVKS-NKE-------A--------------------- ----------------EYKVTGLWI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|54540754|locus|VBIAerVer186715_3956|_extraction hypothetical protein [Aeromonas veronii B565] -------------------------------------------------------------------------------- --------DA---------------------------LVDIAKKSSS--------------------------------- -----------------SYTEF-IV--PS--------K-----------------------N-KDRLV------------ ------YEPK----HD-LKRIQKRINTR-I----FEK--------VEYPR------------------------------ --------YLQ------------------GGIK------DALIKR--------DYV--ENASLH---TTS---------- ----------------------------------K--H-LI-NL--------DIKSFY-DNIK----------------- -SCH--VFSV-YK------YFFKFPD---D------------------VCEIL---------TSLTTY------------ -------------------------------------------------------------------------------- ------------------KSK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGACT------------------S- SYIANLIF-----------------------------------------------------------FN----------- --------SE--YS-LVSSFR----------------------------------------------------------- ----------------------Q------------QG----VIYTR----------LLDDVTL----------------- -----SSNK----ELTEEEIT---------------------KAIKDV-S---------------------ALFR----- ----K--Y----DLRL------K-N------NK----T---KIELKA-NKD-------A--------------------- ----------------DYKVTGLWI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|19957061|locus|VBIPsePut93764_3618|_extraction hypothetical protein [Pseudomonas putida W619] -------------------------------------------------------------------------------- --------AL---------------------------LERLAARADR--------------------------------- -----------------MYRHV-PQ--EK--------KVK------------------PGQPPETRDT------------ ------YDAH----EP-LKRIQRKLVDR-V----LSK--------AIFPT------------------------------ --------YLH------------------GGIR------DCKSPR--------SIH--SNAAVH---AGA---------- ----------------------------------R--F-VI-LQ--------DIRNFY-PSIS----------------- -KSH--VHAM-FR------GLFGFGA---G------------------VAELL---------SSLCTR------------ -------------------------------------------------------------------------------- ------------------SGS----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------TP-----QGAST------------------S- GYIANLVF-----------------------------------------------------------WD----------- --------VE--PK-VVSDLS----------------------------------------------------------- ----------------------G------------KG----FQYSR----------FADDITI----------------- -----SCTR----EPEADELT---------------------GIVSAI-T---------------------GMLA----- ----S--K----GCHQ------K-R------SK----L---HVRIRG-QALKSEGTTFQ--------------------- ----------------PITVTGLSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32277000|locus|VBIPecWas23660_1333|_extraction retron reverse transcriptase [Pectobacterium wasabiae WPP163] -------------------------------------------------------------------------------- --------NE---------------------------LSSLARRADM--------------------------------- -----------------MYRLA-SS--------------L------------------PKADGSVRQT------------ ------WDAH----EP-LKKIHRRIRHN-I----LDH--------VTYPS------------------------------ --------YLT------------------GSLK------GC------------DYK--VNASLH---AGA---------- ----------------------------------T--I-VI-NE--------DITGFF-PATS----------------- -ATV--VHGI-WR------SFFCFGQ---D------------------VADCL---------TRLTTR------------ -------------------------------------------------------------------------------- ------------------HGE----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAIT------------------S- SFLANLAF-----------------------------------------------------------WN----------- --------TE--PV-LYNSFK----------------------------------------------------------- ----------------------A------------RG----LIYSR----------YVDDIAV----------------- -----SSGT----FLDNSAKT---------------------EVIASL-Y---------------------GMLF----- ----R--H----GYRP------K-R------NK----H---EIRTSG-ERM----------------------------- ------------------GVTTLSV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|20287556|locus|VBIRalMet4734_1880|_extraction retron reverse transcriptase [Cupriavidus metallidurans CH34] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------------------------MNYPA------------------------------ --------YLT------------------GSIK------GC------------DYK--VNASLH---ARA---------- ----------------------------------R--I-VI-NE--------DISGFF-PSTS----------------- -ADR--VFSI-WR------GFFGFSE---D------------------VARCL---------TQLTTR------------ -------------------------------------------------------------------------------- ------------------HGE----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAIT------------------S- SFLANLVF-----------------------------------------------------------WW----------- --------DE--PE-LFGKFA----------------------------------------------------------- ----------------------A------------QG----LVYSR----------YVDDIAV----------------- -----SSKT----FLTNEAKT---------------------NVVRQV-Y---------------------GMLL----- ----K--H----GYKA------K-R------AK----H---EIATSG-SRM----------------------------- ------------------AVTKLAV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|125119977|locus|VBIThaOle281250_0332|_extraction hypothetical protein [Thalassolituus oleivorans MIL1] -------------------------------------------------------------------------------- --------DS---------------------------LKSIGDAAQK--------------------------------- -----------------MYREV-PQ--KK--------K-----------------------DGSLRMT------------ ------YDAH----PS-LKKIQDKISKL-I----LKK--------VVYPD------------------------------ --------YLQ------------------GGLP------KK------------DQV--SNAAKH---SKA---------- ----------------------------------I--I-LI-QD--------DIEDFY-PSLS----------------- -DTT--VRGV-WQ------QFFKFSP---E------------------VAKLL---------TSLTTL------------ -------------------------------------------------------------------------------- ------------------NGK----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------VP-----QGAKT------------------S- SYLANLAF-----------------------------------------------------------WD----------- --------CE--HL-LVQKLL----------------------------------------------------------- ----------------------D------------KD----LTYTR----------FADDIIV----------------- -----ST-R----AIISNSEI---------------------AVVRASIY---------------------NMLA----- ----L--K----SCKP------K-R------SK----S---RVCRKG-TRL----------------------------- ------------------SVTGLNT------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|54728660|locus|VBIMetMet172210_3286|_extraction hypothetical protein [Methylomonas methanica MC09] -------------------------------------------------------------------------------- -------EDL---------------------------IELASNSNEY--------------------------------- -----------------FFIAK-KV--EK--------P-----------------------DKSIRLT------------ ------YDVK----PR-LKQIHEKICCN-L----LKK--------VNYPD------------------------------ --------YIQ------------------GGVR------GK------------SYL--SNCQNH---THK---------- ----------------------------------K--I-VI-KE--------DVSNFF-PSIS----------------- -KKI--IHEV-WA------GFFHFPS---D------------------VSELL---------SELVTF------------ -------------------------------------------------------------------------------- ------------------NGY----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LV-----QGGKA------------------S- GFLCNLVL-----------------------------------------------------------YD----------- --------RE--SK-LVEEFS----------------------------------------------------------- ----------------------K------------KG----FKYTR----------FVDDITI----------------- -----SCLR----NITKDEQT---------------------YIIRKT-Y---------------------GLLK----- ----S--I----EVNP------N-K------RK----H---KIMSNG-VQQ----------------------------- ------------------QLHGVNL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|124741923|locus|VBIBraOli259415_1160|_extraction RNAdirected DNA polymerase (Reverse transcriptase)( EC:2.7.7.49 ) [Bradyrhizobium oligotrophicum S58] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----PVHGFVK------------------Q--R--------------------GAI--TNAGEH---QRR---------- ----------------------------------P--Y-LL-NI--------DVRNFF-GVIS----------------- -RRR--VRGM-L-------ASMGLPD---E------------------TAEAI---------CSICVT------------ -------------------------------------------------------------------------------- ------------------ANQ----------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------------------LP-----QGAPT------------------S- PILSNLVA-----------------------------------------------------------YR----------- --------LD--RD-LMTFAK----------------------------------------------------------- ----------------------A------------YR----LRYTR----------YADDISF----------------- -----SSYA----PPLALFNAGLPIPGRVKPD----------QLSVAL-A---------------------AAFS----- ----S--N----GFEV------A-A------DK----V---WYAGPK-TRK----------------------------- ------------------EVTGLVVNEFTNVRRTFIRNLRAALYKTEKLGLAAAQSDYQKKYKTQSTLEQILRGRLEWVA QVRGRSFGPYRTLAKRFNQLFPNSPIPISPTYDEVAERSVWVVEFWID-------------------------------- -------------------------- >Gfid|42938822|locus|VBIErwSp41759_1124|_extraction hypothetical protein [Erwinia sp. Ejp617] -------------------------------------------------------------------------------- ---------------------------------------KNFADFHT--------------------------------- -----------------LRTED-EV--RL--------IPL------------------SR-----RKA------------ ------ELYQ-CS-AR-LKLIHTFLSRF-V----FSE--------MPVKK------------------------------ ---DIVFSYRK------------------D--V--------------------NIT--DAVRPH---CKS---------- ----------------------------------E--F-IF-KT--------DISNFF-PSIS----------------- -GDA--LSDK-LSKYCGDLRTVDIEEVR-D------------------NIRRI---------IYLCTL------------ -------------------------------------------------------------------------------- ----------------------------------------------------DN-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLP-----IGFSS------------------S- PSVSNFCF-----------------------------------------------------------YD----------- --------YD--NL-IESYCN----------------------------------------------------------- ----------------------D------------KD----YIYSR----------YADDLII----------------- -----SSQN---------------------EIAKD-------RITADL-T---------------------AILS----- ----S--DPLL-KLSI------N-H------KK----T---KIITKK-YER----------------------------- ------------------KILGISI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20519408|locus|VBIProMir120933_2305|_extraction Reverse transcriptase (EC 2.7.7.49) [Proteus mirabilis HI4320] -------------------------------------------------------------------------------- ---------------------------------------ESFEQFYT--------------------------------- -----------------LSLND-HI--KE--------FNF------------------NN-----KKC------------ --------YS-IS-DE-LKTIQLFLSKF-I----FEN--------IEFRK------------------------------ ---DLVYSYRK------------------G--V--------------------NVV--DCISPH---RFN---------- ----------------------------------N--Y-IY-KT--------DIENFF-PSIG----------------- -YDL--IKNK-IIEKVKDISFLDTNDVM-V------------------HLSRI---------LELVTI------------ -------------------------------------------------------------------------------- ----------------------------------------------------DK-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------KLP-----IGFSS------------------S- PAISNFVM-----------------------------------------------------------YD----------- --------ID--CK-IDCFAK----------------------------------------------------------- ----------------------N------------ND----LIYTR----------YADDIIL----------------- -----SGKN---------------------NLDKN-------SINQEI-N---------------------NILN----- ----Y--NQDN-VFHL------N-D------KK----T---KIITKK-FER----------------------------- ------------------NILGISI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|21792235|locus|VBIDicDad95084_2538|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Dickeya dadantii Ech703] -------------------------------------------------------------------------------- ---------------------------------------MDFNDFFN--------------------------------- -----------------LNKME-YL--EK--------KIF------------------RG-----REI------------ ------VAYK-KGFNK-LRLLHRFLNEA-I----FNR--------VDIMD------------------------------ ---DVVFSYRK------------------K--V--------------------NVF--DCVYPH---RAN---------- ----------------------------------P--F-IF-KT--------DIKNFF-PSFN----------------- -RDF--IESK-LESVFKEFIISDINE----------------------YLFKI---------VELVSY------------ -------------------------------------------------------------------------------- ----------------------------------------------------ND-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FLP-----VGFST------------------S- PSLSNILF-----------------------------------------------------------SD----------- --------AD--KK-LKSFSI----------------------------------------------------------- ----------------------S------------NG----YIYTR----------YSDDIII----------------- -----SSES---------------------EIHKK-------DIFTTV-Q---------------------GIIN----- ----S--ECD--SFVL------N-F------DK----T---KLLKKG-GVR----------------------------- ------------------KIMGVSI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|23554142|locus|VBIShePut135485_2726|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Shewanella putrefaciens CN32] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------NI--KE--------IKI------------------GK-----KFA------------ ------YSHQDNNKPT-HQKLSKIIFEN-F----LSN--------IPLNQ------------------------------ ---SAI-AYVK------------------K--K--------------------SYF--DFIEPH---RNN---------- ----------------------------------Y--F-FL-RI--------DLKDFF-HSIS----------------- -EDL--LKRT-LSDYFSSESLSE--TIKQS------------------NIDAI---------FTFLTV------------ -------------------------------------------------------------------------------- ------------------NLKSDSSNVK---------------------FLDKK-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ILP-----IGFPL------------------S- PNLANIVF-----------------------------------------------------------RK----------- --------TD--LL-LEKLCD----------------------------------------------------------- ----------------------M------------HG----VTYTR----------YADDMLF----------------- -----SSRG----IMEKNLLFRKNNNYKKPYIHSD-------NFLSEI-K---------------------YLVS----- ----I--D----GFFI------N-H------NK----T-----IKSV-NTL----------------------------- ------------------SLNGYTI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20170334|locus|VBIVibVul40472_1752|_extraction retrontype reverse transcriptase [Vibrio vulnificus YJ016] -------------------------------------------------------------------------------- ----------------------------------------LLKKPIE--------------------------------- -----------------PHGIS-DV--KQ--------GKL------------------GT-----KSI------------ ------FKLT-PAKSQ-Y--LGR-LNNK-F----FCH--------IQVNN------------------------------ ---SAV-AYVE------------------K--K--------------------SYL--DMFEPH---RKN---------- ----------------------------------T--N-FL-RI--------DIKSFF-HSIN----------------- -REI--LAEA-LSPYVTNEIFFESGKVKQS------------------LLDAL---------LNLVSL------------ -------------------------------------------------------------------------------- ------------------RVSTEYNDQS---------------------LHNKD-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------ILP-----IGFKS------------------S- PVISNIVF-----------------------------------------------------------RR----------- --------FD--II-IQELCA----------------------------------------------------------- ----------------------R------------SD----IIYTR----------YADDMLF----------------- -----SS--------------PEHSK----YLHTN-------KFLSEI-S---------------------YTLS----- ----L--S----GFKL------N-Q------AK----T-----IKDK-NMI----------------------------- ------------------SVNGYVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|18443875|locus|VBIEscCol32010_2494|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Escherichia coli UMN026] -------------------------------------------------------------------------------- ----------------------------------------GVIPPIA--------------------------------- -----------------KNQVS-TI--SN-----------------------------KN-----KTF------------ ------YSLA-HSSPH-YSIQTR-IEKF-L----LKN--------IPLSA------------------------------ ---SSF-AFRK------------------E--R--------------------SYL--HYLEPH---TQN---------- ----------------------------------V--K-YC-HL--------DIVSFF-HSID----------------- -VNI--VRDT-FSVYFSDEFLV---KEKQS------------------LLDAF---------MASVTL------------ -------------------------------------------------------------------------------- ------------------TAELDGVE--------------------------KT-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------FIP-----MGFKS------------------S- PSISNIIF-----------------------------------------------------------RK----------- --------ID--IL-IQKFCD----------------------------------------------------------- ----------------------K------------NK----ITYTR----------YADDLLF----------------- -----ST--------------KKENN----ILSST-------FFINEI-S---------------------SILS----- ----I--N----KFKL------N-K------SK----Y-----LYKE-GTI----------------------------- ------------------SLGGYVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|87317163|locus|VBICloSp224589_0973|_extraction hypothetical protein [Clostridium sp. BNL1100] -------------------------------------------------------------------------------- --------ET---------------------------VLKGLGAKSE--------------------------------- -----------------AYFKF-DM--PK--------ANN------------------EK-----RTI------------ ------SALD-KDNLI-HELQSN-LNNN-F----FSL--------QPLPI------------------------------ ---CVK-GFVK------------------G--N--------------------SYL--DYLNAHIYDNSS---------- ----------------------------------T--Y-FV-RL--------DIKDFF-DSIS----------------- -KEV--LIST-LREFVGIE----------D------------------VINVI---------YDICTL------------ -------------------------------------------------------------------------------- ----------------------------------------------------EE-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------KLP-----QGAVT------------------S- PLLSNLVL-----------------------------------------------------------RR----------- --------ID--QR-ITLYCQ----------------------------------------------------------- ----------------------S------------LG----VTYTR----------YADDLLF----------------- -----SSNE----ID---------------FRLKK-------WFVKKI-K---------------------YILC----- ----S--I----DLKI------N-Y------KK----V-----KYGK-GLI----------------------------- ------------------SINGYVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42840763|locus|VBICloCf158569_1526|_extraction hypothetical protein [Clostridium cf. saccharolyticum K10] -------------------------------------------------------------------------------- --------LF---------------------------IENSLAEKEN--------------------------------- -----------------SYIVF-DI--RD---------RG------------------KE-----RTI------------ ------CQVD-KTSGL-YQLQNN-LNRN-L----LAK--------IPLSK------------------------------ ---AAA-GFVR------------------G--L--------------------SYQ--DYLRPH---CGK---------- ----------------------------------K--F-HM-RL--------DIHHFF-DHVT----------------- -EEQ--VIKS-LEEFVQDE----------K------------------IRENI---------GEITTF------------ -------------------------------------------------------------------------------- ----------------------------------------------------NG-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------KLP-----QGAVT------------------S- PAVSNIVF-----------------------------------------------------------RR----------- --------ID--QR-ILKYCQ----------------------------------------------------------- ----------------------SVRKVREGKKFYQED----LIYTR----------YADDMMF----------------- -----SSDY----LD---------------FRRDL-------FVYRMI-K---------------------HILK----- ----E--N----GFSL------N-E------KK----T-----YMAE-GEI----------------------------- ------------------SLSGFVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|61057433|locus|VBICloCla155345_3324|_extraction RNAdirected DNA polymerase (Reverse transcriptase) [Clostridium clariflavum DSM 19732] ------------------------------------------------------------------GHIGKRDLRMLREY IDRRKYSIVL---------------------------DNIKNAVPFP--------------------------------- -----------------YPKKY-IL--NK--------IGV------------------DK----KRIV------------ ------YTYD---------EAEKYVLKI-IAFLLHEK--------DKIFA------------------------------ ---DNLFSFRH------------------D-----------------------QGV--RKAIASLVNKPDI--------- ----------------------------------SEYY-SY-KL--------DIHDYF-NSVN----------------- -VER--ILPR-LQ------KVLIKEE---A------------------TYNLI---------KEIL-------------- -------------------------------------------------------------------------------- -------------------------------------LNPFVIDEEGEIVEERK-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GIM-----AGVPI------------------S- SFLANLYL-----------------------------------------------------------MD----------- --------MD--WY-FQN-------------------------------------------------------------- -----------------------------------KR----ISYAR----------YSDDIIV----------------- -----FAKS----RAELE------------------------GHREYI-E---------------------SYLF----- ----R--E----GLTI------N-P------KK----I---YYSEPY-GEW----------------------------- ------------------NFLGIKY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|32326467|locus|VBISalEnt136302_3200|_extraction RNAdirected DNA polymerase [Salmonella enterica subsp. enterica serovar Choleraesuis str. SCB67] -------------------------------------------------------------------------------- -------------------------------------KQADKLMLDS--------------------------------- -----------------AYKEF---------------VND-------AG----------------RTI------------ ------QEPI----AQ-LKAVHRKIGVL-LSRI-ELP------------------------------------------- ---PYLHSGRK------------------K--H--------------------STL--TNVESH---K-LA--------- ----------------------------------T--E-LL-KL--------DIHKFF-PSTR----------------- -AAK--VYKA-FV------EKFEMSP---D------------------VAYIM---------TNLSTF------------ -------------------------------------------------------------------------------- ------------------A----------------------------------G-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------KVP-----TGSPI------------------S- MAMAFWAN-----------------------------------------------------------KD----------- --------MF--DE-LSNLAA----------------------------------------------------------- ----------------------S------------NS----LVFTA----------YVDDVAF----------------- -----SGS-----KIP-K------------------------GFAAQA-K---------------------KCIR----- ----S--H----GLTS------K-D------KK----E---RFYPSS-EGK----------------------------- ------------------LLTGIVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|182730028|locus|VBIEscCol247449_2916|_extraction hypothetical protein [Escherichia coli Tx1686] -------------------------------------------------------------------------------- ---------------------------------------------DD--------------------------------- -----------------LY-SL---------------RKD-------EGNYS--VFEQLSKKGKARKI------------ ------QKPL----EK-LELVHTRIASL-LSRI-ALP------------------------------------------- ---EYLHSGKK------------------K--C--------------------SNV--TNAKAH---L-NN--------- ----------------------------------E--K-MM-TT--------DIKAFF-SSTT----------------- -RGM--IFSF-FF------SVMKMSS---D------------------VADVL---------SHICTC------------ -------------------------------------------------------------------------------- ------------------H----------------------------------D-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------RLP-----TGSRI------------------S- MPLAYFAN-----------------------------------------------------------SR----------- --------MF--GE-IYQLCQ----------------------------------------------------------- ----------------------K------------LR----VNMTV----------YVDDLTF----------------- -----SGS-----NVN-R------------------------LFCAVI-R---------------------KIVN----- ----K--H----GHVI------H-P------TK----T---KLYARD-KPK----------------------------- ------------------LVTGVIV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|20708600|locus|VBIAciBau103538_0032|_extraction hypothetical protein [Acinetobacter baumannii ACICU] -------------------------------------------------------------------------------- ---------E---------------------------LNHIVRSKNQ--------------------------------- -----------------MYQYF-NE--NK--------TDD-------SGNII-----------KVRPI------------ ------QNPH----DR-LKQIHSRIGKF-LGNL-KAP------------------------------------------- ---DYLHSKRS------------------K-----------------------SAI--SNAKAH---VGLK--------- ----------------------------------G--H-TL-NI--------DITDFY-PSTS----------------- -KAK--VQSF-FG------YTLQYPT---D------------------IAKYI---------SEVCTV------------ -------------------------------------------------------------------------------- ------------------N----------------------------------N-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------CLP-----TGSPL------------------S- SALAFWAN-----------------------------------------------------------KS----------- --------MF--DE-IYRMAK----------------------------------------------------------- ----------------------S------------RS----ITMTV----------YVDDISF----------------- -----TGK-----AVN-Q------------------------NFLNKI-I---------------------QIIE----- ----K--Y----QHNI------K-Q------EK----I---KFFPEN-SIK----------------------------- ------------------FVTGVAI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43009263|locus|VBIFaePra154692_0451|_extraction hypothetical protein [Faecalibacterium prausnitzii L26] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------MLKYV-LQ--IK--------NDDM----LKQSNITSKIHPYIDKTKKPRLI------------ ------EAPQ----AE-LKIVQRRIKTL-LGKI-QTP------------------------------------------- ---DNVFSGIK------------------G--K--------------------SYP--ENARMH---IGGR--------- ----------------------------------K--R-NLYKT--------DLTAFF-PSIS----------------- -RED--VYQF-FN------NELCCSP---D------------------VSEVL---------TNLTTV------------ -------------------------------------------------------------------------------- ------------------SLERFPKEELTEVYDFLGQKG----------VHCYN-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLI-----SGAPT------------------S- QILSYLVN-----------------------------------------------------------HK----------- --------MF--DE-LHTLSA----------------------------------------------------------- ----------------------K------------ND----MVMTV----------YVDDMVF----------------- -----SSEH----KIS-S------------------------HFRNSV-K---------------------SIIK----- ----K--Y----RYKL------S-H------NK----V---KGYSKG-YPK----------------------------- ------------------LVTGVVI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|17969030|locus|VBIFraTul58433_0831|_extraction Reverse transcriptase [Francisella tularensis subsp. tularensis WY963418] -------------------------------------------------------------------------------- ---------A---------------------------NIKKCLANPK--------------------------------- -----------------RYRNF-EK--PV--------AN------------------------KVRSF------------ ------EPPV----GL-NKKIHTRIFQL-LTRIDDIP------------------------------------------- ---EYLHSGVK------------------G--R--------------------SYV--TNAKQH---Q-LS--------- ----------------------------------Q--Y-FL-KM--------DIKDFY-PSTG----------------- -KDK--VFLF-FY------EYLQCSP---D------------------VANML---------ALLLTN------------ -------------------------------------------------------------------------------- ------------------K-------------------N----------LTNGR-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------HLV-----QGSCV------------------S- QILSFYCN-----------------------------------------------------------KE----------- --------MF--DE-VYKYSK----------------------------------------------------------- ----------------------E------------RN----ITFTL----------YVDDLSF----------------- -----SSS------------------------------------------------------------------------ -------------------------------------Q---NFNAKD-NNK----------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|43032562|locus|VBIGorPam38406_0536|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Gordonibacter pamelaeae 7101b] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------MLWL-LSAV-EKP------------------------------------------- ---DWVMSATP------------------G--K--------------------CHK--DNAVFH---R-SN--------- ----------------------------------A--Y-MV-TM--------DIESFY-DKCE----------------- -RER--VYQF-FK------RKLRQPG---D------------------VAKAL---------TDLSTY------------ -------------------------------------------------------------------------------- ------------------R----------------------------------Q-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------SIP-----TGAPT------------------S- QIVAYFAY-----------------------------------------------------------ES----------- --------MF--AE-INQVAK----------------------------------------------------------- ----------------------S------------HG----CIFTL----------YVDDMTF----------------- -----SSGE----PFSPD------------------------KLASEV-D---------------------AVLR----- ----K--Y----GHKS------K-M------SK----T---RYYPKG--------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42745249|locus|VBIBacFra167533_1197|_extraction Reverse transcriptase (EC 2.7.7.49) [Bacteroides fragilis 638R] -------------------------------------------------------------------------------- -------------------------------------LDYLLTHIED--------------------------------- -----------------YYYSF-ER--VK--------FNK----------FTDKPKKNSSGEIATRQI------------ ------NSSK----GK-LKEVQTRLYDF-MSKQVEIP------------------------------------------- ---QYVYGGVL------------------R--K--------------------NNV--RNARLH---Q-GN--------- ----------------------------------K--Y-IF-TT--------DLKSFF-PSIS----------------- -HKQ--VFQM-FL------R-EGCTP---A------------------IARIL---------TKLTTH------------ -------------------------------------------------------------------------------- ------------------K----------------------------------Y-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------QVP-----QGIPT------------------S- TLIANLVF-----------------------------------------------------------KP----------- --------IG--ME-IDQLAK----------------------------------------------------------- ----------------------E------------HH----IKFSM----------FVDDITL----------------- -----SSK------VDFK------------------------NLVPQF-L---------------------AIIK----- ----K--F----GFRI------S-H------KK----T---HYQTKN---P----------------------------- ------------------IITGVIC------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|21040698|locus|VBIBacFra29119_2092|_extraction Reverse transcriptase (EC 2.7.7.49) [Bacteroides fragilis NCTC 9343] -------------------------------------------------------------------------------- -------------------------------------LDSIIENIDK--------------------------------- -----------------YYSTW-EK--PK--------LNK----------DTNEPLYNSDGTIKKRTI------------ ------NSTN----KD-LKVIQKRLYNY-LLSKTTLP------------------------------------------- ---NYFFGGIP------------------K--K--------------------DNI--LNAKYH---Q-GN--------- ----------------------------------K--Y-VF-TT--------DLKSFF-PSIN----------------- -HKM--VFYM-FL------K-LGCTP---E------------------IARTL---------TKLTTH------------ -------------------------------------------------------------------------------- ------------------N----------------------------------Y-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------QVP-----QGVPT------------------S- TLIANLVF-----------------------------------------------------------KP----------- --------VG--DR-IQALAK----------------------------------------------------------- ----------------------E------------NN----IKFSI----------FVDDITM----------------- -----SSS------IDFH------------------------KKIPEI-L---------------------SIIT----- ----T--S----GYKI------S-H------SK----T---FYKTKN---P----------------------------- ------------------IVTGVIC------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|67759973|locus|VBIPanAna225404_2867|_extraction putative reverse transcriptase [Pantoea ananatis LMG 5342] -------------------------------------------------------------------------------- -------------------------------------------PASD--------------------------------- -----------------WLHKF-E-------------------------------------IKENRWV------------ ------FIPN----ED-TLTLGRKIHKY-IKSKWKFP------------------------------------------- ---LYMFHLRD------------------G-----------------------GHV--AAANYH---I-EN--------- ----------------------------------T--Y-FC-LI--------DISDFF-GATS----------------- -QSR--ITRE-LN------KFI--PY---D------------------RAREI---------AKLSTV------------ -------------------------------------------------------------------------------- ------------------K----------------NLNS----------NGLKK-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------VLP-----YGYPQ------------------S- PILASFCF-----------------------------------------------------------RQ----------- --------SYCGKV-IHTLSK----------------------------------------------------------- ----------------------S------------GN----ISISV----------YMDDILL----------------- -----SSDD----IEQLV------------------------YAFNSI-K---------------------TALK----- ----K--S----GYTV------N-E------AK----T---Q-SPST-SVN----------------------------- ------------------VFNLVL-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|180865020|locus|VBIEscCol264023_2783|_extraction Putative reverse transcriptase [Escherichia coli HVH 139 (43192644)] -------------------------------------------------------------------------------- -------------------------------------------EVQR--------------------------------- -----------------WEDKF-E-------------------------------------IKPGVWV------------ ------YVPS----VE-ARKVGGKILQA-VRNKWIPP------------------------------------------- ---LYFYHLRT------------------G-----------------------GHL--KAARLH---L-KS--------- ----------------------------------D--F-FA-VV--------DIKQFF-QSTS----------------- -RSR--ITRD-LK------SYF--TY---S------------------QAREI---------STFSTV------------ -------------------------------------------------------------------------------- ------------------R----------------NLSH----------SPHKH-------------------------- -------------------------------------------------------------------------------- -----------------------------------------------VLP-----FGFVQ------------------S- PILATLCL-----------------------------------------------------------DK----------- --------SYFGSL-LRRLNK----------------------------------------------------------- ----------------------H------------HD----LKLSV----------FMDDVII----------------- -----SSNN----LAQLQ------------------------AAYDEA-L---------------------VAMR----- ----K--S----GYQA------N-M------SK----T---Q-APSS-KIS----------------------------- ------------------VFNLTL-------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Actinobacteriafid|43034598|locus|VBIGorPam38406_1610|_extraction hypothetical protein [Gordonibacter pamelaeae 7101b] -------------------------------------------------------------------------------- --------YIKKPHTIKSYVHFDFRTSFAAKANLVMDPGFVKSYAFY--------------------------------- -----------------PLIQR-DLRRMK--------P------------D-----GHGKFFNDPRPI------------ ------KYAA----HL-DRCIYQYYSGL-LNER-YNDVATEIG--IG--------------------------------- ---DCAIAYRA------------------H------LKGQSNCDFALRAFSKMRDL--EA--CF---------------- ---------------------------------------AY-AG--------DFEDFF-ETLD----------------- -HAY--LKKQ-VRRLFP-G--GAIPDDYYH------------------VLKNA---------TRHSVW-------DIEKL LDHYGL---------------PYTKSGVKRL-------------NNRGRVLSSDEFKNM--VGASVERPW---------- -RENG-------------EK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGLPI------------------S- GTLANIYM-----------------------------------------------------------LE----------- --------FD--AA-VKAVAE----------------------------------------------------------- ------------------------------------RH--DGLYMR----------YSDDFIFV------------VPT- -----EKGFEEGRDE---------------------------F----------------------------SRLA----- ----GTMP----SLKI------H-P------RK----THSFRM--VD---------------------------GGVYL- -------LDSEDEPKCAIDYLGFSFD------------------------------------------------------ -------------------------------------------------------------------------------- -------------------------- >gi|89893427|ref|YP_516914.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------LRDIWSYINDPQNIKTHGFY--------------------------------- -----------------PFIHY-QLIYKK--------F------------N-----KNNGISPKTREL------------ ------CYSA----HI-DRYIFQYYGYK-LNQL-YNERVVNDG--IN--------------------------------- ---DSVIAYRD------------------N------LK-KSNIHFAKKAIDFIRDT--KS--CF---------------- ---------------------------------------IV-VG--------DFTNFF-DNLD----------------- -HKY--LKKM-LKEL---I--GGLPEDYYA------------------VFKNI---------TRYSTW-------DMEHI LKLNGL---------------PNNKKGIQEL-------------NQKDLALTLSQFKKY--KSKYQK------------- -PNTK-------------GY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSAI------------------S- AVLSNIYM-----------------------------------------------------------LY----------- --------FD--NM-INDYVK----------------------------------------------------------- ------------------------------------KH--NGLYMR----------YSDDFIII------------LPE- -----NNKSL--------------------------------FVEQYGFL---------------------VKTI----- ----NSVD----RLNL------Q-P------DK----TQVFQY--DN---------------------------NQILSC NEIVSPGVINGKD---ILDYLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|120400339|gb|ABM21395.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------LAKCKKYITNSKNIERHGFY--------------------------------- -----------------PFIKY-DLEYHK--------Y------------N-----RAEGRKTKKRTI------------ ------CYAS----HI-DSCIFQYYSFL-INKK-YNARLKQDG--IY--------------------------------- ---EVPIAYRT------------------D------LH-TDTIAEFRKMHQFMIQH--PS--SY---------------- ---------------------------------------VM-IG--------DFTSFF-DKID----------------- -HQY--LKER-LCDLLQ-V--DKLNSDYYA------------------IFKRI---------TKYDYW-------DLTDL YRLNNL---------------PKKRKDKIKI-------------NSKVRILSQSDYKKY--RSHIQH------------- -QDNK--------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSSI------------------S- ACLANVYM-----------------------------------------------------------LE----------- --------ID--RL-INEFVV----------------------------------------------------------- ------------------------------------AR--GGIYRK----------YSDDFIII------------LPM- -----DTSSTDDIEK---------------------------IILKFGFK---------------------DKGI----- -------------LEL------Q-P------EK----TQVYKL--QD---------------------------HSVVNI GHLFTPSLNEKNK---TINFIGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42796364|locus|VBIButBac135163_1581|_extraction hypothetical protein [butyrateproducing bacterium SSC/2] -------------------------------------------------------------------------------- ------------------------------------NIKWVSKHGFY--------------------------------- -----------------PFIHF-QMNCSK--------Y------------T-NDLKGHKFLKEKVRDI------------ ------YYAA----HI-DRFIYEYYGNR-LNNQ-YNNYVKSKG--IG--------------------------------- ---KVSTAYRN------------------C------MPGKCNIDFAKEVFEYIVKC--KS--AY---------------- ---------------------------------------IF-VG--------DFSKFF-DKLD----------------- -HKY--LKEK-IKCVIG-Q--ESLDPADYA------------------IYKNI---------TRFTYI--EAADIEFE-- -----------------------KDKLQRDM-------------RQLDKYFQTQEFQQF--KKKYLH------------- -KNVK-------------DY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------QIP-----QGSSI------------------S- AVYANVYM-----------------------------------------------------------ID----------- --------FD--KK-INDYVT----------------------------------------------------------- ------------------------------------SH--KGLYRR----------YCDDIIIV------------IPM- -----TKKEVSNGRT---------------------------NKIS-KFI---------------------YNVR----- ----DDIP----NLEL------N-E------DK----TEHFFY--GN---------------------------GKIRK- -------LKGQSN---LVNYLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|42185853|locus|VBIBacCer111781_5866|_extraction hypothetical protein [Bacillus cereus biovar anthracis str. CI] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------F-EIKFQK--------Y------------S----RKEKQAKEKVRKI------------ ------FYAS----HI-DSYIYKYYGDE-LNNH-YNCIADELG--IS--------------------------------- ---DIATAYRN------------------N------LSGKSNIDFSKEVIDFIKSQ--KA--AY---------------- ---------------------------------------IF-VA--------DFTNFF-DTLD----------------- -HKY--LKNK-IRQVLK-E--DTLPDDYYN------------------VFKNI---------TRFSYFFKDAIEMDLE-- -----------------------TKYTESEI-------------KGSYKYFTEEEFRDF--KHKNIY------------- -RNTK-------------GY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGAGI------------------S- SVCSNIYL-----------------------------------------------------------LD----------- --------FD--KE-IQNYIN----------------------------------------------------------- ------------------------------------EQ--NGIYRR----------YCDDLIIV------------IPI- ------EGEFKDYDY---------------------------TIPQ-TIV---------------------ENIK----- ----MKIP----NLKI------Q-P------EK----TGNYFY--TN---------------------------DKIID- -------LEFKNT---KLDYLGFSF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|46908547|ref|YP_014936.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------IENVESYVTDHSKIGNHSFL--------------------------------- -----------------PLIRY-VSSFEKRIEEKNPEF------------D------NRPIKTKDRVI------------ ------MYAG----HM-DNFIYKYYAEV-LNKDFYNKFCMEKG--ID--------------------------------- ---DCVSAYRN------------------K------V-GKSNIDFAAEIINQMVNY--KE--AY---------------- ---------------------------------------IL-VG--------DFTNYF-DKIN----------------- -HEL--LKKH-LAEVLN-Q--PRLSKDWFN------------------VFRSI---------TKYGYYEKSFLNEEYG-- -----------------------SDESIKRS-------------NKKSYFENISKFREF--QKNNKT--L---------- -CNKN-------------KF------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSAI------------------S- AVFANIYA-----------------------------------------------------------SE----------- --------FD--LK-LKEIAD----------------------------------------------------------- ------------------------------------EF--SGIYRR----------YSDDFILV------------IPK- -----SDIVN---EQ---------------------------KIRRIETD---------------------TRRV----- ----ASEY----KIEL------H-K------DK----TGLYLY--EN---------------------------DKIFDI -------ISNEVS---HLDYLGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149116005|ref|ZP_01842739.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------QEKATLLVTSPSRVAKHPFF--------------------------------- -----------------PLI--------S--------Y-VVLSKKISKDQS-TG---TLNVVTKDRPI------------ ------AYSA----HK-DSHIYAYYSQI-LSEK-YEKELIKNN--LQ--------------------------------- ---DSILAFRE------------------L--------GKSNIDFAYEAFCRIKSL--GE--CS---------------- ---------------------------------------AV-AL--------DFSKFF-DTLD----------------- -HDL--LKKS-WANLLG-K--QKLPTDHFN------------------VYKSL---------TKYSKV-------DKATL YKALS------ISIN------NP-KNGRFRV------------------C-NASEFRSM--V-----RDK-GLIE----- -TNKS-------------RY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- ALLSNIYM-----------------------------------------------------------LE----------- --------FD--KE-MKRFVD----------------------------------------------------------- ------------------------------------IY--NGYYYR----------YCDDMLFI------------VPT- -----ELR---NTVA---------------------------GFAA-------------------------KHVK----- -----A-L----KVSI------N-P------KK----TELRSF------------------------------------- KKVG--DTLISEQ---MLQYLGFMF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|148871195|gb|EDL70073.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------LKKATSIVTNPELVSRHAFY--------------------------------- -----------------PLI--------N--------Y-EIEIQKVSKDKL-TKR--VIKRPIKLRPI------------ ------AYPA----HL-DSQIYSYYSYI-ISKE-YENLLSESN--LS--------------------------------- ---DSILAFRK------------------L--------GKSNIDFAFDAFCKIKDF--GE--CS---------------- ---------------------------------------VV-AL--------DITGFF-DNLN----------------- -HEK--LKKM-WSKTIG-E--DRLPQDHFN------------------VYKSL---------TKYSKV-------DRNTL YKLLG------ISLN------YR-NHTKVKL------------------C-EPKEFRES--V-----RKS-KLIK----- -VNKD-------------CF------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTPI------------------S- AMLSNLYM-----------------------------------------------------------LD----------- --------FD--KN-IKAFVD----------------------------------------------------------- ------------------------------------GF--SGKYFR----------YCDDILII------------SPQ- -----DKK---DAVI---------------------------QHVK-------------------------KEIE----- -----L-I----KLSI------N-E------AK----TEVRDF------------------------------------- VLVD--GNLYTPH---HLQYLGFMF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|109898056|ref|YP_661311.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------KRKAISIATNPEAVAKHAFY--------------------------------- -----------------PLI--------N--------F-EVTKIKVRSDDN-GK----LIKHKKPRPI------------ ------AYAA----HS-DAAIYSYYAQL-LSKE-YEVKVAELN--LD--------------------------------- ---DNVLAFRS------------------K--------GKSNIHFANDAFEAINQV--GP--CT---------------- ---------------------------------------AI-GY--------DVSKFF-DTLD----------------- -HGV--LKSQ-WQTLLG-V--KTLPDDHYK------------------VFKSL---------TKFTQV-------DKCKL FGSLG------LSIH------NP-KVQNKRL------------------C-TAEQFRKY--V-----REN-NLIS----- -KNRP-------------NK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTPI------------------S- ALLSNIYM-----------------------------------------------------------LD----------- --------FD--QR-VKEALE----------------------------------------------------------- ------------------------------------GQ--GGVYFR----------YCDDMLFI------------IPD- -----AAKQFVSSID---------------------------QFVT-------------------------DAIE----- -----S-L----KIEI------N-H------GK----TEKRQF------------------------------------- VLKG--SELFSNK---PLQYLGFLF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|146281094|ref|YP_001171247.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------QAAALEIVSSPKTVAKHAFY--------------------------------- -----------------PFI--------R--------Y-VAQTQKVFFDKS-IGK--VVKKDPKQRPI------------ ------SYAA----HV-DSHIYSYYCEL-LNRA-YEDHLAKVQ--WS--------------------------------- ---SAILAFRA------------------L--------GKSNIDFARDAFLDIATR--DS--CC---------------- ---------------------------------------VI-AI--------DIKGFF-DNLD----------------- -HVH--LKNA-WQALLG-S--SQLPDDHYA------------------VYRSL---------TKFSFV-------YRDQV YEALG------LSKS------NP-KQGRKRI------------------C-EPHEFRAK--V-----REG-GLIE----- -TNKD-------------KK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- AMLSNVYM-----------------------------------------------------------MG----------- --------FD--EQ-IHAHVE----------------------------------------------------------- ------------------------------------SC--GGAYYR----------YCDDVLLI------------VPL- -----EKE---AEAK---------------------------ALVD-------------------------LRVN----- -----E-I----GLEI------Q-A------AK----TETCKF------------------------------------T RSAK--G-LRSDR---PLQYLGFIF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|38163126|locus|VBIDesAlk70802_1028|_extraction conserved domain protein [Desulfurivibrio alkaliphilus AHT2] -------------------------------------------------------------------------------- ------------------------------------------RHSFY--------------------------------- -----------------PFI--------A--------Y-EIKSKKVRKGED--GK--SLAIKEKKRPV------------ ------SYAS----HV-DSHIYTYYTYM-LNEA-YEKFIKDIG--VE--------------------------------- ---ENVMAFRK------------------L--------GKSNIDFANDAFNKIIER--QF--CA---------------- ---------------------------------------AI-AF--------DIEGFF-DNLN----------------- -HSI--LKES-WSRVLG-L--KKLPDDHYN------------------VFKSL---------TKFSKV-------YKDPL YEAFA------ISKN------NP-KKKNKRV------------------C-EPRDFRKV--V-----RKN-GLVE----- -VNNT-------------QK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- ALLSNIYM-----------------------------------------------------------VE----------- --------FD--QI-VSSVMG----------------------------------------------------------- ------------------------------------EI--NGCYLR----------YCDDILCI------------VPL- -----ARK---NEII---------------------------DRIN-------------------------KEIK----- -----K-L----NLNI------N-T------DK----TKTSEY------------------------------------- RVVK--GQLTCDK---PLQYLGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|161894677|locus|VBIManHae297097_0336|_extraction conserved domain protein [Mannheimia haemolytica M42548] -------------------------------------------------------------------------------- ---------------------------------------------FY--------------------------------- -----------------PFL--------S--------Y-TLSVAKIKKDKN-EGK---LVRLIKNRGI------------ ------SYSS----HK-DSHIFSYYAYL-LNEK-YENLLKNKYYILD--------------------------------- ---DSILAFRK------------------L--------NKSNIDFARIAFDSIRSA--QN--CT---------------- ---------------------------------------VL-GL--------DISKFF-DHID----------------- -HKI--LKDM-WCKVLG-V--PSLPEDHHS------------------VFKAI---------TKFVKV-------DRDTV FEFFN------ISLH------NPRKNGKNRI------------------C-SPREFREYRKL-----KKP-DLFNDNPAF FINKG-------------QK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- ALLSNIYM-----------------------------------------------------------LE----------- --------FD--VL-VKEKIM----------------------------------------------------------- ------------------------------------EC--KGSYFR----------YCDDILCI------------IPN- -----EYE---EFIL---------------------------DYIT-------------------------GEIK----- -----SKL----KLEI------N-K------DK----TEVVKF------------------------------------- QYCNRTKKIINEK---KLQYLGFIL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|120586953|ref|YP_961298.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------FEKASRIVKNPRKVASWNFF--------------------------------- -----------------PFL--------Q--------T-TVKTSKITRNDE-GE----IVPKNKSRPI------------ ------SYAA----HT-DSHIYSYYATL-LQPI-YEKFIEKHG--LG--------------------------------- ---TNITGFRK------------------LD-------GECNIDFAHRAFNAIRSM--TP--CI---------------- ---------------------------------------AL-SF--------DVKSFF-DEID----------------- -HSI--LKQA-WCTILE-K--TLLPEDHFA------------------IFKSL---------TTYSYV-------DRDDA FNAFG------ITKS------SK-KNGIRRI------------------C-NPLEFRSI--L-----RPA-GLIK----- -RNKN-------------SY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- GLLSNIYL-----------------------------------------------------------FE----------- --------FD--KA-ISYFAS----------------------------------------------------------- ------------------------------------DT--KSHYYR----------YCDDIIII------------CNE- -----EHE---ELFK---------------------------NLVS-------------------------DELK----- -----K-L----NLRT------N-E-------K----NVIRKF------------------------------------F MGCDGPE---CDK---PIQYLGFVF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115663330|locus|VBIChaMin231992_2030|_extraction hypothetical protein [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------------------- ----------------------------DYAKSKVETPENIINHSFW--------------------------------- -----------------PFLRK-IQCTPK--------Y-KRELKKVSS---------------KDRPI------------ ------MYAS----HI-DSHIYSWYTHL-LNKR-YEERIKETH--LN--------------------------------- ---SSVLAYRA------------------L--------GKSNIDFAKDVFDEVEKR--EH--CI---------------- ---------------------------------------VL-TF--------DISKFF-DCID----------------- -HKK--LKLS-WCRLLD-LNNSKLPKDHYK------------------VYKSI---------TKYSYI-------NMSDV CQELN------IKDF------RE-LQSRRRI------------------C-SSQEFRSK--L-----KSS---IK----- -INTT-------------EH------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- ALLSNIFL-----------------------------------------------------------WE----------- --------FD--IL-MSNHAL----------------------------------------------------------- ------------------------------------ET--GGIYRR----------YCDDILWI------------CHP- -----EQS---DKTM---------------------------RKVH-------------------------QEIQ----- ----NSGS----NLTI------N-E------DK----SEKSEF------------------------------------I RENGLLYYHPETQ---PLQYLGFIF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|91216797|ref|ZP_01253761.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------ISKLESYIMDEDKIAKHSFL--------------------------------- -----------------PFLHR-SIFQRK--------F-RPR-KDNLRNKK-SKKRRRGKLKPKAREI------------ ------YFAS----HF-DAQIYSYYSYL-LSKK-YNALLETKT--FD--------------------------------- ---KAVVAYRK------------------IEMEGKKGKHKCNVDFAHETFQFIKDN--QDKELT---------------- ---------------------------------------VI-VA--------DVTKFF-DSLD----------------- -HKI--LKQK-WSQVWDGS--TTLPKDHFR------------------VYKSL---------INMRYV-------NESLL FRNYKDQIWV-KTRE------ENDPKSFDRLQKPIKQKRFLKDNNAIAYC-EKTEF-----L-----KNSLDLIT----- -KSKA-------------TK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTAL------------------S- ATLANIYM-----------------------------------------------------------LD----------- --------FD--QE-VQDFID----------------------------------------------------------- ------------------------------------SDSVKGFYQR----------YSDDLIIV------------VPR- -----DQQ---EEAI---------------------------RHLR-------------------------SLVD----- -----DKV----NLEI------H-P------DK----TKVYHFQNED---------------------------EKFIGF EVDEHTGERK-NK---TLEYLGFDY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|146300088|ref|YP_001194679.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------YKWIKRYVQDENCIKKHSFL--------------------------------- -----------------PLVHK-CIVQRK--------Y-RADLSTNTRN-P-SGKRRRIIGTPKIRNI------------ ------YYSS----HL-DSLIFSYYNYL-LSEN-YKELMKLKN--FN--------------------------------- ---NSIVAYRK------------------IPLFEGSEKNKCNIDFAKDTFEYIEKN--KEKKLS---------------- ---------------------------------------VI-VA--------DVTSFF-DNLN----------------- -HKI--LKKQ-WSRLLN-E--KTLPDCHFN------------------VFKAL---------TNLRYV-------ESDQL FNSYFGTMIVEKGIP------NSDKKEYKRIK--INSNKYFKEKNAVAYC-SKSDF-----L-----KNNLNLII----- -SANS-------------TK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGSPI------------------S- ATLANVYM-----------------------------------------------------------MD----------- --------FD--QE-VYDKIV----------------------------------------------------------- ------------------------------------SN--KGFYQR----------YSDDLIII------------CEQ- -----EFE---DDII---------------------------KFVR-------------------------DRIQ----- -----NLV----KLEI------S-E------SK----TKVYRFEELN---------------------------GKFLGF EIDEKTKEPNFNK---TLEYLGFEY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|54172681|locus|VBIRunSli161982_3444|_extraction hypothetical protein [Runella slithyformis DSM 19594] -------------------------------------------------------------------------------- ------------------------------AKEFVSSEKKIKTYSFY--------------------------------- -----------------PLIHR-VIIQRR--------Y-KKLTDGKRSHFD-STKKESTA---KERQI------------ ------YYAN----HL-DTLIYAYYTKVKLEPK-YEGKIKEIDG-LS--------------------------------- ---NCISAYRT------------------IKVHPNTKSGKSNIHFANDVFNCIKQY--DE--CA---------------- ---------------------------------------VL-TF--------DIEKFF-DSLN----------------- -HQH--LKKA-WCNLLN-S--NKLPDDHYN------------------IYKSL---------TNFSFV-------EESEL LNELGL-----LNIG------N--RFEIKRI---------IKNKNIKSYCKNNKEFRLK--VCGRENKKCKSLVKPHPFE YKKNQIDYLELKSYKKRHLK------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTAI------------------S- AFLANLYL-----------------------------------------------------------LE----------- --------FD--TK-VFNEVT----------------------------------------------------------- ------------------------------------NF--EGIYRR----------YSDDIVIV------------CKL- -----KDE---THLK---------------------------NFVI-------------------------NAIE----- -----D-Y----KLII------N-K------DK----TEVSYF------------------------------------- -KRNLNKELVLDKDSMPLRYLGFEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|110679506|ref|YP_682513.1_extraction_extraction -------------------------------------------------------------------------------- ---------------------------ENRRERTIDFGSEIKPHRFW--------------------------------- -----------------PLLGF-TDLTRR--------YVRVKDADGNWLRD-DKGKFVRKEKAKPRPI------------ ------RFAG----HE-DAAYLQGYAAH-LNGF-YEYALSADG--SS--------------------------------- ---GSVLAYRK------------------G--------GGTNIHHAKSLFDEIVKR--QN--CS---------------- ---------------------------------------VF-AM--------DISGFF-DCLD----------------- -HKL--LRDE-IAGLLG-V--SRLQSHHGR------------------VWANI---------TKYAWV-------ETDDL DKLLG-----------------RKRNGHGRV------------------C-SPQDFKAH--VQG---RKS-GLIR----- -RHDR-------------DY------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTPV------------------S- GLYANIYL-----------------------------------------------------------RS----------- --------FD--RE-MIAMCA----------------------------------------------------------- ------------------------------------KY--GGSYRR----------YSDDIAVV------------LPL- -----GAK--VRHVV---------------------------AIVE-------------------------KMLS----- -----D-F----GLAM------S-V------EK----TETADF------------------------------------- ----ANGQLISEK---PIQYLGFTF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|35505344|locus|VBIRhoCap134200_2618|_extraction hypothetical protein [Rhodobacter capsulatus SB 1003] -------------------------------------------------------------------------------- ---------------------FDAPLSLREIRRLVTSPERVAANSFY--------------------------------- -----------------PFFLY-EESWQP--------YRSADAAKPDK---------------KTRPI------------ ------RYGA----RR-DAYIFAFYRRK-LSRL-YEARLRTLG--IE--------------------------------- ---DCPIAYRQ------------------V-SKSGLGGGKCNIDFAKDAFDEIDRL--GD--CV---------------- ---------------------------------------AV-AL--------DIKGYF-ENLD----------------- -HRR--IKQI-WCDLLG-V--AELPPDHYA------------------VFKNI---------TKYHFV-------DQRTT YRRLGYFGLRERNGEMIDGFLRPYRDMPKQL------------------C-SNADFRAK--ICGGDPAYP-SIIK----- -KNDK-------------PH------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GVP-----QGAPI------------------S- DLIANFYL-----------------------------------------------------------LE----------- --------FD--VV-MAAYAR----------------------------------------------------------- ------------------------------------AR--GGRYMR----------YSDDILLI------------LPG- -----GAS-EASAAV---------------------------AFAT-------------------------AEML----- ----NHGP----ELRI------K-D------SK----TCVAQFERVA---------------------------GAL--- ----------------RFQHL----------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|84687336|ref|ZP_01015215.1_extraction -------------------------------------------------------------------------------- ------------------------------------DPDFVLKHSFL--------------------------------- -----------------PLLHY-TKSEKR--------Y-----------KK-CPKTGKRTITSKDRPI------------ ------KYAS----HR-DACILSFYASE-MNKL-LDAHYNAAG--LS--------------------------------- ---DSVLAYRA------------------L--------GRGNYDFSAEVLAFAKTH--AP--VT---------------- ---------------------------------------IL-AF--------DVSSFF-DNLD----------------- -HTL--LKRR-LKAVLG-V--TSLPEHWMR------------------VFRAI---------TAFHYV-------DMEEL KANATFSS-------------RLKEKSRDRI------------------A-SVEELKAN------------GIFHPNPEL -AKGH-------------RR------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTPI------------------S- AAASNLYM-----------------------------------------------------------ID----------- --------FD--AA-AHAFCD----------------------------------------------------------- ------------------------------------SI--GALYRR----------YSDDILVI------------CAP- -----AHA---AAAE---------------------------AKIM-------------------------DLIK----- ----AE------KLDI------S-P------HK----TERTEF------------------------------------- ----TGTGVVAGK---AAQYLGFSL------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|146342070|ref|YP_001207118.1_extraction_extraction -------------------------------------------------------------------------------- ----------------------------EQYAIKVADEGFVERHAWL--------------------------------- -----------------PLIRY-QKRVKR--------Y-----------K---PKLGKTVF--KQRPI------------ ------MYAS----HR-DSCILSKYAWD-LSKR-LDAYYERKG--LD--------------------------------- ---RNVIAYRR------------------L--------GKSNYHFSAEAFQFAKSR--PG--CV---------------- ---------------------------------------VL-CF--------DISGFF-DNLD----------------- -HRI--LKRR-LKFILE-V--EELGRDWFA------------------VFRHV---------TRFSTV-------DKSAL AAHPRFSS-------------QLSGDSREPI------------------A-TIAEVKLE------------GI----PIL -VNDE-------------RF------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGTPI------------------S- SALSNLYM-----------------------------------------------------------LE----------- --------LD--ER-MVECCR----------------------------------------------------------- ------------------------------------RC--GALYQR----------YSDDILIV------------CNI- -----GDE---VQLR---------------------------SAFL-------------------------TELK----- ----KH------QLEI------N-E------DK----TERVVF------------------------------------- -------GSVGAR---EFQYLGFNI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|54419991|locus|VBIFluTaf147081_2830|_extraction hypothetical protein [Fluviicola taffensis DSM 16823] -------------------------------------------------------------------------------- ------------------------------IEAKLKSKEFVANYAFY--------------------------------- -----------------PLLHT-IIDERK--------FKKIPNETSKRAHSFVDENGVIKKNVKKRPL------------ ------HYAN----HF-DALIMAYYAEK-LQEK-YDEQLKKNLE-LD--------------------------------- ---KSVTAYRK------------------IPVDDTPDKNKGNIHFAKEVFDEIKMRVCDSGDTI---------------- ---------------------------------------VM-AF--------DIKSFF-STLN----------------- -HGF--LYKK-WAELIN-E--KDLPADHLN------------------VFKAA---------TKFSFI-------YKNDL KRLGKG---------------PQRKYDEKRL------AEIRNKNGFRAYFESPKDFRTS--VKK-------GEIHIFQNN FKNEA-----------GEQV------------------------------------------------------------ -------------------------------------------------------------------------------- -----------------------------------------------GIP-----QGLPI------------------S- ALLANLYL-----------------------------------------------------------LD----------- --------FD--KTIIQKLVD----------------------------------------------------------- ------------------------------------QK--RCYYRR----------YSDDIIVI------------CSP- -----EQT---ELVN---------------------------QVVT-------------------------EEMK----- ----RQ------EVII------S-T------EK----TEVFRFQNTN---------------------------TGISVF KKSG--EEWIKNQ---PLNYLGFEF------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|71734230|ref|YP_276337.1_extraction_extraction -------------------------------------------------------------------------------- --------FVDAIQHRDLFHNKPALYETLRTLLSTDNGRY------L--------------------------------- -----------------GCERK-IYDIPK-------------------------------KGLGIRYS------------ ------LETD----FY-DRFIYQAIC-SYLMP-----------YFDPLLS------------------------------ ---HRVLGHRY------------------NKNRTSEKYIFKNRIDLWKTFEGVTKTALKNNQ------------------ ------------------------------------SLLVT-----------DLLNYF-ENIS----------------- -IAS--IKNAFENLLQKVDATGPEKSLIRN------------------AIQTL---------CELLAR------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------WSYNDLH-----------GLP-----Q-NRD------------------AS SFVANIVL-----------------------------------------------------------NS----------- --------VD--QTMV-NLGH----------------------------------------------------------- -----------------------------------------D-YYR----------YVDDIRI-------------ICA- -----SPRA----AK---------------------------KVLTELIS---------------------QLRT----- ----V-------GMNI------N-S------GK----T---IILTSSNSEAEIAEHF----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|121606813|ref|YP_984142.1_extraction_extraction -------------------------------------------------------------------------------- --------FFDAIQYKDLLSTKKDLKSTLSHLLLEGNGSY------T--------------------------------- -----------------GDNRI-VYDIPK-------------------------------KGLGIRYA------------ ------LETD----FY-DRFIYQAVC-SYLIP-----------FFDPLLS------------------------------ ---HRVLGHRY------------------NKNRLEEKYLFKNRIELWKTFEGVTYTAFRDKK------------------ ------------------------------------ALLAT-----------DLINYF-ENIT----------------- -TEK--IKEAFESKLPNIHASGKEKLKIRN------------------AITTL---------CDLLVK------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------WSYSEKH-----------GLP-----Q-NRD------------------SS SFIANMVL-----------------------------------------------------------ND----------- --------ID--HEMK-RRGY----------------------------------------------------------- -----------------------------------------D-YYR----------YVDDIKI-------------ICD- -----SPRH----AR---------------------------KILSELIK---------------------ELRK----- ----V-------GMNI------N-S------SK----T---KVLTAEEKPDVLAEFF----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|85708303|ref|ZP_01039369.1_extraction_extraction -------------------------------------------------------------------------------- --------YYDCLQYDDLFKDPSEAKRIIISLLQEWNGEY------R--------------------------------- -----------------GTRSV-VRNIPK-------------------------------QGYGERYG------------ ------LETD----FF-DRFVYQAIC-SFLIP-----------FYDPLLG------------------------------ ---HRVLSYRY------------------EPTPIKAKYLFKNKIDRWFTFEGVTLTFRKSGL------------------ ------------------------------------YLLIT-----------DLSNFF-ENVS----------------- -REQ--IIKALEQAVPNLLATGPQKLHVRN------------------AIATL---------DRLLGQ------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------WTYSGDH-----------GLP-----Q-NRD------------------AS AFLSNILL-----------------------------------------------------------SN----------- --------VD--RKMA-EKGY----------------------------------------------------------- -----------------------------------------D-YYR----------YVDDIRI-------------ITD- -----SETH----AR---------------------------RGLQDLIR---------------------ELRT----- ----V-------GLNI------N-A------KK----T---EILAPDVSDEKVAKYF----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|118739127|ref|ZP_01587199.1_extraction_extraction -------------------------------------------------------------------------------- --------YFDCLQYDDIFKNPDEAKRIVLSLLQEWNGVY------L--------------------------------- -----------------GTRSV-VRNIPK-------------------------------KGYGERYG------------ ------LETD----FF-DRFVYQAIC-TFLIP-----------YYDSLLS------------------------------ ---HRVLSYRH------------------DSAPQNSKYLFKNKIDRWFTFEGITLTFARSNQ------------------ ------------------------------------HLLVT-----------DLSNFF-ENIS----------------- -REQ--IIAALEKAIPEIVATGPEKLQIRN------------------AIRTL---------DRLLEQ------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------WTFSRDH-----------GLP-----Q-NRD------------------AS SFLSNILL-----------------------------------------------------------SS----------- --------VD--REMA-KKGY----------------------------------------------------------- -----------------------------------------D-YYR----------YVDDIRV-------------LAD- -----TEIH----AR---------------------------RALQDIIR---------------------ELRK----- ----V-------GLNI------N-A------SK----T---EILPPNASLEKLVAHF----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|89890656|ref|ZP_01202165.1_extraction_extraction -------------------------------------------------------------------------------- ------VDILDTFDYYDFNYNIDERALLLRTVLL--NGNY------Q--------------------------------- -----------------PSQPL-IYRIEK-------------------------------KFGICRHL------------ ------VIPH----PL-DALVLQVIT-ENISQQILNNQPSKNSYYS---------------------------------- ----RDKHNLR------------------KPHEIDE-YGYHWR-RLWKKMQKQIYQFKEEKE------------------ ------------------------------------LIIVT-----------DLSNYY-DSIY----------------- -IPE--LRKVISGFID----------KKES------------------VLDIL---------FKIIER------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------ISWLPDYLPYTGR-----GLP-----TTNLE------------------GV RLLAHSFV-----------------------------------------------------------FE----------- --------ID--EVLK-SKSN----------------------------------------------------------- -----------------------------------------ESFTR----------WMDDIII-------------GVN- -----SRTE----AV---------------------------NVLSSTSD---------------------MLKS----- ----R-------GLAL------N-L------KK----T---NIYSSKEAE----FHFQIEEN------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|114567587|ref|YP_754741.1_extraction_extraction -------------------------------------------------------------------------------- ------QDILDLHDYYYFHRNRKDRIKIIKTEVL--EGIY------K--------------------------------- -----------------PKSPY-IVRMEK-------------------------------THGICRHI------------ ------EIPS----AE-DAVILQTIV-ECLAPIIEKAQPSDRAFFS---------------------------------- ----RSHKRFK------------------TEADIDESFPYDWK-ELWPQFQNRIYEFTTIFN------------------ ------------------------------------YVVVT-----------DIANYF-DNIS----------------- -FSQ--LRNVLSGYGQ----------IDEG------------------LLDFL---------FFMLES------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------WIWRPDYLPLSGL-----GLP-----QVNFD------------------AP RLLAHSFL-----------------------------------------------------------FE----------- --------ID--KYLT-EKTN----------------------------------------------------------- -----------------------------------------NNFVR----------WMDDIDF-------------GTN- -----SIED----AK---------------------------EILRGLDE---------------------MLLT----- ----R-------GLRL------N-I------GK----T---KILSSTEAR----KFFLPDEN------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42787653|locus|VBIButBac106850_0529|_extraction FIG00623195: hypothetical protein [butyrateproducing bacterium SS3/4] -------------------------------------------------------------------------------- -----GSDWKPQVQRYEMAYLLDLSK--MQRELKEHTYEF------Q--------------------------------- -----------------PCSSF-PLNERG-------------------------------K---TRFI------------ ------TGEQ----IR-DRIAKHSLCDEVLTPAIKDHL-----IYDNGAS------------------------------ ---QKGKG-----------------------------IDFTRR-----RLEAHLHKFFRENQSN---------------- ----------------------------------DGYILLM-----------DFSKYY-DNIR----------------- -HDK--LMELFEKYVD-----------DDT------------------ALWFL---------EKIVDN------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------EKVDVSYMNDEEYESAMDDVFNSLE-----HEKVD------------------KN LLTGKKFL-----------------------------------------------------------RKHLNIGDQVAQD AGIAYPIPID--NYIKIVKSV----------------------------------------------------------- -----------------------------------------KFYGR----------YMDDSYV-------------IHK- -----DKEF----LK---------------------------GLLIEIVE---------------------IAHG----- ----L-------GITV------N-L------RK----T---RICKLSEMWRFLQIQY----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|78063598|ref|YP_373506.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------SSSE-----------------------------HFLRPL------------ ------AHVG----IR-EQTIATAAMLC-LA---------------DCVESAQGD-T--SLDALDAQKAGVFSYGNRLFC SWTD--QGAR-------------------ARFSWGNSNVYSRYFQDYQSFVERPLLIAQSAVLSGQDALTL--------- ---------------------------------------FVIKL--------DLSAFY-DNIN----------------- -IEG--LVEK-L--------------TELYWRYSETIAPTA-KTSSARFWATL---------AKSLSI------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------GWQVEDAKWAPYL-KG--QKLPS-----GLP-----QGLVS------------------S- GFFANAYL-----------------------------------------------------------VD----------- --------FD--EA-VGESIG----------------------------------------------------------- -----------------------------RSFNRRGVKFRLHDYCR----------YVDDVRLV------------VSCD KQVPSEEELGLALTE---------------------------WVQARLDS---------------------KAND----- ----R--------LVV------N-E------QK----TEVQPFASLGGESGTAARM------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|113866906|ref|YP_725395.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------PGEA-----------------------------PVLRPL------------ ------AHLP----VR-EQTVASALMLC-LA---------------DCIETLQGP-TIGHADFAEAQRMGVYSYGNRLLC SWTELLPGTKH------------------ARFRWGNSDIYTRYFTDYQRFIDRPKYFAEQARDKGG---TV--------- ---------------------------------------LLVKL--------DLSAFY-DNID----------------- -VGR--LIES-L--------------RRHYAKFCETF-PDY-PKHDEDFLAVA---------REALTL------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------RWDPQDSTYAPLL-RD--RALPR-----GLA-----QGLAA------------------S- GFFANAYL-----------------------------------------------------------ID----------- --------FD--KE-IGRQLN----------------------------------------------------------- -----------------------------TAIHFEGDEFTLVDYCR----------YVDDLRIV------------VSVD ARLPHTHIPGV-ITQ---------------------------WVQRILDV---------------------TVNE----- ----R--N----VLKL------N-A------RK----TEHEEFAAVSAESGDVAAM------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|90580000|ref|ZP_01235808.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------EKKE-----------------------------HRLRPL------------ ------ADIS----IR-EQTFATAVTMC-IA---------------DSLETKQRDCSMNNSDYSEHIKNKVYNYGNRLLC DWES-----DN------------------ARFRWGGSEFYRKFSTDYKNFLQRPISIGREIHQKVSEIDDV--------- ---------------------------------------FIVSL--------DLTNFY-GCIK----------------- -KDL--LIRK-L--------------KEIAS--EHHD--IE-YGETDEFWELI---------KNILDW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------EWSEESLSLLKKL-NITTSDI-T-----GLP-----QGLAS------------------A- GALANAYL-----------------------------------------------------------VK----------- --------FD--EK-LGTKLN----------------------------------------------------------- -----------------------------TKINDS--TIMLHDYCR----------YVDDIRLV------------ISGE SLSSKD------IKR---------------------------SVRFFIQG---------------------ILNETLE-- ----E--Q----YLEV------N-D------DK----TNILPLSELDNASSLANRV------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|115183372|locus|VBISerMar265383_1286|_extraction hypothetical protein [Serratia marcescens FGI94] -------------------------------------------------------------------------------R THNWYADLLALDKCAFDISDEVTKWSNEVKNATFHKSDIELILAPKG--------------------------------- -----------------ANWFFKKGRWITNKED-----------------------------RKLRPL------------ ------ANIS----IK-EQSFATAVMMC-IA---------------DSLETRQKDCSLSSLGYAEHVKNKVVSYGNRLIC DWNN-----DK------------------ARFRWGGNEYYRKFSTDYRNFLQRPIYVGRETVNKVSGIDDV--------- ---------------------------------------YIVSL--------DLKNFF-SSIK----------------- -IDL--LIKK-L--------------KEISS--KHYD--CS-VSDNNDFWSLA---------TQILNW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------TWPKDTLSILGSL-DL--KKI-V-----GLP-----QGLAS------------------A- GALANAYL-----------------------------------------------------------ID----------- --------FD--ES-IISKFR----------------------------------------------------------- --------------------------------DGS--HIILHDYCR----------YVDDIRLV------------ISGE ALNNKE------IKE---------------------------FVHGLVQC---------------------VLDETLKQD KLDGE--P----YLEI------N-D------KK----TSILILSDIDNGSGLTNRI-----------------NEIQHEI GTSSIPERNGLDNNIPALQQLLLTEQDNFFDDANDSFPGFNNDKSIKVESLRRFSAHRLETSLTKKSKLISPEERKQFDN ESELIAKKLFKAWLKDPSIMVIFRKAIAINPNLDAYKTILDDIFKRIKSNRDKCDKYIMIYLL----------------- -------------------------- >gi|163748973|ref|ZP_02156224.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------RNKK-----------------------------SPLRPL------------ ------AHLT----IR-DQTWASAAMLC-LA---------------DAVETVQGNCSNKNQSFEMTRASKVYSYGNRLVC DWKN----KDK------------------AWFRWGNSETYRKFFTDYQNFLKRPLELGRLVSNQALGSENV--------- ---------------------------------------FIVNL--------DLSKFY-NTID----------------- -VDV--LIER-L--------------QDISSGFGHEI--C------DDFWVAF---------KRITNW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------QWSQEDIDLAKEL-KL--GDIET-----GLP-----QGLVS------------------A- GFYANAYL-----------------------------------------------------------AD----------- --------FD--QK-IGDKIG----------------------------------------------------------- -----------------------------GVLSKTG-NIVLHDYCR----------YVDDLRLV------------ISTD DITPQK------IAD---------------------------EINKVIGK---------------------LLDK----- -------------LAL------N-T------EK----TKVTHLSDLDNSGSMANRI------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|53718205|ref|YP_107191.1_extraction_extraction -------------------------------------------------------------------------------- -----------------------------SDAG----------------------------------------------- --------------------------------------------------------------QKLRPL------------ ------AHVA----IR-DQTLATAVMMC-LA---------------EAIESHQGDTT--EGDLTKLRKLGVVSYGNRLQC RWVEKGDGTKR------------------AHFSWGNSRTYRQYFEDYRAFLSRPRRVCAELSPQTATRKEL--------- ---------------------------------------FVVSL--------DIKSFF-DCVD----------------- -SKA--LVHE-L--------------YRLQTTYQKHEGLSENDSTDVPFWELA---------SLILSW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------RWRESDHQSAPLI-RGQFEHLAV-----GLP-----QGLVA------------------S- GFFANAYL-----------------------------------------------------------VG----------- --------FD--TE-M-WRVC----------------------------------------------------------- -----------------------------EEQQQLPEDIRILDYCR----------YVDDLRIV------------VEAP LAMGAAGSER--TLQ---------------------------RVQQFVSE---------------------MLAN--HCN VSLT-----------L------S-L------QK----SSITPYRSISAQSNVSSLM------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87181716|locus|VBIPseSyn11986_0007|_extraction hypothetical protein [Pseudomonas synxantha BG33R] -------------------------------------------------------------------------------R RHNSYADVLELDSSTINLECQLKAWGKAVGEPGFQTEDLKLVPAPKN--------------------------------- -----------------GKWDFHEHVGFPQAAFLDIPINEIDSLFHQWCPSSEQVGGAQPELQKLRPL------------ ------AHLS----IR-DQSLATAVMMC-LA---------------EAIETAQGDPE--ETDVLKARERGLVSYGNRLHC RWSETGGTQAR------------------AHFSWGNSQTYRKYFQDYRAFLSRPRQICAEFSPRLSKGREL--------- ---------------------------------------FVVSL--------DLKSFY-DRVD----------------- -IKA--LLAE-L--------------KHLEAEYQQRFHLQAEVGADDDFWNKS---------ARIMAW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------RWRAADKIHAGLFNEAEGQELEL-----GLP-----QGLVA------------------S- GFLANAYL-----------------------------------------------------------VR----------- --------FD--RS-V-DQAA----------------------------------------------------------- -----------------------------KEAHDLGSEIKVLDYCR----------YVDDMRIV------------VEAP SRCAA-------LLE---------------------------QVQKFVLD---------------------LLDA--HCK LLGTT--K----LIGL------S-E------GK----CSVTPYRSMSAQNNVSTLMGVLGAELSGTFDLDSLAQAAGGLE GLLWITEQLEGADKPPASQLKLATIAAPATDVRDDTVKRFVATRLADLMRQRLAMTDISAPDNTGESLGERVTSGMALAH EFESTARKLIKCWAENPALVLLLRCGLDLFPHPRLLDPVLEALDAKLFNVPIKWLRPKQDREIRAAEYVAADL------- -------------------------- >gi|37913078|gb|AAR05370.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------------------------RPL------------ ------AHVS----IK-DQTIFTALMML-LA---------------NHVETEQGDTS---TSFYDVHAKGLINYGNRLHC KYSD-----NN------------------AIYSWGNSNTYSKFFTDYQRFLERPIHFGREAKRVKTSKEEI--------- ---------------------------------------YEIHL--------DFSKFY-DSVN----------------- -RGI--LTKK-I--------------SALVEKITG-------AETDECISHVL---------SKFRNW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------KWTEKSKELYTGVCKN--KHIETLKDNRGIP-----QGLVA------------------G- GFLANIYM-----------------------------------------------------------LD----------- --------FD--KA-ISKLIG----------------------------------------------------------- -----------------------------QYLDDN-ETILLIDACR----------YVDDLRLI------------IKAD KNEVSENK----IRE---------------------------VITTRFKS---------------------YYDE----- -------------LIL------Q-P------QK----TKVKKFSSKDGA--ISSKL------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|32473346|ref|NP_866340.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ----------------------------------------------------------QKVNSKMRPL------------ ------AHPS----VR-DQIISTAMMIL-VA---------------NAVETAQGDPR---QSIRSANEKGMVSYGHRLFC DNED-----DL------------------LNYRWGNSTVYREYFQDYQTFIRRPQAIVTDTFPDGDTAWAV--------- -----------------------------------------ITA--------DLSQFY-DRVR----------------- -PSL--LHSK-L--------------RNLLG-----------DVADSKLLDAL---------STFFNW------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------SWHADDKAEAIKYATNSSPEIAG-YDQIALP-----QGLVS------------------S- GFFANLVL-----------------------------------------------------------ID----------- --------FD--RD-MVSKYR----------------------------------------------------------- -----------------------------ETRVDD--CFQYVDYCR----------YVDDMRFV------------VRLD SRFAAKSSAA--QEQ---------------------------ILKESFSD---------------------MLKT----- ----A--P----GLLA------K-E------SK----LSVLLGTNAGGGSTRFSMAM----------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|87213279|locus|VBITheSac207413_0611|_extraction hypothetical protein [Thermoanaerobacterium saccharolyticum JW/SLYS485] KILIEVNIYMINNYFPEDECIRKIYEILNTINNDFLIISMCNILKWVKDEFKERRIDIVKLIDPIIHQIEPLHTILLSVE PAKFDDLGRL-----QRLVQSILNRIYNRYDYLKSNKYNYCMYNFPNSYFYKERRGKIYESYYGSIVTYNTTTSKMADSK ELFSSFFKARMSLKKEFPYDEI-AIKLFEYNLEDNINRFSKEILKGYKFNTDFIGYKVPKNEKDDRQK------------ ------VMDN----IF-NTIAGASFLDI-IG-IVIDREF-------S--------------------------------- ---SNCCGNRL------------------NKKLNTEYSYEYFWYGWYYKFMKKAFNKVLNKNNY---------------- ---------------------------------------YL-KL--------DIKSFY-TNIN----------------- -QNI--LYDKIIKLIPY---------KDSR------------------LKEFI---------NSLIKR------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------HIPYVNNGK-------------GLP-----QGSLT------------------S- GFLANLYL-----------------------------------------------------------DD----------- --------FD--KYFISKTND----------------------------------------------------------- ------------------------------------------GYMR----------YVDDIFI-------------FGK- -----TEEQ----IK---------------------------ELGKEAEN---------------------KLKD----- -------L----YLEI------N-K------EK----T---SMGDKSSLKNIYYDDKELDDFQKRLDRILHSI------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|47031718|locus|VBISynGly105927_0590|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Syntrophobotulus glycolicus DSM 8271] -------------------------------------------------------------------GARKFQKEAVIFD MCRERNLVHLWRDLKDEEYEV----------------GKYIR--FKV--------------------------------- -----------------------F---EP----------------------------------KERNI------------ ------SAPH----IR-DKTVQ----FA-VH-SVLKEVYKPVFIKG---------------------------------- ---SFA--CQE------------------DK----------------------------GNHRAVEHLQHNM--RLCKWK H---------------------------------GGGW-IL-KI--------DVKKFF-YSID----------------- -RDI--LKRI-LQKKIKD-------EK---------------------LLRLL---------NKIIDS------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------SP----------------EGEKGIP-----LGNVT------------------S- QDMANIYL-----------------------------------------------------------DK----------- --------LD--QYCVRFLKV----------------------------------------------------------- -----------------------------------------KYYTR----------YMDDVCI-------------VT-- ------------------------------------------PTKEQAQE-------------YLKKIKT-FLEE----- ----R--L----GLET------N--------QK----T---KIFPLE--------------------------------- ---------------QGVNAYGFKI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|42754073|locus|VBIBacXyl109951_4388|_extraction hypothetical protein [Bacteroides xylanisolvens XB1A] -----------------------------------------------------------------------TSTDCVEFY NDYQSALVRLW-------YSIIY--------------GEYVPDFSKV--------------------------------- -----------------------FIRTYP----------------------------------VYREV------------ ------FAAA----FI-DRVVHHWIALR-IE-PILEERFRE---QG---------------------------------- ---NVSKNCRK------------------GE----------------------------GCLSAVHYL-NNMIVEVSENY T---------------------------------ADAY-IF-KD--------DLFSFF-MSIS----------------- -KSL--VWEM-LNIFVRDNYKGDDIEC---------------------LLYLLAVTIFHCPQNKCIRR------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------SPVSMWDRLPSNKSLFHNDPDRGVA-----IGNLP------------------S- QLIANFLA-----------------------------------------------------------SV----------- --------YD--YFVMEILGF----------------------------------------------------------- -----------------------------------------MYYVR----------FVDDFCI-------------VV-- ------------------------------------------KSPEEILS-------------KVHLLDG-FLKE----- ----Q--L----LLRL------H-P------RK----L---YLQHYK--------------------------------- ---------------KGVLFVGAFI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|124485751|ref|YP_001030367.1_extraction_extraction ----------------------------------------------------------------------------IPLH M-----LDVLWI------MKV----------------GAYLDEELGD--------------------------------- -----------------------CCFGNR----------------------------------VHENL------------ ------CERK----NK-DTSSH------------LMKIYGK---QY---------------------------------- ---AFW----Q------------------NK----------------------------ALDFANKAIENNEIVD----- ---------------------------------------IL-TL--------DFKQFY-YHIE----------------- -GDFGEIEAH-ITKKSEEATFKGNLQT---------------------NLFL----------TNILLR------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------IHEKYFEIVKNYRTIKDREINKFLP-----IGLFS------------------S- GILANWHL-----------------------------------------------------------KE----------- --------FD--KAVIKDVSP----------------------------------------------------------- -----------------------------------------RYYGR----------YVDD-CI-------------FVFA RTQYNTKDKVNDNDS---------------------------PSKEEIIKKYLVDRDVQLTNYKIRLNDSRFTNS----- ----R--I----AVQN------N---------K----V---MLYHVD--------------------------------- ---------------PNHSTAILRLFK----------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >DEfid|184283092|locus|VBIHelPyl300427_0117|_extraction FIG00712798: hypothetical protein [Helicobacter pylori UM065] -------------------------------------------------------------------------------- -------------------------------------------GTPR--------------------------------- -----------------------YFAIPN------------------PLAYYNQCKILWDNWDKLKEYFKEKTDGNTHKI SRIHIQKIPK------------------------NKKIFQNDYSKDRVA------------------------------- --------------------------------------MRKSIFDMGHKDI---FCEGKLGRSI---------------- ----------------------------------RIGARYKLKA--------DISTCF-PSIY----------------- -THS--IPWA-IRGKEIAKKDKNH------------------------WSDEI---------DTQTRN------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------MNHKETT-----------GIL-----IGPHS------------------S- NLISEIIL-----------------------------------------------------------VA----------- --------VD--ED-LKNLRK----------------------------------------------------------- ---------------------------GQNKT--------EYRYIR----------HIDDYTC-------------YVN- -----SRDE----AE---------------------------QFTIDLAK---------------------CLKK----- -------Y----NISL------N-H------KK----T---KIFELPLMYEEEWINQLKIFKMDEYKGKIK--------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|32455447|ref|NP_862563.1_extraction_extraction -----------------------------------------------------------------------FNMESFADY LRSHD--LKTHFN------------------------GKKPLSTDPV--------------------------------- -----------------------YFNIPK-------------------------------NIEARRQY------------ ------KMPN-------LYSYMALNYYI----CDNKKEFIEVFIDNKFS------------------------------- --------------------------------------TSK-FFNQLNFDY---PKTQEITQTL---------------- ----------------------------------LYGGIKKLHL--------DLSNFY-HTLY----------------- -THS--IPWM-IDGKSASKQNRKK---G--------------------FSNTL---------DTLITA------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------CQYDETH-----------GIP-----TGNLL------------------S- RIITELYM-----------------------------------------------------------CH----------- --------FD--KQ-MEY--K----------------------------------------------------------- ---------------------------K-------------FVYSR----------YVDDFIF-------------PFT- -----FENE----KQ---------------------------EFLNEFNL---------------------ICRE----- -------N----NLII------N-D------NK----T---KVDNFPFVDKSSKSDIFSFF------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|24636606|dbj|BAC22947.1_extraction_extraction -----------------------------------------------------------------------FGLQDLYFL MKSED--LRGFFNFS-------D--------------FKKNVDTEPI--------------------------------- -----------------------YFNTPK-------------------------------NNYVRREY------------ ------KMPN-------VYSYLHLCFFI----EDNKEEFINIFENNVQS------------------------------- --------------------------------------TSK-YFNELNFNF---KFTKKIEQRL---------------- ----------------------------------LFGGNSILSL--------DLSNFY-HTLY----------------- -THS--IPWV-IHGKQNSKDNRYK---G--------------------FANNL---------DSLIQK------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------CQYGETH-----------GIP-----VGNII------------------S- RIIAELYM-----------------------------------------------------------CY----------- --------ID--KK-LIE--K----------------------------------------------------------- ---------------------------G-------------YKYAR----------YVDDIKY-------------PFV- -----SNTD----KE---------------------------GFLMEFNS---------------------ICRE----- -------Y----NLIL------N-D------KK----T---DVQTFPYRNNMQKVEIFSYL------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|17227201|ref|NP_478367.1_extraction_extraction -----------------------------------------------------------------------FNMSKLAEY ITYNPFYIKKFFN------------------------DRNIRSTEPI--------------------------------- -----------------------IFTIPK-------------------------------NNTSRREY------------ ------KIPN-------IYSYLNLMFFM----QENKKEFKEVFLTNKFS------------------------------- --------------------------------------TSK-FFSLPDFDF---KFTDNLKKTL---------------- ----------------------------------LYGGNHILNV--------DLSNFY-HSLY----------------- -THS--IPWV-IMGKKNAKKERNK---G--------------------FSNQL---------DKLITS------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------CQYNQTH-----------GIP-----TGNIL------------------S- RIISELYM-----------------------------------------------------------CY----------- --------ID--SE-MEN--K----------------------------------------------------------- ---------------------------G-------------YRYAR----------YVDDISF-------------SFN- -----FEEE----KD---------------------------KFYRDFNK---------------------LCMK----- -------Y----ELKI------N-D------KK----T---EVNDFPYIHPQNKDFIFNYF------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|110679729|ref|YP_682736.1_extraction_extraction -------------------------------------------------------------FTNENLFLSELRLDQMTGV QQ--NYLNRLRRPHN--------------------------NYTKPY--------------------------------- -----------------------LYSINR-------------------------------SRRSKNTL------------ ------GLIH-------PAVQLRIATFY----SEFEQTIIQACGRSTFSIRHPYEALRIYSKDSAKDVRKRWKLALPGEN V------GHAI------------------KTS------YTSSYFAYRKYLLLDKFFSSNEIIRL---------------- ----------------------------------EGKYSRLRML--------DVSKCF-FNIY----------------- -THS--ISWS-LKDKDFSKKNAKN-Y-S--------------------FEQQF---------DTLMQH------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------SNYNETA-----------GIL-----VGPEV------------------S- RIFAEIIL-----------------------------------------------------------QR----------- --------VD--VE-LERAV------------------------------------------------------------ ---------------------------SKRLK---LECGRDYDIRR----------YVDDFHL-------------FAN- -----DEDV----LD---------------------------KVEGVLAE---------------------ILET----- -------Y----KLFL------N-T------GK----S---EEVERPFVTGISRLKFEV--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|123441512|ref|YP_001005498.1_extraction_extraction -------------------------------------------------------------FNN----------DYFHGI LQLPNFF------FN--------------------------EVTIPY--------------------------------- -----------------------NYRIRK-------------------------------GENSFRTL------------ ------SIIH-------PIYQVKICDFY----KKYEHVMIHSCTKSPLSLRYPNRVGNYYYEKDFSSNRVSFK-----EG VVEFNRDGFEV------------------QSQ------TSSSHFSYKKYPFVYKFYESYEFHRL---------------- ----------------------------------ERKFSKLMKL--------DISKCF-GHIY----------------- -THS--ISWA-VKSKEYAKKNTSY-N-H--------------------FEGLF---------DKIFQN------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------SNYGETN-----------GIL-----IGPEV------------------S- RIFAETIF-----------------------------------------------------------QR----------- --------ID--LN-IINKL------------------------------------------------------------ ---------------------------SDVFE---LELEKDYSIRR----------YVDDFFV-------------FST- -----DNIN----LD---------------------------KIEKVVST---------------------ELEK----- -------Y----KLYL------N-E------SK----K---EIMDRPFITGTTIAKSEI--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|50119997|ref|YP_049164.1_extraction_extraction -------------------------------------------------------------FSNDGFYINSHLSIQSNG- -QYNIFFRKIIAPKNLSNDAERA--------------KKQAEQTYPL--------------------------------- -----------------------KFKIFK-------------------------------DESKLRTL------------ ------SLIH-------PRSQYNYVEFY----HYFSSAITSLCSQSKLSIRAPVKIANSFYTKKKIVEKIAYK------- KTKIDTLDSEI------------------WSK------HASSFFSYKGYDRIYKLFENGQYITL---------------- ----------------------------------EKKFNVMWSL--------DIANCF-DSIY----------------- -THT--ISWA-IKNKEFIKANLKKGK-Q--------------------FGDEL---------DSIMQR------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------SNNNETN-----------GIP-----IGAEF------------------S- RVFAELIF-----------------------------------------------------------QN----------- --------ID--DC-IVKKL------------------------------------------------------------ ---------------------------EEVYQ---YKLKTHFEILR----------YVDDYFI-------------FAI- -----NESV----AG---------------------------RVHGIISD---------------------ELGK----- -------Y----NLYL------G-D------KK----L---SKLERPFLTNKSEMVIEA--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|151560524|gb|ABS14022.1_extraction_extraction -------------------------------------------------------------VSNDGFYRNLHKLDKLGSL AK--SLVERL------------V--------------IPAKEFTIPY--------------------------------- -----------------------NFKIVK-------------------------------DEKSARRL------------ ------SLPH-------PRSQAEICLFY----KNNTALILHYCSTGDLSIRRPTRDAAKYYISNPRENDSSYK------- SDKVTTLNVEL------------------LEK------NPASYYSYSGFNRLHQFFNSRHYNTL---------------- ----------------------------------ERKFSTLASF--------DIARCF-DSVY----------------- -THS--IAWA-TKSKEEAKETAHSA--T--------------------FGGMF---------DRLMQS------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------LNYNETN-----------GIV-----VGPEV------------------S- RIFAEIIL-----------------------------------------------------------NK----------- --------ID--SN-LVQRL------------------------------------------------------------ ----------------------------QDLS---LIKDIDYSCYR----------YVDNYYV-------------FCN- -----ENSV----LE---------------------------KIKSELQE---------------------CLDQ----- -------F----KLSL------N-D------AK----T---EIISRPFFTRKSMA------------------------- -----------------------IYEA----------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|53718224|ref|YP_107210.1_extraction_extraction -----------------------------------------------------------------------FGIKPDVGI AT--EVVTEWGRQKT--RRFVPI--------------GKCEMATIPF--------------------------------- -----------------------NFRVTH-------------------------------NL-DGRIL------------ ------SVVH-------PRNQVAVASFY----ATHSALIIYHTSVSEFSIRRPVSVSRYAYF-----KDKLHE------- ERMD---AVAG------------------LEEEDREYEQLGSYFVYRKFRNIHRFFESYEYHEC---------------- ----------------------------------EKKYDAMVQF--------DVSKCF-DSIY----------------- -THS--LPWA-VLGKDQTKFSLKQSA-T--------------------FGGQF---------DALMQN------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------LNQKETN-----------GIV-----TGPEF------------------S- RIFAEIIL-----------------------------------------------------------QS----------- --------VD--VE-LMTRL------------------------------------------------------------ ---------------------------AQESN---LTHKIDYEIFR----------YVDDFFV-------------FYN- -----AEFA----QL---------------------------KIFETLQE---------------------VLKS----- -------K----KLSV------N-T------SK----I---KRYQKPIITEITIAKEWI--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|87348283|locus|VBIMetSp147316_1517|_extraction hypothetical protein [Methylophaga sp. JAM7] ------------------------------------------------------------------TMQLLFGIEDKGSI ET--EILSEWGRTKT--RRYVRL--------------KKCNMTTIPF--------------------------------- -----------------------NFQISH-------------------------------NL-DGRTL------------ ------SVVH-------PRNQVAVASFY----AAQSALIVYYTSLSDFSIRRPVSVSRHAYF-----NDRLHQ------- EKLD---SVAG------------------VEEEDKEYEQLGSYFVYKKYRNIHQFFESYKYHRC---------------- ----------------------------------EKKYDAMVQI--------DVNKCF-DSIY----------------- -THS--LPWA-VLGKSQTKFSLSESKHT--------------------FGGLF---------DTLMQN------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------LNHNETN-----------GIV-----IGPEF------------------S- RIFSEIIL-----------------------------------------------------------QA----------- --------VD--VE-LSKRL------------------------------------------------------------ ---------------------------SEDSN---LTHKVDYEIFR----------YVDDYFV-------------FYN- -----EEST----QL---------------------------KIIETLQA---------------------FLKV----- -------N----KLSI------N-T------SK----I---KHYQKPIITEITIAKNRISTLLNNEINPATEEVVVEDPE EAGAIKLACPVNSNRLIIRFKTVIKETSVTYGELLNYTLAITENKIEKLFQSYLACDK---------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|87263722|locus|VBIFleLit174749_0251|_extraction hypothetical protein [Flexibacter litoralis DSM 6794] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------------------------------------------DFREL------------ ------CVIH-------PFNQLKMVDFY----EEYKSLIIYYSSLSPFSIRKPSKVANFTFY-----NDILHK------- KNEDKDLQYSQ------------------IEQDDSEYENLKSFFSYRKYSNIHKFYESYQFHRS---------------- ----------------------------------EKKYNKLLKI--------DVTKCF-DSIY----------------- -THS--IAWA-LFNKDIVKSNIDDSLST--------------------FGGEF---------DRLMQD------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------LNANETN-----------GII-----IGPEF------------------S- RIFAELIL-----------------------------------------------------------QQ----------- --------ID--IN-IHESLK----------------------------------------------------------- ---------------------------SDDSNGKYLVHKVDYEIFR----------YVDDYFI-------------FYN- -----KEED----KE---------------------------RITETFKL---------------------ALKE----- -------Y----KMYL------N-D------KK----S---IQYSKPIITEISIAKQKINDLFNNHL------------- -----------------------ILKEKENQGEPTNPILYFSSNNV---------------------------------- -------------------------------------------------------------------------------- -------------------------- >Gfid|35291935|locus|VBILegLon159544_1945|_extraction hypothetical protein [Legionella longbeachae NSW150] -------------------------------------------------------------------------------- -----------HLLARTINEWLPQGIQSLIE------GTY------T--------------------------------- -----------------P-RFL-KRYYFK-----------------------------------DEML------------ ------DQLY----LA-DRVLQNLLLQQ-LK-PTFPYVM-------N--------------------------------- ---PNCY---H------------------VH----------------------------GPSGVQLAAQRIRETLATK-- ----------------------------------HYKY-II-RA--------DIKSYY-KSIQ----------------- -HHV--LIED-IKRYY----------FDTK------------------VQLML---------EQIVRN------------ -------------------------------------------------------------------------------- ------------------PIETPRGYKNPDN------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GIA-----LRGPL------------------S- QFFSALYL------------------------------------------------KPLDDAF----DT----------- --------LD---------------------------------------------------------------------- -----------------------------------------VTYLR----------YQDDIII-------------LCH- -----SKRQLERCKR---------------------------RLMTILKE------------------------------ ----R-------HLQL------S-R------KK----T---RIGAIE--------------------------------- ---------------GGFHFLGINY------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacteroidetesfid|22844934|locus|VBIParDis29947_0874|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Parabacteroides distasonis ATCC 8503] -------------------------------------------------------------------------------- --GKKAIIKFEADLEKNLSDLL----YSFEN------GTF----VTS--------------------------------- -----------------PYRFM-TVHEPK-----------------------------------KRLI------------ ------GMLP----FP-DHVQHWAMLNE-VE-DYFTRSF-------S--------------------------------- ---AYTYGGVK------------------GR----------------------------GPHAYMRMIRKVLRKYPER-- ----------------------------------TTDY-LL--C--------DIHHFY-PTVN----------------- -HPV--LKSQ-LRTRI----------KDNH------------------LLRRL---------DEI--------------- -------------------------------------------------------------------------------- -------------------IDSVEG----DT------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------GMF-----PGTKL------------------A- QFFSLVYLYLFDHDLKRCFHVGECPALVEYYTKRYIEESIATAKTEHDYEELSKGIQYLSDRFKGYLNR----------- --------LD---------------------------------------------------------------------- -----------------------------------------FCY-R----------LADDVLI-------------L-H- -----EDTVFLHLVI---------------------------EWIGLYYA------------------------------ ----N-------ELRI------GLN------PR----W---KIGHVT--------------------------------- ---------------DGVDTGGYVH------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|149369910|ref|ZP_01889761.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------SIDYYTA-----PK----------------KFSPI----RSEKQ------LVISNNFTASNYKLS ----------KVNYFI-QASVEIHLISI-LWILYEGYSLHKMYSK----------------------------------- ---NN-YAYNL--EL--ES----------GNGE--VVDGLRLFKPYFEQYQKWRDNGIEMAKNILDEDIDV--------- ---------------------------------------VILSI--------DIKEYY-YNINID--------------- -KVF--Q--DKLSKDIKGSLNKRKLFFTDL------------------LFDIM---------RAYQKI------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------CP--H---DSNNILP-----IGLLS------------------S- GILANYYL-----------------------------------------------------------KD----------- --------FD--EE-IKSELA----------------------------------------------------------- ----------------------------------------PAYYGR----------YVDDIQIV------------LSNT KLDF-------DSSS---------------------------NDSNVIDKYLNKFLVKDKSFC----------YK----- ---------LP-KIKI------Q-Q------DK----I---LLYAFESKESKAVLEMFK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|60682764|ref|YP_212908.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------SDNYQTD-----LD----------------KLVSIGYKLKKKES------TIVTNIPHTGKIEVE ----------SYNILI-DAPIEIHIISV-LWLIVAGKELCKYVNE----------------------------------- ---NN-YAYKL--LLLDESYLPMTDIKIETNKENYVVTGLQLYEPYFIGYQNWRDNALNSATKLLDDNKDA--------- ---------------------------------------TILSL--------DIQRYF-YSVRIDLDSIKNRCSHSNKQI EKCF--HLLQIINKTYTSKINK--------------------------LLDI---------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ------------------------------------PLTN---DELNILP-----IGLLS------------------S- GLLGNLYL-----------------------------------------------------------ED----------- --------FD--KT-IKEELN----------------------------------------------------------- ----------------------------------------PAYYGR----------YVDDILFV------------FSDR KVKL-------EVNN---------------------------PIHDFIDRY---FIKLDENIT----------YL----- ---------LPVKLKI------Q-S------EK----V---ILEHFNHKESRAAINIFK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|160944232|ref|ZP_02091461.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------SISCRLL-----PK----------------KIMDT----KEEKNCDGTKGVLISGVP-AKQVKVD ----------EIQYFI-DMKIEGMILGV-LWIMMIGWQLDKSYT------------------------------------ ---NC-YGNRIRKTLYNE-----------FSKE--PTFSPALFEPYFQNYESWRDQALTLAQKCMQQD-DV--------- ---------------------------------------LIFTL--------DFKRYY-YSVDVSEEFMKEILTGVKGK- ---------SDYNEEYLSRIN-------DF------------------VYCVI---------KRYSQI------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------YK-EH---SGERILP-----IGFLP------------------S- GVLANECL-----------------------------------------------------------KK----------- --------FD--TA-ILDGWN----------------------------------------------------------- ----------------------------------------PLYYGR----------YVDDILLVEKVEEGSRIAQKAQDG KLEFYEAFEYYTVNN---------------------------------SRWTGKENRNDECGT----------YI----- ---------LPEKIQL------Q-A------SK----I---KLFYFQRGQSDALITCFK--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|163723024|ref|ZP_02130561.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------KISCWCL-----PK----------------AINNS----EPEQNGDS---CFISNIRTERQYEVN ----------SVNYFI-KAPIELFILDV-IWSMKVGVLLDRRLEE----------------------------------- ---YC-LGNRL------EY----------AGHELPMDENGKLFKIYHRQYSMWRDTAIDKAKKALENGTDI--------- ---------------------------------------MIIGL--------DIMQCY-YHIKVDWNSIEAVVANDYFGM SLCT--IL-KRISTKYIKKLRK-----------------------------------------NTEIT------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------HPTVM---AEDEGLP-----IGLSS------------------S- GIIANWAL-----------------------------------------------------------NI----------- --------FD--QA-VRSKLS----------------------------------------------------------- ----------------------------------------PLYYGR----------YVDDILIV------------LQSP QI---------NAPD---------------------------GCTSAIDKLFIETQLLDDNIV----------YS----- -------LELP-SLKI------Q-P------NK----L---IFQYFDAKHSHAGLKEFS--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|54293035|ref|YP_125450.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------LGIPRLI-----PK----------------KMTPNLKGEEINGYLINGHIHFSNP-----AREFI SLKENYNLDAEFRVVG-DFPIDTHIISA-LWINMIGYKFDAVLSN----------------------------------- ---NV-YSSRLRRINSE----D-------CQSNLFHISAINSFEPYFHRYQKWREDGLKAIRDNLHNSRNV--------- ---------------------------------------IAASL--------DLENFF-HNID----------------- -PSF--LSSEAFVKELIGDLSEDEAEFNVQ------------------MSQFL---------KRWSDK------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------ASSLK---PFKGGLV-----VGLTA------------------T- RVISNIIL-----------------------------------------------------------YK----------- --------FD--KL-VKEKIT----------------------------------------------------------- ----------------------------------------PIHYGR----------YVDDLFLV------------ILDP GNVR-------NIVE---------------------------FMDYLKNRLGEDYLKLGKDYK----------------- ----------KSIFQF------Q-T------EK----Q---RLFVLEGQAGCDLLDSIE--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|114045853|ref|YP_736403.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------LGNFRLL-----PK----------------KLSSDKKSDTTE----NGHVHFSKP-----ERAVE SLFNNYDIVPEFRIVG-DFPVETHIISA-LWINMIGRQFDAKLDY----------------------------------- ---SC-YGARLKRIRNDELFTD-------GDEKPFHISSIGSFVPYFQPYQKWRSDGLKAIRDELEKDRDI--------- ---------------------------------------IAVSL--------DLKCYY-HFID----------------- -PLA--ISGETL--EL--KLTDEEEAFTEQ------------------LSKFL---------SNWAEE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------ASEFKTVGKINGGLV-----IGLTC------------------S- RIISNALL-----------------------------------------------------------YK----------- --------WD--RL-VLEKLS----------------------------------------------------------- ----------------------------------------PVHYGR----------YVDDMFLV------------LRDT GTIS-------NSLD---------------------------LMNLIQARMGNK--KVWENQG----------------- ----------K--ITL------Q-S------DK----Q---KLFILQGRAGLDLLDSIE--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|160876603|ref|YP_001555919.1_extraction_extraction -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------IGDFRYI-----PK----------------ELDDNNWNN--K-----VNYRSVDPITDWKQRFKE NSNKRLN--VKYRQIL-CPSVEYQILSA-LWILKVGHLFEAKLDK----------------------------------- ---ELSYGNRLRRRSGLVPDSD-------SLQDQLNLDASGLFSPYFSAYKNWRGNGLNAMKKLISEGHDV--------- ---------------------------------------TAITM--------DLAGFY-HNAS----------------- -PNF--LLRPSFLRKLGLSLSSDERKFTRL------------------ILESI---------NNWYLT------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------TPDY----RTDGALP-----VGSSI------------------S- KIISNVLL-----------------------------------------------------------YE----------- --------LD--KQ-ISEGLE----------------------------------------------------------- ----------------------------------------PEYYGR----------YVDDIFLV------------FKTP DEEL-------TGDS---------------------------ILSHMSKHV-E-CLKINRVKG-----RLRFVY------ ---------QDSDLKF------T-A------SK----Q---KIFSLSSKHGLDFINQIS--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bfid|42358657|locus|VBIHerSer153339_0006|_extraction hypothetical protein [Herbaspirillum seropedicae SmR1] -------------------------------------------------------------------------------- ----------------------------------------------------------HFHALAFTEYEQKLDENLRGLL TILLDDASPWSRLDIQGDYAYL-----PK----------------SVDCEPWENGHE-----GHFRALNPLLDWQQRFQE ---KRIPAIAKLRLVI-RPTVNFQIISA-LWIIKVGHKFDAVINT----------------------------------- ---EVSHGNRLRRRSRKIDEKW-------SARGPLNMTAAGLFAPYFSAYRKWRETGLNRMEESLKQGKDI--------- ---------------------------------------LAITM--------DLEQFY-HRVA----------------- -PTF--LLRKSFLQSIRLKLTTFERQFTTD------------------LLDAI---------ALWYEQ------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------TPDFKV--RPQGALP-----VGLSA------------------S- KIISNVLL-----------------------------------------------------------AN----------- --------FD--NA-ILQKIK----------------------------------------------------------- ----------------------------------------PIYYGR----------YVDDIFLV------------FENT KLAV-------NAKE---------------------------VTQRIAKAM-HPMLTIPENQEGSPSIRLKIPYA----- ---------MDSELIF------A-G------TK----Q---KIFSLSSPHGLDLIQHIR--------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115583420|locus|VBIMicSp236384_0688|_extraction hypothetical protein [Microcoleus sp. PCC 7113] ------------------------------------------------------------------------------SM FYYKQLASQATALETEEYFNKRVANNLFYGLEREFAVYDY---------------------------------------- -----------------------VI--PK--------------------------------------------------- ---ASLGLRNQKFFTYPMRVLYYSIGLYLLK---------------LSQEL---LQNYVKNNERFKCYYGGNLLFNKDSL VIKHETTFYKP------------------------------------------------NYKEFKKQVRKQATNDV---- ----------------------------------DKKL-VI-KL--------DIQNYF-DNIS----------------- -ITT--LLNK-LDRLIKPSIKESLRFDAST--------------------------------KEQITF------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------YFNYISGKQGGIPQS---DNDII------------------S- GFIGHVYL-----------------------------------------------------------LFSDLI------- --------IDTEISKYPTEIK----------------------------------------------------------- -------------------------------------EHKVI---R----------FVDDIFI-------------IINF -LESTKQKKKEAIAD---------------------------SLTSQIAD---------------------VLHY----- ----N--S----GLRL------N--------TK----TKLYWLNNLEQKEELLKDL------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >cianobacteriafid|115662334|locus|VBIChaMin231992_0824|_extraction hypothetical protein [Chamaesiphon minutus PCC 6605] -------------------------------------------------------------------------------D FEIIRRESSLDIITPEEFYRNYIKNDICFNYPYLFSSIPY---------------------------------------- -----------------------SV--PK--------------------------------------------------- ---GDGGVRKFHFLETHLRILYYSLGFYFLD---------------LTKDVRIELKG-IQDRSFMYTHYGADI--NPASL --GRSDIDYKK------------------------------------------------DYQKFTSKIRKTARNAVK--- ----------------------------------DGKIAVL-HL--------DIQDFF-HSIE----------------- -HSL--LTQV-LREQALPEAQLRLKYNEQT--------------------------------RLTIRE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------ILFLIMQRSEGLPIS---QHNII------------------S- NLLSHLFL-----------------------------------------------------------HH----------- --------LDCYIREIQLELG----------------------------------------------------------- -------------------------------------SSFLLTYHR----------YVDDMFI-------------TVKF PIGESNQSIGTKMLD----------------------------ISTRIGE---------------------YLSG----- ----N--L----ALSL------N-P------LK----TRLDIISSEDEVDSLIERS------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|19469398|locus|VBICloKlu111549_3999|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Clostridium kluyveri DSM 555] -------------------------------------------------------------------------------- ---GENIGNVVTEK------------------------------------------------------------------ ------------------VHSFNN-----------------------------EGVHKIIEKLKNESYCPESLEKSDKQN KKHSQIKGLY-------DNLIQQIIVEI-LQ-SIYNVNFS---------------------------------------- ---VNSHAFIP------------------NK----------------------------NCHTALYKI----KTTCS--- ----------------------------------GARW-AV-KG--------NIESCF-YNIN----------------- -YDF--VIKS--------------------------------------LCEKISDGR----FINLIRK------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------------FLAAGYTKEKKNCDTWSGISQRESLA- NILINIYL-----------------------------------------------------------DK----------- --------FD--KYINKEFGQ----------------------------------------------------------- -----------------------------------------VKYTR----------YLDNFIIF---------------- --------I---------------------------------SGTKDLAE---------------------YMIE----- ----K--I----KVFL------K--------D-------KLNIETTE-------------------------------EE IFIIDLNKQRVK-------FLGYEI------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|42885534|locus|VBICopCat158046_1802|_extraction Retrontype RNAdirected DNA polymerase (EC 2.7.7.49) [Coprococcus catus GD/7] -------------------------------------------------------------------------NKDFKRT LEAESEVAKMLTQRIINRDLQLK--------------------------------------------------------- -----------------PIRQFQRI----------------------------DGL-----TQKLRDICQESPEQ----- ------------------QVFEYIGVYA-LK-PLFRAKIL---------------------------------------- ---PIQYGSIP------------------NK----------------------------GGVAGKRKIERLLRKKFH--- ----------------------------------GKVV-AL-KG--------DVTKAY-PSVT----------------- -IPV--VMEM--------------------------------------LRRDIGKNKVLLWFLGALMS------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------NYPGNHLCIGGYL------PAW----------- --LFNYVM-----------------------------------------------------------SY----------- --------VL--RYI-YEQAQ----------------------------------------------------------- -----------------------------------------IRRGKRNRLVYAVVCYADDFTIYGDL------------- -----SKLK---------------------------------KAMKKATS---------------------WAHD----- ----K--F----GLKI------K--------DI----WQFYQVASFD-------------------------------EE RENLEERKKGSKKRTPGVDMMGYVV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >clostridiafid|43016558|locus|VBIFaePra148460_2585|_extraction hypothetical protein [Faecalibacterium prausnitzii SL3/3] -------------------------------------------------------------------------NHECRKI VDAIDAVAERETQKIRDECLDLK--------------------------------------------------------- -----------------PVRQFKRI----------------------------DGI-----KMKERDLCQESPEQ----- ------------------QVHEYILVHA-LQ-PLLHAKLL---------------------------------------- ---PMQFGSIP------------------GK----------------------------GQVAGTRQIERIVRKKIL--- ----------------------------------GKLD-AV-KG--------DVHKAY-PSTT----------------- -IVC--VITL--------------------------------------LKRDIGKNKKLIWYAGAVTE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------------------NYPDGVLLIGGYF------STW----------- --AFNYVM-----------------------------------------------------------SY----------- --------VL--RYL-LSLKQ----------------------------------------------------------- -----------------------------------------VRRGTGTRLVREIVCYADDFVIIGHA------------- -----SQLM---------------------------------KAMKKATH---------------------WVKS----- ----T--L----GLEL------K--------QA----WQQVRFASFE-------------------------------EE KRVKAARVQGSKHRTPALDMMGFAV------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|33595199|ref|NP_882842.1_extraction_extraction -----------------------------------------------------------------DIWIDEADLAHFE-- ---------------VHLGHELRGLGDDLLSGR------F---------------------------------------- -----------------RMSPIRPMVFPK--------------------------NPDGDGNPRVRQY--FHFTVRDQ-A ------------W-VAVVNVLGRYIDEQ-MPVWSYGNRLFRSAWIEE--------------------------------- ---DIHGNKIRK--IGP-----------------YRHSSGRIYRLFQQSWPLFRRHIALAVSAAAHGYSKVDSLDDDERE ELGFQRR--MHRANQCPFVLADYWTNLPTGPNESDVYWASV-----------DLEKFY-PSIP----------------- -LTA--C----VDAISQFVPAELRPEVQRL------------------LKTLTQLPLNLDGWT--DAE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- ---------------------------------LKHIELDESRKTFHKIP-----TGLMV------------------S- GFLANAAL-----------------------------------------------------------LP----------- --------VD--QE-VQK-TL----------------------------------------------------------- -------------------------------------PRGRVAHFR----------YVDDHVI----------------- -----LTKTFD---------------------DLITWIDHYKDVIDNLGS------------------------------ ------------GASI------N-P------AK----TEPKALGELLGTSDTSKRF------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|119357506|ref|YP_912150.1_extraction_extraction -----------------------------------------------------------------DAWINEIELAEFE-- ---------------LELESNLKSIAEEMKQGR------Y---------------------------------------- -----------------KLRPLKPMAFPK--------------------------NPSKDGKQQVRQY--FNVAVRDQVA ------------W-TAVVNIIGPFLDQK-MPTWSYGNRLFRSIWTES--------------------------------- ---DEKGVRRQK--IGR-----------------YRDSSGQIYLPFLQSWPRFRRHVFLSTLAMTGQTGKYSTTKED-VE ELECQES--LPEKYKCPFIQKKHWAIKRPREGKKKLFWCSL-----------DLEKFY-PSLK----------------- -LDI--I----LRNITEFLPAELRVDADNL------------------IESMMLFEVDTANWSTDDLE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------------ALNEIGLSAKERDFKGIP-----TGLYV------------------A- GFLANAGL-----------------------------------------------------------FK----------- --------VD--LE-VDK-LL----------------------------------------------------------- -------------------------------------KNRNVAHFR----------FVDDHIV----------------- -----LAYSFN---------------------ELVKWVEEYLKLLDSLET------------------------------ ------------GATV------N-R------DK----IEPEALAQYFGAKENPEK------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >gi|69935276|ref|ZP_00630256.1_extraction_extraction -----------------------------------------------------------------DGLFDQAEIAAFE-- ---------------LNLEAELTAIRHDFETGN------W---------------------------------------- -----------------RNRPIRLVPQPK--------------------------KPDKEGKPRLRQY--FEISVRDQVA ------------W-IALVNVLGPELDQR-MPAWSYGNRLYRAAWYEE--------------------------------- ---ETAEGQNSKLNIGP-----------------YRHSAGYLYRHFKHSWPLFRRHISLTARKMVGK-----SIEED--- DLDAGERLALEQGDELAYFHPTFWN--RPASGDNTLYAASL-----------DLSKFY-PSVQ----------------- -VSA--IQRG-FDTLVEGFSDEPRLSA--L------------------LSAMLRFEVDDGGL---DQA------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------------LKAKVDPIAPVGSFDGIP-----TGLFV------------------G- GFLANVAM-----------------------------------------------------------LP----------- --------ID--RE-VEKLLL----------------------------------------------------------- -------------------------------------QHRNIAHFR----------FVDDHEF----------------- -----LAYDFD---------------------RLLEWMTEYAHLLARHGI------------------------------ ------------GVEI------E-R------EK----YSPAELKWLLHPDETVEPK------------------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Afid|115125822|locus|VBISinMel272116_2443|_extraction hypothetical protein [Sinorhizobium meliloti GR4] ----------------------------------------------------------RIFGIPHPAFIRDAGLF-YE-- ---------------KHWPSLLPLVDGSFGSASKPIFQLV---------------------------------------- -----------------GNRHVRITPHSE--------------------------LP----RIRLRAFSRFKYCLITDVA RFFPSVYTHSFPWAINGKNAAKQDMNSQ--SSFVFGNRL---DFILR--------------------------------- ---NSQSKQTIGIAVGPDTSKVASELLMAAVDREFVRRSGR-------ARPTYVRHV------------------DD--- ----------------------YWIAGNTHE---------------------ECEKHL-QNLR----------------- -ASL--REYE-LDI----------NEAKTK------------------IVSMRQVFADDWPFEF-DKE------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- --------------------------------IVASLELGANPNEALGVL-----TSMIE------------------R- ATTTN--------------------------------------------------------------------------- ----------------DDGLI----------------------------------------------------------- -------------------------------------RHA----IR----------VIDERRL----------------- -----WSRDWDLLQHFLAQCAVQFPHTTDYVARVIAW------------------------------------------- ------------------------R------HR----VWPDDVDAGLW-------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------- >Bacillifid|18848629|locus|VBIBacCer122868_5788| FIG01226649: hypothetical protein [Bacillus cereus AH820] --------------------------------------------------------------------MDYLSLEEVCD- -------------------------------------------------------------------------------- -----------------------RVGLTK--------------------------------------------------- -----------------NQLGYLIKYKH-IE-PINMATW----------------------------------------- ---KADGGYRF--------------------------------------------------------------------- ----------------------------------EQED-VK-----------KLEDLYKDSLT----------------- ------LKEA----------------------------------------------------AEFLNK------------ -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- -----------------------------------SKTYVHNAAKDGILPFKEIAKGKST------------------E- ----RLYL----------------------------------------------------------KKD----------- --------LEVFKEKIENKSK----------------------------------------------------------- ----------------------------------EESKEKKQHLSL----------YLDEKVV----------------- ------------------------------------------EAIKKKAA------------------------------ ----KKGY--------------NGY------KK----F------------------------------------------ ----------------AEDILSAEVKEEIEE------------------------------------------------- -------------------------------------------------------------------------------- --------------------------